Reproducible bioinformatics pipelines
Challenge:
Omics analyses often rely on ad hoc scripts, uncontrolled environments and workflows that are difficult to reproduce. This limits traceability, complicates result validation and makes it harder to scale or reuse analyses in future projects.
Solution:
We design and develop reproducible, modular and scalable bioinformatics pipelines for genomic, transcriptomic and other omics data. We build workflows that can run consistently across different environments, including local infrastructure, HPC and cloud, with version control, documentation and automation from the start.
We work on:
End-to-end pipelines for omics analysis
Germline and somatic variant calling
Genome assembly using short and long reads
Development of polishing tools for long-read sequencing
Gene expression analysis
Workflow automation
Containerization of tools and environments
Integration with HPC, cloud and reproducible systems
Impact:
Consistent and reproducible results, with documented code and maintainable workflows that can be audited, scaled and transferred to other teams, reducing errors and execution time in future projects.