| Literature DB >> 31185495 |
Mathieu Bourgey1,2, Rola Dali1,2, Robert Eveleigh1,2, Kuang Chung Chen3,4, Louis Letourneau1,2, Joel Fillon5, Marc Michaud2, Maxime Caron1,2,5, Johanna Sandoval6, Francois Lefebvre1,2, Gary Leveque1,2, Eloi Mercier1,2, David Bujold1,2, Pascale Marquis1,2, Patrick Tran Van7, David Anderson de Lima Morais8, Julien Tremblay9, Xiaojian Shao1,2, Edouard Henrion1,2, Emmanuel Gonzalez1,2, Pierre-Olivier Quirion1,2, Bryan Caron3,4, Guillaume Bourque1,2,5.
Abstract
BACKGROUND: With the decreasing cost of sequencing and the rapid developments in genomics technologies and protocols, the need for validated bioinformatics software that enables efficient large-scale data processing is growing.Entities:
Keywords: bioinformatics; frameworks; genomics; pipeline; workflow; workflow management systems
Mesh:
Year: 2019 PMID: 31185495 PMCID: PMC6559338 DOI: 10.1093/gigascience/giz037
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:General workflow of GenPipes. Diagram showing how the information flows from the user command line input through the 4 different objects (Pipeline, Step, Job, and Scheduler) in order to generate system-specific executable outputs. cmds: commands.
Figure 2:GenPipes’ properties and growth. A, Diagram showing GenPipes’ features, compatible computing platforms, and available pipelines. B, GenPipes’ available pipelines and maintained servers since the release of GenPipes in 2014. C, Bar plot showing the number of GenPipes runs per year since its release. RRBS: reduced-representation bisulfite sequencing; WGS: whole-genome seqencing.
Figure 3:GenPipes DNASeq pipeline diagram. Schematic representation of GenPipes’ dnaseq.py pipeline. Hexagons represent steps in the pipeline. White hexagons represent steps that process input from a single sample, while grey ones represent steps that process input from several samples. Arrows show step dependencies.
Comparison of available solutions for NGS analysis.
| Features | Pipelines | |||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Solution | Language | Software license | Published | Free | Open source | Cloud/Container | HPC | Workflow manager | Progress Monitoring | Package Manager | GUI | Reports | Config Validation | Germline | Somatic | RNA-Seq | RNA-Seq De novo | ChIP-seq | Metagenome | Methyl-Seq | Hi-C | PacBio assembly |
|
| Python | GNU LGPL | Pending |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Genome Modeling System | Perl | GNU LGPLv3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Galaxy | Python | Academic Free L3.0 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| bcbio-nextgen | Python | MIT License | No |
|
|
|
|
|
|
|
|
| N/A |
|
|
|
|
|
|
|
|
|
| Omics Pipe | Python | MIT License | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Gene Pattern | Java | Custom | [ |
|
|
|
|
|
|
|
|
|
|
| N/A |
|
|
|
|
|
|
|
| Illumina BaseSpace | bash | Custom | No |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| BINA Genomic Analysis System | Java/Python | Custom | No |
|
|
|
|
|
|
| N/A | N/A | N/A |
|
|
| N/A |
|
|
|
|
|
| SeqWare | Java | GNU GPLv3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| DNA Nexus Platform | Python/bash | Custom | No |
| Partial |
|
|
|
|
|
|
| N/A |
|
|
|
|
|
|
|
|
|
| gkno | Python | MIT License | No |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| NGSANE | bash | BSD3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| GATK's Queue | Scala | MIT License & Broad Institute | No | Partial | Partial |
| N/A |
|
|
|
| N/A | N/A |
|
|
|
|
|
|
|
|
|
| CGA's Firehose | Java | N/A | No |
|
| N/A |
|
|
|
|
|
| N/A | N/A |
|
|
|
|
|
|
|
|
| MIT STAR | Python | GNU GPLv3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| CromWell/WDL | Scala | BSD 3-Clause | No | Partial |
|
|
|
|
|
|
| N/A |
|
|
|
|
|
|
|
|
|
|
| BigDataScript | BDS | Apache License V2 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Kronos | Python | MIT license | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Nextflow | Java | GNU GPLv3 | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SnakeMake | Python | MIT License | [ |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Modified from Griffith & Griffith et al. [64]. Note that community-built pipelines are not considered in the Pipelines section of the table. It is also worth noting that the following table is meant to provide the reader with an overview of the features of several tools in the field but not necessarily an exhaustive list. For a full description of each tool's capabilities, please consult their official documentation.