| Literature DB >> 24710047 |
Alberto Magi1, Matteo Benelli2, Alessia Gozzini3, Francesca Girolami4, Francesca Torricelli5, Maria Luisa Brandi6.
Abstract
The emergence of next-generation sequencing (NGS) platforms imposes increasing demands on statistical methods and bioinformatic tools for the analysis and the management of the huge amounts of data generated by these technologies. Even at the early stages of their commercial availability, a large number of softwares already exist for analyzing NGS data. These tools can be fit into many general categories including alignment of sequence reads to a reference, base-calling and/or polymorphism detection, de novo assembly from paired or unpaired reads, structural variant detection and genome browsing. This manuscript aims to guide readers in the choice of the available computational tools that can be used to face the several steps of the data analysis workflow.Entities:
Year: 2010 PMID: 24710047 PMCID: PMC3954090 DOI: 10.3390/genes1020294
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Summary of the main features of the four HTS technologies.
| Roche 454 | Illumina Genome Analyzer | ABI SOLiD | Helicos Heliscope | |
|---|---|---|---|---|
| Sequencing method | Pyrosequencing | Reversible dye terminators | Sequencing by ligation | Single Molecule Sequencing |
| Read lengths | 400 bases | 100 bases | 50 bases | 35 bases |
| Sequencing run time | 10 h | 10 days | 11-12 days | 30 days |
| Total bases per run | 500 Mb | 20 Gb | 100 Gb | 35 Gb |
| Error Rate | 0.1% | 1.5% | 4% | 2-7% |
A selection of short reads alignment tools. The platform compatibility depends on the maximum read length supported by the program.
| Program | Author(s) | Website | Platform | Aligned Gbp per CPU day |
|---|---|---|---|---|
| Maq [ | Li H | Illumina, SOLiD (partial) | ~ 0.2 | |
| Bowtie [ | Langmead B Et, al. | Illumina | ~ 7 | |
| SSAHA2 [ | Ning Z Et, al. | Illumina, SOLiD, 454 | ~ 0.5 | |
| BWA [ | Li H and Durbin R | Illumina, SOLiD, 454 | ~ 7 | |
| SOAP2 [ | Li R Et, al. | Illumina | ~ 7 |
A list of tools for the visualization of alignments or assemblies of short read data.
| Program | Author(s) | Website | Distribution |
|---|---|---|---|
| EagleView | Huang W and Marth G | Binary version for Windows, Mac OS X and Linux | |
| MapView | Bao H Et, al. | Binary version for Windows and Linux | |
| MaqView | Li H Et, al. | Source Code (C, Java) and Binary version for Linux and Mac OS X | |
| Tablet | Milne I Et, al. | Binary version for Windows, Mac OS X and Linux | |
| IGV | Broad Institute | Binary version for Windows, Mac OS X and Linux |
Figure 1A screenshot of Tablet, an Alignment / Assembly visualization software. Figure taken from [60].
A selection of PEM-based algorithms for the detection of Structural Variants.
| Program | Author(s) | Website | Detectable Events |
|---|---|---|---|
| PEMer | Korbel J Et, al. | basic deletion, basic insertion, basic inversion, linking, linked insertion | |
| VariationHunter | Hormozdiari F Et, al. | basic deletion, basic insertion, basic inversion, everted duplication | |
| MoDIL | Lee S Et, al. | basic deletion, basic insertion | |
| BreakDancer | Chen K Et, al. | The software package is available as supplementary information at Nature Methods Online | basic deletion, basic insertion, basic inversion, hanging insertion |