| Literature DB >> 19452216 |
Daniel J Turner1, Thomas M Keane, Ian Sudbery, David J Adams.
Abstract
Next-generation sequencing technologies are revolutionizing biology by allowing for genome-wide transcription factor binding-site profiling, transcriptome sequencing, and more recently, whole-genome resequencing. While it is currently not possible to generate complete de novo assemblies of higher-vertebrate genomes using next-generation sequencing, improvements in sequence read lengths and throughput, coupled with new assembly algorithms for large data sets, will soon make this a reality. These developments will in turn spawn a revolution in how genomic data are used to understand genetics and how model organisms are used for disease gene discovery. This review provides an overview of the current next-generation sequencing platforms and the newest computational tools for the analysis of next-generation sequencing data. We also describe how next-generation sequencing may be applied in the context of vertebrate model organism genetics.Entities:
Mesh:
Year: 2009 PMID: 19452216 PMCID: PMC2714443 DOI: 10.1007/s00335-009-9187-4
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Comparison of second-generation sequencing technologies
| Sequencing platform | Sample requirements | Length of library prep/feature generation (days) | Method of feature generation | Sequencing chemistry | Read length (bases) | Run time | Throughput/run (Gb) | Throughput/day (Gb) |
|---|---|---|---|---|---|---|---|---|
| Roche 454 (FLX-Titanium) | 1 μg for shotgun library, 5 μg for paired end | 3–4 | Bead-based/emulsion PCR | Pyrosequencing | 400–500 | 10 h | 0.4–0.5 | ~1 |
| Illumina Genome Analyzer (GAII) | <1 μg for single or paired-end libraries | 2 | Isothermal ‘bridge amplification’ on flowcell surface | Reversible terminator SBS | 35–75 | 2 days for 36-cycle single-end run, 4 days for 36-cycle paired-end run | 3–6 | 1.5 |
| ABI SOLiD | <2 μg for shotgun library, 5–20 μg for paired end | 2–4.5 | Bead-based/emulsion PCR | Ligation | 25–75 | 6–7 days for fragment libraries, 8 days for 2 × 25 base paired-end libraries | 10–20 | 1.7–2 |
| Helicos tSMS | <2 μg, single end only | 1 | N/A (single molecule sequencing) | Virtual terminator SBS | 25–50 | 8–9 days | 21–28 | 2.5 |
Fig. 1The proportion of unique sequence in the Streptococcus suis (squares) and Mus musculus (triangles) genomes for varying read lengths. This graph indicates that read length has a critical affect on the ability to place reads uniquely to the genome
A summary of short-read alignment tools
| Illumina | 454 | SOLiD | S | I | URL | |
|---|---|---|---|---|---|---|
| Bowtie | Y | Y | N | Y | N | |
| ELAND | Y | N | N | N | N | |
| Exonerate | Y | Y | N | N | Y | |
| GMAP | Y | N | N | N | N | |
| MOSAIK | Y | Y | Y | Y | Y | |
| MAQ | Y | N | Y | Y | Y | |
| MUMer | Y | Y | N | Y | Y | |
| Novocraft | Y | N | N | Y | Y | |
| RMAP | Y | N | N | N | N | |
| SeqMap | Y | N | N | N | I | |
| SHRiMP | Y | Y | Y | Y | Y | |
| SOAP | Y | N | N | Y | Y | |
| SSAHA2 | Y | Y | N | Y | Y |
S outputs SNPs, I outputs short insertion deletions (indels)
A summary of assemblers developed for use with next-generation sequencing data
| Illumina | 454 | SOLiD | D | M | E | URL | |
|---|---|---|---|---|---|---|---|
| Abyss | Y | N | N | Y | N | Y | |
| ALLPATHS | Y | N | N | Y | N | N | |
| MAQ | Y | N | Y | N | Y | N | |
| MIRA2 | N | Y | N | Y | N | N | |
| Newbler | N | Y | N | Y | Y | N | |
| SSAKE | Y | N | N | Y | N | N | |
| SHARCGS | Y | N | N | Y | N | N | |
| VCAKE | Y | N | Y | Y | N | N | |
| Velvet | Y | N | N | Y | N | N |
D capable of de novo assembly, M mapped assembly, E can assemble experimental organisms