| Literature DB >> 25150023 |
El Mustapha Bahassi1, Peter J Stambrook2.
Abstract
Demand for new technologies that deliver fast, inexpensive and accurate genome information has never been greater. This challenge has catalysed the rapid development of advances in next-generation sequencing (NGS). The generation of large volumes of sequence data and the speed of data acquisition are the primary advantages over previous, more standard methods. In 2013, the Food and Drug Administration granted marketing authorisation for the first high-throughput NG sequencer, Illumina's MiSeqDx, which allowed the development and use of a large number of new genome-based tests. Here, we present a review of template preparation, nucleic acid sequencing and imaging, genome assembly and alignment approaches as well as recent advances in current and near-term commercially available NGS instruments. We also outline the broad range of applications for NGS technologies and provide guidelines for platform selection to best address biological questions of interest. DNA sequencing has revolutionised biological and medical research, and is poised to have a similar impact on the practice of medicine. This tool is but one of an increasing arsenal of developing tools that enhance our capabilities to identify, quantify and functionally characterise the components of biological networks that keep us healthy or make us sick. Despite advances in other 'omic' technologies, DNA sequencing and analysis, in many respects, have played the leading role to date. The new technologies provide a bridge between genotype and phenotype, both in man and model organisms, and have revolutionised how risk of developing a complex human disease may be assessed. The generation of large DNA sequence data sets is producing a wealth of medically relevant information on a large number of individuals and populations that will potentially form the basis of truly individualised medical care in the future.Entities:
Mesh:
Year: 2014 PMID: 25150023 PMCID: PMC7318892 DOI: 10.1093/mutage/geu031
Source DB: PubMed Journal: Mutagenesis ISSN: 0267-8357 Impact factor: 3.000
The most currently used platforms and comparison of their specifications
| Platform | Ion Torrent PGM | PacBio RS | Illumina HiSeq 2000 | Illumina MiSeq | llumina NextSeq 500 | Illumina HiSeq X10 |
|---|---|---|---|---|---|---|
| Instrument cost | $80 K | $695 K | $654 K | $128 K | $250 K | $10 million |
| Sequence yield per run | 20–50Mb on 314 chip, 100–200Mb on 316 chip, 1 Gb on 318 chip | 100 Mb | 600 Gb | 1.5–2 Gb | 120 Gb | 1.6–1.8 Tb |
| Sequencing cost per Gb | $1000 (318 chip) | $2000 | $41 | $502 | $40 | $10 |
| Run time | 2 h | 2 h | 11 days | 27 h | 30 h | <3 days |
| Reported accuracy | Q20 | <Q10 | >Q30 | >Q30 | >Q30 | >Q30 |
| Observed raw error rate | 1.71% | 12.86% | 0.26% | 0.80% | 0.80% | 0.50% |
| Read length | ~200 bases | Average 1500 bases | Up to 150 bases | Up to 150 bases | 2×150 bases | 2×150 bases |
| Paired reads | Yes | No | Yes | Yes | Yes | Yes |
| Insert size | Up to 250 bases | Up to 10 kb | Up to 700 bases | Up to 700 bases | 350 bp | 350 bp |
| Typical DNA requirements | 100–1000 ng | 1000 ng | 50–1000 ng | 50–1000 ng | 50–1000 ng | 50–1000 ng |
Fig. 1.NGS technologies: template preparation, sequencing and imaging and data analysis. For WGS, gDNA is sheared by sonication or nebulisation to form fragments of 300–500 bp. Library amplification can be done by either emPCR or solid-phase amplification. In emPCR (A), a reaction mixture consisting of an oil–aqueous emulsion is created to encapsulate bead–DNA complexes into single aqueous droplets. PCR amplification is performed within these droplets to create beads containing several thousand copies of the same template sequence. EmPCR beads can be chemically attached to a glass slide or deposited into PicoTiterPlate wells. Solid-phase amplification (B) is composed of two basic steps: initial priming and extending of the single-stranded, single-molecule template, and bridge amplification of the immobilised template with immediately adjacent primers to form clusters. (C) Sequencing and imaging using one of the platforms described above. (D) Data analysis using the available software or an integrated workflow such as the GATK pipeline described below.
Fig. 2.The GATK workflow for NGS data analysis.
Fig. 3.Visualisation of NGS mutations in normal versus tumour tissue using IGV and Circos. (A) Detection of a SNP in a proneural brain tumour patient inIDH1 gene but not in normal DNA. A mutation C (blue) to T (red) is flagged in red in the tumour (top) but not in the normal (bottom). (B) Detection of CNVs inEGFR gene in a classic GBM patient. The increase in copy number in the tumour is indicated by the large increase in the number of reads (top panel) compared with the low number of reads in the normal (bottom panel). (C) Detection of EGFR-vIII deletion in the tumour but not in the normal tissue of a GBM patient. The start and finish of the deletion are flagged in red in the tumour (bottom panel) but the deletion is absent in the normal tissue (upper panel). (D) A Circos diagram of a GBM patient. The outmost spine-like histogram (dark red) shows the coverage (10×) at 10 kb bin width. The numbered ring is the chromosomal ideogram, with each number indicating the position of an individual chromosome. The yellow grid ring shows the CNVs (at 10 kb bin width). The normal CNV values (defined by T1/C1 ratio) are represented by grey dots. Mono and bi-allelic deletions are represented by green and red circles, respectively. If the value of T1/C1 is >1.5 but <2.0, it is represented by a blue circle. Any T1/C1 value >2 is represented by a black circle. The CVN track clearly shows there is a chr7 trisomy. In addition, mono- and bi-allelic deletion at chr9 is also very prominent. The CNV pattern of chr10 indicates part of T1 tumour may have aneuploidy in chr10. The orange ring illustrates detected SNPs (8× coverage), validated by IGV analysis. The light yellow ring illustrates detected indels (8× coverage). Non-functional indels are represented by grey circles. Indels with functional impact (located in exon or UTR) are represented by red circles. The innermost circle shows various genomic rearrangements. Red, black, green and blue curves represent deletions (DEL), inversions (INV), inter- and intra-chromosomal translocation (CTX and ITX), respectively.