| Literature DB >> 30684135 |
Aleksandra Kozińska1, Paulina Seweryn2, Izabela Sitkiewicz3.
Abstract
For the last 40 years, "Sanger sequencing" allowed to unveil crucial secrets of life. However, this method of sequencing has been time-consuming, laborious and remains expensive even today. Human Genome Project was a huge impulse to improve sequencing technologies, and unprecedented financial and human effort prompted the development of cheaper high-throughput technologies and strategies called next-generation sequencing (NGS) or whole genome sequencing (WGS). This review will discuss applications of high-throughput methods to study bacteria in a much broader context than simply their genomes. The major goal of next-generation sequencing for a microbiologist is not really resolving another circular genomic sequence. NGS started its infancy from basic structural and functional genomics, to mature into the molecular taxonomy, phylogenetic and advanced comparative genomics. Today, the use of NGS expended capabilities of diagnostic microbiology and epidemiology. The use of RNA sequencing techniques allows studying in detail the complex regulatory processes in the bacterial cells. Finally, NGS is a key technique to study the organization of the bacterial life-from complex communities to single cells. The major challenge in understanding genomic and transcriptomic data lies today in combining it with other sources of global data such as proteome and metabolome, which hopefully will lead to the reconstruction of regulatory networks within bacterial cells that allow communicating with the environment (signalome and interactome) and virtual cell reconstruction.Entities:
Keywords: Microbiome; Next-generation sequencing; Phylogenetic analysis; Sanger sequencing; Structural genomics; Transcriptomics
Mesh:
Substances:
Year: 2019 PMID: 30684135 PMCID: PMC6373407 DOI: 10.1007/s13353-019-00482-2
Source DB: PubMed Journal: J Appl Genet ISSN: 1234-1983 Impact factor: 3.240
Fig. 1Differences of experimental approach between classical (a) and experiments involving high-throughput analyses such as NGS (b)
Comparison of sequencing technologies available currently on the market, based on Liu et al. (2012), Quail et al. (2012), Pevsner (2015), Levy and Myers (2016)) and information from system manufacturers (https://www.thermofisher.com, https://nanoporetech.com/, http://sequencing.roche.com, https://www.illumina.com/, https://www.pacb.com/)
| Method | Instrument manufacturer | Read length | Accuracy (single read not consensus) | Reads per run | Time per run (not including library preparation) | Cost per 1 million bases (in US$) | Advantages | Disadvantages |
|---|---|---|---|---|---|---|---|---|
| Chain termination (Sanger sequencing) | ThermoFisher | 400 to 900 bp | 99.9% | N/A | 20 min to 3 h | $2400 | Useful for many smaller applications | Very expensive |
| Ion semiconductor (ion torrent sequencing) | ThermoFisher | Up to 600 bp | 98% | Up to 80 million | 2 h | $1 | Less expensive NGS equipment. Fast | Homopolymer errors |
| Nanopore sequencing | Oxford Nanopore | Depends on the library preparation, not the device | ~ 92–97% | Dependent on read length selected by user | Data streamed in real time. No fixed running time | $500–999 per flow cell | Long individual reads. Portable (palm-sized) | Lower throughput than other machines, low single read accuracy |
| Pyrosequencing | Roche | 700 bp | 99.9% | 1 Million | 24 h | $10 | Long reads. Fast | Runs are expensive. Homopolymer errors |
| Sequencing by synthesis | Illumina | 50–500 bp depending on the instrument | 99.9% | 1 Million–3 billion depending on the instrument | 1 to 11 days, depending upon sequencer and specified read length | $0.05 to $0.15 | High output. Low costs of sequencing | Expensive equipment |
| Sequencing by ligation (SOLiD sequencing) | ThermoFisher | 50 + 35 or 50 + 50 bp | 99.9% | 1.2 to 1.4 Billion | 1 to 2 weeks | $0.13 | Low cost per base | Slower than other methods. Problems with palindromic sequences |
| Single-molecule real-time sequencing | Pacific Biosciences | On average 14,000 bp maximum read length > 40,000 bases | 87% Single-read accuracy | 50,000 per SMRT cell, or 500–1000 megabases | 30 min to 4 h | $0.13–$0.60 | Fast. Long reads | Moderate throughput. Equipment is very expensive |
Fig. 2A schematic structure of a pan-genome that includes genes shared by all strains/isolates, genes shared by all genomes (core genome), and strain-specific genes that are present only in individual
Fig. 3A virtual cell reconstruction that will be possible in the future based on a combination of multiple levels of the “-omics” data
Fig. 4Overview of the different steps involved in the use of NGS technologies for data gathering and utilization. After Angers-Loustau et al. (2018), modified