| Literature DB >> 28090077 |
Charlotte J Houldcroft1, Mathew A Beale2, Judith Breuer3.
Abstract
Whole-genome sequencing (WGS) of pathogens is becoming increasingly important not only for basic research but also for clinical science and practice. In virology, WGS is important for the development of novel treatments and vaccines, and for increasing the power of molecular epidemiology and evolutionary genomics. In this Opinion article, we suggest that WGS of viruses in a clinical setting will become increasingly important for patient care. We give an overview of different WGS methods that are used in virology and summarize their advantages and disadvantages. Although there are only partially addressed technical, financial and ethical issues in regard to the clinical application of viral WGS, this technique provides important insights into virus transmission, evolution and pathogenesis.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28090077 PMCID: PMC7097211 DOI: 10.1038/nrmicro.2016.182
Source DB: PubMed Journal: Nat Rev Microbiol ISSN: 1740-1526 Impact factor: 60.633
Advantages and disadvantages of different viral sequencing methods
PowerPoint slide
| Method | Advantages | Disadvantages |
|---|---|---|
| Metagenomic sequencing | • Simple, cost-effective sample preparation • Can sequence novel or poorly characterized genomes • Effective in 'fishing' approaches to identify a potential underlying pathogen • Lower required number of PCR cycles causes few amplification mutations • Preservation of minor variant frequencies reflects • No primer or probe design required, which enables a rapid response to novel pathogens or sequence variants | • High sequencing cost to obtain sufficient data • Relatively low sensitivity to target pathogen • Coverage is proportional to viral load • High proportion of non-pathogen reads increases computational challenges • Incidental sequencing of human and off-target pathogens raises ethical and diagnostic issues |
| PCR amplification sequencing | • Tried and trusted well-established methods and trained staff • Highly specific; most sequencing reads will be pathogen-specific, which decreases sequencing costs • Highly sensitive, with good coverage even at low pathogen load • Relatively straightforward design and application of new primers for novel sequences | • Labour-intensive and difficult to scale for large genomes • Iterating standard PCRs across large genomes requires high sample volume • PCR reactions are subject to primer mismatch, particularly in poorly characterized or highly diverse pathogens, or those with novel variants • Limited ability to sequence novel pathogens • High number of PCR cycles may introduce amplification mutations • Uneven amplification of different PCR amplicons may influence minor variant and haplotype reconstruction |
| Target enrichment sequencing | • Single tube sample preparation that is suited to high-throughput automation and the sequencing of large genomes • Higher specificity than metagenomics decreases sequencing costs • Overlapping probes increases tolerance for individual primer mismatches • Fewer PCR cycles (than PCR amplification) limits the introduction of amplification mutations • Preservation of minor variant frequencies reflects | • High cost and technical expertise for sample preparation • Unable to sequence novel pathogens and requires well-characterized reference genomes for probe design • Sensitivity is comparable to PCR, but coverage is proportional to pathogen load; low pathogen load yields low or incomplete coverage • Cost and time to generate new probe sets limit a rapid response to emerging and novel viruses |
Limitations of viral sequencing compared with bacterial sequencing
PowerPoint slide
| Feature | Bacteria | Viruses | Challenges |
|---|---|---|---|
| Genome | dsDNA | dsDNA, ssDNA, partially dsDNA, ssRNA or dsRNA | Different extraction protocols for different viruses. RNA viruses require cDNA synthesis and ssDNA second strand synthesis |
| Gene conservation | Highly conserved, essential genes (for example, 16s rRNA) enabling broad microbiome studies and surveys of taxa | No homologous genes between viruses of different phyla | Lack of conserved homology between viral phyla prevents universal primer-based surveys of viromes |
| Culture | Often straightforward to culture and obtain pure, highly enriched bacterial DNA and RNA | Challenging to culture, and require a host cell for replication | Cultured viruses are heavily contaminated with host cell nucleic acids, which decreases viral sequencing output |
| Clinical specimens | Hardy bacterial cells with cell walls can often be separated from human cells in clinical specimens using differential lysis methods or flow cytometry144 prior to extraction | Viruses are intracellular pathogens, and although separation from the host is possible (for example, by filtration or antibody pull-down), viruses cannot easily be separated from clinical samples prior to extraction | Clinical specimens are heavily contaminated with host nucleic acids, which decreases viral sequencing output |
| Methylation patterns | Bacteria use different methylation patterns from eukaryotes; host DNA can be depleted post-extraction using restriction endonucleases that are directed against CpG methylation145 | DNA viruses are often methylated by the host intracellular machinery, and may have similar methylation patterns | DNA digestion according to methylation patterns is less effective as a means of host depletion for viral sequencing |
cDNA, complementary DNA; dsDNA, double-stranded DNA; dsRNA, double-stranded RNA; rRNA, ribosomal RNA; ssDNA, single-stranded DNA; ssRNA, single-stranded RNA.
Figure 1Methods for sequencing viral genomes from clinical specimens.
All specimens originally comprise a mix of host (in blue) and pathogen (in red) DNA sequences. For pathogens that have RNA genomes, RNA in the sample is converted into complementary DNA (cDNA) before PCR and library preparation. Direct metagenomic sequencing provides an accurate representation of the sequences in the sample, although at high sequencing and data analysis and storage costs. PCR amplicon sequencing uses many discrete PCR reactions to enrich the viral genome, which increases the workload for large genomes substantially but decreases the costs. Target enrichment sequencing uses virus-specific nucleotide probes that are bound to a solid phase, such as beads, to enrich the viral genome in a single reaction, which reduces workload but increases the cost of library preparation compared with PCR.
PowerPoint slide