| Literature DB >> 31660953 |
Sanzhima Garmaeva1, Trishla Sinha1, Alexander Kurilshikov1, Jingyuan Fu1,2, Cisca Wijmenga1, Alexandra Zhernakova3.
Abstract
The human gut harbors a complex ecosystem of microorganisms, including bacteria and viruses. With the rise of next-generation sequencing technologies, we have seen a quantum leap in the study of human-gut-inhabiting bacteria, yet the viruses that infect these bacteria, known as bacteriophages, remain underexplored. In this review, we focus on what is known about the role of bacteriophages in human health and the technical challenges involved in studying the gut virome, of which they are a major component. Lastly, we discuss what can be learned from studies of bacteriophages in other ecosystems.Entities:
Mesh:
Year: 2019 PMID: 31660953 PMCID: PMC6819614 DOI: 10.1186/s12915-019-0704-y
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Viruses can be classified based on various characteristics. These terms are used continuously throughout this manuscript. While all characters are important in determining taxonomic relationships, sequence comparisons using both pairwise sequence similarity and phylogenetic relationships have become one of the primary sets of characters used to define and distinguish virus taxa [6]
Fig. 2Size distributions of genomes and virions of the most prevalent virus families in the gut. Values are given for the prototype virus of each family. Prokaryotic viruses are shown in red, eukaryotic viruses in blue. Structural information as well as genome sizes have been exported from the ICTV Online Report [24]. The prevalence of each family in the human gut has been inferred from the following studies: Inoviridae [20, 25], Circoviridae, Adenoviridae, Microviridae, Podoviridae, Myoviridae, Siphoviridae [26], Anelloviridae [25–27], CrAss-like [28, 29]. dsDNA double-stranded DNA. ssDNA single-stranded DNA
Selection of studies on gut virome changes in humans in various disease states
| Disease | Study population | Major finding | Authors |
|---|---|---|---|
| Malnutrition | Healthy twins ( | Bacteriophage as well as members of the | Reyes et al. 2015 [ |
| CDI patients ( | Treatment response in FMT associated with a high colonization level of donor-derived | Zuo et al. 2018 [ | |
| Inflammatory bowel disease (IBD) | Crohn’s disease ( | Enteric virome richness was increased in Crohn’s disease and ulcerative colitis, and both forms of IBD were associated with a significant expansion of | Norman et al. 2015 [ |
| Colorectal cancer (CRC) | CRC cases ( | Dysbiosis of the gut virome was associated with early- and late-stage CRC. A combination of four taxonomic markers was associated with reduced survival of patients with CRC | Nakatsu et al. 2018 [ |
| Acquired immune deficiency syndrome (AIDS) | HIV-negative ( | Alterations in the enteric virome and bacterial microbiome were associated with low peripheral CD4 T cell counts rather than HIV infection alone | Monaco et al. 2016 [ |
| Type 1 diabetes (T1D) | 11 infants from Finland and Estonia recruited at birth based on their HLA risk genotype and followed for 36 months | Significant enrichment of | Zhao et al. 2017 [ |
| Type 2 diabetes (T2D) | T2D patients ( | Observed a significant increase in the number of gut phages in the T2D group and identified seven phage operational taxonomic units specific to T2D. Significant alterations of the gut phageome not explained by co-variation with the altered bacterial hosts | Ma et al. 2018 [ |
| Hypertension | Healthy controls ( | Noted that certain viruses can be selected as biomarkers to distinguish healthy people, pre-hypertension people, and hypertension patients. Viruses had superior resolution and better discrimination power than bacteria for identifying hypertension samples | Han et al. 2018 [ |
| Parkinson’s disease (PD) | PD patients ( | Identified shifts of the phage/bacteria ratio in lactic acid bacteria known to produce dopamine and regulate intestinal permeability, both major factors implicated in PD pathogenesis | Tetz et al. 2018 [ |
Fig. 3The steps in metagenomic study of the virome. Nucleic acid extraction: the virome can be studied by extraction of nucleic acids from both fractions of the total microbial community which includes bacteria and viruses (left) and purified viral-like particles (VLPs; right), and different types of VLP-enriching techniques might be applied to obtain the latter fraction (see main text for details). Genomic library preparation: the extracted viral genetic material is subjected to sequencing after genomic library preparation. Both the choice of genomic library preparation technique and the sequencing coverage can affect the representation of specific members of the viral community in the sample (see discussion in the main text). Quality control: the raw sequencing reads are further trimmed of sequencing adapters, and low-quality and overrepresented reads are discarded. Virome annotation: there are two main ways of studying viral communities—read-mapping to closed reference databases or de novo assembly of viral genomes with optional, but advised, validation of contigs via reference databases
Challenges of studying human gut virome and possible solutions
| Steps | Challenges | Possible solutions |
|---|---|---|
| Nucleic acid extraction | • Existence of active and silent fractions of viromes • Total nucleic acid isolation protocols (TNAI): • Viral-like particle (VLP) isolation protocols: – Usually require multiple time-consuming steps of VLP and nucleic acid precipitation [ | • Combination of TNAI and VLP isolation protocol approaches [ |
| Genomic library preparation | • Limited amount of viral genetic material available | • Use of more sensitive genomic library preparation kits |
| • MDA may lead to overrepresentation of circular ssDNA viruses [ | • Restricted use of MDA | |
• Studying RNA viruses requires additional effort due to the relative instability of RNA genetic material: - Use of reverse transcriptase to convert RNA to cDNA - Restricted usage of RNase in protocols handling both DNA and RNA viruses [ - May require separate isolation protocol (arising from the previous point) and, therefore, increase of the starting material | • Metatranscriptomics approaches • Use of reverse transcription step | |
• Studying ssDNA viruses requires additional effort: - Some of the WGA techniques that precede the genomic library preparation procedure might introduce biases into the representation of ssDNA viruses [ - The majority of current genomic library preparation procedures cannot handle ssDNA genomes due to the use of dsDNA adapters - ssDNA viruses have been shown to have higher mutation rates than dsDNA viruses [ | • Use of ssDNA adaptors in adaptor-ligation reaction at the genomic library preparation step [ | |
| • Selection of an appropriate cut-off for coverage is complicated | • Studies report discoveries of a huge number of viruses at a depth of 1–15 × 106 reads per sample [ | |
| Quality control | • Removal of bacterial sequences is complicated by the viral signals from prophages (both cryptic and inducible) carried by bacterial genomes | • Use of tools for identification of prophages in bacterial genomes [ |
| Data analysis | • Existing databases do not fully represent viral diversity [ | • Use of de novo assembly approaches |
| • Rapid evolution and diversity of viral genomes limits reference-based approaches | • Use of reference databases that include both cultured viruses and computationally identified viral contigs [ • Use of a protein-based search • Use of a profile hidden Markov model based on protein domains allows the identification of remote homologs [ | |
• De novo assembly approach is sensitive to biases introduced during genomic library preparation and sequencing: - Low DNA input for genomic library preparation decreases the percentage of reads that map back to the corresponding assemblies [ - Use of a DNA amplification step might affect the distribution of read coverage [ - Shifts in GC content during genomic library preparation [ | • Adjustment of the assembly pipeline according to applied genomic library preparation procedure [ • Use of genomic library preparation protocols without any amplification procedure (needs high DNA input, probably not applicable for viromics) [ | |
| • Reproducibility of assembly results when combining different assemblers is complicated by technical challenges [ |