| Literature DB >> 31141902 |
René Kallies1, Martin Hölzer2,3, Rodolfo Brizola Toscan4, Ulisses Nunes da Rocha5, John Anders6,7, Manja Marz8,9,10, Antonis Chatzinotas11,12.
Abstract
Viral ecology of terrestrial habitats is yet-to be extensively explored, in particular the terrestrial subsurface. One problem in obtaining viral sequences from groundwater aquifer samples is the relatively low amount of virus particles. As a result, the amount of extracted DNA may not be sufficient for direct sequencing of such samples. Here we compared three DNA amplification methods to enrich viral DNA from three pristine limestone aquifer assemblages of the Hainich Critical Zone Exploratory to evaluate potential bias created by the different amplification methods as determined by viral metagenomics. Linker amplification shotgun libraries resulted in lowest redundancy among the sequencing reads and showed the highest diversity, while multiple displacement amplification produced the highest number of contigs with the longest average contig size, suggesting a combination of these two methods is suitable for the successful enrichment of viral DNA from pristine groundwater samples. In total, we identified 27,173, 5,886 and 32,613 viral contigs from the three samples from which 11.92 to 18.65% could be assigned to taxonomy using blast. Among these, members of the Caudovirales order were the most abundant group (52.20 to 69.12%) dominated by Myoviridae and Siphoviridae. Those, and the high number of unknown viral sequences, substantially expand the known virosphere.Entities:
Keywords: AquaDiva; aquifer; groundwater; sequencing library preparation; viral metagenome
Year: 2019 PMID: 31141902 PMCID: PMC6631259 DOI: 10.3390/v11060484
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
P-values of analysis of variance (ANOVA) of raw sequencing read and read cluster numbers between the different library preparation methods.
|
| ||||||||
| Library Preparation | NASL | LASL | SISPA | MDA | ||||
| NASL | n/a | >0.05 | 0.008 | 0.002 | ||||
| LASL | n/a | >0.05 | 0.023 | |||||
| SISPA | n/a | >0.05 | ||||||
| MDA | n/a | |||||||
|
| ||||||||
| Relative proportion | Number of clusters | |||||||
| Library Preparation | NASL | LASL | SISPA | MDA | NASL | LASL | SISPA | MDA |
| NASL | n/a | <0.001 | >0.05 | >0.05 | n/a | 0.018 | >0.05 | 0.008 |
| LASL | n/a | <0.001 | <0.001 | n/a | >0.05 | >0.05 | ||
| SISPA | n/a | >0.05 | n/a | >0.05 | ||||
| MDA | n/a | n/a | ||||||
NASL: non-amplified shotgun library; LASL: linker amplification shotgun libraries; SISPA: single-primer amplification; MDA: multiple displacement amplification.
Figure 1Comparison of Hainich groundwater viromes diversity and coverage as function of sequencing effort using Nonpareil curves [53,54,55]. Estimated coverage is shown as dotted lines, true coverage as solid lines. Estimated diversity is shown with arrows on the x-axis. Horizontal dotted line shows 95% coverage. Viral metagenome coverage, actual sequencing effort, required sequencing effort and kmer-based diversity for each library are shown in the right panel.
Figure 2Overlap of sequencing read cluster (90% identity) information identified by library preparation methods, independently shown for each sampling site. Non amplified sequencing libraries (NASL) were not included in the analysis due to the homopolymeric and repetitive nature of sequences obtained from these libraries.
Figure 3Number of contigs (A) and N50 (B) produced by sequence library preparation methods and assembly tools. Differences between library preparation methods were tested using analysis of variance (ANOVA). SOAPd: SOAPdenovo-Trans, NASL: non-amplified shotgun library, LASL: linker amplification shotgun libraries, SISPA: single-primer amplification, MDA: multiple displacement amplification.
Figure 4The venn diagram presents numbers of unique and shared viral contigs among the different viromes. Cross-assembled viral contigs (as identified by VirSorter, VirFinder and VrAP) were compared between sites by an all-versus-all clustering approach (95% identity) with CD-hit-est-2D [44].
Figure 5Taxonomic assignment of viral contigs identified from cross-assemblies. (A) pie charts present relative and absolute abundance of viral contigs after blastp analysis. (B) Taxonomic profile of viral contigs as classified by blastp (viral contigs with blast hits in figure A). Data were visualized with Pavian [71].
Figure 6A network analysis of shared predicted protein content between viral RefSeq database and Hainich viral populations. Nodes (circles) indicate contigs and shared edges (lines) indicate shared protein content. Data were analysed using vConTACT2 [61,62] and displayed with cytoscape [64].