| Literature DB >> 34184913 |
Ankita Kothari1, Simon Roux2, Hanqiao Zhang1, Anatori Prieto1, Drishti Soneja1, John-Marc Chandonia3, Sarah Spencer4,5,6, Xiaoqin Wu7, Sara Altenburg8, Matthew W Fields8,9, Adam M Deutschbauer3,10, Adam P Arkin3,11,12, Eric J Alm4,5,13,6, Romy Chakraborty7, Aindrila Mukhopadhyay1,3.
Abstract
Viruses are ubiquitous microbiome components, shaping ecosystems via strain-specific predation, horizontal gene transfer and redistribution of nutrients through host lysis. Viral impacts are important in groundwater ecosystems, where microbes drive many nutrient fluxes and metabolic processes; however, little is known about the diversity of viruses in these environments. We analyzed four groundwater plasmidomes (the entire plasmid content of an environment) and identified 200 viral sequences, which clustered into 41 genus-level viral clusters (approximately equivalent to viral genera) including 9 known and 32 putative new genera. We used publicly available bacterial whole-genome sequences (WGS) and WGS from 261 bacterial isolates from this groundwater environment to identify potential viral hosts. We linked 76 of the 200 viral sequences to a range of bacterial phyla, the majority associated with Proteobacteria, followed by Firmicutes, Bacteroidetes, and Actinobacteria. The publicly available WGS enabled mapping bacterial hosts to several viral sequences. The WGS of groundwater isolates increased the depth of host prediction by allowing host identification at the strain level. The latter included 4 viruses that were almost entirely (>99% query coverage, >99% identity) identified as integrated in the genomes of Pseudomonas, Acidovorax, and Castellaniella strains, resulting in high-confidence host assignments. Lastly, 21 of these viruses carried putative auxiliary metabolite genes for metal and antibiotic resistance, which might drive their infection cycles and/or provide selective advantage to infected hosts. Exploring the groundwater virome provides a necessary foundation for integration of viruses into ecosystem models where they are key players in microbial adaption to environmental stress. IMPORTANCE To our knowledge, this is the first study to identify the bacteriophage distribution in a groundwater ecosystem shedding light on their prevalence and distribution across metal-contaminated and background sites. Our study is uniquely based on selective sequencing of solely the extrachromosomal elements of a microbiome followed by analysis for viral signatures, thus establishing a more focused approach for phage identifications. Using this method, we detected several novel phage genera along with those previously established. Our approach of using the whole-genome sequences of hundreds of bacterial isolates from the same site enabled us to make host assignments with high confidence, several at strain levels. Certain phage genes suggest that they provide an environment-specific selective advantage to their bacterial hosts. Our study lays the foundation for future research on directed phage isolations using specific bacterial host strains to further characterize groundwater phages, their life cycles, and their effects on groundwater microbiome and biogeochemistry.Entities:
Keywords: antibiotic resistance; extrachromosomal DNA; groundwater; metal resistance; phage; plasmidome; viral genome; viral host; viral sequences; virus
Year: 2021 PMID: 34184913 PMCID: PMC8269241 DOI: 10.1128/mSystems.00537-21
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1Overview of the study. Groundwater from the Oak Ridge Field research site from background (B) and contaminated (C) areas was filtered and subjected to circular DNA extraction. Sequencing, assembly, and annotation resulted in identification of both plasmids and viral genomes. The viral genomes were subjected to viral cluster analysis to study the virus types, host association analysis to get a prediction of bacteria they might infect, and auxiliary metabolite analysis (AMG) to study what functional genes they carry.
FIG 2vConTACT-generated viral cluster map depicting clustering of 85 viral sequences from background (light blue) and contaminated (dark blue) groundwater, along with known virus reference genomes (white). The 9 viral clusters that contain known viruses are annotated on the figure as follows: 1, Microviridae; 2, Podoviridae (Caudovirales); 3, Myoviridae (Caudovirales); 4, Myoviridae (Caudovirales); 5, Podoviridae (Caudovirales); 6, Siphoviridae (Caudovirales); 7, Podoviridae (Caudovirales); 8, Inoviridae; 9, Myoviridae (Caudovirales). The green lines show vContact pairwise similarity scores. The order and distance between different viruses are arbitrarily selected values.
FIG 3Size distribution of viruses from the background and contaminated groundwaters. The circular viral sequences are depicted in blue, while the rest are in red.
FIG 4Viral host predictions based on BLAST, high-stringency BLAST (BLAST99), tetranucleotide frequency (4-mer), and CRISPR methods using whole-genome sequence (WGS) information from 261 ORFRC bacterial isolates. The details of the 20 viruses (“a” to “t”) are provided in Table S4. The viruses “h” and “p” are assigned to hosts in the class Betaproteobacteria and the family Comamonadaceae. The rest of the viruses are assigned to the indicated genera. The phylogenetic tree was made from 16S rRNA sequence of 261 ORFRC strains. The viral sequence “s” appears twice because it was predicted to infect two different genera based on the different prediction methods.
FIG 5Compilation of viral sequences across the background and contaminated groundwater sites based on availability of bacterial host prediction (green indicates that the bacterial host was predicted, while blue indicates a lack of available bacterial-host prediction).
FIG 6Example of a viral contig carrying auxiliary metabolite genes. Map of the virus (gw456_c_scaffold_130) from background groundwater with phage-related genes highlighted in green (darker green represents true hallmark genes of viruses), metal (copper, cobalt, zinc, cadmium, lead, mercury, and arsenic) resistance genes highlighted blue, antibiotic (spectinomycin and fosfomycin) resistance genes highlighted in pink, and the metabolism (lactate dehydrogenase) gene in yellow. The viral contig was annotated via Prokka in KBase and the annotations for virus-associated genes were updated on the map using VirSorter predictions; details are in Table S6.