| Literature DB >> 32661357 |
Jessica K Jarett1,2,3, Mária Džunková4,5, Frederik Schulz1,2, Simon Roux1,2, David Paez-Espino1,2, Emiley Eloe-Fadrosh1,2, Sean P Jungbluth1,2, Natalia Ivanova1,2, John R Spear6, Stephanie A Carr7, Christopher B Trivedi8, Frank A Corsetti9, Hope A Johnson10, Eric Becraft11,12, Nikos Kyrpides1,2, Ramunas Stepanauskas12, Tanja Woyke13,14,15.
Abstract
Our current knowledge of host-virus interactions in biofilms is limited to computational predictions based on laboratory experiments with a small number of cultured bacteria. However, natural biofilms are diverse and chiefly composed of uncultured bacteria and archaea with no viral infection patterns and lifestyle predictions described to date. Herein, we predict the first DNA sequence-based host-virus interactions in a natural biofilm. Using single-cell genomics and metagenomics applied to a hot spring mat of the Cone Pool in Mono County, California, we provide insights into virus-host range, lifestyle and distribution across different mat layers. Thirty-four out of 130 single cells contained at least one viral contig (26%), which, together with the metagenome-assembled genomes, resulted in detection of 59 viruses linked to 34 host species. Analysis of single-cell amplification kinetics revealed a lack of active viral replication on the single-cell level. These findings were further supported by mapping metagenomic reads from different mat layers to the obtained host-virus pairs, which indicated a low copy number of viral genomes compared to their hosts. Lastly, the metagenomic data revealed high layer specificity of viruses, suggesting limited diffusion to other mat layers. Taken together, these observations indicate that in low mobility environments with high microbial abundance, lysogeny is the predominant viral lifestyle, in line with the previously proposed "Piggyback-the-Winner" theory.Entities:
Mesh:
Year: 2020 PMID: 32661357 PMCID: PMC7490370 DOI: 10.1038/s41396-020-0705-4
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Fig. 1Experimental workflow.
a Overview of Cone Pool hot spring; b section through microbial mat, showing dendritic cones and layers of the mat; c delineation of layers of the mat; and d Sequencing workflow for single-cell genomics (left, layer C) and shotgun metagenomics (right, layers C, E, and H). The numbers at the bottom of the figure indicate the number of resulting high quality (HQ), medium quality (MQ) and low quality (LQ) genomes, as based on MISAG/MIMAG standards.
Fig. 2Summary of microbes and viruses found in this study.
a Bacterial and archaeal species recovered by metagenomics and single-cell analysis. Phylum-level cluster representatives are displayed in a phylogenetic tree based on concatenated alignment of 56 universal single copy marker proteins. Each row represents a species based on 95% ANI. The first two columns represent the number of genomes in each species cluster and the source of the genomes of the given species (SAGs only, MAGs only or both MAGs and SAGs). The following three columns show read coverage of each species in metagenomics samples from layers C, E and H. The last two columns indicate the number of CRISPR spacers and of viral contigs detected for each species. b Alluvial plot of virus and host connection. The left panel represents host species, colored by phylum, and the right panel shows viral clusters separated by horizontal black lines and singletons. The black dots indicate viruses detected on MAGs, while other viruses were detected on SAGs. Full results of this analysis, including the viruses with unknown host information, are shown in Supplementary Fig. S3.
Fig. 3Host–virus genome read coverage ratios.
a Possible scenarios for the interpretation of genome read coverage results. If there is a low rate of viral replication, we expect the genome coverage ratio of the virus and host to be nearly the same in a metagenome. Higher viral coverage could result from a higher number of virions compared to host cells, or more copies of the viral genome in each infected cell, but could also mean that a lysogenic virus has more than the single predicted host species. Higher coverage of a bacterial genome suggests that single-cell genomics captured a very rare infection event or that the virus infected only a subset of cells (i.e. only certain strains). b Detection of the 59 de-replicated host–virus pairings in the three layers. The gray portions indicate the pairs in which virus, host, or both genomes were below the detection threshold. For 35 pairings (purple), detection was possible in at least one of the layers (>75% of the genome length covered). c Number of host genomes in each phylum for which the host–virus genome coverage ratio could (Detectable) or could not (Not detectable) be calculated. Virus and host icons indicate which one from the host–virus pair was above the detection threshold. d Fold-change of host and virus genome coverage for the four pairs detected in two or three layers. The dashed line indicates the 1.5× fold-change range. Dots positioned on the right from the host-baseline in the middle indicate higher coverage of the viral genome, while on the left indicate lower coverage of the viral genome compared to the host. e Fold-change of host and virus genome coverage of all 35 host-virus pairs (dots) grouped by host phyla. The distribution of points relative to the x-axis is described in (d).
Fig. 4Predicted virus diffusion across the mat layers.
a Possible scenarios of virus diffusion across the layers. Given that the layers differ by bacterial composition, a layer-specific viral composition suggests limited diffusion across the layers. If the viruses can move across layers, their abundances would vary across layers. b Genome coverage of the 323 virus sequences with no host information across the mat layers. The red, blue, and green stripes above the heatmap indicate whether the viruses were detected in one, two, or three layers, and the percentage above the stripes indicate the proportion of each of these groups. c Histograms of genome coverage fold-change of viruses detected in upper layer compared to the lower layer.