| Literature DB >> 34201311 |
Sigitas Šulčius1, Gediminas Alzbutas1, Viktorija Juknevičiūtė1, Eugenijus Šimoliūnas2, Petras Venckus1, Monika Šimoliūnienė2, Ričardas Paškauskas1.
Abstract
Little is known about the diversity and distribution of viruses infecting green sulfur bacteria (GSB) thriving in euxinic (sulfuric and anoxic) habitats, including gypsum karst lake ecosystems. In this study, we used targeted cell sorting combined with single-cell sequencing to gain insights into the gene content and genomic potential of viruses infecting sulfur-oxidizing bacteria Chlorobium clathratiforme, obtained from water samples collected during summer stratification in gypsum karst Lake Kirkilai (Lithuania). In total, 82 viral contigs were bioinformatically identified in 62 single amplified genomes (SAGs) of C. clathratiforme. The majority of viral gene and protein sequences showed little to no similarity with phage sequences in public databases, uncovering the vast diversity of previously undescribed GSB viruses. We observed a high level of lysogenization in the C. clathratiforme population, as 87% SAGs contained intact prophages. Among the thirty identified auxiliary metabolic genes (AMGs), two, thiosulfate sulfurtransferase (TST) and thioredoxin-dependent phosphoadenosine phosphosulfate (PAPS) reductase (cysH), were found to be involved in the oxidation of inorganic sulfur compounds, suggesting that viruses can influence the metabolism and cycling of this essential element. Finally, the analysis of CRISPR spacers retrieved from the consensus C. clathratiforme genome imply persistent and active virus-host interactions for several putative phages prevalent among C. clathratiforme SAGs. Overall, this study provides a glimpse into the diversity of phages associated with naturally occurring and highly abundant sulfur-oxidizing bacteria.Entities:
Keywords: Kirkilai; Lithuania; Microviridae; Natura 2000; Single Cell Genomics; chlorobium phages; green sulfur bacteria; gypsum karst ecosystem; rare ecosystems; virus diversity
Mesh:
Substances:
Year: 2021 PMID: 34201311 PMCID: PMC8226683 DOI: 10.3390/genes12060886
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Bacterial community composition of the most abundant (>1% from total reads) taxa throughout the water column of the gypsum karst Lake Kirkilai (Lithuania).
Figure 2Taxonomic (Kaiju) classification (a) and distribution (b) of viral contigs recovered from 62 C. clathratiforme single amplified genomes (SAGs). Phage contigs were ordered by detection frequency among 62 C. clathratiforme SAGs.
Figure 3Phylogenetic tree reconstruction of Microviridae family bacteriophages (RefSeq DB; accessed on March 2021) and representation of C. clathratiforme SAGs-associated Microviridae genome (NODE_396). Members of Bullavirinae are marked in blue, members of Gokushovirinae are marked in green and unclassified microviruses are marked in red. The C. clathratiforme SAGs-associated Microviridae phage genome (NODE_396) is shown in black. The tree was calculated using a neighbor joining algorithm of the conserved sites in the genomes, with the bootstrap values higher than 75% given at the nodes.
Auxiliary metabolic genes identified in C. clathratiforme SAGs-associated viral contigs.
| Metabolism | Pathway | Total AMGs | AMG KO a | AMG KO Name |
|---|---|---|---|---|
| Carbohydrate metabolism | Pentose phosphate pathway | 1 | K07404 | pgl; 6-phosphogluconolactonase |
| Fructose and mannose metabolism | 4 | K01711 | Gmd; GDPmannose 4,6-dehydratase | |
| K02377 | fcl; GDP-L-fucose synthase | |||
| Galactose metabolism | 1 | K01784 | gale; UDP-glucose 4-epimerase | |
| Amino sugar and nucleotide sugar metabolism | 7 | K01709 | rfbG; CDP-glucose 4,6-dehydratase | |
| K01711 | Gmd; GDPmannose 4,6-dehydratase | |||
| K02377 | fcl; GDP-L-fucose synthase | |||
| K13010 | rfbE; perosamine synthetase | |||
| C5-Branched dibasic acid and Butanoate metabolism | 1 | K01652 | ilvB, ilvG, ilvI; acetolactate synthase I/II/III large subunit | |
| Energy metabolism | Sulfur metabolism | 2 | K00390 | cysH; thioredoxin-dependent phosphoadenosine phosphosulfate (PAPS) reductase |
| 2 | K02439 | TST; thiosulfate sulfurtransferase (rhodanese) | ||
| Amino acid metabolism | Alanine, aspartate and glutamate metabolism | 1 | K01953 | asnB; asparagine synthase (glutamine-hydrolyzing) |
| Glycine, serine and threonine metabolism | 1 | K00613 | Glycine amidinotransferase | |
| Cysteine and methionine metabolism | 3 | K00558 | dcm; DNA (cytosine-5)-methyltransferase | |
| K00789 | metK; S-adenosylmethionine synthetase | |||
| Valine, leucine, and isoleucine biosynthesis | 1 | K01652 | ilvB, ilvG, ilvI; acetolactate synthase I/II/III large subunit | |
| Arginine and proline metabolism | 1 | K00613 | Glycine amidinotransferase | |
| Cysteine and methionine metabolism | K00558 | dcm; DNA (cytosine-5)-methyltransferase | ||
| Metabolism of cofactors and vitamins | Pantothenate and CoA biosynthesis | 1 | K01652 | ilvB, ilvG, ilvI; acetolactate synthase I/II/III large subunit |
| Porphyrin and chlorophyll metabolism | 1 | K04034 | bchE; anaerobic magnesium-protoporphyrin IX monomethyl ester cyclase | |
| Ubiquinone and other terpenoid-quinone biosynthesis | 2 | K03183 | ubiE; demethylmenaquinone methyltransferase/2-methoxy-6-polyprenyl-1,4-benzoquinol methylase | |
| Unclassified | – | 1 | K02039 | phoU; phosphate transport system protein |
a KO—KEGG Orthology database of molecular functions represented in terms of functional orthologs.
Summary of CRISPR arrays identified in the consensus genome of C. clathratiforme and spacer sequence matches C. clathratiforme SAGs-associated viral contigs and viral sequences in publicly available repositories (accessed on March 2021).
| CRISPR No. | Array Length (bp) | Repeat Sequence | Cas Proteins | Number of Spacers | Spacer No. | Viral Contig Match | ViralDB Match | NCBI Accession No. |
|---|---|---|---|---|---|---|---|---|
| CRISPR1 | 2983 | ATTATCTCCGACCTGACATATCAAAAGGGATTACGAC | Cas1, Cas2, Cas6, Cas10, Csm2, Csm3, Csm4, Csm5, RT, DExK | 40 | 13 | NODE_31 | – | – |
| 35 | NODE_77 | – | – | |||||
| 37 | NODE_31 | – | – | |||||
| CRISPR2 | 2078 | GTCGCGCCCCCTGCGGGCGCGTGGATTGAAAC | – | 31 | 1 | NODE_31 | – | – |
| 4 | NODE_113 | – | – | |||||
| 5 | NODE_396 | Gokushovirus WZ-2015a | KT264813.1 | |||||
| 16 | – | MH617588.1 | ||||||
| 18 | NODE_113 | – | – | |||||
| 19 | NODE_113 | – | – | |||||
| CRISPR3 | 5829 | GTTTCAATCCGCTATGCGTGCAATAAGATATGATG | Cas 1, Cas2 | 81 | 52 | NODE_31 | – | – |
| 54 | NODE_48 | Erythrobacter phage vB_EliS_R6L | KY006853.1 | |||||
| 62 | NODE_395 | – | – | |||||
| 66 | NODE_31 | – | – | |||||
| 73 | NODE_48 | – | – |