| Literature DB >> 34140946 |
Timothy J Williams1, Michelle A Allen1, Natalia Ivanova2, Marcel Huntemann2, Sabrina Haque1, Alyce M Hancock1, Sarah Brazendale1, Ricardo Cavicchioli1.
Abstract
Organic Lake in Antarctica is a marine-derived, cold (-13∘C), stratified (oxic-anoxic), hypersaline (>200 gl-1) system with unusual chemistry (very high levels of dimethylsulfide) that supports the growth of phylogenetically and metabolically diverse microorganisms. Symbionts are not well characterized in Antarctica. However, unicellular eukaryotes are often present in Antarctic lakes and theoretically could harbor endosymbionts. Here, we describe Candidatus Organicella extenuata, a member of the Verrucomicrobia with a highly reduced genome, recovered as a metagenome-assembled genome with genetic code 4 (UGA-to-Trp recoding) from Organic Lake. It is closely related to Candidatus Pinguicocccus supinus (163,218 bp, 205 genes), a newly described cytoplasmic endosymbiont of the freshwater ciliate Euplotes vanleeuwenhoeki (Serra et al., 2020). At 158,228 bp (encoding 194 genes), the genome of Ca. Organicella extenuata is among the smallest known bacterial genomes and similar to the genome of Ca. Pinguicoccus supinus (163,218 bp, 205 genes). Ca. Organicella extenuata retains a capacity for replication, transcription, translation, and protein-folding while lacking any capacity for the biosynthesis of amino acids or vitamins. Notably, the endosymbiont retains a capacity for fatty acid synthesis (type II) and iron-sulfur (Fe-S) cluster assembly. Metagenomic analysis of 150 new metagenomes from Organic Lake and more than 70 other Antarctic aquatic locations revealed a strong correlation in abundance between Ca. Organicella extenuata and a novel ciliate of the genus Euplotes. Like Ca. Pinguicoccus supinus, we infer that Ca. Organicella extenuata is an endosymbiont of Euplotes and hypothesize that both Ca. Organicella extenuata and Ca. Pinguicocccus supinus provide fatty acids and Fe-S clusters to their Euplotes host as the foundation of a mutualistic symbiosis. The discovery of Ca. Organicella extenuata as possessing genetic code 4 illustrates that in addition to identifying endosymbionts by sequencing known symbiotic communities and searching metagenome data using reference endosymbiont genomes, the potential exists to identify novel endosymbionts by searching for unusual coding parameters.Entities:
Keywords: Antarctic microbiology; Bacterial endosymbionts; extreme genome reduction; genetic code 4; metagenome
Year: 2021 PMID: 34140946 PMCID: PMC8204192 DOI: 10.3389/fmicb.2021.674758
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Phylogeny of Candidatus Organicella extenuata. Phylogeny of Verrucomicrobia and related bacteria based on 16S rRNA sequences, showing Ca. Organicella extenuata nested inside the newly proposed “Nucleococcus cluster”; other than Ca. Organicella extenuata, this cluster comprises the cytoplasmic endosymbiont Ca. Pinguicoccus supinus from a freshwater ciliate and intranuclear endosymbionts of amitochondriate protists resident in the hindgut of termites. The maximum likelihood tree was constructed with 59 sequences, and positions with less than 80% site coverage were eliminated, resulting in 1,415 positions in the final dataset. Bootstrap values > 70 are shown next to individual nodes. Fusobacterium varium is the outgroup. Accessions are given as NCBI Nucleotide accessions or IMG Gene IDs: for Ca. Organicella extenuata, sequences were included for the original Organic Lake MAG (contig Ga0307966_1000010, bases 107297..108828), Unnamed Lake 18 (contig Ga0400283_000007, bases 52431..53966), and “Portals” Lake (contig Ga0400669_009478, bases 1..1071 and contig Ga0400669_039189, bases 1314..1821). Sequences identical to the 16S rRNA sequence from the original Organic Lake MAG were represented in metagenome data from 19 other Organic Lake metagenomes and also in Unnamed Lake 13 (Supplementary Table 4). Note that nine-digit accessions are IMG Gene IDs, and all others are NCBI Nucleotide accessions.
FIGURE 2Co-occurrence of Candidatus Organicella extenuata and Euplotes sp. AntOrgLke in Antarctic metagenomes. The abundance of Ca. Organicella extenuata (k141_311079) and Euplotes sp. AntOrgLke (bin81 + bin14 + contigk141_859071), calculated as the sum of (contig length × contig coverage) for all contigs, was analyzed using SparCC to determine their co-occurrence (r, correlation coefficient). Across 29 metagenomes in which Ca. Organicella extenuata was detected, the abundance of Euplotes sp. AntOrgLke strongly positively correlated with the abundance of Ca. Organicella extenuata (r = 0.89, p = 0), indicating Euplotes sp. AntOrgLke was likely the host of Ca. Organicella extenuata. None of the other 187 bins representing other potential hosts exhibited a positive correlation above r = 0.54. X-axis labels: Organic Lake, metagenome IDs (see Supplementary Table 5); Other lakes, lake names (Unnamed abbreviated as UN).
FIGURE 3Phylogeny of Euplotes sp. AntOrgLke. Unrooted maximum likelihood phylogeny of RNA polymerase subunit II proteins from members of Ciliophora showing Euplotes sp. AntOrgLke clustering with members of the Euplotes genus. Within the cluster for each RNA polymerase type (RPB, RPC, and RPA), the percent identity between Euplotes sp. AntOrgLke protein and an individual protein is shown after the species name. Bootstrap values ≥ 70 are shown next to individual nodes, and protein sequences are available in Supplementary Dataset 5. A total of 41 RNA polymerase subunit II amino acid sequences were used in analysis. Positions with less than 80% site coverage were eliminated, and 944 positions remained in the final dataset, with the exception of MSTRG.29381.1_fr3, Euplotes vannus which was a partial sequence (283 aa) and is marked with an*.
FIGURE 4Depiction of function of Candidatus Organicella extenuata within Euplotes sp. AntOrgLke. The potential metabolic capacities of Ca. Organicella extenuata are limited to pyruvate oxidation; type II fatty acid biosynthesis; iron-sulfur (Fe-S) cluster assembly; and conversion, activation, and transfer of hexose and heptose sugars. The Ca. Organicella extenuata MAG lacks any identifiable genes for glycolysis, tricarboxylic acid cycle (aside from pyruvate oxidation), pentose phosphate pathway, respiration, fermentation, ATP generation (either by oxidative phosphorylation and ATP synthase, or substrate-level phosphorylation), or synthesis of phospholipids (aside from fatty acids), amino acids, nucleic acids, or vitamins. There were no identifiable transporter genes. Processes, pathways, and enzymes that were inferred to be functional in Ca. Organicella extenuata are shaded in red. OM, outer membrane; IM, inner membrane; LPS, lipopolysaccharide. Fatty acid synthesis (type II): PDH, pyruvate dehydrogenase; CoA, coenzyme A; ACP, acyl carrier protein; ACC, acetyl-CoA carboxylase complex; FabD, malonyl-CoA-ACP-transacylase; FabB and FabF, 3-oxoacyl-ACP synthase; FabG, 3-oxoacyl-ACP reductase; FabZ, 3-hydroxyacyl-ACP dehydratase; FabV, enoyl-ACP reductase. Glycan synthesis: GT, glycosyltransferase (three different GT); ColD, GDP-4-keto-6-deoxy-D-mannose 3-dehydratase; ColC, GDP-L-colitose synthase; Udg, UDP-glucose 6-dehydrogenase; LpsL, UDP-glucuronate epimerase; Tkl, transketolase; GmhA, phosphoheptose isomerase; HddA, D-glycero-α-D-manno-heptose 7-phosphate kinase; HddC, D-glycero-α-D-manno-heptose 1-phosphate guanylyltransferase. DNA replication: GyrAB, DNA gyrase; DnaG, DNA primase; DnaB, replicative DNA helicase. Transcription: RNAP, RNA polymerase. Translation: tRNA, transfer RNA; aaRS, aminoacyl tRNA synthetases. Fe-S cluster assembly: Fdx, ferredoxin; SufS, cysteine desulfurase; SUF, Fe-S cluster assembly complex (SufCBD, SufU, SufT). In this reconstruction, pyruvate is supplied by the host and the sole purpose of PDH is to provide the acetyl-CoA precursor for fatty acid synthesis. The fatty acid synthesis pathway is functionally complete in Ca. Organicella extenuata, with FabF or FabB substituting for missing FabH. There are three pathways involved in synthesis of heptose (glycero-manno-heptose) or hexose (galacturonic acid; colitose) subunits of lipopolysaccharide glycans in Ca. Organicella extenuata, but none of these pathways are complete, and all depend on exogenous precursors.