Maria C Fernandes-Martins1, Lisa M Keller1, Mason Munro-Ehrlich1, Kathryn R Zimlich1, Madelyn K Mettler2, Alexis M England1, Rita Clare3, Kevin Surya1, Everett L Shock4,5, Daniel R Colman1, Eric S Boyd1. 1. Department of Microbiology and Cell Biology, Montana State Universitygrid.41891.35, Bozeman, Montana, USA. 2. Department of Chemical and Biological Engineering, Montana State Universitygrid.41891.35, Bozeman, Montana, USA. 3. Department of Chemistry and Biochemistry, Montana State Universitygrid.41891.35, Bozeman, Montana, USA. 4. School of Molecular Sciences, Arizona State Universitygrid.215654.1, Tempe, Arizona, USA. 5. School of Earth and Space Exploration, Arizona State Universitygrid.215654.1, Tempe, Arizona, USA.
Abstract
Little is known of how the confluence of subsurface and surface processes influences the assembly and habitability of hydrothermal ecosystems. To address this knowledge gap, the geochemical and microbial composition of a high-temperature, circumneutral hot spring in Yellowstone National Park was examined to identify the sources of solutes and their effect on the ecology of microbial inhabitants. Metagenomic analysis showed that populations comprising planktonic and sediment communities are archaeal dominated, are dependent on chemical energy (chemosynthetic), share little overlap in their taxonomic composition, and are differentiated by their inferred use of/tolerance to oxygen and mode of carbon metabolism. The planktonic community is dominated by putative aerobic/aerotolerant autotrophs, while the taxonomic composition of the sediment community is more evenly distributed and comprised of anaerobic heterotrophs. These observations are interpreted to reflect sourcing of the spring by anoxic, organic carbon-limited subsurface hydrothermal fluids and ingassing of atmospheric oxygen that selects for aerobic/aerotolerant organisms that have autotrophic capabilities in the water column. Autotrophy and consumption of oxygen by the planktonic community may influence the assembly of the anaerobic and heterotrophic sediment community. Support for this inference comes from higher estimated rates of genome replication in planktonic populations than sediment populations, indicating faster growth in planktonic populations. Collectively, these observations provide new insight into how mixing of subsurface waters and atmospheric oxygen create dichotomy in the ecology of hot spring communities and suggest that planktonic and sediment communities may have been less differentiated taxonomically and functionally prior to the rise of oxygen at ∼2.4 billion years ago (Gya). IMPORTANCE Understanding the source and availability of energy capable of supporting life in hydrothermal environments is central to predicting the ecology of microbial life on early Earth when volcanic activity was more widespread. Little is known of the substrates supporting microbial life in circumneutral to alkaline springs, despite their relevance to early Earth habitats. Using metagenomic and informatics approaches, water column and sediment habitats in a representative circumneutral hot spring in Yellowstone were shown to be dichotomous, with the former largely hosting aerobic/aerotolerant autotrophs and the latter primarily hosting anaerobic heterotrophs. This dichotomy is attributed to influx of atmospheric oxygen into anoxic deep hydrothermal spring waters. These results indicate that the ecology of microorganisms in circumneutral alkaline springs sourced by deep hydrothermal fluids was different prior to the rise of atmospheric oxygen ∼2.4 Gya, with planktonic and sediment communities likely to be less differentiated than contemporary circumneutral hot springs.
Little is known of how the confluence of subsurface and surface processes influences the assembly and habitability of hydrothermal ecosystems. To address this knowledge gap, the geochemical and microbial composition of a high-temperature, circumneutral hot spring in Yellowstone National Park was examined to identify the sources of solutes and their effect on the ecology of microbial inhabitants. Metagenomic analysis showed that populations comprising planktonic and sediment communities are archaeal dominated, are dependent on chemical energy (chemosynthetic), share little overlap in their taxonomic composition, and are differentiated by their inferred use of/tolerance to oxygen and mode of carbon metabolism. The planktonic community is dominated by putative aerobic/aerotolerant autotrophs, while the taxonomic composition of the sediment community is more evenly distributed and comprised of anaerobic heterotrophs. These observations are interpreted to reflect sourcing of the spring by anoxic, organic carbon-limited subsurface hydrothermal fluids and ingassing of atmospheric oxygen that selects for aerobic/aerotolerant organisms that have autotrophic capabilities in the water column. Autotrophy and consumption of oxygen by the planktonic community may influence the assembly of the anaerobic and heterotrophic sediment community. Support for this inference comes from higher estimated rates of genome replication in planktonic populations than sediment populations, indicating faster growth in planktonic populations. Collectively, these observations provide new insight into how mixing of subsurface waters and atmospheric oxygen create dichotomy in the ecology of hot spring communities and suggest that planktonic and sediment communities may have been less differentiated taxonomically and functionally prior to the rise of oxygen at ∼2.4 billion years ago (Gya). IMPORTANCE Understanding the source and availability of energy capable of supporting life in hydrothermal environments is central to predicting the ecology of microbial life on early Earth when volcanic activity was more widespread. Little is known of the substrates supporting microbial life in circumneutral to alkaline springs, despite their relevance to early Earth habitats. Using metagenomic and informatics approaches, water column and sediment habitats in a representative circumneutral hot spring in Yellowstone were shown to be dichotomous, with the former largely hosting aerobic/aerotolerant autotrophs and the latter primarily hosting anaerobic heterotrophs. This dichotomy is attributed to influx of atmospheric oxygen into anoxic deep hydrothermal spring waters. These results indicate that the ecology of microorganisms in circumneutral alkaline springs sourced by deep hydrothermal fluids was different prior to the rise of atmospheric oxygen ∼2.4 Gya, with planktonic and sediment communities likely to be less differentiated than contemporary circumneutral hot springs.
Yellowstone National Park (YNP), Wyoming, hosts the world’s largest active continental volcanic, hydrothermal system (1), and its surficial expression forms the >10,000 hot springs, mudpots, fumaroles, and geysers distributed across the landscape (2). Each of these hydrothermal features has a unique geochemical composition (3) that supports unique and biodiverse microbial communities (4, 5). This is particularly true for hot springs with geochemical conditions that preclude photosynthesis, such as those with temperatures >73°C (6, 7). In such hot springs, microbial populations are dependent on chemical energy that is generated by dissipating disequilibrium in available electron donors and acceptors through redox-based metabolism (3, 8). As such, the availability of, and disequilibrium in, electron donors/acceptors and other nutrients exerts fundamental control on the taxonomic and functional diversity of high-temperature (>73°C) hot spring populations and their communities.While high-temperature acidic hot springs tend to be dominated by aerobes that are supported by dissimilatory iron- and sulfur-based metabolisms (5, 9), far less is known of the energy sources that support microbial metabolism in circumneutral to alkaline hot springs. To assess energy sources that may support microbial metabolism in circumneutral to alkaline springs, metagenomic data from sediments or biofilms reported from 18 circumneutral to alkaline hot springs (pH ≥ 6.5) in YNP were analyzed for the abundance of proteins involved in a variety of energy metabolism functionalities (5). Sediment- or biofilm-associated communities from circumneutral to alkaline springs were found to be enriched in functionalities involving respiration of inorganic nitrogen compounds or oxygen (5). Perhaps consistent with this, a predominant bacterium identified in circumneutral to alkaline springs is Thermocrinis (10, 11), a genus comprising microaerophilic organisms (12). However, other studies have shown that a predominant archaeal genus commonly identified in waters and/or sediments of circumneutral to alkaline springs is Pyrobaculum (13). Some Pyrobaculum strains respire oxygen (14–16), whereas other strains are anaerobes that can respire ferric iron (17) or, in the case of Pyrobaculum yellowstonensis, compounds such as elemental sulfur or arsenate (18). The prevalence of P. yellowstonensis strains putatively capable of respiring arsenic in circumneutral to alkaline high-temperature springs in YNP (13) is consistent with elevated concentrations of arsenic, including arsenate, in this spring type (19).Previous 16S rRNA gene studies have shown that communities inhabiting the water column (i.e., planktonic) and the sediments of hot springs can be differentiated by habitat type, both in YNP (13) and in other continental hot springs such as those in the Great Basin, United States (20), and Tengchong, China (21, 22). Colman et al. (13) show that this is particularly true for hot springs in YNP with circumneutral to alkaline pH that are sourced from the parent hydrothermal aquifer. Sediments in these springs often lack detectable redox active minerals (13), leading to the hypothesis that the electron donors and acceptors fueling microbial communities are in a soluble or dissolved form supplied by the volcanic system or by exogenous input. However, this hypothesis has yet to be robustly evaluated. An important consideration in evaluating such a hypothesis is that deep fluids sourced from the parent hydrothermal aquifer tend to be saturated with dissolved silica that can precipitate as sinter during their ascent and cooling as they migrate to the surface (23, 24). This is thought to seal the plumbing system feeding these springs, thereby limiting the input of dissolved organic carbon (DOC) and oxygen (O2) enriched near-surface groundwaters (25) and further constraining the spectrum of nutrients and energy sources available to microorganisms inhabiting these springs. Evidence in support of this comes from the lack of detection of tritium (half-life, 12.3 years) in hot spring waters sourced by the parent aquifer (26–28); tritium would be expected to be present in hot spring waters if recent meteoric water was input into these springs. Given the length of time that these waters have been out of contact with the atmosphere (>200 years [26]), it is likely that the waters discharging in this spring type are anoxic and lacking in electron acceptors for microbial metabolism other than SO42− and bicarbonate from parent hydrothermal fluid (2). The high temperatures of these discharging waters likely also limit extensive infusion of atmospheric O2, given the inverse relationship between oxygen solubility and temperature (8). Further, the high temperatures (>73°C) of these springs do not permit photosynthesis (6, 7) and, thus, endogenous production of photosynthetic DOC or O2. Thus, it is possible that previously observed habitat differentiation in planktonic and sediment communities inhabiting circumneutral to alkaline hot springs in YNP (13) and elsewhere (20–22) derives from differential availability of oxidants, in particular, atmospheric O2, given the limited availability of alternative oxidants in these environments.In the present study, we hypothesized that hot springs sourced by the deep hydrothermal aquifer in YNP would host distinct sediment and planktonic microbial communities that are adapted to take advantage of differentially available electron donor/acceptor pairs in their respective habitats. Specifically, we hypothesized that populations comprising the planktonic community would be primarily aerobic, given the potential infusion of O2 through atmospheric ingassing, whereas populations inhabiting sediments would be primarily anaerobic due to limited availability of dissolved O2 in that habitat and consumption of O2 in the overlying water column. Further, we hypothesized that the populations comprising both planktonic and sediment communities would be primarily autotrophic, given the low concentrations of DOC present in fluids sourcing this spring type (25). We assessed these interrelated hypotheses through analyses of the geochemistry of hot spring waters and metagenomic sequencing of sediment and planktonic microbial communities. Metagenomic sequences were binned into population-level metagenome-assembled genomes (MAGs) and subjected to analyses of community structure, inferred metabolic capabilities, and estimated genome replication rates. The results are discussed in relation to the consequences of influx of atmospheric gas (i.e., O2) into reduced waters that are DOC limited on the ecology of planktonic and sediment communities inhabiting circumneutral to alkaline hot springs in YNP. The results provide insight into how the habitability and ecology of these early Earth analog environments may have changed as O2 began to accumulate in the atmosphere ∼2.4 billion years ago (Gya) (as reviewed in reference 29).
RESULTS AND DISCUSSION
Hot spring description.
Gibbon Geyser Spring 63 (National Park Service identifier GGS063) is a large (∼2 m diameter) (Fig. 1A), shallow (∼0.5 m depth), turbulent, and circumneutral hot spring (pH 7.1; temperature, 85.9°C) (Fig. 1C) located in the northern section of the Geyser Creek area of the Gibbon Geyser Basin in YNP. Concentrations of sulfate (98.6 mg liter−1) and chloride (690 mg liter−1) are consistent with this spring being sourced by the parent hydrothermal aquifer (Fig. 1B) (2, 29). Thus, the spring is representative of >50% of hot springs in YNP that are also thought to be sourced primarily by this aquifer (9). Like many high-temperature circumneutral to alkaline springs, the sulfate and chloride in GGS063 are likely further concentrated due to boiling and evaporation (29). As such, dissolved solutes in the spring, which are delivered from the aquifer sourcing the spring, are presumed to be uniform in the pool. Concentrations of nitrate and nitrite were below the limit of detection (Fig. 1C). This is also consistent with GGS063 being sourced by the parent hydrothermal aquifer with limited input of surface waters since nitrate is a common solute in hot springs impacted by input of near-surface groundwaters (30). The concentration of DOC was low (0.4 mg liter−1) and is near the detection limit of the method used. The low concentration of DOC in GGS063 is consistent with low DOC concentrations measured in springs in YNP (and Tengchong, China) sourced by the parent hydrothermal aquifer (25), as discussed in more detail below.
FIG 1
Physical and chemical properties of Gibbon Geyser Spring 063 (GGS063), Yellowstone National Park (YNP). (A) Image of GGS063, looking northwest, with the red arrow denoting the location within the spring where planktonic and sediment biomass was collected. (B) Plot of sulfate (SO42−) and chloride (Cl−) concentrations in YNP springs sampled between the years 2003 and 2013 from U.S. Geological Survey (USGS) publications in gray (32, 90) and end member fluid compositions, based on a published model (29), denoted in black boxes. The measured concentrations for GGS063 are indicated by the black data point, which represents the sulfate and chloride values measured for GGS063. (C) Physical and select chemical measurements made for GGS063. All measured chemical data are available in Table S5 in the supplemental material. Elev., elevation; Temp., temperature; Cond., conductivity.; DO, dissolved oxygen; T, total; B.D, below detection. All chemical data are reported in mg liter−1.
Physical and chemical properties of Gibbon Geyser Spring 063 (GGS063), Yellowstone National Park (YNP). (A) Image of GGS063, looking northwest, with the red arrow denoting the location within the spring where planktonic and sediment biomass was collected. (B) Plot of sulfate (SO42−) and chloride (Cl−) concentrations in YNP springs sampled between the years 2003 and 2013 from U.S. Geological Survey (USGS) publications in gray (32, 90) and end member fluid compositions, based on a published model (29), denoted in black boxes. The measured concentrations for GGS063 are indicated by the black data point, which represents the sulfate and chloride values measured for GGS063. (C) Physical and select chemical measurements made for GGS063. All measured chemical data are available in Table S5 in the supplemental material. Elev., elevation; Temp., temperature; Cond., conductivity.; DO, dissolved oxygen; T, total; B.D, below detection. All chemical data are reported in mg liter−1.GGS063 had a low concentration of dissolved O2 (0.021 mg liter−1 or 1.3 μM) that was near the limit of detection of the probe used (0.015 mg liter−1 or 0.9 μM). Dissolved O2 was measured at a depth of ∼20 cm beneath the air-water interface, and it is possible that spatial variability exists in the measurement, given the turbulence of the spring. While spatial variation of dissolved O2 (or other dissolved components) was not assessed in this study, the low concentration of dissolved O2 measured in GGS063 is consistent with other circumneutral to alkaline hot springs in YNP (4) where temperatures are too high (>73°C) to support endogenous production of O2 through oxygenic photosynthesis (6, 7). Rather, the dissolved O2 measured in springs like GGS063 is due to atmospheric influx, which is limited at high temperatures (31).The concentration of total arsenic (3.1 mg liter−1 or 41 μM) was high in GGS063 waters (Fig. 1) and is among the highest concentration of total arsenic measured in hydrothermal waters in YNP (19, 32; https://www.usgs.gov/mission-areas/water-resources/science/water-chemistry-data-selected-springs-geysers-and-streams?qt-science_center_objects=0#qt-science_center_objects). This is consistent with the high arsenic concentrations measured in hot springs sourced by the parent hydrothermal aquifer since the rhyolitic bedrock that hosts this aquifer is thought to be the source of As in these springs (19). While the oxidation state of arsenic measured in GGS063 was not determined in the present study, the amount of total arsenic as arsenite [As(III)] in other springs with similar chemical compositions tends to be >50%, with the balance presumed to be arsenate [As(V)] (19, 32; https://www.usgs.gov/mission-areas/water-resources/science/water-chemistry-data-selected-springs-geysers-and-streams?qt-science_center_objects=0#qt-science_center_objects). As(III) is the predominant form of arsenic in hot springs sourced by the parent hydrothermal aquifer (33) since its oxidation is slow in the absence of O2 and aerobic microorganisms (34). As such, As(III) is likely to be an available electron donor in GGS063, and As(V), generated as a product of aerobic microbial As(III) oxidation, may be an available electron acceptor. Other soluble inorganic electron donors that were measured include sulfide (0.18 mg liter−1 or 5.3 μM). Other soluble electron acceptors, such as ferric iron, were below the limit of detection. Although evidence for iron oxides or elemental sulfur were not visually apparent in GGS063 (Fig. 1A) and these minerals are not typical for springs with circumneutral to alkaline pH sourced by the parent hydrothermal aquifer (13, 35), their presence and potential use as electron acceptor and/or donor by sediment communities cannot be excluded. The complete list of measured aqueous chemical data is available in Table S5 in the supplemental material.
Planktonic and sediment community composition and structure.
While the focus of the manuscript is a functional analysis of planktonic and sediment communities using metagenomics approaches, it was first necessary to ensure that the taxonomic composition of MAGs obtained from our metagenomes was reflective of the community 16S rRNA genes that were assembled from the metagenomes, as well as 16S rRNA genes from other high-temperature circumneutral to alkaline springs in YNP. Indeed, the taxonomic composition of MAGs and 16S rRNA genes extracted from metagenomic assemblies was similar in both the planktonic and sediment communities (Fig. S1A). Further, comparison of the composition of 16S rRNA genes obtained from planktonic and sediment communities from GGS063 to those compiled from the water column and sediments from other circumneutral to alkaline hot spring communities in YNP (13) shows that they are also highly similar, with the exception of Desulfurococcales, which were particularly abundant in the sediment metagenome but largely absent in the previously reported PCR-amplified 16S rRNA gene libraries (Fig. S1A and B). The recovery of Desulfurococcales MAGs and 16S rRNA genes in GGS063 versus 16S rRNA gene amplicons from other hot springs is likely attributable to the primers (515F-806R [36]) that were used to amplify 16S rRNA genes in the previous study that are now known to be biased against the amplification of key groups of Archaea (37), including members of the Desulfurococcales (38). Collectively, these data indicate that the taxonomic and functional insights generated from MAGs from planktonic and sediment communities in GGS063 are reflective of other communities inhabiting circumneutral to alkaline springs in YNP.A total of 12 and 33 MAGs were recovered from the planktonic and sediment communities in GGS063, respectively, indicating that the latter is taxonomically richer than the former. These observations were also highly consistent with 16S rRNA gene diversity observed within the entire metagenomes (16 versus 28 operational taxonomic units, respectively). Consistent with the planktonic community being less taxonomically rich, it was dominated by two MAGs, one of which is closely related (95.2% average nucleotide identity [ANI]) to the archaeon Pyrobaculum yellowstonensis WP30 (bin P11; 47.6% of community) and the other that is closely related (93.4% ANI) to the bacterium Thermocrinis ruber OC 1/4 (P10; 40.4%) (Fig. 2A; Table S2). The type strains of T. ruber OC 1/4 and P. yellowstonensis WP30 were both isolated from circumneutral to alkaline hot springs in YNP (12, 18), and close relatives are commonly detected by molecular analyses of springs sourced by the parent hydrothermal aquifer in YNP (13, 18, 39, 40). In contrast to the planktonic community that consists of both bacteria (40.5% of community) and archaea (59.5%), MAGs recovered from the sediment community were primarily affiliated with archaea (96.6%), and their abundance was more evenly distributed (Fig. 2). Moreover, the combined abundances of the top 17 most abundant MAGs in the sediment (88.3% of community) were nearly equivalent to the combined abundance (88.0% of the community) of the two most abundant MAGs in the plankton community (Fig. 2). The most abundant MAG in the sediment community (bin S34; 13.3% of community) was distantly related to the archaeon Desulfurococcus amylolyticus (76.3% ANI) (Fig. 2B). The distinctly different taxonomic structures of the GGS063 planktonic and sediment communities suggest different factors are influencing the assembly of these two communities. The dominance of the planktonic community by two taxa, Pyrobaculum (bin P11) and Thermocrinis (P10), suggests that these taxa are efficient at acquiring nutrients from the water column to fuel their metabolism and growth. Dominance of the planktonic community suggests that the spectrum of nutrients available in the water column is narrow, allowing for minimal partitioning of niche space to support additional abundant taxa (41). In contrast, the more even distribution in the abundances of MAGs in the sediment community suggests that the greater diversity of taxa may be supported by a wider spectrum of available electrons and acceptors that may include minerals (e.g., elemental sulfur), as has been suggested previously for hot spring ecosystems supported by chemical energy (13). It is equally plausible that the apparent prevalence of anaerobes in the sediment community can be attributed to those organisms being excluded from the water column community due to their sensitivity to O2.
FIG 2
Rank-abundance plot of Gibbon Geyser Spring 063 (GGS063) metagenome-assembled genomes (MAGs) from planktonic (A) and sediment (B) communities. The relative abundance of MAGs with an estimated completeness of >50% is presented (MAG completeness indicated in parentheses). MAG taxonomy was assessed using GTDB-Tk classification and reported with the average nucleotide identity (ANI) to the highest taxonomic rank available, as reported in Table S2.
Rank-abundance plot of Gibbon Geyser Spring 063 (GGS063) metagenome-assembled genomes (MAGs) from planktonic (A) and sediment (B) communities. The relative abundance of MAGs with an estimated completeness of >50% is presented (MAG completeness indicated in parentheses). MAG taxonomy was assessed using GTDB-Tk classification and reported with the average nucleotide identity (ANI) to the highest taxonomic rank available, as reported in Table S2.A common characteristic of MAGs comprising both the planktonic and sediment communities is that they are distantly related to cultivars. For the planktonic community, only 2 of 8 MAGs (ANIs could not be calculated for 4 MAGs due to incompleteness) exhibited more than 90% ANI to a cultured species, with the remainder of MAGs exhibiting ANIs to characterized species that ranged from 75 to 90% (Table S2). This trend is even more pronounced in the sediment community, with only 7 of the 25 MAGs (ANIs could not be calculated for 9 MAGs due to incompleteness) exhibiting more than 90% ANI to a characterized species. These observations could be explained by the fact that most studies of circumneutral to alkaline hot springs in YNP have been focused on biofilm (i.e., “streamers”) (4, 39, 42), and fewer have been conducted on communities inhabiting the water columns and sediments of these springs. These observations call for additional cultivation-dependent and cultivation-independent characterizations of populations inhabiting circumneutral to alkaline springs.
Phylogenetic similarity analysis.
A network was built from an amino acid identity (AAI) matrix depicting the similarities among the 45 sediment and planktonic MAGs from GGS063 (Fig. 3). The sediment MAGs (nodes), which are predominantly archaeal, define the topology of the network, a finding that is consistent with the observation that nearly all planktonic MAGs are connected by short branches to a closely related sediment MAG. The only exception is the Thermocrinis MAG (bin P10) that does not have a corresponding high-quality MAG in the sediment community. Eight predominant MAG clusters were identified in the network, including MAGs affiliated with the Thermodesulfobacteriaceae (cluster 1; bins P9 and S4; pairwise AAI, 98.1%), Fervidococcaceae (cluster 2; P5 and S20; pairwise AAI, 98.2%), Desulfurococcaceae (cluster 3; P3 and S29; pairwise AAI, 99.4%), Pyrobaculum (cluster 4; P11, S18, and S24; mean pairwise AAI, 89.1%), Thermofilaceae (cluster 5; P1, P4, S1, S6, S13, and S19; mean pairwise AAI, 64.0%), Desulfurococcaceae (cluster 6; P7, P8, S12, S14, S15, S21, S25, S34, and S36; mean pairwise AA, 67.6%), Ignisphaera (cluster 7; P2, P6, S23, and S33; mean pairwise AAI, 72.8%), and Desulfurococcales (cluster 8; 12P and 9S, mean pairwise AAI, 98.9%). These clusters comprise closely related MAGs that were identified in both the sediment and planktonic communities. Within a defined cluster, the sediment-plankton MAG pairs tend to have a higher pairwise AAI than MAGs from within the same community; planktonic MAGs do not form a cluster of their own. Collectively, this observation supports the results from the rank abundance analysis, indicating that the planktonic community is taxonomically dominated by a few phylotypes, whereas the sediment community is taxonomically more even and is comprised of a more diverse array of closely related phylotypes.
FIG 3
Correlation network of 45 archaeal and bacterial MAGs (nodes) from Gibbon Geyser Spring 063 (GGS063). The network depicts the average amino acid identity (AAI) of MAGs, with longer edges indicating lower AAI values and shorter edges indicating higher AAI values. Edges are only shown between pairs of MAGs when AAI is >40%. Edge colors in pink denote AAI of >90%. The node designations correspond to the bins in Fig. 2 and as defined in Table S2. Node colors in orange denote MAGs recovered from GGS063 sediment (sediment community), and nodes in blue denote MAGs recovered from the GGS063 water column (planktonic community). Clusters of nodes are numbered 1 to 8 and correspond to those discussed in the text.
Correlation network of 45 archaeal and bacterial MAGs (nodes) from Gibbon Geyser Spring 063 (GGS063). The network depicts the average amino acid identity (AAI) of MAGs, with longer edges indicating lower AAI values and shorter edges indicating higher AAI values. Edges are only shown between pairs of MAGs when AAI is >40%. Edge colors in pink denote AAI of >90%. The node designations correspond to the bins in Fig. 2 and as defined in Table S2. Node colors in orange denote MAGs recovered from GGS063 sediment (sediment community), and nodes in blue denote MAGs recovered from the GGS063 water column (planktonic community). Clusters of nodes are numbered 1 to 8 and correspond to those discussed in the text.The observation that the sediment community in GGS063 is more evenly structured and more biodiverse than the planktonic community is consistent with a previous study of sediment and planktonic communities in 15 high-temperature hot springs in YNP (13). In that study, it was hypothesized that the higher diversity and evenness in sediment communities, compared to planktonic communities, can be attributed to those communities being differentiated by characteristics of their local habitat (13). Specifically, based on 16S rRNA gene inference to cultivars, it was suggested that planktonic communities are supported to a greater extent by aerobic metabolisms, whereas sediment communities are largely supported by mineral-based anaerobic metabolisms, in particular, those involving elemental sulfur or ferric iron reduction. This was interpreted to reflect differential availability of O2, the preferred electron acceptor for the metabolisms of thermophiles (3, 8), in the hot spring water column versus the sediments. This motivated the hypothesis that planktonic and sediment populations in GGS063 are differentiated by oxidant availability, in particular, dissolved O2, and the metagenomics-enabled functional analysis of the encoded potential of planktonic- and sediment-associated organisms to respire O2, which was tested herein.
Metabolic differentiation of planktonic and sediment MAGs.
Roughly 47% and 50% of the planktonic community members were predicted to be capable of aerobic metabolism or to be aerotolerant, respectively, totaling roughly 97% of the community (Fig. 4A). This includes two abundant MAGs most closely related to Thermocrinis (bin P10; 40.4% of the community) and Desulfurococcaceae (P3; 6.2%) that both encode homologs of cytochrome c oxidase (Cox). The most abundant MAG in the planktonic community that is related to Pyrobaculum (P11; 47.6%) encodes a homolog of cytochrome bd oxidase (CydA), which is common in P. yellowstonensis WP30 and other Crenarchaeota. No members of Pyrobaculum are known to respire O2 using CydA, and this MAG is thus predicted to correspond to an aerotolerant anaerobe. Like P. yellowstonensis WP30 (18), the GGS063 Pyrobaculum P11 MAG encodes several homologs of dimethyl sulfoxide (DMSO) reductase that are inferred to function in elemental sulfur/polysulfide reduction and a homolog of arsenate reductase (ArrA) that is inferred to function in arsenate reduction, suggesting that elemental sulfur or As(V) may be respired by this organism. Interestingly, this MAG also encodes a homolog of [NiFe] hydrogenase and 2 homologs of arsenite oxidase (AioA), suggesting that it can use H2 or As(III) as electron donors. This would potentially help to explain how this strain could grow autotrophically via the dicarboxylate/4-hydroxybutyrate (DC/4HB) pathway (discussed below). Similar to Pyrobaculum bin P11, a MAG most closely related to Thermofilum (P1; 1.8%) is predicted to be aerotolerant based on it encoding CydABX. The detection of abundant Pyrobaculum- and Thermofilum-affiliated MAGs that are predicted to be aerotolerant anaerobes in GGS063 is consistent with the limited solubility of O2 in nearly boiling waters (boiling point is 93°C in YNP) of GGS063 (85.9°C, when sampled) (31) (Fig. 1C). Based on the detection of homologs of AioA, sulfide-quinone oxidoreductase (Sqo), and thiosulfate/sulfur oxidase (Sox), T. ruber is inferred to be capable of utilizing As(III), sulfide, or the sulfide oxidation product thiosulfate as an electron donor during O2 respiration (discussed below). It is thus possible that aerobic respiration by T. ruber in the water column of GGS063 facilitates the presence of Pyrobaculum and Thermofilum by consuming O2. In the case of Pyrobaculum, T. ruber may also be a source of elemental sulfur/polysulfide through incomplete oxidation of sulfide or As(V) via oxidation of As(III). As such, only 3.9% of the planktonic community was determined to be comprised of strict anaerobes.
FIG 4
Inferred metabolic potential of planktonic and sediment communities from Gibbon Geyser Spring 063 (GGS063). (A) The predicted ability for a population to integrate oxygen into their energy metabolism, as assessed by proteins encoded in MAGs comprising those communities, is depicted. MAGs were classified as aerobic, anaerobic, or aerotolerant using criteria outlined in the text. Oxygen preference for Thermofilum (bin P4) and Aenigmatarchaeota (S16) could not be determined and are labeled as unknown. (B) Predicted ability for a population to perform autotrophy in the planktonic and sediment communities from GGS063. Populations were designated autotrophs or heterotrophs based on proteins encoded in MAGs. The mode of carbon metabolism of Verstraetearchaeota (bins S2, S30, and S31) and Aenigmarchaeota (S16) could not be determined and are labeled as unknown.
Inferred metabolic potential of planktonic and sediment communities from Gibbon Geyser Spring 063 (GGS063). (A) The predicted ability for a population to integrate oxygen into their energy metabolism, as assessed by proteins encoded in MAGs comprising those communities, is depicted. MAGs were classified as aerobic, anaerobic, or aerotolerant using criteria outlined in the text. Oxygen preference for Thermofilum (bin P4) and Aenigmatarchaeota (S16) could not be determined and are labeled as unknown. (B) Predicted ability for a population to perform autotrophy in the planktonic and sediment communities from GGS063. Populations were designated autotrophs or heterotrophs based on proteins encoded in MAGs. The mode of carbon metabolism of Verstraetearchaeota (bins S2, S30, and S31) and Aenigmarchaeota (S16) could not be determined and are labeled as unknown.In contrast to the planktonic community that is primarily comprised of putative aerobes or aerotolerant anaerobes, 23 out of the 33 sediment MAGs, which represent ∼79% of the community, are predicted to be strict anaerobes, including the 5 most abundant MAGs, Desulfurococacceae (bin S34; 13.4% of the community), Desulfurococacceae (S11; 12.1%), Desulfurococacceae (S36; 11.3%), Verstraetearchaeota (S30; 7.6%), and Ignisphaera (S33; 6.9%). The detection of divergent homologs of sulfite/tetrathionite reductase (AsrAB) in Desulfurococacceae bins S34 and S36 and Ignisphaera S33 suggests the possibility that these populations reduce an oxyanion of sulfur (e.g., sulfite, tetrathionate) that might be produced in the water column through incomplete oxidation of sulfide (e.g., Thermocrinis). Desulfurococacceae bin S11 likely corresponds to an anaerobic heterotroph or fermenter, based on the lack of homologs allowing for other anaerobic respiratory capabilities and the lack of homologs of autotrophic pathways (described below). It is also possible that such proteins are encoded in the genome of S11 but either were not sequenced or did not assemble well. Like many environmental Verstraetearchaeota MAGs recovered from hot springs and other environments (43), all three bins (S2, S30, and S31) from GGS063 encode enzyme homologs involved in anaerobic methane or alkane metabolism (methanogenesis, methanotrophy, alkanotrophy), but only one bin (S31) encodes methyl CoM reductase subunit A (McrA), a genetic marker for these processes. Two of the three bins (S2, S30) encoded homologs of [NiFe] hydrogenase. Thus, the metabolism of these organisms is not clear, but they may be supported by oxidation of H2 or oxidation of methane or other alkanes by yet-to-be-defined pathways. Another six sediment MAGs (11.7% of the community) are predicted to be aerotolerant based on detection of CydABX, including Thermofilum (bin S1; 4.7% of community), Pyrobaculum (S18; 2.1%), Archaeoglobus (S27; 1.8%), Korarchaeum (S22; 1.5%), and Caldimicrobium (S4; 1.4%). Only three MAGs are predicted to be aerobic based on detection of Cox, and a single MAG could not be confidently assigned as aerobic, aerotolerant, or anaerobic (S16; Aenigmatarchaeota). The two aerobic MAGs with greater than 1% abundance belong to uncharacterized Desulfurococacceae (bin S29; 6.7% of community) and Armatimonadetes (S17; 1.7%).The dichotomy in the predicted metabolism of O2 or its detoxification in the planktonic community (∼97% of the community correspond to organisms capable of respiring or detoxifying O2) versus the sediment community (∼79% of the community correspond to strict anaerobes) suggests that the availability of O2 likely varies markedly between these two environment types within the hot spring. The waters sourcing GGS063 are deficient in O2 due to its consumption via water-rock interactions during the lengthy inferred residence time (>200 years) (26–28) that these waters have been out of contact with the atmosphere. Further, the geochemical composition of GGS063 waters reveals little evidence for mixing with more oxidized near-surface water (e.g., nitrate below detection, no evidence for dilution of sulfate/chloride). This is possibly due to silica armoring the conduit(s) feeding this spring (23, 24) and thus restricting near-surface inputs of groundwater. To the extent this is true and since the temperature of GGS063 is too high for oxygenic photosynthesis (6, 7), the primary source of oxygen in GGS063 waters is likely influx from the atmosphere, which provides limited O2 in waters with high temperature (85.9°C) (31), such as those in GGS063. It is suggested that the high proportion of anaerobes in the sediment community is thus due to limited availability of O2 in GGS063 (0.021 mg liter−1) due to limited influx of atmospheric gases combined with consumption by aerobes (e.g., T. ruber) comprising the planktonic community that render the sediment environment of the spring suboxic or anoxic, allowing for or facilitating habitation by anaerobes. The low concentration of dissolved O2 in waters of GGS063 (0.021 mg liter−1) is similar to other springs in YNP sourced by the parent hydrothermal aquifer (<0.042 mg liter−1) (4), suggesting that the dichotomy between planktonic and sediment communities as it relates to O2 usage is unlikely to be limited to GGS063.A previous study of 181 hot springs in YNP showed that circumneutral to alkaline springs sourced with parent hydrothermal water also tend to be limited in DOC compared to those that are sourced by mixed meteoric and hydrothermal fluids (25). The low concentration of DOC (0.4 mg liter−1) in water from GGS063 (Fig. 1) is consistent with this previously observed pattern. Like O2, the limited availability of DOC in circumneutral to alkaline springs may be attributable to silica armoring of the conduit(s) feeding this spring type that may prevent substantial input of near-surface DOC-enriched ground- or surface waters (44). Further support that circumneutral to alkaline springs tend to be limited in autochthonous DOC comes from isotopic analyses of streamer biofilms from several other alkaline springs across YNP (42, 45). These studies suggest the streamer biofilm communities primarily comprise autotrophs, but these populations are facultative and rapidly (albeit temporally) switch to heterotrophic or mixotrophic metabolisms when allochthonous organic carbon is input into springs from runoff due to summer rainstorms. Since GGS063 is sourced by deep hydrothermal parent fluids and has low concentrations of DOC, it was thus hypothesized that microbial populations at the time of sampling (July 2017) would be capable of autotrophic metabolism. Like the streamer biofilm communities, it is possible that these autotrophs are facultative and would switch to heterotrophic metabolism should allochthonous organic carbon become temporally available.To examine this hypothesis, the inferred proteomes of GGS063 planktonic and sediment MAGs were examined for the key proteins associated with the six known autotrophic pathways (46, 47). These included (i) the Calvin-Benson-Bassham (CBB) cycle, (ii) the reductive tricarboxylic acid (rTCA) cycle, (iii) the Wood-Ljungdahl (WL) pathway, (iv) the 3-hydroxypropionate (3HP) bicycle, (v) the 3HP/4HB cycle, and (vi) the DC/4HB cycle. Similarly, the proteomes of MAGs were examined for proteins that would allow for the use of other inorganic electron donors and acceptors that could potentially inform on their ecology in GGS063.Roughly 88.1% of the planktonic community is predicted to be capable of autotrophic metabolism (Fig. 4; Table S7). This includes MAGs closely related to Pyrobaculum (bin P11; 47.6% of the community), Thermocrinis (P10; 40.4%), and Thermodesulfobacterium (P9; 0.1%) that encoded the genetic potential to be autotrophic. Specifically, Pyrobaculum encoded the DC/4HB pathway, Thermocrinis encoded the rTCA cycle, and Thermodesulfobacterium encoded the WL pathway. Pyrobaculum (P11) is inferred to respire elemental sulfur/polysulfide or As(V) based on the MAG encoding a variety of DMSO reductase homologs and a homolog of arsenate reductase, ArrA. Electron donors may include H2 or As(III) based on the MAG encoding homologs of [NiFe] hydrogenase and arsenite oxidase, AioA. Thermocrinis (P10) is predicted to support autotrophic growth by coupling the oxidation of sulfide (homologs of Sqo), thiosulfate (Sox), or As(III) (AioA) to the reduction of O2 (Cox). Finally, Thermodesulfobacterium (P9) is predicted to support autotrophic growth by coupling H2 oxidation (homologs of [NiFe] hydrogenase) to SO42− reduction (homologs of Sat, AprAB, DsrAB). Thermodesulfobacterium may supplement autotrophic metabolism with organic carbon in a mixotrophic mode of metabolism or may be facultatively autotrophic, as evidenced by the MAG encoding a full glycolysis pathway. It is important to note that P. yellowstonensis WP30, which also encodes the DC/4HB pathway, could not be cultivated autotrophically when grown anaerobically on elemental sulfur or As(V) (18). However, many archaea, including sulfur- and iron-reducing archaea (48, 49), are known to harbor mixotrophic carbon and energy metabolisms, and mixotrophic metabolism was not specifically examined in the study of P. yellowstonensis WP30.Surprisingly, only two sediment MAGs comprising just 3.2% of the community encoded autotrophic pathways (Fig. 4; Table S7) This included MAGs most closely related to the bacterium Caldimicrobium (bin S4; 1.4% of the community) and the archaeon Archaeoglobus (S27; 1.8%), both of which encoded the WL pathway. Caldimicrobium and Archaeoglobus are predicted to couple H2 oxidation (homologs of [NiFe] hydrogenases) to SO42− reduction (homologs of DsrAB, Sat, and AprAB) to fuel autotrophic growth. Both the Caldimicrobium and Archaeoglobus MAGs also encode a complete glycolysis pathway, indicating the organisms are facultative autotrophs, and the Archaeoglobus MAG encodes a complete TCA cycle.Only 11.9% of the planktonic community is inferred to be obligately heterotrophic (Fig. 4; Table S7), based on the absence of genes in MAGs that encode autotrophic pathways. Of these putative heterotrophic MAGs, Desulfurococcaceae (bin P3; 6.2% of the community) is inferred to be an aerobic heterotroph based on the presence of homologs of Cox. Aerobic members of the Desulfurococcaceae are rare (50), however, this MAG shares 79.3% DNA-directed RNA polymerase subunit B (RpoB) identity with that of Aeropyrum camini (Desulfurococcaceae), which is capable of aerobic respiration (51). All other planktonic heterotrophic MAGs (Desulfurococcales, bin P12; Thermofilum, P1; Thermofilum, P4; Desulfurococcaceae, P6, P2, and P7; Fervidococcaceae, P5; and Thermosphaera in P8) are inferred to be obligately fermentative due to the apparent absence of terminal reductases and the presence of homologs of enzymes that typify fermentative organisms, such as alcohol dehydrogenases.In contrast to the planktonic community, inferred heterotrophs constituted 87.2% of the sediment community, suggesting organic carbon serves as both carbon source and electron donor for these populations (Fig. 4; Table S7). Based on the inferred proteomes of MAGs, four strategies of heterotrophic metabolism were observed: fermentation or heterotrophic respiration of O2, As(V), or a variety of sulfur compounds. Several heterotrophic MAGs in the sediment, including Desulfurococcaceae (bins S11, S14, S21, S26, S28, S34, and S36), Pyrobaculum (S24), and Thermoproteaceae (S10), encoded the potential for dissimilatory SO42− (homologs of DsrAB, AprA, Sat) and/or sulfite/tetrathionate (Asr) reduction. Heterotrophic MAGs predicted to be aerobes (homologs of Cox) included Desulfurococcaeae (bin S29), Armatimonadetes (S17), and Candidatus Caldipriscus (S7). All other sediment heterotrophic MAGs, including Ignisphaera (bin S33), Thermofilum (S1, S6, S13, and S19), Desulfurococcaeae (S3, S12, and S25), Desulfurococcales (S9), Caldarchaeales (S32), Ignisphaera (S23), Fervidococcaceae (S20), and Thermosphaera (S15) are inferred to be fermentative due to the apparent absence of terminal reductases and the presence of homologs of fermentative enzymes such as alcohol dehydrogenases. It is important to note that Pyrobaculum (bin S24) encoded many of the enzymes required for the DC/4HB pathway but did not encode a key enzyme demarcating this pathway, 4-hydroxybutyrate dehydrogenase. It is possible that this is due to the relatively low completeness (62.1%) of the genome. Nonetheless, given that it represents only 2.8% of the community, referring to this genome as autotrophic or heterotrophic would not markedly change the observation that the majority of the sediment community is comprised of putative obligate heterotrophs.
Genome replication rate estimations.
Estimates of genome replication rates (i.e., GRiD scores) were calculated for MAGs with >50% completeness and <5% contamination (Table S2). However, since phylogenetically distinct organisms may replicate their genomes at different rates independent of environmental characteristics or growth status (52), only GRiD scores calculated for pairs of phylogenetically similar MAGs in planktonic and sediment communities were compared. Planktonic and sediment MAGs that shared >95% ANI were identified (eight MAG pairs in total) (Table S4) and their GRiD scores at the time of sampling were estimated and compared (Table S7). These eight MAG pairs include those related to Thermofilum (bins P1 and S1), Pyrobaculum (P11 and S18), Desulfurococcales (P12 and S9), Fervidococcaceae (P5 and S20), Desulfuroccocaceae (P6 and S23), Desulfuroccocaceae (P7 and S25), Thermosphaera (P8 and S36), and Thermodesulfobacteraceae (P9 and S4). All of these pairwise comparisons involve MAGs from putative aerotolerant anaerobes or strict anaerobes; the MAGs from putative aerobes either did not have a closely related MAG of high quality in the sediment community (i.e., Thermocrinis bin P10), or one of the pairwise MAGs did not meet quality standards for completeness or for dif/ter ratios (i.e., Aeropyrum P3 and S29).In seven of eight pairwise comparisons, the GRiD scores for planktonic MAGs exceeded those of the sediment MAGs (Fig. 5A). In particular, the Pyrobaculum population in the planktonic community (bin P11) had a substantially higher GRiD score than the corresponding MAG in the sediment (S18). One MAG pair, Desulfurococcales (P12 and S9), had a higher, albeit not statistically significant, GRiD score in the sediment community than the planktonic community. This suggests that GRiD scores in phylogenetically closely related MAGs were higher in the plankton community than in the sediment community. To the extent that the pattern of higher genome replication rates for phylogenetically closely related MAGs of aerotolerant anaerobes and strict anaerobes in the planktonic versus the sediment community extends to MAGs of aerobes, these results indicate that populations comprising the planktonic community are likely replicating their genomes at a higher rate than are those of the sediment community. This is true even for populations (i.e., Pyrobaculum) with autotrophic capabilities that would add to the metabolic burden of the cell. Estimates of GRiD calculations for all MAGs are reported in Table S7.
FIG 5
Estimated replication rates (GRiD scores) for pairs of metagenome assembled genomes (MAGs) from the planktonic and sediment communities that exhibit average nucleotide identity (ANI) and amino acid identity (AAI) exceeding 95%. Each point represents a comparison of a MAG pair from planktonic and sediment communities, as defined in the legend. A 1:1 line representing equivalent GRiD scores for each pairwise comparison is shown. Bin designations are as presented in Fig. 2 and Table S2. GRiD scores for planktonic and sediment MAGs are presented in Table S7.
Estimated replication rates (GRiD scores) for pairs of metagenome assembled genomes (MAGs) from the planktonic and sediment communities that exhibit average nucleotide identity (ANI) and amino acid identity (AAI) exceeding 95%. Each point represents a comparison of a MAG pair from planktonic and sediment communities, as defined in the legend. A 1:1 line representing equivalent GRiD scores for each pairwise comparison is shown. Bin designations are as presented in Fig. 2 and Table S2. GRiD scores for planktonic and sediment MAGs are presented in Table S7.Higher inferred genome replication rates among planktonic versus sediment populations may reflect increased availability of high potential oxidants to support their energy metabolism and biomass generation. Based on metabolic inferences and geochemical data, one may predict that this is due to the planktonic populations having better access to O2 (e.g., Thermocrinis) and As(V) (e.g., Pyrobaculum). Access to higher concentrations of such substrates, when combined with available electron donors such as sulfide/thiosulfate (e.g., Thermocrinis), As(III) (e.g., Thermocrinis), and H2 (e.g., Pyrobaculum), could provide the energy necessary to support autotrophic growth that then could support secondary production from other heterotrophic members of the community, including those that reside in the water column or in the sediment of GGS063. The higher inferred rates of genome replication for planktonic populations compared to sediment populations, when combined with the inferred autotrophic capabilities of the planktonic populations and heterotrophic capabilities of the sediment populations, is potentially consistent with a trophic level pyramid, if the autotrophic populations are cross-feeding heterotrophic populations.Conclusions. Data presented herein from GGS063, a hot spring with a chemical composition consistent with it being sourced from the parent hydrothermal aquifer, reveal extreme dichotomy in the taxonomic composition and inferred metabolic potential of planktonic and sediment communities. Both planktonic and sediment communities are taxonomically representative of those identified in other circumneutral to alkaline hot springs, suggesting the observations made here could extend beyond GGS063. The planktonic and sediment communities in GGS063 were dominated by archaea, a finding that is unique among circumneutral to alkaline communities studied to date, likely due to the focus of previous studies on streamer biofilms that are dominated by the bacterium Thermocrinis (4, 39, 42). Further, the planktonic community is predominantly comprised of organisms capable of respiring or detoxifying O2 that are also capable of autotrophic growth, while the sediment community consists predominantly of anaerobes capable of heterotrophic growth. The prevalence of organisms capable of autotrophy is consistent with the limited amounts of DOC in hot springs sourced by the parent hydrothermal aquifer (25), including GGS063. The prevalence of putative aerobes or aerotolerant anaerobes in the planktonic community is interpreted to reflect influx of atmospheric O2 to otherwise anoxic hydrothermal waters sourcing GGS063. The higher estimated genome replication rates in the planktonic community are attributed to greater access to nutrients due to turbulence in the water column as well as a greater availability of high-potential electron acceptors, such as O2 from ingassing.The apparent role of ingassing of O2-rich atmospheric gas in driving the dichotomy in the taxonomic composition and inferred metabolic potentials of planktonic and sediment communities in this and other circumneutral to alkaline springs across YNP (13) indicates that the ecology of this spring type may have changed markedly over Earth’s history. While this type of hot spring is often considered an analog for early Earth environments (53, 54), the apparent effects of O2 on the ecology of planktonic populations in GGS063, and potentially other circumneutral to alkaline springs in YNP and elsewhere, raise the question of how well these contemporary communities and their functionalities reflect those that inhabited such springs prior to the rise of O2 in the atmosphere ∼2.4 Gya (55). Further research is thus warranted to determine if other springs sourced by anoxic waters from the parent hydrothermal aquifer host dichotomous microbial communities and to establish the extent that such springs can serve as early Earth analogs.
MATERIALS AND METHODS
Sample collection and geochemical analyses.
An unnamed hot spring (YNP thermal inventory identifier GGS063; lat 44.69047; long −110.72917; elevation, 2,291 m) in the northern Geyser Creek Group, Gibbon Geyser Basin, YNP, was selected for study and sampled on 21 June 2017. The temperature, conductivity, and pH of GGS063 were measured on-site with temperature-compensated probes (model YSI EC300; YSI Inc., Yellow Springs, OH, USA or model WTW 3100, WTW Weilheim, Germany). Dissolved Fe(II) and total sulfide (S2−) concentrations were measured in the field using Hach reagents (ferrozine pillows and sulfide reagents 1 and 2) and a Hach DR/890 spectrophotometer (Hach, Loveland, CO). Dissolved oxygen was measured in situ using a fluorescence method with a detection limit of 15 ppb and a resolution of ±4 ppb (Fibox 4 m and optical probe; PreSens Inc., Germany) following methods of St. Clair et al. (56). Dissolved oxygen measurements were made at ∼20 cm below the air-water interface. Spring waters were filtered (0.22 μm) and stored at 4°C for determination of anions via ion chromatography (Metrohm Compact Plus; Herisau, Switzerland), cations via inductively coupled plasma optical emission spectrometry (Thermo Scientific iCAP 6000 series; Waltham, MA), trace metals via inductively coupled plasma mass spectrometry (Thermo Scientific iCAP Q), and dissolved organic carbon via an OI Analytical wet oxidation total organic carbon analyzer coupled to a Thermo Delta Plus Advantage mass spectrometer as previously described (57) based on the methods of St-Jean (58). Anions, cation, and trace metal determinations were made at the Analytical Lab at the Montana Bureau of Mines and Geology.Triplicate subsamples of sediment (0.5 g each) were collected from GGS063 using a flame-sterilized spatula and were transferred to sterile 1.5-ml microcentrifuge tubes. Planktonic biomass was collected from a depth of ∼20 cm beneath the atmosphere-water interface in the hot spring with a peristaltic pump (Geotech Environmental Equipment, Denver, CO) and sterilized tubing prerinsed with ∼10 liters of hot spring water. Fluids were pumped through sterile 0.2-μm 47-mm Supor polyethersulfone (PES) membrane disc filters (Pall Corporation, Port Washington, NY) held in autoclaved in-line steel filter housings (Pall Corp.) in triplicates of 8 liters each. After filtering, filters were removed from the housings using flame-sterilized forceps and transferred aseptically into sterile 50-ml Falcon tubes. The 1.5-ml microcentrifuge tubes containing sediments and the 50-ml tubes containing filters were then placed on dry ice for transport to the lab, where they were kept at −80°C.DNA was extracted from filtered biomass and sediment-associated biomass with the FastDNA Spin Kit for Soil (MP Biomedicals, Irvine, CA) following the manufacturer’s instructions. Genomic DNA was quantified fluorometrically via the high-sensitivity Qubit assay (Thermo Fisher Scientific, Waltham, MA) and was subjected to paired-end shotgun metagenomic sequencing (Illumina NovaSeq; 2 × 151 bp) at the Department of Energy Joint Genome Institute (JGI; Walnut Creek, CA) following the Illumina regular fragment (300 bp) method of library preparation (59). Reads were quality filtered, trimmed, and cleaved of Illumina-specific sequencing adapters using BBTools and then read corrected using bbcms (within BBTools) via the standard JGI metagenome processing pipeline. Reads were then assembled into contigs using metaSPAdes v.3.13.0. Metagenome assembly statistics are reported in Table S1 in the supplemental material. The assembled contigs were binned into metagenome-assembled genomes (MAGs) using the MetaWRAP platform (60) to map reads to contigs using BWA aligner v.0.7.17 (61). MetaWRAP was also used to generate MAGs with the MetaBAT v.2.12.1 (62), MaxBin v.2.0 (63), and CONCOCT v.1.0.0 (64) binning programs and contigs >2,500 bp. Finally, MetaWRAP was used to identify the best-quality MAGs from the three aforementioned binning strategies via “contamination” and “completion” estimates from CheckM v.1.1.3 (65) and the bin_refinement module. Only MAGs that exhibited >50% estimated completeness and <10% contamination (consistent with moderate- to high-quality genomes [66]) were retained for further analyses.Binned contigs were mapped back to the JGI scaffolds. To evaluate potential abundances of populations within communities, the BWA-mapped reads to contigs data set was summed across binned MAGs using the “profile” module within CheckM. The reported relative abundances are based on overall number of reads mapped to individual MAGs, then corrected for estimated MAG sizes, and ultimately reported as a proportion of all binned MAGs for each metagenome. The MAGs were taxonomically classified using the GTDB-Tk v.1.3.0 (67) classifier and the bac120 and ar122 data sets for bacterial and archaeal classification, respectively. Taxonomic designations are given to the lowest taxonomic rank that is formally characterized. The planktonic (JGI Integrated Microbial Genomes [JGI] name, GCR.EP_P) and sediment (IMG name, GCR.EP_S) metagenomes from GGS063 are available on JGI-IMG under genome identifiers 3300033431 and 3300033490, respectively.
Comparison of GGS063 16S rRNA gene compositions to those of other circumneutral hot springs in YNP.
To evaluate the similarity of GGS063 communities to those of other circumneutral hot springs in YNP, 16S rRNA genes were retrieved from the IMG database for both the sediment and planktonic communities from GGS063 and compared against sediment and planktonic community compositions of 15 other hot springs previously reported in Colman et al. (13). Briefly, the metagenome-derived 16S rRNA genes were subject to alignment to the Silva reference database with operational taxonomic units (OTU) representatives from the other 30 communities, trimmed to the length of the V4-V5 hypervariable region used in the previous study, clustered into OTUs at the 97% nucleotide sequence identity level, and classified against the Silva database, all as described previously (38). The GGS063 OTUs were added to those from the previous data set based on clustering with the previously identified representative 16S rRNA gene sequences. Finally, the GGS063 OTUs were ascribed relative abundances based on the relative sequencing depth coverage values of the metagenomic contigs from which they derive. These data were then used to construct nonmetric multidimensional scaling (NMDS) ordinations inclusive of the GGS063 plankton and sediment 16S rRNA communities, as well as those from Colman et al. (13), as described in the latter study.To visualize the phylogenetic relationships among MAGs recovered from the sediments and plankton communities, whole-genome pairwise amino acid identity (AAI) values were calculated with the AAI comparison workflow (aai_wf module) within the CompareM v.0.0.23 software package (http://github.com/dparks1134/CompareM) with default parameters for protein comparisons (minimum identity of 30%, minimum alignment length of comparison query of 70%). The pairwise AAI values were then used as edges within the network visualization tool, Cytoscape v.3.8.2 (68). The force-directed edge-weighted layout was used to structure MAG positions based on their among-MAG AAI values while only considering pairwise values >40%.
Metabolic potential of MAGs.
The amino acid sequences of MAGS were uploaded to the Kyoto Encyclopedia of Genes and Genomes (KEGG) function database (69) with the KEGG Automatic Annotation Server (KAAS) (70). Annotations were mapped through bidirectional best hit (BBH) searches against KAAS system metabolic pathways. Initial inferences of metabolic capability were made based on KEGG Orthology (KO) mapping to the “oxidative phosphorylation” and “carbon fixation of prokaryotes” pathway groups. To explore additional functional potential, a type strain or closely related genome was identified for each MAG based on the relatedness of the housekeeping proteins DNA-directed RNA polymerase subunit b (RpoB) or the 50S ribosomal protein L2 (RPL2). Demonstrated metabolic capabilities from the related cultivars with available genomes guided further metabolic reconstruction analyses. Sequences of interest from these genomes were then used as queries for Basic Local Alignment Search Tool (BLASTp) (71) searches against MAGs using the National Center of Biotechnology Information (NCBI) tool. An E value cutoff of 1.0e−20 was used to screen homologs along with >30% amino acid identity and >60% coverage of the query sequence. Annotations of consequential sequences from both KO mapping and BLASTp analyses were verified by structural homology using SWISS-MODEL (72) and/or by alignment in Clustal Omega (v.1.2.4) (73) to identify motifs or residues that are required for functionality.The potential for planktonic and sediment-associated populations in GGS063 to integrate O2 into their energy metabolism was assessed based on MAGs encoding homologs of cytochrome c oxidase (Cox I and II; EC 7.1.1.9), which is used to respire O2 in most aerobic or facultatively anaerobic bacteria and archaea (74). MAGs that did not encode Cox homologs (comprising all subtypes) were deemed anaerobes unless those MAGs were too incomplete to make a definitive determination. In such cases, literature related to the most closely affiliated strains was used to increase confidence in whether the organisms in question were likely to be capable of aerobic respiration or not. Finally, MAGs were examined for homologs of the cytochrome bd complex (CydABX; EC 7.1.1.7). In many organisms, CydABX is used to reduce O2 as a detoxification strategy (75), while in other organisms (currently only bacteria), it can be used to respire O2 (76). Thus, MAGs that encoded CydABX were conservatively labeled as aerotolerant unless physiological or biochemical evidence indicated that this complex in a closely related taxon is used to respire O2. An example is in the Aquificae, where aerobic respiration is catalyzed by CydABX (77, 78).MAGs were also subjected to BLASTp analyses to identify homologs of proteins that may allow use of alternative electron donor and acceptor pairs that could potentially provide further insight into their ecology, including proteins involved in dissimilatory nitrate reduction (NarABG, EC 1.7.5.1, and NapAB, EC 1.9.6.1), sulfate/sulfite reduction (Sat, EC 2.7.7.4; AprAB, EC 1.8.99.2; DsrAB, EC 1.8.99.5), elemental sulfur/polysulfide reduction (DMSO reductases, EC 1.8.5.3; SreABC, no EC), sulfite/tetrathionate reduction (Asr, no EC), thiosulfate reduction (PhsA, EC 1.8.5.5), arsenate reduction (ArrA, EC 1.20.99.1), hydrogen metabolism ([NiFe] and [FeFe] hydrogenases, EC 1.12.1.2 and 1.12.99.6, respectively), arsenite oxidation (AioA, EC1.20.9.1), sulfide oxidation (Sqo, EC 1.8.5.8), thiosulfate/sulfur oxidation (Sox, EC 2.8.5.2), and methane oxidation or production (MmoX, EC 1.14.13.25; McrA, EC 2.8.4.1). A complete list of key enzymes and their EC numbers that were used to screen MAGs is provided in Table S3.MAGs were also screened for pathways that allow for autotrophy. The six major autotrophic pathways are (i) the Calvin-Benson-Bassham (CBB) cycle, (ii) the reductive tricarboxylic acid (rTCA) cycle, (iii) the Wood-Ljungdahl (WL) pathway, (iv) the 3-hydroxypropionate (3HP) bicycle, (v) the 3-hydroxypropionate/4-hydroxybutyrate (3HP/4HB) cycle, and (vi) the dicarboxylate/4-hydroxybutyrate (DC/4HB) cycle (46, 47). Diagnostic proteins for each of these pathways were used to designate MAGs as being capable of autotrophy. This includes homologs of ribulose 1,5-bisphosphate carboxylase/oxygenase (RuBisCO; EC 4.1.1.39) and phosphoribulokinase (PRK; EC 2.7.1.19) for the CBB cycle and citryl-CoA synthetase (ccs; EC 6.2.1.18), citryl-CoA lyase (cit; EC 4.1.3.34), or ATP-citrate lyase (ACLY; EC 2.3.3.8) for the rTCA cycle. Homologs of carbon monoxide dehydrogenase/acetyl-CoA synthase (CODH/ACS; EC 1.2.7.4) and formate-tetrahydrofolate ligase (fhs; 6.3.4.3) were used to identify evidence for the Wood-Ljungdahl pathway, while homologs of propionyl-CoA carboxylase (PCC; EC 6.4.1.3), malonyl-CoA reductase (mcr; EC 1.2.1.75), malyl-CoA/B-methylmalyl-CoA/citranyl-lyase (mcl; EC 4.1.3.24), and acetyl-CoA carboxylase (ACC; EC 6.4.1.2) were used to identify evidence for the 3HP pathway. Evidence for the 3HP/4HB pathway was gained by identifying homologs of the same enzymes as the 3HP pathway with the addition of 4-hydroxybutyrl-CoA dehydratase (4-BUDH; EC 4.2.1.120). No enzyme homolog is diagnostic for the DC/4HB since it uses enzymes common to the rTCA or 3HP/4B cycles above. However, the DC/4HB pathway can be differentiated from the 3HP/4HB pathway by identifying homologs of pyruvate synthase (por; EC 1.2. 7.1) (79, 80). MAGs lacking homologs of these indicator proteins for autotrophic pathways were assigned as putatively heterotrophic unless otherwise noted.
Genome replication rate.
The GRiD (Growth Rate Index) program (81) was used to estimate genome replication rates of MAGs in the GGS063 planktonic and sediment communities. GRiD estimates in situ genome replication rate by calculating genomic peak-to-trough read ratios by mapping read coverage at the inferred origin and terminus of the genome. This calculation assumes bidirectional replication of a circular genome proceeding from the origin. Quality control is assessed by read mapping to determine coverage levels of the DNA replication initiator protein-encoding gene, dnaA, and the replication termination site gene, dif. dnaA is typically localized near the origin (ori) of replication, oriC (82), and dif is typically localized near the terminus (ter) (83). The GRiD score is assumed to be accurate when dnaA/oriC and ter/dif read coverage ratios approach 1.0 (81). GRiD also estimates strain heterogeneity by determining variance in GRiD estimates across a single MAG. GRiD scores are assumed to be accurate when strain heterogeneity is less than 0.3 (81).Replication rate estimates for all MAGs were performed using the GRiD single option, which estimates a replication rate for the MAG independent of the replication rates of the other community members. GRiD results of MAGs with dif/ter ratios outside the range 0.7 to 1.3, or species heterogeneity exceeding 0.3, were discarded without further consideration, as they did not meet quality thresholds. dnaA/ori ratios were not evaluated, as dnaA does not colocalize near the origin in most archaea (84), making it an uninformative metric for archaeal MAGs. It is important to note that some of the archaeal community members are closely related to organisms that are polyploid, such as Archaeoglobus (85). However, while many archaea are polyploid or may change ploidy over the course of their life cycle (86), the peak/trough ratio would be expected to be the same regardless of ploidy. Similarly, some archaeal community members are closely related to organisms encoding multiple origins of replication, such as Pyrobaculum (87). However, the empirical determination of low strain heterogeneity estimates for all archaeal MAGs that are related to organisms that putatively encode multiple origins of replications indicates that those MAGs likely encode only a single origin of replication.GRiD score comparisons were initially generated for eight MAG pairs present in both the sediment and planktonic communities, as determined by pairwise average nucleotide identity (ANI) and amino acid identity (AAI) that each exceeded 95%. Pairwise ANI and AAI values for each MAG pair are presented in Table S4. Ninety-five percent was chosen as the cutoff because it consistently corresponds to the species level (88) and was presumed to minimize the confounding effects of differing phylogeny on growth rate (52). Pairs were only selected that met a minimum pairwise genome alignment of 20% to ensure selected pairs best represented the closest phylogenetic relatives. Consequently, some pairs, such as planktonic bin eight and sediment bin 36, were paired despite not sharing the highest pairwise ANI or AAI score. ANI scores were generated by FastANI (89). AAI scores were generated by using the AAI comparison workflow (aai_wf module) within the CompareM v.0.0.23 software package as described above. GRiD scores were then calculated for all MAGs with completeness >50%.
Data availability.
Individual MAGs (Table S2) from the planktonic and sediment communities have been deposited in the National Center for Biotechnological Information database under BioProject accession numbers PRJNA568221 and PRJNA568220, respectively.
Authors: Maximiliano J Amenabar; Daniel R Colman; Saroj Poudel; Eric E Roden; Eric S Boyd Journal: Environ Microbiol Date: 2018-08-16 Impact factor: 5.491
Authors: Johannes Alneberg; Brynjar Smári Bjarnason; Ino de Bruijn; Melanie Schirmer; Joshua Quick; Umer Z Ijaz; Leo Lahti; Nicholas J Loman; Anders F Andersson; Christopher Quince Journal: Nat Methods Date: 2014-09-14 Impact factor: 28.547
Authors: Tara Djokic; Martin J Van Kranendonk; Kathleen A Campbell; Malcolm R Walter; Colin R Ward Journal: Nat Commun Date: 2017-05-09 Impact factor: 14.919
Authors: Robert M Bowers; Nikos C Kyrpides; Ramunas Stepanauskas; Miranda Harmon-Smith; Devin Doud; T B K Reddy; Frederik Schulz; Jessica Jarett; Adam R Rivers; Emiley A Eloe-Fadrosh; Susannah G Tringe; Natalia N Ivanova; Alex Copeland; Alicia Clum; Eric D Becraft; Rex R Malmstrom; Bruce Birren; Mircea Podar; Peer Bork; George M Weinstock; George M Garrity; Jeremy A Dodsworth; Shibu Yooseph; Granger Sutton; Frank O Glöckner; Jack A Gilbert; William C Nelson; Steven J Hallam; Sean P Jungbluth; Thijs J G Ettema; Scott Tighe; Konstantinos T Konstantinidis; Wen-Tso Liu; Brett J Baker; Thomas Rattei; Jonathan A Eisen; Brian Hedlund; Katherine D McMahon; Noah Fierer; Rob Knight; Rob Finn; Guy Cochrane; Ilene Karsch-Mizrachi; Gene W Tyson; Christian Rinke; Alla Lapidus; Folker Meyer; Pelin Yilmaz; Donovan H Parks; A M Eren; Lynn Schriml; Jillian F Banfield; Philip Hugenholtz; Tanja Woyke Journal: Nat Biotechnol Date: 2017-08-08 Impact factor: 54.908
Authors: Daniel R Colman; Emily A Kraus; Patrick H Thieringer; Kaitlin Rempfert; Alexis S Templeton; John R Spear; Eric S Boyd Journal: Proc Natl Acad Sci U S A Date: 2022-10-10 Impact factor: 12.779