Literature DB >> 29099490

A dynamic microbial community with high functional redundancy inhabits the cold, oxic subseafloor aquifer.

Benjamin J Tully1, C Geoff Wheat2, Brain T Glazer3, Julie A Huber4,5.   

Abstract

The rock-hosted subseafloor crustal aquifer harbors a reservoir of microbial life that may influence global marine biogeochemical cycles. Here we utilized metagenomic libraries of crustal fluid samples from North Pond, located on the flanks of the Mid-Atlantic Ridge, a site with cold, oxic subseafloor fluid circulation within the upper basement to query microbial diversity. Twenty-one samples were collected during a 2-year period to examine potential microbial metabolism and community dynamics. We observed minor changes in the geochemical signatures over the 2 years, yet the microbial community present in the crustal fluids underwent large shifts in the dominant taxonomic groups. An analysis of 195 metagenome-assembled genomes (MAGs) were generated from the data set and revealed a connection between litho- and autotrophic processes, linking carbon fixation to the oxidation of sulfide, sulfur, thiosulfate, hydrogen, and ferrous iron in members of the Proteobacteria, specifically the Alpha-, Gamma- and Zetaproteobacteria, the Epsilonbacteraeota and the Planctomycetes. Despite oxic conditions, analysis of the MAGs indicated that members of the microbial community were poised to exploit hypoxic or anoxic conditions through the use of microaerobic cytochromes, such as cbb3- and bd-type cytochromes, and alternative electron acceptors, like nitrate and sulfate. Temporal and spatial trends from the MAGs revealed a high degree of functional redundancy that did not correlate with the shifting microbial community membership, suggesting functional stability in mediating subseafloor biogeochemical cycles. Collectively, the repeated sampling at multiple sites, together with the successful binning of hundreds of genomes, provides an unprecedented data set for investigation of microbial communities in the cold, oxic crustal aquifer.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 29099490      PMCID: PMC5739024          DOI: 10.1038/ismej.2017.187

Source DB:  PubMed          Journal:  ISME J        ISSN: 1751-7362            Impact factor:   10.302


Introduction

The largest actively flowing aquifer system on Earth is circulating through oceanic crust underlying the oceans and sediments (Sclater ; Stein and Stein, 1994; Johnson and Pruis, 2003). The movement of water through the aquifer serves as a vital conduit for exchange of both microorganisms and nutrients between the ocean basins and the subseafloor and offers a route by which organisms can extract energy from the fluids and rocks beneath the seafloor (Orcutt ; Meyer ). Our understanding of life within the marine crustal aquifer has largely been shaped by studies of anaerobic and thermophilic organisms in warm ridge flank environments (Cowen ; Huber ; Jungbluth , 2016) and crustal-source basalts exposed at the seafloor (Lysnes ; Mason ; Santelli ; Lee ). However, much of the microbial interaction with the crustal aquifer occurs within the seafloor at sites where cold, oxygenated deep ocean waters circulate through basaltic crust, entering and exiting through seafloor exposures (Fisher and Wheat, 2010; Edwards ; Wheat ). Therefore, despite advancing knowledge about microbial life in the subseafloor, our understanding is limited relative to which microorganisms live in the rocky oceanic crust, what hydrogeologic processes control subsurface fluid circulation, how these organisms harness energy in this environment, and the overall contribution to marine biogeochemical cycles is limited. To more effectively study these prevalent ocean environments, several subseafloor observatories, termed circulation obviation retrofit kits (CORKs; Davis ; Wheat ), have been deployed in oceanic crust in part to allow for sampling and monitoring of the crustal aquifer (Wheat ). Two CORK observatories are installed at the well-studied site North Pond, an isolated sediment basin (8 km × 15 km, ~4484 m water depth), just west of the Mid-Atlantic ridge on 7–8 million years old crust (22°45′ N, 46°05′ W; Edwards ). At North Pond, seawater circulates between the crust and the deep ocean through the exposed ridge flanks, while sediments within the basin act as an impermeable barrier that prevents seawater exchange. Previous studies have sought to constrain the microbial community and its activity within the basaltic aquifer at North Pond. Measurements of carbon fixation activity on basalts recovered by ocean drilling (Orcutt ) were unable to detect quantifiable rates of activity at in situ temperatures (4°C), while additions of nitrate and ammonia to crustal rocks stimulated microbial growth (Zhang ). Modeling of the subsurface at North Pond suggests that hydrogen and ferrous iron likely have an important role in maintaining microbial biomass, with ferrous iron estimated to support ~10% of the microbial biomass (Bach, 2016). In support of this hypothesis, a Marinobacter isolate capable of iron oxidation was enriched from North Pond basalts (Zhang ). PCR-based assessments of the microbial community associated with the basalts from North Pond have shown that Gammaproteobacteria are the dominant phylogenetic group (Jørgensen and Zhao, 2016), while the presence of genes involved in the carbon fixation through the Calvin-Benson-Bassham cycle are more common than the reverse citric acid cycle (Orcutt ). Additional work examining the crustal fluids from the aquifer at North Pond has shown that the geochemistry of the fluids is nearly identical to the deep Atlantic bottom water (DABW), indicating a short residence time for seawater within the crustal aquifer at North Pond (Meyer ). However, basaltic formation fluids within the aquifer have concentrations of dissolved oxygen, silica and dissolved organic carbon that are different than those of the deep bottom water (Meyer ), and assessment of the crustal fluid microbial community through 16S rRNA gene and transcript sequencing, stable-isotope incubations, and metagenomics revealed that the aquifer community was active with a distinct community structure from bottom water. The community also had the capacity to perform both autotrophy and heterotrophy (Meyer ), with low rates of activity detected using nanocalorimetry (Robador ). Together, these initial studies show a diverse and distinct microbial community living in the oligotrophic, oxic, basaltic crustal aquifer at North Pond with relatively low levels of metabolic activity. However, little is known about the metabolic potential and community dynamics in this understudied environment. Here, we present genomic reconstruction of North Pond crustal fluid samples collected over a span of two years, providing 21 samples for a detailed examination of potential microbial metabolism and community interactions within this subseafloor aquifer. Our high-resolution analysis of hundreds of genomes reveals a temporally and spatially dynamic microbial community and provides new insights into microbially-mediated biogeochemical cycling within the crustal aquifer.

Materials and methods

Sampling, cell quantification and chemical analysis

Crustal fluids were collected from the single horizon at U1382A and from the shallow, middle and deep horizons in U1383C (Edwards ) using a mobile pumping system designed for microbial sampling from CORK fluid delivery lines as described in Meyer and Cowen ; Figure 1). Deployed with the ROV system, mobile pumping system connectors are attached to the CORK wellhead via an umbilical to the hydrological zone of interest within the aquifer. Fluid systems were flushed and allowed to equilibrate before sampling, and dissolved oxygen concentrations were measured during pumping using an Aanderaa sensor (Meyer ). In 2012, 12 l of each fluid sample were filtered on to a 0.22 μm Sterivex-GP filter (Merck Millipore, Billerica, MA, USA) as described in Meyer . In 2014, 12 l of each sample was filtered in situ and immediately fixed with RNALater (Thermo Fisher Scientific, Waltham, MA, USA), as described previously (Akerman ). After sampling in 2012, a battery-powered GeoMICROBE sled was left at each CORK for time series autonomous sampling of the fluid delivery lines (Cowen ). For each filter sample, ~10 l of fluid were filtered in situ and immediately fixed with RNALater. For downstream analysis, ~500 ml of fluid were filtered into two Tedlar bags, one containing 54 ml of 37% formaldehyde for cell enumeration and the other with 4 ml of 10% HCl for inorganic chemistry analyses. Sleds were deployed in April 2012 and recovered in April 2014 with samples collected according to Table 1. Upon sled recovery, filters were transferred to fresh RNALater and stored at −80 °C, while all bag samples were stored at 4 °C (Cowen ). Deep bottom water was sampled in 2012 and 2014 via a CTD at 100 m above the seafloor and filtered in the same manner as the crustal fluids onto Sterivex filters. Total microbial biomass in fluids was enumerated with DAPI (4′,6′-diamidino-2-phenylindole; Sigma-Aldrich, St Louis, MO, USA) and epifluorescent microscopy (Porter and Feig, 1980). Fluids also were analyzed for dissolved silicon and nitrate using automated colorimetric analysis and pH was measured with an electrode before a potentiometric titration for the determination of alkalinity (Wheat ).
Figure 1

Idealized schematic of North Pond, CORK U1382A and CORK U1383C. Hypothesized flow of entrained seawater through the crust is represented by blue arrows. Seafloor and sediment boundaries are represented by black line. Basalt boundaries are represented by red, dashed lines. For each CORK horizon, the number of metagenomic samples are indicated, including the relative time sampled. CORK, continuous obviation retrofit kits; mbsf, meters below seafloor; TP, time point. (Modified from Edwards ).

Table 1

Collection details, cell enumeration and inorganic chemistry values for North Pond samples

CORK/bottom waterDepth horizonDate sampledTime pointCell counts (cells ml−1±95% confidence level)O2 (μmol l−1)NO3− (μmol l−1)Si (μmol kg−1)
1382A90–21025 Apr 2012TP01.4 × 104 (±6 × 102)244±121.156
1382A90–2106 Aug 2012TP11.1 × 104 (±1.0 × 103)N.dN.dN.d
1382A90–2105 Oct 2012TP28.1 × 103 (±1.1 × 103)N.dN.dN.d
1382A90–2104 Dec 2012TP39.2 × 103 (±8.3 × 103)N.dN.dN.d
1382A90–2102 Feb 2013TP4N.d.N.dN.dN.d
1382A90–2103 Apr 2013TP51.5 × 104 (±6.9 × 103)N.dN.dN.d
1382A90–2102 Jun 2013TP66.2 × 103 (±5.0 × 103)N.dN.dN.d
1382A90–2101 Aug 2013TP71.8 × 104 (±1.1 × 104)N.dN.dN.d
1382A90–21030 Sept 2013TP85.1 × 103 (±2.5 × 103)N.dN.dN.d
1382A90–2105 Apr 2014TP92.8 × 104 (±3.8 × 102)233±121.973
1383C70–14630 Apr 2012TP02.0 × 104 (±7 × 102)216±121.8125
1383C70–1469 Apr 2013TP4N.d.N.dN.dN.d
1383C70–1468 Jul 2013TP57.7 × 103 (±9.8 × 102)N.dN.dN.d
1383C70–1462 Apr 2014TP91.1 × 104 (±1.0 × 103)202±222.3146
1383C146–2009 Apr 2013TP45.6 × 103 (±3.0 × 102)N.dN.dN.d
1383C146–2008 Apr 2014TP9N.d.185±221.9123
1383C200–33220 Apr 2012TP02.1 × 104 (±8 × 102)213±121.8120
1383C200–3329 Apr 2013TP45.0 × 103 (±4.0 × 102)N.dN.dN.d
1383C200–33231 Mar 2014TP92.8 × 104 (±3.8 × 102)187±121.7158
Bottom water (CTD)~4400 m26 Apr 2012TP02.1 × 104 (±5 × 102)~25021.148
Bottom water (CTD)~4400 m10 Apr 2014TP91.9 × 104 (±8.0 × 102)~25021.155

Abbreviation: N.d., not determined.

DNA extraction and sequencing

Total genomic DNA was extracted from the filters using a phenol chloroform method, as previously described (Sogin ). DNA was sheared to 175 bp using a Covaris S-series sonicator. Metagenomics libraries were constructed using the Ovation Ultralow Library DR multiplex system (Nugen) following manufacturer’s instructions. Paired-end sequencing was performed on an Illumina HiSeq 1000 at the WM Keck sequencing facility at the Marine Biological Laboratory. Raw sequence reads underwent quality control using Cutadapt (Martin, 2012; v.1.7.1; -e 0.08 —discard-trimmed —overlap=3) to locate and remove Illumina adapter sequences from both ends of the of the read, followed by quality trimming using Trimmomatic (Bolger ; v.0.33 ; PE SLIDINGWINDOW:10:28 MINLEN:75).

Ribosomal rRNA identification and relative abundance

From the high-quality paired-end Illumina sequencing reads, 16S rRNA gene fragments were identified using Meta-RNA (Huang ; v.H3; -e 1e-10). Putative rRNA fragments and associated mate pairs from each sample were processed through EMIRGE (Miller , 2013); emirge_amplicon.py; -l 113 -i 163 -s 33 -a 32 —phred33) to generate full-length sequences using the SILVA (Quast ) SSURef111 reference database (https://github.com/csmiller/EMIRGE). Reconstructed 16S rRNA genes were assigned taxonomy using mothur (v1.34.4) by first aligning the sequences to the SILVA SSURef123 database (align.seqs; flip=T), removing sequences that failed to align, if necessary (remove.seqs), and classifying the sequences (classify.seqs; cutoff=80, iters=1000). Utilizing the high-quality sequence reads, each set of 16S rRNA sequences was used to recruit reads from the corresponding metagenomic sample, randomly subsampled using seqtk (v1.0-r82; https://github.com/lh3/seqtk) to the size of the smallest library (n = 22 142 100 reads). Reads were recruited using Bowtie2 (v.2.2.5; default parameters) and individual counts of reads per 16S rRNA were determined. Read counts were length normalized and used to calculate the relative abundance of each reconstructed 16S rRNA in the sample (Supplementary Data 15). Relative abundances were combined for sequences that shared the same mothur-ascribed Phylum (or Class for Proteobacteria) level.

Metagenomic assembly and binning

High-quality sequence reads were subjected to two rounds of assembly. A primary set of contigs was generated using IDBA-UD (Peng ; v.1.1.1; default parameters) utilizing the reads from each individual sample. A secondary set of contigs was generated in Geneious (Kearse ) v6.1.8; modified parameters used for ‘High Sensitivity/Slow’ Supplementary Data 11) by combining the primary set of contigs ⩾500 bp in length from samples with the same source (that is, combining all primary contigs generated from U1382A, and so on). Secondary contigs ⩾5 kb in length from U1382A and U1383C were combined with secondary contigs ⩾3 kb generated from DABW (Supplementary Data 4). The size-selected set of secondary contigs was used to recruit high-quality sequencing reads from each sample using Bowtie2 (as above). A coverage value, equivalent to recruited reads per bp, was determined for each contig in each sample, the coverage values were log(n + 1) transformed, and subjected to binning using affinity propagation (Frey and Dueck, 2007) and a pre-release version of BinSanity (Graham ; -p -1). CheckM (Parks ; v1.0.3; lineage_wf) was used to assess the results of the binning utilizing a ⩾50% completeness threshold to identify putative genomes. Multiple bins were identified above the completeness threshold that contained substantial estimated contamination (>55% contamination). For each suspect bin, the %G+C and coverage values for each contig were plotted against each other (data not shown) and manually assessed for putative cohesive groups (Supplementary Data 12).

Phylogeny

Each putative genome was assessed for the presence of 16 conserved ribosomal marker proteins (Hug ) based on a HMMER (Finn ) search (v.3.1b2; hmmsearch —cut_tc —notextw) of TIGRfam (Haft ) and Pfam (Bateman ) models corresponding to the proteins (Supplementary Data 13). If multiple copies of a ribosomal markers protein were detected, that protein was not included as a marker for that genome, and any genome with <8 markers was not included for further phylogenetic assessment. Ribosomal markers were collected from 1652 reference genomes representing the major Families and/or Genera from within the Bacteria. Each marker gene from the putative and reference genomes was aligned using MUSCLE (Edgar, 2004; v3.8.31; -maxiters 8), trimmed using trimAL (Capella-Gutiérrez ; v.1.2rev59; -automated1), imported in Geneious (Kearse ), and manually assessed and trimmed, if necessary. All of the individual alignments were concatenated and a phylogenetic tree was constructed using FastTree (Price ; v2.1.3; -lg -gamma). Genomes were assessed for the presence of full-length 16S rRNA genes utilizing RNAmmer (Lagesen ; v1.2; -S bac -m ssu). The identified rRNA sequences were aligned to the SILVA SSURef123 database using the web-based SINA aligner (Pruesse ); default setting, trailing sequences removed from alignment). Aligned rRNA sequences were added to the SSURef123 NR99 ARB tree (Ludwig ) using the ARB Parsimony (Quick) tool (default parameters). Members of the ARB tree phylogenetically related to the North Pond genome rRNA sequences were extracted. ARB-based and North Pond genome 16S rRNA sequences were-aligned, trimmed, and manually assessed (as above). The final alignment was used to construct a phylogenetic tree with FastTree (-nt -gtr -gamma). Within the genomes, 53 16S rRNAs were identified within 47 genomes. The 16S rRNA gene tree was (Supplementary Data 14) used to support or refute the assignments provided from the ribosomal marker tree (RMT) and/or CheckM (Supplementary Data 16).

Annotation and metabolic analysis

Putative CDS were predicted for the North Pond genomes using Prodigal (Hyatt ; v2.6.3; -m -p meta -q) and submitted to the GhostKoala (Kanehisa ; default parameters; genus_prokaryotes + family_eukaryotes) for annotation using the KEGG (Kanehisa ) Ontology (KO) system. Based on these KO assignments, genomes were assessed for the degree to which specific pathways and functions were complete in the individual genomes using the information on canonical pathways available as part of the KEGG Pathway Database (updated, 14 Nov 2016) and the script KEGG-decoder.py (www.github.com/bjtully/BioData/tree/master/KEGGDecoder). Beyond specific assignments to pathways and function, for this manuscript several broad functional metabolic categories were identified for genomic bins based on the presence multiple genes. Cytochromes that participate in oxygen chemistry in aerobic organisms were defined as cytochrome c oxidase, aa-type (coxABCD) and cytochrome o ubiquinol oxidase (cyoABCD), while microaerobic cytochrome metabolism was defined as cytochrome c oxidase, cbb-type (ccoPQNO) and cytochrome bd complex (cydAB; (García-Horsman ). Similarly, sulfide oxidation was determined by the presence of sulfide:quinone oxidoreductase (sqr), sulfur dioxygenase (sdo), and/or sulfite reductase (dsrA), when applicable (see below). Additionally, putative thiosulfate oxidation was assessed based on components of the SOX system (soxABCXYZ) and/or thiosulfate dehydrogenase (tsdA). Putative CDS annotated as the large subunit of ribulose-1,5-bisphosphate carboxylase (RuBisCO; K01601) were extracted, along with RuBisCO sequences representing previously described major lineages (Tabita ; Supplementary Data 17). The RuBisCO sequences were aligned, automatically trimmed, and used to construct a phylogenetic tree (as above). A similar procedure was applied to identifying molybdopterin oxidoreductases (MOBs), specifically to identify MOBs associated with Fe2+ oxidation. Putative MOBs were identified from the Prodigal-derived CDSs via HMMER (hmmsearch, bit score threshold ⩾75) using the molybdopterin Pfam (PF00384). Environmental MOBs were aligned with reference sequences (Supplementary Data 6), automatically trimmed, and used to construct a phylogenetic tree (as above). To differentiate between sulfite reductases present in sulfur reducing organisms and reverse sulfite reductase present in sulfur oxidizing organisms, putative CDS annotated as DsrA (K11180) were used to construct a phylogenetic tree (as above) with reference sequences (Loy ) (Supplementary Figure 6). Two HMM models designed using homologs of Cyc1PV-1 and Cyc2PV-1 identified in neutrophilic iron oxidizing organisms (Barco ; Tully and Heidelberg, 2016) were used to search via HMMER (hmmsearch, bit score threshold Cyc1PV-1 ⩾ 445 and Cyc2PV-1 ⩾ 55) the putative CDSs of North Pond genomes (Supplementary Data 7).

Relative abundance and community structure

Contigs composing the North Pond genomes were used to recruit high-quality sequencing reads using Bowtie2 (default parameters) from the subsampled metagenomic samples (as above). Read counts for each genome were determined using featureCounts (Liao ; v.1.5.0-p2; -F SAF) and normalized to reads per bp for the full-length of each genome. Length normalized relative abundance was determined for each sample: The relative abundance values for the genomes in all 21 sample were used to cluster the genomes and samples in Past3 using the Bray-Curtis similarity measure (Supplementary Figure 9). A separate Bray-Curtis clustering step was performed only on genomes detected in the U1382A samples above 0.05% relative abundance (Supplementary Figure 8). The observed clusters of genomes were used to determine ecological units that were observed over the course of the time series sampling. Annotations for genomes in the identified U1382A ecological units were used to predict potential function. Genomes from U1382A were assigned to six functions: carbon fixation, partial denitrification (functional assignment predicts incomplete set of denitrification genes), complete denitrification, DNRA, sulfide oxidation, partial dissimilatory sulfur redox (functional assignment predicts incomplete set of sulfur redox genes), and thiosulfate oxidation. The fraction of the observed community with a given function was determined by: This process was repeated for the complete set of genomes to compare function between U1382A, U1383C and the DABW. Genomes with relative abundance >0.05% were considered for this analysis and could be assigned to multiple samples. Instead of broad functions, as for U1382A ecological units, genome counts and fractional abundance were assigned based on the presence of key genes in the nitrogen, sulfur and carbon cycles.

Results

Assessment of microorganisms in the subseafloor aquifer

The cold, oxic Mid-Atlantic subseafloor aquifer was sampled for geochemistry, cell quantification and microbial DNA from two seafloor CORK installations at the North Pond site. Water samples were collected 10 times from hole U1382A over the course of two years, and Hole U1383C was sampled 9 times from three different depth horizons that had been sealed during CORK installation. These horizons are defined by packers that seal the borehole and limit vertical mixing, defining three distinct hydrologic zones based on formation properties (Edwards ; Figure 1; Supplementary Figure 1). Two additional background seawater samples were collected from Niskin bottles that were tripped ~50 m off bottom, above the water-sediment interface (~4450-m depth) in both 2012 and 2014. These samples proved a measure of bottom water properties using the same techniques employed on those from the crustal fluids (Figure 1). Cell counts in all 19 borehole samples ranged from 5 to 20 × 103 cells ml−1 of crustal fluid, with no discernable change during the two year period (Table 1). Geochemical data from discrete samples collected in 2012 and 2014 indicated a minor increase in silica, whereas oxygen concentrations decreased slightly at all sampling horizons. Nitrate concentrations did not change. In total, 21 metagenomic samples were sequenced, generating 1.2 billion high-quality paired-end Illumina sequencing reads (Supplementary Data 1). 2,829 approximately full-length 16S rRNA gene sequences were reconstructed from the data set (Supplementary Data 2). The full-length 16S rRNA gene sequences in the metagenome (Figure 2) provide a snapshot of community composition in the samples, revealing a temporally and spatially dynamic community, with large shifts in the relative abundance of Proteobacteria, specifically in the Alpha-, Gamma- and Deltaproteobacteria, the Epsilonbacteraeota, and the Bacteroidetes.
Figure 2

Microbial community structure in North Pond crustal fluids based on reconstructed full-length 16S rRNA gene sequences. Relative abundance for 16S rRNA gene sequences was combined at the Phyla (Class level for Proteobacteria) from holes U1383C and U13832A and the Deep Atlantic Bottom Water. Individual samples are grouped based on depth of sampling (shallow, middle, and deep in U1383C) and time (from initial sampling in 2012 to 2014 sampling time point).

After two rounds of assembly of the high-quality sequencing reads, 1.5 million contigs were produced. A subsection of contigs ⩾5 kbp in length (78 004 contigs; N50=25 932 bp; total bp=1.2 Gbp) were used to reconstruct microbial genomes (Supplementary Data 3). 195 metagenome-assembled genomes (MAGs) were reconstructed and determined to be ⩾50% complete (an additional 234 genome bins were identified that were 20–50% complete, though were not analyzed further). Throughout this manuscript, the term ‘genome’ will be used to refer to the 195 binned MAGs. The genomes were given the designation NORP, for North Pond genome (NORP1-195). With the exception of two genomes (NORP4 and -5), all of the genomes had ⩽10% cumulative contamination/redundancy (Supplementary Data 3). The genomes recruited between 9–61% of the sequencing reads (mean=37.3%) from the individual samples, with the lowest recruitment rate from the 2012 bottom water sample (Supplementary Data 4). 140 MAGs had a sufficient number (⩾ 8) of 16 ribosomal marker proteins to be included in a phylogenetic tree with genomes from IMG (Markowitz ) that represent the major bacterial Genera and/or Families (Supplementary Data 5). The North Pond genomes were assigned to 20 Phyla, including all the lineages within the Proteobacteria (including Acidithiobacillia), the Candidate Phyla Radiation (CPR; Hug ), and the Planctomycetes (Supplementary Figure 2). Based on the relative abundance of sequencing reads competitively recruited to each genome from each sample, the mean relative abundance for all genomes in all samples was 0.19% (median, 0.004%), and when examined closely, most genomes were ‘present’ in all samples at low abundance values (<0.05% on average 151 of 195 genomes were below this threshold in each sample). NORP9 had the highest relative abundance (40.4%) in the 2012 U1383C deep sample (Figure 3; Supplementary Data 9). Genomes were subjected to Bray-Curtis clustering based on the relative abundance values (Supplementary Figure 3). Several of the genomes (for example, NORP125, -161, and -172) were cosmopolitan in the subseafloor crustal fluids, present in both holes, and at several time points and depths (Figure 3). Most of the genomes associated with the bottom water samples were not present in the crustal samples, although several genomes did have low abundances; specifically NORP160 and -164, both assigned to the Nitrosopumilales, and three additional genomes detected in the 2014 samples from U1382A and the middle section of U1383C. Generally, when genomes were grouped together, these groups were abundant in one or a few samples (e.g., NORP51, -54 and -55). When groups of organisms are present in multiple samples, the samples tend to be in close spatial proximity or sequential sampling events (Figure 3). In several instances, a single organism becomes highly abundant, but is only present in a single sample (for example, NORP6 or NORP73).
Figure 3

Heat map of the percent relative abundances for the 195 high-quality genomes. Genomes are organized and clustered based on the Bray-Curtis similarity index (Supplementary Figure 3). Genomes explicitly mentioned in text are highlighted in bold and colored purple. Values at the bottom of the column represent the total observed percent relative abundance of genomes in that sample.

Metabolic potential of metagenome-assembled genomes

From the genomes, 523,212 putative coding DNA sequences (CDSs) were identified, of which, 245,902 (47%) were annotated with a KEGG ontology (KO; Kanehisa ) number/function (Supplementary Figure 4). Genomes were assessed for the presence of specific KO functions involved in numerous processes, including: carbon, nitrogen, sulfur, and hydrogen cycling, methanogenesis, motility, vitamin biosynthesis and transport, and fermentation pathways. Two of the 6 establish carbon fixation pathways (Hügler and Sievert, 2011) were identified amongst the genomes, with 32 genomes containing ribulose-1,5-bisphosphate carboxylase (RuBisCO) and elements of the Calvin-Benson-Bassham (CBB) cycle, and 7 genomes containing ATP-citrate lyase or citryl-CoA synthetase and citryl-CoA lyase part of the reverse citric acid cycle (rTCA; Table 2). A phylogenentic analysis of the 32 genomes possessing putative RuBisCO proteins, identified 5 genomes with Form IV RuBisCO-like-proteins, 15 genomes with Form II RuBisCO, and 12 genomes with Form I RuBisCO (Supplementary Figure 5).
Table 2

Genomes with carbon fixation potential and putative electron sources

IDPhylogenetic AssignmentCarbon fixation pathway [CBB or rTCA] (RuBisCO Form)Putative electron source(s)Evidence
NORP4g_MethylophagaCBB (IA+B)H2Ssqr
NORP17g_RobiginitomaculumCBB (II)H2Ssqr
NORP23f_RhodobacteraceaeCBB (II)Thiosulfate, H2S, H2soxABCXYZ, sqr, rdsrA, hoxHFUY
NORP24f_ThiotrichaceaeCBB (IA+B)Thiosulfate, H2SsoxABXYZ, sqr, rdsrA
NORP31c_ZetaproteobacteriaCBB (II)Sulfur, H2S, Fe2+sdo, sqr, cyc1PV-1, cyc2PV-1
NORP33g_MethylophagaCBB (IA+B)H2S, H2sqr, hoxHFUY
NORP48g_BlastopirellulaCBB (II)?
NORP54g_RobiginitomaculumCBB (IA+B)H2Ssqr
NORP55g_RobiginitomaculumCBB (IA+B)H2Ssqr
NORP56g_KangiellaCBB (II)Thiosulfate, H2SsoxABCXYZ, sqr
NORP60CBB (II)Thiosulfate, H2SsoxCYZ, sqr
NORP65g_MethylophagaCBB (II)ThiosulfatesoxCY
NORP78f_RhodobacteraceaeCBB (II)Thiosulfate, H2SsoxABXYZ, sqr, rdsrA
NORP93g_MethylophagaCBB (IA+B)Thiosulfate, H2SsoxABCXYZ, sqr
NORP100f_EctothiorhodospiraceaeCBB (IA+B, IC/D)Thiosulfate, H2S, Fe2+(?)soxABYZ, rdsrA, cyc1PV-1
NORP103f_ThiotrichaceaeCBB (IA+B)H2S, Fe2+sqr, actB1, cyc1PV-1
NORP104f_MethylophilaceaeCBB (IC/D)?soxY
NORP108CBB (II)ThiosulfatesoxABCXYZ
NORP109g_MarinosulfonomonasCBB (II)Thiosulfate, H2S, H2soxABCXYZ, sqr, rdsrA, hoxFUY
NORP110g_MarinosulfonomonasCBB (II)Thiosulfate, H2SsoxABCXYZ, sqr, rdsrA
NORP116f_RhodospirillaceaeCBB (II)Thiosulfate, H2S, Fe2+(?)rdsrA, actB1
NORP125g_RobiginitomaculumCBB (II)H2Ssqr
NORP128f_RhodospirillaceaeCBB (II)Thiosulfate, H2S, H2, Fe2+(?)soxABXYZ, fccB, hoxHFUY, actB1
NORP169o_RhizobialesCBB (IC/D)?
NORP178-CBB (IA+B)H2S, Fe2+sqr, actB1, cyc1PV-1
NORP181f_RhodobacteraceaeCBB (II)Thiosulfate, H2SsoxABCXYZ, sqr, rdsrA
NORP192o_RhizobialesCBB (IC/D)ThiosulfatesoxABCXYZ
NORP9g_SulfurimonasrTCAThiosulfate, H2SsoxABCXYZ, sqr
NORP14g_AcrobacterrTCAThiosulfate, H2SsoxABCXYZ, tsdA, sqr
NORP62g_SulfurovumrTCAH2soxC, hoxHFUY
NORP87g_SulfurimonasrTCAThiosulfate, H2SsoxCYZ, sqr
NORP112g_SulfurimonasrTCA?
NORP168g_SulfurimonasrTCAThiosulfate, H2SsoxCY, sqr
NORP195g_SulfurimonasrTCAThiosulfatesoxABCXYZ

Abbreviations: actB1, dissimilatory Fe2+ molybdopterin oxidoreductase; cyc1PV-1, cytochrome c, cbb-type; CBB, Calvin-Benson-Bassham cycle; hoxHFUY, NAD-reducing hydrogenase; rdsrA, reverse sulfite reductase; rTCA, reverse citric acid cycle; RuBisCO, ribulose-1,5-bisphosphate carboxylase; sqr, sulfide:quinone oxidoreductase; soxABCXYZ, thiosulfate oxidation subunits/components; sdo, sulfur dioxygenase; tsdA, thiosulfate dehydrogenase.

Analysis of the putatively carbon fixing genomes for possible electron donors revealed at least 5 sources: sulfide (HS−), sulfur (S0), thiosulfate, hydrogen (H2) and ferrous iron (Fe2+; Table 2). The putative electron donors of four of the genomes could not be identified. The most prevalent electron donor as indicated by the presence within the genomes was HS−, as 25 genomes had the potential to utilize HS- via either sulfide:quinone reductase (sqr) or reverse sulfite reductase (rdsrA; Supplementary Figure 6). Twenty-one genomes possessed either the components for the SOX system or thiosulfate dehydrogenase (tsdA), suggesting a potential for thiosulfate oxidation. The presence of NAD-reducing hydrogenase, capable of the reversible H2 redox reactions, offers an avenue for H2 as an electron donor in 5 of the genomes capable of carbon fixation. A single genome (NORP31) possessed a sulfur dioxygenase (sdo) that could utilize S0 as an electron source. Lastly, 6 genomes were identified that may mediate the oxidation of Fe2+ linked to carbon fixation, based on the presence of dissimilatory Fe2+ molybdopterin oxidoreductase (Tully and Heidelberg, 2016; Act1B; Supplementary Figure 7; Supplementary Data 6) and/or Fe2+ reactive cytochromes (Barco ; Cyc1PV-1, Cyc2PV-1; Supplementary Data 7). In assessing the potential for aerobic respiration, 20.5% of the genomes were determined to possess low-oxygen sensitivity (aerobic; aa- and/or bo-type) cytochromes, 12.3% contained high-oxygen sensitivity (microaerobic; cbb- and/or bd-type) cytochromes, and an additional 55.9% contained cytochromes for both aerobic and mircoaerobic oxygen metabolism (Supplementary Figure 4; Supplementary Data 8). An assessment of anaerobic metabolisms showed that 7.7% of genomes possessed the potential to perform complete denitrification (nirK/nirS, norBC, and nosZ). An additional 19.5% of genomes were annotated to have the potential to perform a single step in denitrification process (nitrite reduction, nitric oxide reduction, or nitrous-oxide reduction), while 9.2% of genomes could potentially perform only two of the three steps (Supplementary Figure 4). Further, one genome (NORP6) contained sulfite reductase, necessary for complete sulfate reduction (Supplementary Figure 6).

Ecological units and community metabolic function

The genomes that had an appreciable presence (>0.05%, n=134) in the U1382A samples were placed into 6 ecological units (Unit I–VI) representing 98 genomes that had distinguishing temporal patterns throughout the time series (Figure 4; Supplementary Figure 8). An additional ecological unit (Unit VII) consisted of eight genomes that were generally cosmopolitan in the U1382A samples. These ecological units represent a progression of community structure over the course of the time series, though in several instances members of an ecological unit re-occur in multiple time points (Figure 4). For example, Unit I consists of genomes originally sampled in TP0 that are not observed in TP1, but are observed 12 months later in the TP2 sample. This pattern can also be observed in Unit II (genomes in TP1 seen in TP3-5) and Unit V (genomes in TP2 seen in TP7-9) and supports the results of the sample clusters that indicate that TP1 and TP3-6 are more similar to one another than TP2 and TP7-8 (Supplementary Figure 9).
Figure 4

Ecological units and community metabolic function of U1382A. Presence (left column) and predicted function (right column) for genomes assigned to ecological units. Ecological units are ordered to illustrate the progression of community structure through time. Values at the bottom of the column represent the total relative abundance of genomes in ecological units for that time point. cyt, cytochrome; DNRA, dissimilatory nitrate reduction to ammonia; dissim., dissimilatory; TP, time point.

Each ecological unit possessed genomes capable of carbon fixation, partial and complete denitrification, DNRA, thiosulfate oxidation, and sulfur redox, with the exception of complete denitrification in Unit V and sulfur redox in Unit VII. However, analysis of the fraction of the observable community based on relative abundance with a specific functional potential changes considerably over time (Figure 5). During the time series, organisms capable of DNRA and complete denitrification are positively correlated (linear regression, R2=0.86), which is reflected in the fact that most organisms capable of complete denitrification were also capable of DNRA, though the reciprocal was not true (Figure 5). This disparity between organisms with DNRA and complete denitrification likely explains why the fraction of DNRA capable community members was always greater than complete denitrification. Similarly, thiosulfate oxidation and sulfur redox processes were positively correlated (linear regression, R2=0.80), though these functions generally occur in different genomes. The community fraction capable of sulfide oxidation tracks with the observed ecological units and sample clusters, as sulfide oxidation was the most prevalent sulfur pathway in TP1, 3–6, while thiosulfate oxidation was more prevalent in TP2, 7–9.
Figure 5

Fraction of the observed microbial community with potential to contribute to biogeochemically relevant processes. (a) Carbon fixation. (b) Nitrogen cycing. (c) Sulfur cycling. DNRA, dissimilatory nitrate reduction to ammonia; dissim., dissimilatory

Comparison of the potential function of genomes assigned to U1382A, U1383C and DABW, the number of genomes with a predicted function, and the fraction of the observed community with that function indicates that genomes associated with the DABW do not possess the capability for carbon or nitrogen fixation, denitrification or sulfide oxidation (Figure 6). Based on relative abundance patterns, U1382A and U1383C are dominated by different microbial communities (Figure 3), but the number of genomes capable of the various steps in the nitrogen, sulfur, and carbon cycles do not vary (Figure 6). Further, there was no statistical difference (Student’s t-test and Wilcoxon rank sum, P<0.5) between U1382A and U1383C based on the fraction of the observed community capable of an ascribed metabolic reaction, with the exception of ammonia oxidation (Student’s t-test and Wilcoxon rank sum, P=0.005).
Figure 6

Comparison of putative microbial functionality between U1382A, U1383C and Deep Atlantic Bottom Water. (a) Metabolic pathways for nitrogen, sulfur and carbon fixation, with numbers indicating the number of genomes assigned to a particular sample type with that predicted function. (b) For each metabolic step represented in A, the fraction of the observed community from each sample that possesses that metabolic step. Abbreviations: DNRA, dissimilatory nitrate reduction to ammonia; TP, time point.

Discussion

Despite being the largest actively flowing aquifer on Earth, our understanding of microbial communities and their role in biogeochemical cycling in subseafloor crustal fluids is largely unknown. The bulk of our understanding is from studies of fluids from warm environments, including the Juan de Fuca Ridge flank in the NE Pacific Ocean and hydrothermal vents around the globe (Takai and Horikoshi, 1999; Huber ; Reveillaud ). These environments are characterized by high temperature (25–80 °C), low-oxygen fluids that are usually dominated by mesophilic and (hyper)thermophilic microorganisms with microaerobic and anaerobic metabolisms (Cowen ; Huber ; Jungbluth ; 2016). This is in contrast to North Pond, which represents a common, but understudied type of ridge flank region, where circulating fluids are cold (4–15 °C) and oxygenated (Edwards ; Meyer ). Previous work at North Pond showed that the fluids in the basaltic crust have similar chemistry to the oceanic bottom water, but that the microbial community has a distinct population structure with potential for both heterotrophic and autotrophic activity (Meyer ). Using the increased temporal and spatial sampling offered by our metagenomic time series at North Pond, we verified that the microbial community composition of the crustal fluid samples is fundamentally different from the DABW, and extended this finding to microbial communities and their genomic functional potential using MAGs (Figures 2 and 5,Supplementary Data 10). Further, we also found that the microbial communities within the crustal fluids show shifts in the dominant phyla (and proteobacterial classes) over time within a single hole and between the two holes (Figure 2). Gammaproteobacteria are dominant in 10 of the crustal fluid samples, but several other phylogenetic groups, Alpha- and Deltaproteobacteria, Epsilonbacteraeota and Bacteroidetes, are abundant in other samples. The initial samples were collected in 2012, approximately six months after the holes were drilled and the CORK systems were installed, therefore it is possible that the observed shifts are due to the holes returning to a natural state after the perturbation of drilling, during which surface water is pumped into the borehole to clear cuttings, inevitably pumping surface waters into the formation. Such shifts in subseafloor crustal fluid community structure have been documented in samples collected shortly after drilling and for several years afterwards on the flanks of the Juan de Fuca Ridge, a younger, warmer crustal system (Jungbluth , 2016), highlighting the importance of time series for understanding such ecosystems and potential stresses. However, the magnitude of chemical shifts observed in discrete samples collected in 2012 and 2014 suggests only minor changes in geochemistry, including a decrease in dissolved oxygen concentrations and increase in dissolved silica concentrations at all four sampling horizons. Increases in dissolved silica may result from either diffusive exchange with sediment pore waters or water-rock reactions at low temperatures, whereas the decrease in oxygen concentrations indicates continued consumption of oxygen (Ziebis ; Meyer ), such as that inferred from a similar cool ridge flank setting at Dorado Outcrop (Wheat ). The high-resolution analysis, provided by the relative abundance of the reconstructed genomes, reveals that the microbial communities of U1382A, U1383C, and the DABW are composed of distinct MAGs (Figure 3). Importantly, genomes from the DABW form a cohesive group of organisms that were not present (or had a limited presence) in the crustal fluids, and conversely none of the crustal-originating genomes were detected in the DABW. From these results, it is clear that the genomes we reconstructed represent residential subseafloor bacteria and archaea from North Pond crustal fluids, thus allowing for detailed examination of microbial metabolic functions and community dynamics and interactions within the North Pond crustal habitat. It is important to note, however, that the reconstructed genomes only represent a subset of the total microbial community from any one of the metagenomic samples, thus we can only interpret results from the observed community members (Supplementary Data 4). It is likely, though, that due to the dynamics of assembly and binning that these genomes represent many of the most abundant organisms in the environment.

Carbon fixation

Previous results from North Pond samples in 2012 showed lower concentrations of dissolved organic carbon in the crustal fluids compared to seawater, as well as the potential for carbon fixation, with higher potential rates of autotrophy in the crust compared to seawater, especially at warmer temperatures (25°C) and deeper in the crust (Meyer ). In addition, limited metagenomic analysis of three samples from 2012 showed the presence of some genes associated with carbon fixation (Meyer ). Our assessment of genomes for the presence of genes representative of autotrophic carbon fixation resulted in the identification of two carbon fixation pathways: the CBB cycle and the reverse citric acid (rTCA) cycle (Table 2). All instances of the rTCA cycle were identified within the Epsilonbacteraeota, and the CBB cycle was identified in several different groups, including the Alpha-, Gamma-, and Zetaproteobacteria, as well as the Planctomycetes. Each of the genomes with potential for carbon fixation was also analyzed for pathways that could provide a lithotrophic source of reducing potential necessary for carbon fixation (Table 2). Results indicate that the most prevalent electron source identified amongst the putative carbon fixing genomes was sulfide, but several other electron sources were also identified, including thiosulfate, ferrous iron, sulfur, and hydrogen. These electron sources are likely coupled to the reduction of oxygen, as all but one of the genomes with predicted carbon fixation possess aerobic or microaerobic terminal oxidases. Possible additional terminal electron acceptors include nitrate and the intermediates of denitrification, as all but two of the carbon fixation genomes possess components of the denitrification or DNRA pathways (Supplementary Figure 4). While a majority of the genomes with carbon fixation potential are linked to the oxidation of sulfur compounds, a group of genomes have the potential to utilize both H2 and Fe2+ to drive biomass production in support of the model proposed by Bach (2016). These putative energy couples are congruent with the hypothesis of subseafloor microbial communities that can take advantage of the redox gradient created by the presence of reduced material in volcanic-derived basalt rocks and the oxygenated aquifer fluids (Bach and Edwards, 2003; Bach, 2016). Hydrogen sulfide and iron species have not been detected in the crustal fluids at North Pond (Meyer ), but the oxidation of the iron in sulfide complexes in crustal rocks (via biotic or abiotic process) would increase access to sulfide compounds for microorganisms (Barco ) and for the abiotic oxidation of sulfide to thiosulfate (Moses ). In this manner, it would be possible to sustain carbon fixation through multiple lithotrophic pathways, which are likely important due to the oligotrophic nature of the crustal fluids. This is similar to the prevailing theory in regards to terrestrial crustal systems (Hallbeck and Pedersen, 2008), where lithoautotrophic growth in microorganisms via the CBB cycle has been found in deep terrestrial aquifers in the Fennoscandian shield (Wu ).

Genomic evidence for the prevalence of hypoxic conditions

All measurements at North Pond show that the aquifer fluids at North Pond are oxygenated, with O2 concentrations equal to or slightly less (185–244 μm) than that of the DABW (~250 μm; Table 1; Meyer ). Therefore, it was unexpected to find that many of the North Pond genomes had genes that suggest hypoxic or potentially anoxic conditions. More than half of the genomes (56%) had terminal c-type cytochromes for both aerobic (aa- and bo-type) and microaerobic (cbb- and bd-type) metabolisms, with an additional 13% of genomes only possessing the microaerobic cytochromes (Supplementary Figure 4). There was substantial evidence that the organisms in this environment were capable of the reduction of nitrate via both dissimilatory nitrate reduction to ammonia (DNRA; 36%) and denitrification (36% Supplementary Figure 4). Further, NORP6 possessed the canonical sulfite reductase, necessary for the anaerobic conversion of sulfite to sulfide (Supplementary Figure 4). The role that these genes, commonly associated with anaerobic metabolisms, play in the environment is unclear. It is possible that, similar to sub-oxic microenvironments encountered in the oxic surface ocean (Ploug ), the subseafloor hosts microenvironments in which anaerobic metabolisms are ecologically viable. Like the surface ocean, one possible source of such microenvironments may be organic-rich particles, that can be readily colonized by heterotrophic microorganisms. In 2012, samples collected from North Pond crustal fluids showed a high heterogeneity of particles as detected on GFF filters (Meyer ). Another possibility may be that the complex and fractured structure of the crustal aquifer provides both oxic and sub-oxic conditions. For example, hydrogeological studies of the Juan de Fuca Ridge flank indicated that fluid flow through the crust likely only occurs through small, discrete channels, restricted to a small volume (<1%) of the crust (Fisher and Becker, 2000). Consequently fluid flow would be highly channelized through a small volume of the crustal rock. While measurements at North Pond CORKs show abundant oxygen, it is possible there are regions where fluid flow slows down and fluids could become stagnant, and anaerobic metabolisms may be more significant to the community as oxygen is consumed by heterotrophic activity or abiotic reactions. However, such stagnant fluids would likely not be indicative of the large crustal flow. Overall, the lack of an appreciable signal in the geochemical data may be the result of the extremely low biomass (~104 cells ml−1) and relatively recent entrainment of the formation fluids, especially in U1382A.

Variable inter- and intra-borehole metabolic diversity

The microbial community observed in U1382A can be effectively assigned to seven ecological units with distinct occurrence patterns (Figure 4). These ecological units generally progress in sequential order, though several genomes within an ecological unit were detected in multiple time points, with up to 11 months between samples (TP2 vs TP7). This re-occurrence of members of the community suggest that there is mechanism for organisms to persist in the aquifer, either locally or transported from elsewhere within the subseafloor. Patterns may also be related to local geochemical conditions, where growth, and thus relative abundance, is tied to specific metabolic processes. Despite these changes in community structure over time, the genomes that are present in the ecological units are functionally redundant, with various metabolisms related to carbon fixation and nitrogen and sulfur cycling present in each of the measured time points (Figure 4). While the ecological units as a whole are functionally redundant, the fraction of the observed community capable of a specific metabolic potential shifts over the course of the time series (Figures 5a–c). Shifts in genomes capable of nitrate reduction (DNRA and complete denitrification) and sulfur oxidation (thiosulfate oxidation and sulfur redox) processes were positively correlated, suggesting that these metabolic pairs are linked to the same environmental change. Further, shifts in the fraction of the community capable of sulfide oxidation is linked to a microbial community structure that overlaps TP1 and TP3-6, while thiosulfate oxidation is linked to overlaps in TP2 and TP7-8 (Figures 5a–c; Supplementary Figure 9). This suggests that changes in availability of sulfide and thiosulfate are responsible for the changes in microbial community structure, or conversely, that microbial community metabolic potential impacts the availability of sulfide and thiosulfate. In comparing U1382A and U1383C, several large, cohesive microbial groups were present in both boreholes (Figure 3), with organisms more abundant in U1383C clustering together, to the exclusion of organisms more abundant in U1382A. However, it was common for a group of MAGs to be more abundant in one hole and also have a reduced or minimal abundance in the other hole (Figure 3). While this result suggests there is some connectivity between the two subseafloor environments sampled by the CORKs, it is also clear that there are distinct, dominant populations within each hole, likewise there are distinct chemical signatures in both. However, the variation in community structure does not result in differences in metabolic potential, with functional redundancy in all queried processes, except for nitrogen fixation (Figure 6). This functional redundancy is further reflected in the fraction of the observed microbial community capable of participating in each metabolic step, with no statistically significant difference between the boreholes, except for ammonia oxidation (Figure 6). These results indicate that the observed differences in community structure are not related to carbon fixation or nitrogen and sulfur cycling, and are likely governed by environmental parameters that structure spatially distinct communities with a high degree of functional redundancy. A top–down control on community structure could be susceptibility to viral predation (Nigro ), while a bottom-up control may involve limits in trace nutrients or vitamin availability. Continued analysis of these data and future sampling efforts will help to elucidate the extent of these controls on the microbial community.

Concluding remarks

The microbial community in the crustal fluids of North Pond is temporally and spatially dynamic. The putative genomes extracted from our time series reveal a microbial community capable of impacting subseafloor biogeochemical cycles for carbon, nitrogen, sulfur and iron. These potential functions are redundant as community membership varies in time and space, suggesting that the communities present in both boreholes are poised to utilize the redox potential of the oceanic crust by exploiting reduced sulfur compounds and ferrous iron to drive autotrophic growth. Further research will elucidate the extent to which these organisms drive global biogeochemical processes.

Data availability

This project has been deposited at DDBJ/ENA/GenBank under the BioProject accession no. PRJNA391950, drafts of metagenome-assembled genomes are available with accession no. NVQK00000000-NVXW00000000, and raw sequence reads are available with accession no. SRX3143886-SRX3143902. Raw sequence reads from Meyer constituting the metagenomic samples from 2012, are available under the BioProject accession no. PRJNA280201. Additional files have been provided and are available through figshare (https://figshare.com/s/939160bb2d4156022558), such as: all primary and secondary contigs; MAGs and bins not analyzed as part of this research; and, all files described as Supplementary Data 1–17.
  56 in total

1.  ARB: a software environment for sequence data.

Authors:  Wolfgang Ludwig; Oliver Strunk; Ralf Westram; Lothar Richter; Harald Meier; Arno Buchner; Tina Lai; Susanne Steppi; Gangolf Jobb; Wolfram Förster; Igor Brettske; Stefan Gerber; Anton W Ginhart; Oliver Gross; Silke Grumann; Stefan Hermann; Ralf Jost; Andreas König; Thomas Liss; Ralph Lüssmann; Michael May; Björn Nonhoff; Boris Reichel; Robert Strehlow; Alexandros Stamatakis; Norbert Stuckmann; Alexander Vilbig; Michael Lenke; Thomas Ludwig; Arndt Bode; Karl-Heinz Schleifer
Journal:  Nucleic Acids Res       Date:  2004-02-25       Impact factor: 16.971

2.  FastTree 2--approximately maximum-likelihood trees for large alignments.

Authors:  Morgan N Price; Paramvir S Dehal; Adam P Arkin
Journal:  PLoS One       Date:  2010-03-10       Impact factor: 3.240

3.  Oxygen consumption rates in subseafloor basaltic crust derived from a reaction transport model.

Authors:  Beth N Orcutt; C Geoffrey Wheat; Olivier Rouxel; Samuel Hulme; Katrina J Edwards; Wolfgang Bach
Journal:  Nat Commun       Date:  2013       Impact factor: 14.919

4.  SINA: accurate high-throughput multiple sequence alignment of ribosomal RNA genes.

Authors:  Elmar Pruesse; Jörg Peplies; Frank Oliver Glöckner
Journal:  Bioinformatics       Date:  2012-05-03       Impact factor: 6.937

5.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

6.  The integrated microbial genomes (IMG) system.

Authors:  Victor M Markowitz; Frank Korzeniewski; Krishna Palaniappan; Ernest Szeto; Greg Werner; Anu Padki; Xueling Zhao; Inna Dubchak; Philip Hugenholtz; Iain Anderson; Athanasios Lykidis; Konstantinos Mavromatis; Natalia Ivanova; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  Microbial metagenomes from three aquifers in the Fennoscandian shield terrestrial deep biosphere reveal metabolic partitioning among populations.

Authors:  Xiaofen Wu; Karin Holmfeldt; Valerie Hubalek; Daniel Lundin; Mats Åström; Stefan Bertilsson; Mark Dopson
Journal:  ISME J       Date:  2015-10-20       Impact factor: 10.302

8.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

9.  Identification of ribosomal RNA genes in metagenomic fragments.

Authors:  Ying Huang; Paul Gilna; Weizhong Li
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

10.  Reverse dissimilatory sulfite reductase as phylogenetic marker for a subgroup of sulfur-oxidizing prokaryotes.

Authors:  Alexander Loy; Stephan Duller; Christian Baranyi; Marc Mussmann; Jörg Ott; Itai Sharon; Oded Béjà; Denis Le Paslier; Christiane Dahl; Michael Wagner
Journal:  Environ Microbiol       Date:  2008-09-26       Impact factor: 5.491

View more
  33 in total

1.  Circumventing kinetics in biogeochemical modeling.

Authors:  Stilianos Louca; Mary I Scranton; Gordon T Taylor; Yrene M Astor; Sean A Crowe; Michael Doebeli
Journal:  Proc Natl Acad Sci U S A       Date:  2019-05-16       Impact factor: 11.205

2.  Bacterial community structure and functional profiling of high Arctic fjord sediments.

Authors:  S Vishnupriya; T Jabir; K P Krishnan; A A Mohamed Hatha
Journal:  World J Microbiol Biotechnol       Date:  2021-07-13       Impact factor: 3.312

3.  Microbial production and consumption of hydrocarbons in the global ocean.

Authors:  Connor R Love; Eleanor C Arrington; Kelsey M Gosselin; Christopher M Reddy; Benjamin A S Van Mooy; Robert K Nelson; David L Valentine
Journal:  Nat Microbiol       Date:  2021-02-01       Impact factor: 17.745

4.  Genomic Insights into Two Novel Fe(II)-Oxidizing Zetaproteobacteria Isolates Reveal Lifestyle Adaption to Coastal Marine Sediments.

Authors:  Nia Blackwell; Casey Bryce; Daniel Straub; Andreas Kappler; Sara Kleindienst
Journal:  Appl Environ Microbiol       Date:  2020-08-18       Impact factor: 4.792

5.  Microbial Abundance and Diversity in Subsurface Lower Oceanic Crust at Atlantis Bank, Southwest Indian Ridge.

Authors:  Shu Ying Wee; Virginia P Edgcomb; David Beaudoin; Shari Yvon-Lewis; Jason B Sylvan
Journal:  Appl Environ Microbiol       Date:  2021-09-01       Impact factor: 4.792

6.  Genus-Specific Carbon Fixation Activity Measurements Reveal Distinct Responses to Oxygen among Hydrothermal Vent Campylobacteria.

Authors:  Jesse McNichol; Stefan Dyksma; Marc Mußmann; Jeffrey S Seewald; Sean P Sylva; Stefan M Sievert
Journal:  Appl Environ Microbiol       Date:  2021-11-17       Impact factor: 5.005

7.  Time-series transcriptomics from cold, oxic subseafloor crustal fluids reveals a motile, mixotrophic microbial community.

Authors:  Lauren M Seyler; Elizabeth Trembath-Reichert; Benjamin J Tully; Julie A Huber
Journal:  ISME J       Date:  2020-12-03       Impact factor: 10.302

8.  Recycling and metabolic flexibility dictate life in the lower oceanic crust.

Authors:  Jiangtao Li; Paraskevi Mara; Virginia P Edgcomb; Florence Schubotz; Jason B Sylvan; Gaëtan Burgaud; Frieder Klein; David Beaudoin; Shu Ying Wee; Henry J B Dick; Sarah Lott; Rebecca Cox; Lara A E Meyer; Maxence Quémener; Donna K Blackman
Journal:  Nature       Date:  2020-03-11       Impact factor: 69.504

Review 9.  Low Energy Subsurface Environments as Extraterrestrial Analogs.

Authors:  Rose M Jones; Jacqueline M Goordial; Beth N Orcutt
Journal:  Front Microbiol       Date:  2018-07-18       Impact factor: 5.640

10.  Unravelling the diversity of magnetotactic bacteria through analysis of open genomic databases.

Authors:  Maria Uzun; Lolita Alekseeva; Maria Krutkina; Veronika Koziaeva; Denis Grouzdev
Journal:  Sci Data       Date:  2020-07-31       Impact factor: 6.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.