Literature DB >> 28362721

Hidden diversity revealed by genome-resolved metagenomics of iron-oxidizing microbial mats from Lō'ihi Seamount, Hawai'i.

Heather Fullerton1, Kevin W Hager1, Sean M McAllister1, Craig L Moyer1.   

Abstract

The Zetaproteobacteria are ubiquitous in marine environments, yet this class of Proteobacteria is only represented by a few closely-related cultured isolates. In high-iron environments, such as diffuse hydrothermal vents, the Zetaproteobacteria are important members of the community driving its structure. Biogeography of Zetaproteobacteria has shown two ubiquitous operational taxonomic units (OTUs), yet much is unknown about their genomic diversity. Genome-resolved metagenomics allows for the specific binning of microbial genomes based on genomic signatures present in composite metagenome assemblies. This resulted in the recovery of 93 genome bins, of which 34 were classified as Zetaproteobacteria. Form II ribulose 1,5-bisphosphate carboxylase genes were recovered from nearly all the Zetaproteobacteria genome bins. In addition, the Zetaproteobacteria genome bins contain genes for uptake and utilization of bioavailable nitrogen, detoxification of arsenic, and a terminal electron acceptor adapted for low oxygen concentration. Our results also support the hypothesis of a Cyc2-like protein as the site for iron oxidation, now detected across a majority of the Zetaproteobacteria genome bins. Whole genome comparisons showed a high genomic diversity across the Zetaproteobacteria OTUs and genome bins that were previously unidentified by SSU rRNA gene analysis. A single lineage of cosmopolitan Zetaproteobacteria (zOTU 2) was found to be monophyletic, based on cluster analysis of average nucleotide identity and average amino acid identity comparisons. From these data, we can begin to pinpoint genomic adaptations of the more ecologically ubiquitous Zetaproteobacteria, and further understand their environmental constraints and metabolic potential.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28362721      PMCID: PMC5520029          DOI: 10.1038/ismej.2017.40

Source DB:  PubMed          Journal:  ISME J        ISSN: 1751-7362            Impact factor:   10.302


Introduction

Microbes are everywhere, and in many ecosystems they are the key drivers of biogeochemical cycles. Iron is the most abundant element in the earth and only microbes are able to utilize it as an energy source. Mineralogical evidence of iron- oxidizers has been found, dating to 1.89 Ga, making iron oxidation a very ancient metabolism (Planavsky ). Early Earth hosted a ferruginous ocean where iron oxidation may have been the dominant metabolism (Ilbert and Bonnefoy, 2013; Guilbaud ). Microbial iron oxidizers are found suspended in the water column (Field ), but extensive microbial growth by iron oxidation is limited to areas of high ferrous iron and low oxygen concentrations, such as hydrothermal vents (Emerson and Moyer, 2010; Scott ). Reduced iron released by hydrothermal vent systems fuels primary production by lithoautotrophic microbes, which in turn support additional trophic levels making hydrothermal vent systems some of the most biologically active regions of the deep-sea (Sievert and Vetriani, 2012). It is estimated that 3 × 1011 mol of Fe(II) is released each year through hydrothermal venting in Earth’s oceans (Holland, 2006), and is transported in the water column thousands of kilometers away from the source (Resing ), where it can be utilized by phototrophs in the upper ocean; however, iron is still a limiting factor for phototrophs in the upper ocean (Raven ). The abiotic oxidation of Fe(II) by O2 is rapid in fully aerated seawater (Konhauser ; Druschel ). Therefore, from a microbe’s perspective, Fe(II) is potentially a vast food source, yet it is as ephemeral as it is abundant and bioavailable. Microbial iron oxidation has been recognized in freshwater systems since the 1890s, whereas microbial iron oxidation in marine systems is just beginning to be recognized (Emerson ; Fleming ). The isolates of the newest class of Proteobacteria, the Zetaproteobacteria, are described as neutrophilic marine iron-oxidizers (Emerson ). Zetaproteobacteria have been identified throughout the Pacific and Atlantic Oceans at hydrothermal vent habitats and estuaries (McAllister ; Scott ; Field ). At sites where the predominant vent effluent is high in ferrous iron, Zetaproteobacteria are the dominant microbial mat community members, with the classes of the Gamma-, Delta- and Epsilon-proteobacteria as well as Nitrospira consistently detected in these habitats (Moyer ; Rassa ; Fleming ). Several Zetaproteobacteria operational taxonomic units have been identified and two are globally ubiquitous in iron-driven microbial mat communities (McAllister ). Zetaproteobacteria are considered ecosystem engineers due to their foundational role in the formation of the microbial mat architecture. This architecture is comprised of exopolysaccharide structures, including twisted helical stalks or tubular sheaths as observed by microscopic analysis of cultures and microbial mats (Chan ; Fleming ; Chan ). Through the production of stalks or sheaths, the Zetaproteobacteria can alter their physical and chemical environment (Chan ). Furthermore, Zetaproteobacteria are lithoautotrophs and the primary producers in iron-dominated hydrothermal vent systems (Singer ; Field ). Previous molecular analysis of microbial mats at Lō’ihi Seamount showed that Zetaproteobacteria correlate with the abundance of key functional genes, but that functional gene abundance did not vary across varying mat morphologies; furthermore, vent chemistry was found to be associated with the observed mat morphologies (Jesser ), suggesting unrealized genomic diversity within the Zetaproteobacteria. Zetaproteobacteria were first described at Lō’ihi Seamount, which is located 35 km south-east of the big island of Hawai’i and hosts a plethora of dynamic hydrothermal vents (Moyer ). In 1996, a major eruption formed Pele’s Pit, a 300 m wide caldera near the summit, with several active hydrothermal venting sites (Figure 1). Before the 1996 eruption, Lō’ihi was dominated by low-temperature diffuse-flow hydrothermal vents emitting fluids up to ~70 °C and elevated levels of Fe(II), CO2, CH4 and NH4+ (Sedwick ; Wheat ) and has now returned to these pre-eruption conditions (Glazer and Rouxel, 2009).
Figure 1

Bathymetric map (high resolution at <2 m) of sampling sites in and near Pele’s Pit caldera on the summit of Lō’ihi Seamount, Hawai’i. Precise marker locations include Pohaku (Marker 57), Hiolo North (Markers 36, 39 and 31), Hiolo South (Markers 34, 38 and Ku’kulu). Courtesy of Susan Merle, NOAA EOI/OSU.

Biogeographic patterns for marine microbes remain poorly understood in terms of distribution scale and evolutionary divergence rates. To address this, we sequenced six distinct microbial mat communities collected from Lō’ihi Seamount. From this, a shotgun metagenomics approach was used, where we were able to construct a composite assembly for genome binning. We used differential coverage analysis to reconstruct site-specific community composition and compared this to the community structure as determined by taxa specific Quantitative PCR (qPCR) analyses. Here we present genome-resolved metagenomics to further explore patterns of biodiversity and adaptation of Zetaproteobacteria populations, including two ecologically significant Zetaproteobacteria OTUs (zOTUs).

Materials and methods

Sample collection

Microbial mat samples were collected at Lō’ihi Seamount, HI in October 2009 by the remotely operated vehicle (ROV) Jason II onboard the R/V Kilo Moana. Samples were collected from within Pele’s Pit at Hiolo North (Markers 31, 36 and 39), Hiolo South (Markers 34 and 38), and Ku’kulu Base (no marker) or on the caldera rim at Pohaku (Marker 57) (Figure 1). All samples were collected using a single-action Biomat Syringe (BS) sampler (Figure 2c) as described in Fleming .
Figure 2

Photos illustrating the different mat morphologies. (a) curd-type mat from Marker 34, (b) curd-type mat from Marker 57, (c) veil-type mat from Ku’kulu, (d) veil-type mat from Marker 39, (e) streamers from Marker 31, and (f) streamers from Marker 39. Scale bars are 10 cm.

DNA extraction, T-RFLP analysis, and qPCR

Genomic DNA was extracted from samples using the FastDNA SPIN kit for soil (MP Biomedical, Santa Ana, CA, USA) according to manufacturer’s protocol. Cells were lysed by bead beating twice (stored on ice for 5 min in between) in a FastPrep instrument (MP Biomedical) at a speed setting of 5.5 for 45 s and DNA was eluted with 1 mM Tris at pH 8. Genomic DNA was quantified with a Qubit 2.0 fluorometer (ThermoFisher Scientific, Waltham, MA, USA). Samples were PCR amplified for use in terminal-restriction fragment length polymorphisms (T-RFLP) as previously described (Davis and Moyer, 2008). PCR was visualized on a 1% agarose gel before restriction digestion. The end-labeled fragments were run on an ABI model 3130XL automated DNA sequencer and the data were analyzed with the BioNumerics v7.6 software (Applied Maths, Austin, TX, USA). SSU rRNA gene clone libraries from five sampled microbial mat communities were constructed as described in McAllister , in order assess putative phylotypes in the T-RFLP dataset (Supplementary Figures 1 and 2). qPCR conditions along with Bacterial and Zetaproteobacterial primers used were the same as described by Jesser et al. (2015). Zetaproteobacteria abundance was determined using the ratio of Zetaproteobacteria to Bacteria SSU rRNA gene copies per nanogram gDNA. No qPCR data were used unless primers exhibited better than 95% efficiency and yielded single-peak amplicons upon post-PCR melt curve analysis.

Metagenomic sequencing, assembly and annotation

Extracted DNA was cleaned and concentrated using an Aurora (Boreal Genomics, Vancouver, BC, Canada) prior to sequencing; libraries were prepared with the Nextera DNA Library Kit (Illumina, San Diego, CA, USA). Sample J2-479-BS3 was run on an Illumina HiSeq 2000 using paired-end sequencing with reads of 101 bp from each end. Sample J2-483-BS63 was run using paired-end sequencing with reads of 84 bp on an Illumina MiSeq and was a combination of two samples collected from the same microbial mat. The remainder of the samples were run using an Illumina MiSeq with paired-end sequencing of 308 bp reads (Supplementary Table 1). Sequenced reads were quality checked using FastQC (Andrews, 2010) and were trimmed of adaptors, and pairs were matched using cutadapt (Martin, 2011). Trimmed reads were normalized using BBnorm (target depth: 18). The resulting reads were assembled with IDBA-UD (Peng ) (k-mer sizes: 50–240 in steps of 10 without correction). Reads from each sample were mapped, with bowtie2 (Langmead and Salzberg, 2012) to the composite assembly to get coverage information. This was then used to construct genome bins with MaxBin 2.0 (Wu ) using default parameters. The resulting genome bins were evaluated with CheckM (Parks ). The assembled composite metagenome was uploaded to Integrated Microbial Genomics (IMG) for annotation. Genome bins were separated from bulk data after annotation. Genes annotated as specific proteins identified in M. ferrooxydans PV-1 were identified by BLASTp searches of the composite metagenome with an e-value cutoff of 10−5 (Supplementary Tables 2). Cyc1PV-1 (DAA64808.1), and Cyc2PV-1 (AKN35166.1) were identified via proteomics (Barco ) and Mob (SPV1_03948) identified via fosmid library genome analysis (Singer ).

Average nucleotide and average amino acid identities

Genome bins identified as Zetaproteobacteria by CheckM were compared to genome sequences of Zetaproteobacteria single amplified genomes (SAGs) and Zetaproteobacteria isolate genomes. The average nucleotide identity (ANI) was calculated using the BLAST-based algorithm tool in JSpecies v1.2.1 (Richter and Rosselló-Móra, 2009). The average amino acid identity was calculated using the enveomics toolbox (Rodriguez-R and Konstantinidis, 2016). Hierarchical cluster analysis was calculated in R using the gplots package.

Phylogenetic analysis

All genes annotated as a ribulose 1,5-bisphosphate carboxylase (RubisCO) were further analyzed for binning and taxonomic placement. IMG phylogeny was used for the unbinned genes, whereas CheckM was used for the genes within a genome bin. The identified RubisCO Form II amino acid sequences were then aligned using the Geneious v9.1 aligner (Kearse ). The resulting alignments were manually screened, and all sequences less than 110 amino acids were removed from analysis. The resulting alignment was then used to create a phylogenic consensus tree with RAxML v7.2.8 using the gamma GTR protein model with 1000 bootstrap iterations, again with Geneious (Kearse ).

Accession numbers

Representative sequences from each operational taxonomic unit identified in the clone libraries were submitted to GenBank. Accession numbers are JQ287646−JQ287657, JX468894 (Fleming ) and KY417831−KY417866 (this study). All metagenomic contigs have been made available in the IMG Database (IMG Genome ID 3300009408). All sequence data are also available from NCBI SRA (Biosample accessions SAMN06226859-SAMN06226864).

Results and Discussion

Site description and community structure

Microbial mats vary in and around Pele’s Pit in gross morphology and color, from white-yellow to burnt orange (Figures 2a–f). In addition to variation in color, the mats had variable textures that were assigned to three specific mat morphological groups associated with variable fluid flow regimes. These were described as curds in the presence of direct flow (Figures 2a and b), veils associated with diffuse flow (Figures 2c and d), and streamers also found in direct flow (Figures 2e and f). Pohaku is the only sample site located on the outside of Pele’s Pit on the southern rim of this caldera (Figure 1), and has been characterized as highest in reduced iron, at nearly 1 mM (Glazer and Rouxel, 2009). Microscopic analysis of the curd-type mat shows the predominance of helical stalks (Chan ), whereas analysis of veil-type mats showed a prevalence of the sheathed morphology (Fleming ). A comprehensive community fingerprint analysis by T-RFLP of 25 mat communities from seven vent sites showed three distinct groups, which corresponded to the gross mat morphology of curds, veils and streamers (Figure 3); however, these groups did not correlate with location or site temperature. All of the microbial mats were collected with a Biomat Syringe sampler, allowing for precision sampling of the topmost active layer of the mat. The morphology of Group I mats are characterized as light yellow to light orange curds, Group II are yellow veiled-type mats and Group III are comprised of white to dark orange streamers attached to the vent orifice. Group I mats had the greatest abundance (56.3%±15.5%) of Zetaproteobacteria within the bacterial community, whereas Group II had significantly less Zetaproteobacteria (20.5%±2.7%) and Group III had the lowest (17.7%±13.7%) as determined by qPCR.
Figure 3

T-RFLP analysis of BioMat Samples collected from Lō’ihi microbial mats. Colored boxes represent the three different groups that resulted from the cluster analysis: Group I, orange; Group II, blue; Group III, red. The sampled mat communities in bold were selected for Illumina metagenomic sequencing.

SSU rRNA gene clone libraries were constructed from representative mat communities in an attempt to identify the microbial community members driving the T-RFLP clustering. Group I mats had a lower bacterial diversity compared to the other two groups, and exhibited high levels of zOTUs 1 and 2. Group II mats contained a higher abundance of zOTUs 4, 6 and 10 along with Gammaproteobacteria. Group III mats were dominated by sulfur- and hydrogen-metabolizing Epsilonproteobacteria, with a smaller contribution from zOTUs. These results highlight a clear difference between Fe-rich (Groups I and II) and S-rich (Group III) habitats, and between direct flow (Groups I and III) and indirect/diffuse-flow (Group II) environments (Figures 2 and 3; Supplementary Figures 1 and 2).

Assembly and annotation

Six samples, two representatives from each morphotype group, were chosen for metagenomic sequencing. The resulting composite assembly had 162 376 contigs comprised of 289 114 522 bases with an overall GC% of 51.1% and an n50 of 3483. This composite assembly was separated into genome bins based on coverage and tetranucleotide frequencies of the scaffolds with MaxBin 2.0. These bins contained 77.9% of the total composite metagenome bases and 37.4% of the scaffolds. There were no sequences in multiple bins. Genome binning of the composite metagenome resulted in 93 total bins. These genome bins were assessed for completeness and taxonomic classification using CheckM (Parks ). Two of the bins were identified as Archaea, which is consistent with previous analysis showing Archaea were either below the detection limit or less than 5% of the community, and generally derived from deep-sea archaeoplankton retention in the mats (Moyer ; Rassa ). Nine genomes were unresolved to the class level; however, one of these bins contained a full-length SSU rRNA gene identified as a Deferribacteres (LoihiBin_014). Unclassified bins were removed from further analysis. The most numerous genome bins identified belonged to the Zetaproteobacteria (Table 1). Overall, the bins had an average n50 of 12 376 in an average of 707 scaffolds. The genome bins range in completeness from 6.55 to 100%, with an average of 70.8% (±30.8%). Contamination ranged from 0.0 to 83%, with an average of 12.6% (±15.0%). On average, the Zetaproteobacteria genome bins were 62.9% (±34.1%) complete, with an average contamination level of 11.6% (±13.4%).
Table 1

Summary statistics of 77 population genomes, which have been assigned to a phylogenetic class

Bin IdClassGC Content (%)Genome size (Mbp)Gene countCompl. (%)Cont. (%)Scaffolds (no.)Longest scaffold (bp)
ZetaBin011Zetaproteobacteria58.512.14247296.642.4330550 762
ZetaBin022Zetaproteobacteria55.752.27304883.5112.68110717 841
ZetaBin030Zetaproteobacteria49.620.142029.400.00578654
ZetaBin035Zetaproteobacteria60.802.92330694.2622.3752599 453
ZetaBin037Zetaproteobacteria58.781.98292933.299.66125416 849
ZetaBin040Zetaproteobacteria47.982.91281298.740.84145144 433
ZetaBin041Zetaproteobacteria50.512.19239196.8012.7732143 694
ZetaBin042Zetaproteobacteria51.092.64270797.063.21188129 034
ZetaBin043Zetaproteobacteria50.121.90233963.985.4353825 129
ZetaBin047Zetaproteobacteria51.363.18458173.6633.61136425 119
ZetaBin049Zetaproteobacteria52.242.65374772.4126.5495518 206
ZetaBin050Zetaproteobacteria51.423.33411895.6618.2660076 915
ZetaBin052Zetaproteobacteria43.640.6068222.860.0514616 046
ZetaBin055Zetaproteobacteria43.430.7590940.870.1422111 516
ZetaBin056Zetaproteobacteria43.770.6483726.423.0924712 705
ZetaBin057Zetaproteobacteria43.160.4562519.751.7220210 079
ZetaBin058Zetaproteobacteria43.590.3343814.110.1915311 627
ZetaBin059Zetaproteobacteria42.760.7195630.099.8027511 729
ZetaBin060Zetaproteobacteria42.090.415876.550.341807595
ZetaBin062Zetaproteobacteria43.750.6278334.870.0021611 379
ZetaBin064Zetaproteobacteria43.020.7395318.260.0030416 336
ZetaBin065Zetaproteobacteria41.824.32557578.5034.27155627 653
ZetaBin066Zetaproteobacteria48.523.69426599.585.4654187 185
ZetaBin069Zetaproteobacteria42.710.79110910.270.6934210 884
ZetaBin077Zetaproteobacteria50.372.17264893.6316.4373817 579
ZetaBin078Zetaproteobacteria43.080.5070219.480.002599548
ZetaBin079Zetaproteobacteria51.172.24252186.2710.8536524 239
ZetaBin080Zetaproteobacteria47.962.96300196.2215.6428089 556
ZetaBin084Zetaproteobacteria49.434.46555595.7736.35207322 352
ZetaBin088Zetaproteobacteria52.512.94335690.9922.2782730 533
ZetaBin089Zetaproteobacteria46.847.73886198.5955.282338145 501
ZetaBin090Zetaproteobacteria44.392.66308091.2222.5783626 177
ZetaBin091Zetaproteobacteria43.811.60225759.553.329007813
ZetaBin092Zetaproteobacteria48.982.19248690.178.0557933 624
PlanctoBin028Planctomycetia70.382.53302766.1518.11115315 570
PlanctoBin046Planctomycetia56.903.93533659.9529.64239262 755
NitroBin001Nitrospira42.822.702876100.002.73145102 379
NitroBin004Nitrospira49.602.94317891.133.5242372 390
NitroBin006Nitrospira55.002.97272897.411.72107231 086
NitroBin008Nitrospira54.703.51323199.082.44130308 593
NitroBin010Nitrospira48.181.27162810.850.1652519 238
NitroBin051Nitrospira66.961.91230359.181.728739924
IgnaviBin015Ignavibacteria34.924.284351100.0022.1579262 071
GemmaBin005Gemmatimonadetes65.733.33273297.801.1093433 639
GemmaBin009Gemmatimonadetes70.003.082652100.001.10139186 026
GammaBin013Gammaproteobacteria62.843.47330897.464.5636692 648
GammaBin018Gammaproteobacteria64.563.40347498.2810.5344984 061
GammaBin021Gammaproteobacteria64.891.72212261.342.3747021 092
GammaBin025Gammaproteobacteria63.752.91307497.567.7138339 220
GammaBin034Gammaproteobacteria45.523.17401797.9321.32108037 727
GammaBin036Gammaproteobacteria55.823.34421368.9718.28115233 248
GammaBin038Gammaproteobacteria60.575.80610993.8983.17122160 894
GammaBin045Gammaproteobacteria60.121.96281672.4922.7311229308
GammaBin063Gammaproteobacteria43.851.44209722.480.79507197 530
GammaBin076Gammaproteobacteria50.353.06337895.778.6358844 851
GammaBin082Gammaproteobacteria42.313.44364788.175.2556552 724
GammaBin093Gammaproteobacteria38.544.19449697.063.9847690 958
FlavoBin054Flavobacteriia40.701.45230317.412.4781227 641
FlavoBin072Flavobacteriia30.742.76371667.0722.79113821 266
FlavoBin087Flavobacteriia29.743.28373778.8637.6354333 162
EpsilonBin027Epsilonproteobacteria31.353.20446491.1240.00139917 806
EpsilonBin032Epsilonproteobacteria38.781.83203596.676.3522662 498
EpsilonBin033Epsilonproteobacteria35.691.33188333.035.3466922 159
EpsilonBin053Epsilonproteobacteria38.021.49169495.904.1724333 985
EpsilonBin071Epsilonproteobacteria38.652.82386551.4018.20118437 928
DeltaBin002Deltaproteobacteria62.333.07271698.211.2126889 342
DeltaBin003Deltaproteobacteria56.122.31208194.192.80105128 268
DeltaBin016Deltaproteobacteria72.276.62480189.524.19561102 686
DeltaBin031Deltaproteobacteria54.462.27292464.5811.80110210 516
DeltaBin044Deltaproteobacteria51.082.66284395.2411.9244154 886
DeltaBin048Deltaproteobacteria50.602.90377477.6038.52123824 934
DeferriBin019Deferribacteres46.165.665252100.0059.31160040 272
CaldiBin024Caldilineae58.615.11497899.098.58766132 167
AnaeroBin020Anaerolineae61.544.25498696.5526.33147726 958
AlphaBin023Alphaproteobacteria61.162.39306286.319.0495016 719
AlphaBin068Alphaproteobacteria48.131.31169716.250.8651223 917
ActinoBin026Actinobacteria71.812.90322994.0212.5456635 545

Abbreviations: Compl., completeness; Cont., contamination; OTU, operational taxonomic unit.

T-RFLP and qPCR results both indicate that Group I mat communities were less diverse than Group II or Group III. This is again corroborated by coverage analysis of the genome bins, where communities from Group I have the lowest diversity and Group III had the highest diversity. Zetaproteobacteria genome bins were still present, though as minor community members, in the representative Group III communities (Figure 4). Group III also had higher coverage estimates within the Nitrospira, Gamma-, Epsilon- and Alphaproteobacteria. Zetaproteobacteria genome bins had the highest coverage in the Group I mat communities. The bacterial taxa distribution observed in the clone libraries is consistent with that estimated by genome binning from metagenomics (Figure 4; Supplementary Figure 1).
Figure 4

Relative abundance estimated by read coverage of taxonomically-classified genome bins from the sampled microbial communities. Genome bins were assessed for taxonomic classification using CheckM.

Carbon utilization

All isolates of Zetaproteobacteria grow via lithoautotrophy and encode for the RubisCO protein for carbon fixation from CO2. Mariprofundus ferrooxydans PV-1, M. ferrooxydans JV-1, M. ferrooxydans M34 and Mariprofundus DIS-1 encode for both Form I and Form II large subunit RubisCO gene, whereas Zetaproteobacterium TAG-1 and Mariprofundus sp. EFK-M39 only encodes a Form II RubisCO (Field ). In total, 87 genes were identified as the large subunit of RubisCO. Of these, 67 were Form II and 11 were Form I. Of the Form I genes, only one was binned into a Zetaproteobacteria genome bin (ZetaBin022). The majority of the RubisCO Form II genes belonged to Zetaproteobacteria and 28 of the Zetaproteobacteria genome bins encoded a Form II gene, including the bin with the Form I gene (ZetaBin022). The Gammaproteobacteria had the second highest abundance of RubisCO genes, with four Form I and sixteen Form II genes detected. Twenty-three of the RubisCO genes were not placed into genome bins, but nine of these had the highest similarity to Zetaproteobacteria genes and six were most similar to Gammaproteobacteria (Supplementary Table 5). In comparison, only seven ATP citrate lyase (encoded by aclB) genes were identified. This is a key gene in the reductive tricarboxylic acid cycle and is found in autotrophic Epsilonproteobacteria and Aquifacales (Hügler and Sievert, 2011). Five of the seven aclB genes were binned into Epsilonproteobacteria genome bins (Supplementary Table 6). The closest taxonomic hits were to Sulfurovum sp AR, Sulfurimonas autotrophica OK10 and Nitratiruptor sp SB155-2. Two of these organisms, S. autotrophica OK10 and Nitratiruptor sp. SB155-2, were isolated from Iheya North hydrothermal field sediments and chimneys, respectively (Sikorski ; Inoue ). Sulfurovum sp. AR was isolated from deep marine sediments collected near Svalbard, within the Arctic Circle (Park ). There was a high diversity of Form II RubisCO proteins recovered from Zetaproteobacteria genome bins and unbinned proteins identified as Zetaproteobacteria by IMG (Figure 5; Supplementary Table 5). Many of these RubisCO proteins were most similar to RubisCO proteins from the Zetaproteobacteria SAGs belonging to zOTU 2. This zOTU was one of the two considered as cosmopolitan because it is found throughout the Pacific Ocean (McAllister ).
Figure 5

RAxML phylogenetic tree of the Form II RubisCO proteins from Zetaproteobacteria genome bins (orange) and unbinned proteins identified as Zetaproteobacteria by IMG (blue). Numbers within parenthesis are gene identification numbers. Bootstrap values (⩾50) are representative of 1000 iterations.

Targeted qPCR on RubisCO Form II (cbbM) showed high abundance of the gene correlated strongly with a high abundance of Zetaproteobacteria (Jesser ). The abundance of Form II RubisCO genes in comparison to Form I is indicative of adaptations to high CO2 and very low O2 environments (Hernandez ; Tabita ). The prevalence of Form II RubisCO in the genome bins of the Zetaproteobacteria (Table 2) shows an adaptation to growth in very low O2 environments similar to what is found in and around Pele’s Pit (Glazer and Rouxel, 2009). Zetaproteobacteria SAGs showed a similar pattern, in that Form I RubisCO was undetected (Field ). Only a single Zetaproteobacteria genome bin contained both forms of RubisCO, suggesting that genotypes containing only Form II are the most prevalent.
Table 2

Number of genes per Zetaproteobacteria genome bins. Bins were sorted by their closest zOTU as determined by both ANI and AAI

OTU by Closest SAGGenomeBincbbMccoNccoOcoxAcyc2 (PV-1)cyc1 (PV-1)arsCnirK/nirSnapAnarGnasABnirBnirDnifHamt
1ZetaBin042143 1133  111 2
1ZetaBin066142 2232  111 2
1ZetaBin0771641 23   121 2
1ZetaBin079242  131  121 2
1ZetaBin080142 113   111 2
1ZetaBin0841621524512252 7
1ZetaBin088 5412262  32  2
1ZetaBin092 3    323 211 2
1ZetaBin041131 1135  111 2
1ZetaBin043142 11311 111 1
2ZetaBin052 1    1        
2ZetaBin05511   13        
2ZetaBin056 11  11       1
2ZetaBin05711    1        
2ZetaBin0581     1   111  
2ZetaBin059142 112   211 2
2ZetaBin060    1          
2ZetaBin062 33 1     1   3
2ZetaBin064 31 1  4  11   
2ZetaBin065242 225       5
2ZetaBin069    1 1        
2ZetaBin078              1
2ZetaBin090 33 3 3 2 311  
4ZetaBin030               
4ZetaBin04733 3 121   21 6
4ZetaBin0371  11 21    1 2
4ZetaBin0502322214    2  4
6ZetaBin040131 11311 111 3
9ZetaBin089251221211 42 112
9ZetaBin091131 11  1 211 1
10ZetaBin04912135 1    11 1
11ZetaBin01113122121   21 2
11ZetaBin02222 2  12  121 2
11ZetaBin035131 4 2 1  3 12
Bin Total34232822102219281581192319225
Gene totals 3091421841247132112303720271

Abbreviations: AAI, average amino acid identity; ANI, average nucleotide identity; OTU, operational taxonomic unit; SAG, single amplified genomes; zOTU, zetaproteobacteria OTU.

Nitrogen cycling

Biological nitrogen fixation is a key process in any ecosystem. The gene nifH encodes the nitrogenase reductase subunit, and is commonly used to track abundance and diversity among nitrogen-fixing organisms (Gaby and Buckley, 2012). Of the Zetaproteobacteria, Mariprofundus sp. EKF-M39, DIS-1 and M. ferrooxydans M34 encode a nifH gene, and qPCR estimates showed very low occurrence of nifH in microbial mat communities from Lō’ihi Seamount (Jesser ). Consistent with this notion, only eleven nifH genes were identified and only two of these were within Zetaproteobacteria genome bins (ZetaBin035 & ZetaBin089). ZetaBin089 also encodes for nifD and nifK, the nitrogenase alpha and beta subunits, respectively. These genes are encoded on the same contig and are syntenous with the other identified nifH-containing Zetaproteobacteria isolates (Supplementary Figure 3). ZetaBin035 is lacking the alpha and beta subunits, but encodes the dinitrogenase iron-molybdenum cofactor, which is involved in the synthesis of the iron-molybdenum cofactor that binds the active site of the nitrogenase enzyme. Based on these annotations, it appears that these two Zetaproteobacteria bins (ZetaBin035 and ZetaBin089) are potentially capable of nitrogen fixation. Diverse nifH genes have been identified at Axial Seamount, located along the Juan de Fuca Ridge (Mehta ) and interestingly, ammonium has been detected to similar levels as found at Lō’ihi microbial mats, where nifH genes were either below detection or at very low abundance (Jesser ). Ammonium transport proteins (amt) were found in 26 of the Zetaproteobacteria genome bins, including the genome bins that encode a nifH (Table 2). Use of nitrate and/or nitrite as a nitrogen source appears to be the most common across the Zetaproteobacteria genome bins. The majority of the Zetaproteobacteria genome bins contained genes for nitrate reduction (nasAB) and/or nitrite reductase (nirBD) for the assimilation of nitrogen. The dissimilatory nitrate reductase (napAB) and nitrite reductase (nirK/nirS) were also identified in 18 of the Zetaproteobacteria genome bins (Table 2) showing that denitrification is also possible. The prevalence of the ammonium transport proteins, presence of assimilatory nitrogen pathways, and the low recovery of nifH suggest that Zetaproteobacteria rely more on the presence of bioavailable nitrogen compounds accessed from the environment, rather than by dinitrogen fixation.

Arsenic cycling

Arsenic has been found at hydrothermal vents and the arsenic detoxification gene, arsenate reductase (encoded by arsC) has been identified in abundance in microbial mats from Lō’ihi hydrothermal habitats (Jesser ). ArsC reduces arsenate to arsenite, which can then be exported from the cell via an arsenite specific transporter. In the composite assembly, there were 195 identified arsC genes in 67 of the genome bins representing every taxonomic class. The majority of the binned arsC genes were contained within either the Zetaproteobacteria or Gammaproteobacteria genome bins, with 71 and 28 gene copies, respectively. Of the identified arsC genes that were unbinned, taxonomic placement by IMG shows these genes to again be similar to genes from Zetaproteobacteria and Gammaproteobacteria. Arsenite transport proteins were identified in 23 of the Zetaproteobacteria genome bins. All of the Zetaproteobacteria genome bins with an arsenite transport protein contained an arsenate reductase as well. At Tutum Bay, a shallow water hydrothermal vent system, ~1.5 kg of arsenic per day is released into the environment (Meyer-Dombard ). This system also releases reduced iron, and Zetaproteobacteria were shown to heavily colonize slides incubated in situ. Although arsenic geochemistry has yet to be recorded at Lō’ihi vents, the abundance of arsenic-related genes found in our composite assembly suggests that arsenate is abundant in this environment. However, to show this, further geochemical analysis targeting arsenic redox states at Lō’ihi would be required.

Electron transport chain

Zetaproteobacteria SAGs and isolate genomes encode for a cbb3-type cytochrome c oxidase (Field ; Fullerton ). M. ferrooxydans PV-1, encodes for subunits I–III (ccoNOP) and appears to be lacking subunit IV (ccoQ) according to Singer . Only the CcoNO subunits were identified in the proteomic profile of M. ferrooxydans PV-1 (Barco ). Eleven of the 34 Zetaproteobacteria bins encode all four subunits of the cbb3-type cytochrome c oxidase. Mariprofundus sp. EKF-M39, DIS-1, M. ferrooxydans JV-1 and six of the Zetaproteobacteria SAGs encode all four subunits of the cbb3-type cytochrome c oxidase. Nine of the Zetaproteobacteria genome bins encode for subunits I–III and appear to lack subunit IV. The ccoQ gene product is a membrane-spanning protein of unclear function; ccoN gene encodes for the catalytic subunit and ccoO, a monoheme c-type cytochrome. Only the ccoNO subunits are common to all gene clusters across multiple bacterial phyla (Ducluzeau ). The cbb3-type cytochrome c oxidase has a high affinity for O2 and is predominately used under microaerophilic conditions and may also be used to prevent O2 poisoning (Sievert ; Jewell ). The aa3-type cytochrome c oxidase is encoded by coxABC where expression is repressed in facultative anaerobes under low oxygen conditions (Pitcher and Watmough, 2004). Ten of the Zetaproteobacteria genome bins contain the coxA gene (aa3-type cytochrome c oxidase), and all but one of these genome bins encodes for the ccoNOP (cbb3-type cytochrome c oxidase) as well. This suggests that like other facultative anaerobes and microaerophiles, Zetaproteobacteria are able to modulate their electron transport chain to account for variable oxygen conditions. Only one of the 24 Zetaproteobacteria SAGs encodes both types of the cytochrome c oxidases (Field ). There is no direct evidence that Zetaproteobacteria can grow anaerobically using nitrate as the terminal electron acceptor; however, a number of other iron-oxidizing Proteobacteria can grow anaerobically this way (Hedrich ; Beller ). In the Zetaproteobacteria genome bins there was one bin, ZetaBin084, which encoded the respiratory nitrate reductase, NarG. This genome bin also encodes the cbb3 and aa3 type cytochrome c oxidases, that is, both the ccoNO and coxA genes. Iron oxidation is hypothesized to occur on the outer membrane and is coupled to cytoplasmic and membrane-bound electron transfer proteins (Hedrich ; Ilbert and Bonnefoy, 2013). From M. ferrooxydans PV-1 genome analysis, a molybdopterin oxidoreductase (Mob, SPV1_03948) was hypothesized to be important in Fe(II) oxidation (Singer ), and showed synteny with two contigs contained in a fosmid library generated from a suction-sample collected from Hiolo South (Singer ). This protein was also identified in the top 25 most abundant proteins of M. ferrooxydans PV-1; however, its function in iron oxidation is questionable due to high similarity to proteins found in non-iron oxidizers (Barco ). In the Zetaproteobacteria genome bins, similar proteins were detected and annotated by IMG as different molybdopterin-containing oxidoreductases (for example, nitrate reductase NapA; Supplementary Table 4). Proteomic analysis of M. ferrooxydans PV-1 revealed a membrane bound cytochrome that was highly expressed and distantly related to cytochrome c2 of Acidothiobacillus ferrooxydans (Barco ; White ). It has been proposed that this protein, referred to as Cyc2PV-1, is the site of electron transfer from iron to a cytoplasmic cytochrome (Cyc1PV-1), which was also identified as high-abundant by proteomic analysis. From Cyc1PV-1, electrons are hypothesized to be shuttled into a membrane-bound electron transport chain, terminating with the cbb3-type cytochrome c oxidase. Using the amino acid sequence of Cyc1PV-1 and Cyc2PV-1 to search the composite metagenome, 24 and 41 gene copies, respectively, were identified within the Zetaproteobacteria genome bins (Table 2). The open reading frames most similar to Cyc1PV-1 were annotated as cytochrome c553, whereas the Cyc2PV-1 genes were annotated as hypothetical proteins by IMG (Supplementary Tables 2 and 3). The identification of Cyc1PV-1 and Cyc2PV-1 in our Zetaproteobacteria genome bins, supports the hypothesis that a Cyc2-like protein is the site of iron oxidation, as opposed to the alternative hypothesis using the Mob protein (Hedrich ; Singer ; Ilbert and Bonnefoy, 2013; Barco ). These Cyc2-like proteins were identified in every zOTU detected, indicating their ubiquity across the Zetaproteobacteria, including within the ecologically significant taxa (Table 2).

Whole genome comparisons

In this composite metagenome study, there were 249 total SSU rRNA genes recovered. Of these, 41 were contained within Zetaproteobacteria genome bins as determined by CheckM and 37 SSU rRNA genes were identified as Zetaproteobacteria by the RDP classifier (Wang ; Parks ). Previous studies on Zetaproteobacteria SSU rRNA diversity identified two operational taxonomic units that were ubiquitous across the Pacific Ocean, referred to as zOTUs 1 and 2 (McAllister ). Genomes were compared at the nucleotide level to assess genomic diversity across the Zetaproteobacteria genome bins as compared to isolate genomes and SAGs (Figure 6) by ANI. Hierarchical clustering of the genomes based on ANI showed that genome bins most similar to zOTUs 1 and 2 are the most highly represented, with 10 and 13 out of the 34 Zetaproteobacteria genome bins, respectively. Based on Form II RubisCO phylogeny, these zOTUs constitute a single lineage that diverged more recently than any of those that occurred in other lineages (Figure 5). Both these zOTUs were also found to be the most abundant phylotypes detected in microbial mats from Lō’ihi hydrothermal habitats by SAGs and SSU clone library analyses (McAllister ; Field ). Based on the cluster analysis of ANI comparisons from our Zetaproteobacteria genome bins, this study has shown that zOTU 2 represents a monophyletic cluster and is distinct from all the other zOTU clusters (Figure 6), and based on estimated genome size hints, that genome streamlining may be occurring within this group. This zOTU was also the first to be identified from any hydrothermal system (Moyer ). Our whole genome cluster analysis also showed that zOTUs 1, 4, 6 and 10 have much greater genomic dissimilarity (that is, diversity) than what would be expected based on SSU rRNA identity alone. The distribution of Zetaproteobacteria genome bins across the three different groups of mat communities shows that zOTU 2 is the most abundant in both Group I and Group II (that is, both curds and veils) type mats based on gross morphology, representing twisted-stalks and sheaths, respectively. The Group III mats (streamers), which have a low Zetaproteobacterial abundance relative to the other members of the community, included zOTU 11 as the most highly represented within this mat-type (Supplementary Figure 4).
Figure 6

Hierarchical clustering heatmap and dendrogram of ANI of Zetaproteobacteria genome bins, and isolate Zetaproteobacteria genomes. Genome self-comparisons and where ANI could not be determined are presented in light gray. Genome bins were confirmed by average amino acid identity.

Using this hierarchical cluster analysis approach, patterns of metabolic potential across zOTUs can also be realized. The only two bins with a nifH gene (ZetaBin035 and ZetaBin089) were also most closely related to isolates that are able to fix nitrogen. All cultured isolates remain within the same tight cluster, including the type strain M. ferrooxydans PV-1, possibly indicating a narrow range of selection pressure resulting from our present culturing techniques. Furthermore, there were few Zetaproteobacteria genome bins with similarity to any cultured isolates, suggesting environmental parameters are poorly mimicked in the lab. In general, the RubisCO protein relationships and genome relationships identified by ANI were conserved (that is, similar). None of the genome bins within zOTU 2 contained genes for the aa3-type cytochrome c oxidase, further supporting adaptation to the low O2 levels found at Lō’ihi hydrothermal habitats.

Conclusions

Coverage analysis of our composite metagenome indicates that carbon is fixed primarily by Zetaproteobacteria containing Form II RubisCO. Through an assessment of the diversity of Form II RubisCO genes and the abundance of cbb3-type cytochrome c oxidase genes, many Zetaproteobacteria show an adaptation to life at very low oxygen levels in conjunction with high-ferrous iron and dissolved CO2 levels. Denitrification is less common, and our data indicates that bioavailable nitrogen is primarily metabolized. Nearly all of the Zetaproteobacteria genome bins contain genes for the detoxification of arsenate as well as representatives from each of the other classes that were detected in these microbial mat communities. This shows that metagenomics analyses can also provide insights into geochemical conditions. The lineage represented by zOTU 2 is monophyletic suggesting an ancestral bottleneck during its more recent evolutionary history. This zOTU is also the most prevalent of our Zetaproteobacteria genome bins, indicating it is the most ecologically successful manifestation of both sheath and stalk morphology. Through the use of genome-resolved metagenomics, we have better constrained patterns observed in metabolic potential and divergence across many of the Zetaproteobacteria growing within microbial mats at Lō’ihi Seamount.
  41 in total

1.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

2.  Basin-scale transport of hydrothermal dissolved metals across the South Pacific Ocean.

Authors:  Joseph A Resing; Peter N Sedwick; Christopher R German; William J Jenkins; James W Moffett; Bettina M Sohst; Alessandro Tagliabue
Journal:  Nature       Date:  2015-07-09       Impact factor: 49.962

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  Biodiversity and emerging biogeography of the neutrophilic iron-oxidizing Zetaproteobacteria.

Authors:  Sean M McAllister; Richard E Davis; Joyce M McBeth; Bradley M Tebo; David Emerson; Craig L Moyer
Journal:  Appl Environ Microbiol       Date:  2011-06-10       Impact factor: 4.792

5.  Lithotrophic iron-oxidizing bacteria produce organic stalks to control mineral growth: implications for biosignature formation.

Authors:  Clara S Chan; Sirine C Fakra; David Emerson; Emily J Fleming; Katrina J Edwards
Journal:  ISME J       Date:  2010-11-25       Impact factor: 10.302

Review 6.  Beyond the Calvin cycle: autotrophic carbon fixation in the ocean.

Authors:  Michael Hügler; Stefan M Sievert
Journal:  Ann Rev Mar Sci       Date:  2011

7.  Phylogenetic diversity of nitrogenase (nifH) genes in deep-sea and hydrothermal vent environments of the Juan de Fuca Ridge.

Authors:  Mausmi P Mehta; David A Butterfield; John A Baross
Journal:  Appl Environ Microbiol       Date:  2003-02       Impact factor: 4.792

8.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

9.  Draft Genome Sequence of Mariprofundus ferrooxydans Strain JV-1, Isolated from Loihi Seamount, Hawaii.

Authors:  Heather Fullerton; Kevin W Hager; Craig L Moyer
Journal:  Genome Announc       Date:  2015-10-08

10.  The Architecture of Iron Microbial Mats Reflects the Adaptation of Chemolithotrophic Iron Oxidation in Freshwater and Marine Environments.

Authors:  Clara S Chan; Sean M McAllister; Anna H Leavitt; Brian T Glazer; Sean T Krepski; David Emerson
Journal:  Front Microbiol       Date:  2016-06-01       Impact factor: 5.640

View more
  15 in total

Review 1.  Iron-oxidizing bacteria in marine environments: recent progresses and future directions.

Authors:  Hiroko Makita
Journal:  World J Microbiol Biotechnol       Date:  2018-07-04       Impact factor: 3.312

2.  Aerobic and anaerobic iron oxidizers together drive denitrification and carbon cycling at marine iron-rich hydrothermal vents.

Authors:  Sean M McAllister; Rebecca Vandzura; Jessica L Keffer; Shawn W Polson; Clara S Chan
Journal:  ISME J       Date:  2020-12-17       Impact factor: 10.302

3.  Physiological and ecological implications of an iron- or hydrogen-oxidizing member of the Zetaproteobacteria, Ghiorsea bivora, gen. nov., sp. nov.

Authors:  Jiro F Mori; Jarrod J Scott; Kevin W Hager; Craig L Moyer; Kirsten Küsel; David Emerson
Journal:  ISME J       Date:  2017-08-18       Impact factor: 10.302

4.  Genomic expansion of magnetotactic bacteria reveals an early common origin of magnetotaxis with lineage-specific evolution.

Authors:  Wei Lin; Wensi Zhang; Xiang Zhao; Andrew P Roberts; Greig A Paterson; Dennis A Bazylinski; Yongxin Pan
Journal:  ISME J       Date:  2018-03-26       Impact factor: 10.302

5.  Microbial analysis of Zetaproteobacteria and co-colonizers of iron mats in the Troll Wall Vent Field, Arctic Mid-Ocean Ridge.

Authors:  Jan Vander Roost; Ingunn Hindenes Thorseth; Håkon Dahle
Journal:  PLoS One       Date:  2017-09-20       Impact factor: 3.240

Review 6.  The Fe(II)-oxidizing Zetaproteobacteria: historical, ecological and genomic perspectives.

Authors:  Sean M McAllister; Ryan M Moore; Amy Gartman; George W Luther; David Emerson; Clara S Chan
Journal:  FEMS Microbiol Ecol       Date:  2019-04-01       Impact factor: 4.194

7.  Geochemical and Metagenomic Characterization of Jinata Onsen, a Proterozoic-Analog Hot Spring, Reveals Novel Microbial Diversity including Iron-Tolerant Phototrophs and Thermophilic Lithotrophs.

Authors:  Lewis M Ward; Airi Idei; Mayuko Nakagawa; Yuichiro Ueno; Woodward W Fischer; Shawn E McGlynn
Journal:  Microbes Environ       Date:  2019-08-14       Impact factor: 2.912

8.  Zetaproteobacteria Pan-Genome Reveals Candidate Gene Cluster for Twisted Stalk Biosynthesis and Export.

Authors:  Elif Koeksoy; Oliver M Bezuidt; Timm Bayer; Clara S Chan; David Emerson
Journal:  Front Microbiol       Date:  2021-06-18       Impact factor: 5.640

9.  Novel Pelagic Iron-Oxidizing Zetaproteobacteria from the Chesapeake Bay Oxic-Anoxic Transition Zone.

Authors:  Beverly K Chiu; Shingo Kato; Sean M McAllister; Erin K Field; Clara S Chan
Journal:  Front Microbiol       Date:  2017-07-18       Impact factor: 5.640

10.  Community Structure of Lithotrophically-Driven Hydrothermal Microbial Mats from the Mariana Arc and Back-Arc.

Authors:  Kevin W Hager; Heather Fullerton; David A Butterfield; Craig L Moyer
Journal:  Front Microbiol       Date:  2017-08-28       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.