Literature DB >> 31380022

Microbial communities across a hillslope-riparian transect shaped by proximity to the stream, groundwater table, and weathered bedrock.

Adi Lavy1,2, David Geller McGrath1, Paula B Matheus Carnevali1, Jiamin Wan2, Wenming Dong2, Tetsu K Tokunaga2, Brian C Thomas1, Kenneth H Williams2, Susan S Hubbard2, Jillian F Banfield1.   

Abstract

Watersheds are important suppliers of freshwater for human societies. Within mountainous watersheds, microbial communities impact water chemistry and element fluxes as water from precipitation events discharge through soils and underlying weathered rock, yet there is limited information regarding the structure and function of these communities. Within the East River, CO watershed, we conducted a depth-resolved, hillslope to riparian zone transect study to identify factors that control how microorganisms are distributed and their functions. Metagenomic and geochemical analyses indicate that distance from the East River and proximity to groundwater and underlying weathered shale strongly impact microbial community structure and metabolic potential. Riparian zone microbial communities are compositionally distinct, from the phylum down to the species level, from all hillslope communities. Bacteria from phyla lacking isolated representatives consistently increase in abundance with increasing depth, but only in the riparian zone saturated sediments we found Candidate Phyla Radiation bacteria. Riparian zone microbial communities are functionally differentiated from hillslope communities based on their capacities for carbon and nitrogen fixation and sulfate reduction. Selenium reduction is prominent at depth in weathered shale and saturated riparian zone sediments and could impact water quality. We anticipate that the drivers of community composition and metabolic potential identified throughout the studied transect will predict patterns across the larger watershed hillslope system.

Entities:  

Keywords:  metabolism; metagenomics; microbiology; riparian; soil; watershed

Year:  2019        PMID: 31380022      PMCID: PMC6662431          DOI: 10.1002/ece3.5254

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

Soil microbial communities impact our environment by driving biogeochemical cycles from centimeter to global scales (Rousk & Bengtson, 2014; Schimel & Schaeffer, 2012). They expedite rock weathering (Gorbushina, 2007; Krumbein, 1988) recycle organic material in the subsurface, and facilitate the growth of vegetation by altering the availability of nutrients in the soil (Wardle et al., 2004). These changes influence soil nutritional status and productivity and plant survival and biotic interactions. Mountains contribute the majority of water discharge in river basins (Viviroli, Weingartner, & Messerli, 2003) and were previously considered to be the origin of much of the world's water resources (Rodda, 1994). In recent years, studies have also addressed their contribution to subsurface carbon storage and carbon cycling (Chang et al., 2014; Hagedorn et al., 2010; Wan et al., 2018). These environments are comprised of a complex system of components, such as forests and meadows, floodplains, and glaciers. In turn, each of these accommodates various habitats including soil, bare rock, permafrost, and snow. Development of a predictive understanding of the behavior of such a heterogeneous and interconnected set of ecosystem compartments is an extremely complicated undertaking. Employing a scale‐adaptive approach in which different ecosystem compartments are considered as “systems within systems” could assist in disentangling the processes that shape overall mountain ecosystem function (Hubbard et al., 2018; Levin, 1992). A first step toward such a goal is to investigate structure and functioning within individual montane ecosystem compartments to provide a basis for future comparative studies and modeling efforts. In the long term, the “systems within systems” approach may better enable predictions accompanying natural or anthropogenic environmental perturbations. Hillslope and floodplain compartments host the majority of soils in alpine and subalpine mountain ecosystems, and biogeochemical processes that occur there impact downstream ecosystems. Runoff and groundwater transport solutes along the elevation gradient and into aquifers, rivers, and lakes. Soils on hillslopes and in floodplains, and in general, harbor considerable microbial diversity (Donhauser & Frey, 2018; Frey et al., 2016; Rime et al., 2014). Most studies of microbial communities in mountainous soils have been concerned with the microbial community structure across different climate zones on the mountain slopes (Bardelli et al., 2017; Djukic, Zehetner, Mentler, & Gerzabek, 2010; Klimek et al., 2015; Xu et al., 2014; Zhang, Liang, He, & Zhang, 2013). However, most work has focused only on shallow soil, down to 20 cm (Bardelli et al., 2017; Yuan, Si, Wang, Luo, & Zhang, 2014; Zhang et al., 2013) and sometimes only the top 5 cm (Singh et al., 2014). The shallow layer of soil is profoundly affected by low temperatures that frequently drop below 0°C and snow cover that crucially limits biological, chemical, and physical processes, and thus microbial life (Zumsteg, Bååth, Stierli, Zeyer, & Frey, 2013). In contrast, the deeper soils and weathered rock in mountain ecosystems have been little studied. While affected by events taking place in shallow layers, the microbial communities there are probably also influenced by moisture gradients and the geochemistry of the underlying bedrock (Tytgat et al., 2016). The East River headwaters catchment is a mountainous, high‐elevation watershed, dominated by the Cretaceous Mancos Shale Formation, with carbonate and pyrite contents of roughly 20% and 1%, respectively (Morrison, Goodknight, Tigar, Bush, & Gil, 2012). The watershed has a mean annual temperature of ~0°C, with average minimum and maximum temperatures of −9.2°C and 9.8°C, respectively. The watershed receives ~600 mm of precipitation per year, the bulk of which falls as snow, and is representative of many other headwaters systems within the upper Colorado River Basin (Hubbard et al., 2018; Pribulick et al., 2016). The present research focused on a lower montane hillslope through floodplain transect located within the East River, CO watershed, which is the focus of the Lawrence Berkeley National Laboratory‐led Watershed Function Project. The intensively studied site investigated in the current study is referred to as PLM (Pump House Lower Montane). The Watershed Function Project builds upon a scale‐adaptive investigation, which focuses on different spatial and temporal scales within the East River watershed, explores how mountainous watersheds retain and release water, nutrients, carbon, and metals downgradient (Hubbard et al., 2018). The current study aims to lay the groundwork for the scale‐adaptive, system within systems approach by identifying ecological niches of interest that would later be tested in a bottom‐up approach across the watershed. We hypothesize that microbial community composition and metabolic potential is similar among sites along an altitudinal transect down the hillslope and that hillslope communities differ from those of the floodplain riparian zone. Furthermore, we hypothesize that proximity to shale and groundwater will affect the composition and functionality of microbial communities, differentiating hillslope communities from other watershed microbial consortia.

METHODS

Site description and sample collection

The PLM intensive study site is located on the northeast facing slope of the East River valley near Crested Butte, Colorado, USA (38°55′12.56″N, 106°56′55.39″W) (Figures 1 and A1). Exact locations were determined at an accuracy of 0.5 m with a Trimble Geo 7X GPS. All samples were collected during three days in September 2016 from meadow sites before any intensive research activities were performed. The ground surface at each site was cleared of vegetation with a hand trawler prior to sampling. Samples were collected with a manual corer lined with 7.6 cm tall and 15.2 cm diameter bleached sterile plastic liners. Five soil profile sampling sites abbreviated PLM0, PLM1, PLM2, PLM3, and PLM4 were chosen along a 230 m hillslope transect. The profiles terminated at depth in the unsaturated zone, with the exception of PLM4, which extended below the water table. The base of PLM3 and PLM6 profiles is located near or within the weathered Mancos Shale bedrock, while the base of PLM0 was located >1 m above the weathered bedrock. PLM0 is at the top of the hill and PLM4 on the East River floodplain, 2,804 m and 2,757 m above sea level, respectively (Figure 1). One full core was taken at each sampling depth, and the soil in between sampling depths was removed with an auger. An additional site, PLM6, was sampled by drilling and provided access to weathered shale. Samples at PLM6 were taken from a split‐spoon, dry drilled core. In total, 20 samples were collected as follows: PLM0—5, 30, 60 cm; PLM1—5, 30, 60, 100 cm; PLM2—5, 30 cm; PLM3—5, 30, 60, 127 cm; PLM6—50, 170, 200 cm; PLM4—5, 32, 65, 90 cm.
Figure 1

East River Watershed hillslope‐riparian zone transect sampling sites. (a) The location of East River PLM intensive study site. (b) Five PLM sites are located across a hillslope transect. PLM0 is the highest point of the transect, and PLM4 is located in the floodplain. (c) Schematic representation of the sampling sites. Elevation of the surface, given in meters above sea level, appears below the name of the sampling site. Maximum depth at each sampling site is specified below the depiction of the sampled core in centimeters. Horizontal distances between sites are given at the bottom of the illustration. Maximum and minimum water levels are depicted by dashed blue and red lines, respectively. The PLM6 site was initially drilled for another study, 5 m from PLM3 but at the same elevation. A full view of the East River watershed is given in Figure A1

Figure A1

Illustration of the East River watershed. Star and flag indicate the town of Crested‐Butte and the location of the Pumphouse Lower Montane (PLM) sampling site, respectively

East River Watershed hillslope‐riparian zone transect sampling sites. (a) The location of East River PLM intensive study site. (b) Five PLM sites are located across a hillslope transect. PLM0 is the highest point of the transect, and PLM4 is located in the floodplain. (c) Schematic representation of the sampling sites. Elevation of the surface, given in meters above sea level, appears below the name of the sampling site. Maximum depth at each sampling site is specified below the depiction of the sampled core in centimeters. Horizontal distances between sites are given at the bottom of the illustration. Maximum and minimum water levels are depicted by dashed blue and red lines, respectively. The PLM6 site was initially drilled for another study, 5 m from PLM3 but at the same elevation. A full view of the East River watershed is given in Figure A1 Immediately after extraction, a sample from each site and depth collected within an individual sterile plastic liner was placed in a sterile Whirl‐Pak bag and manually homogenized. Aliquots of 5 g of soil from each bag were placed in 10 ml of LifeGuard Soil Preservation Solution for RNA and DNA co‐extraction, whereas the rest of the sample was used for DNA extraction. Care was taken to avoid roots and small rocks. Samples in sterile Whirl‐Pak bags and preservation solution were placed in a chilled cooler until processing at the Rocky Mountain Biological Laboratory (RMBL) later that day. In the laboratory, roots and small rocks were removed from sampling bags, and three 10 g subsamples were weighted from each sample and placed in a −80°C freezer. Samples were shipped overnight on dry ice to University of California, Berkeley for DNA and RNA extractions. Particle size analyses of samples were conducted according standard methods (Gee & Or, 2002). Geochemical measurements were made at the Earth and Environmental Sciences department's Aqueous Geochemistry Laboratory. Water soluble cation–anion composition was measured by water extraction (1:1 soil:DIW mass ratio) and ICP‐MS. Total inorganic carbon (TIC) and total organic carbon (TOC) in soil samples were determined using a Shimadzu TOC‐VCSH total inorganic and organic carbon analyzer combined with a solid sample combustion unit of SSM‐5000A. Total nitrogen (TN) was analyzed using a Shimadzu Total Nitrogen Module (TNM‐1) combined with the TOC‐VCSH analyzer. pH was measured with an uncertainty of ±0.05. For TIC/TOC and IC the uncertainty is <3% and <5%, respectively. All geochemical measurements for samples taken at PLM6, nitrate concentration for the sample from PLM0 30 cm, and sulfate concentrations for samples PLM0 40 cm, PLM1 60 cm, PLM1 90 cm, PLM2 5 cm, and PLM2 30 cm are not available.

DNA extraction and sequencing

DNA was extracted from 10 g of soil with DNeasy PowerMax Soil Kit in two batches of 5 g each which were combined during the cleaning step. Extraction process followed the manufacturer's protocol with the following modifications: Soil was vortexed at maximum speed for an additional 3 min in the sodium dodecyl sulfate reagent and then incubated for 30 min at 60°C, with intermittent shaking in place of extended bead beating, as established by Hug et al. (2015). For DNA precipitation, sodium acetate (1:10 v/v) and isopropanol (1:1 v/v) were added and samples were incubated overnight (−20°C). Following incubation, DNA was pelleted by centrifugation (15,300 g, 15 min, 4°C), washed with cold ethanol, and suspended in ddH2O. DNA was further cleaned with DNeasy PowerClean Pro Clean Up Kit following the manufacturer's protocol. DNA was also co‐extracted with RNA from 5 g of soil using RNeasy PowerSoil Total RNA Kit and Phenol:Chloroform:Isoamyl Alcohol 25:24:1 saturated with 10 mM Tris (final pH 8.0) and 1 mM EDTA. RNeasy PowerSoil DNA Elution Kit was used to collect DNA which was further cleaned using DNeasy PowerClean Pro Clean Up Kit. The co‐extraction and cleaning steps were conducted according to the manufacturer's protocol. While RNA was extracted for the purpose of another study, using co‐extraction as a second extraction method was expected to improve the detection of the total diversity of microbes in the sample (İnceoǧlu, Hoogwout, Hill, & Elsas, 2010). Overall, two DNA samples were produced from each sampling, one from DNA extraction and the second from the DNA that was co‐extracted along with RNA. A third DNA sample was extracted from the 90 cm deep PLM4 sample; thus, a total of 41 DNA samples were used for further analysis. Metagenomic libraries were prepared at the Joint Genome Institute (JGI) after validating concentrations and DNA integrity using Qubit (Thermo Fisher Scientific) and gel electrophoresis, respectively. Libraries were prepared using NEB's Ultra DNA Library Prep kit (New England Biolabs) for Illumina with Ampure XP bead selection aimed to give fragments of 500 base‐pair (bp) according to the manufacturer's protocol. The library was sequenced at JGI using an Illumina Hiseq 2500, resulting in paired‐end, 150 bp sequences.

Bioinformatic analyses

Raw reads processing followed protocols described elsewhere (Hernsdorf et al., 2017). Briefly, reads were trimmed based on quality scores with Sickle (Joshi & Fass, 2011) and assembly was accomplished with IDBA‐UD v1.1.1 (Peng, Leung, Yiu, & Chin, 2012) using kmer size range of 40–140. Only assembled scaffolds with >1 kbp were included in downstream analysis. Open reading frames were identified by Prodigal v2.6.3 (Hyatt et al., 2010) using the metagenomic setting. Microbial community structure was assessed according to the abundance of the ribosomal protein S3 (rpS3) marker gene (Brown et al., 2015) by modifying the method described by Anantharaman et al. (2016). Archaeal, eukaryotic, and bacterial rpS3 protein sequences were identified using Hidden Markov Models (HMM) (Finn et al., 2015). Ten rpS3 reference sequences which compose TIGRFam's TIG01009 model were added to the protein sequences that were identified by HMMs and aligned with MAFFT (Katoh & Standley, 2013). Positions within the alignment with >95% gaps were removed, leaving 206 amino acids in the longest, nonreference sequence. Sequences that had less than 103 nongap positions (50% of overall nongap positions) were removed from the analysis. This step ensured that only positions that are truly related to the sequence of rpS3 were included in downstream analysis. The amino acid sequences were clustered with the cluster_fast algorithm from USEARCH software (Edgar, 2010) at a 99% similarity threshold, and the following settings: query_cov = 1, target_cov = 0.5, and both max_accept and max_reject set to 0. Scaffolds of DNA sequences that matched the clusters’ open reading frames were retrieved from the metagenomes. Average coverage was used as a proxy for relative abundance of different sequence types. In this analysis, the scaffolds were trimmed to include 2 kbp flanking the rpS3 gene. If the scaffold spanned less than 2 kbp on both sides, then the entire scaffold was kept, with a minimal length of 1 kbp. The relative abundance of each trimmed scaffold was determined by mapping the reads from each sample to each trimmed scaffold with bowtie2 (Langmead & Salzberg, 2012). The average coverage and breadth of coverage of each scaffold in each sample was then calculated (Olm et al., 2017). Each scaffold is considered to be present in at least one sample (at minimum, the sample from which it was originally assembled) but could be falsely identified in other samples due to a low breadth cutoff (i.e., false positive). Therefore, we implemented a breadth cutoff of 0.72 based on iterating breadth cutoffs of 0.1 to 1, to find the lowest breadth cutoff that would retain the same number of clusters as went into the analysis. The abundance of organisms at each site was calculated as the average abundance for the two samples (or three in the case of PLM4 at 90 cm) extracted from that site. Genes involved in carbon, nitrogen, and sulfur metabolism were identified using 86 previously published HMM models (Anantharaman et al., 2016), and KEGG KOfam database (Aramaki et al., 2019) (Table A1). Additionally, srdA which encodes for a membrane‐bound catalytic subunit of selenate reductase was detected with a custom HMM model. The model was constructed by aligning 20 amino acid sequences, 934–1222 aa long, determined to be included in the srdA specific clade (Harel, Häggblom, Falkowski, & Yee, 2016). All matches from HMM search for srdA were aligned, and a threshold was decided upon according to their clustering in a phylogenetic tree. Score cutoffs for custom made and PFAM HMMs were manually validated and adjusted by aligning the HMM search results, plotting a phylogenetic tree using FastTree v2.1.9 (Price, Dehal, & Arkin, 2010), and interrogating clades with NCBI's BLASTP (Boratyn et al., 2013) against nr database. The abundance of each gene was determined by mapping the reads from each sample to each scaffold and calculating the average coverage using the same breadth cutoff as before.
Table A1

Description of Hidden Markov Models (HMMs) and their cutoffs

MetabolismGeneral functionGene symbolGene name/functionOriginHMM file nameCutoff score typeCutoff scoreE‐value cutoffLength cutoff (aa)
Aerobic respirationOxygen as electron acceptorqoxACytochrome aa3 quinol oxidase, subunit IITIGRFAMTIGR01432NC built‐in
Aerobic respirationOxygen as electron acceptorqoxBCytochrome aa3 quinol oxidase, subunit ITIGRFAMTIGR02882NC built‐in
Aerobic respirationOxygen as electron acceptorcydACytochrome bd terminal oxidase subunit IPFAMPF01654Domain251.00E‐20210
Aerobic respirationOxygen as electron acceptorcydBCytochrome d ubiquinol oxidase, subunit IITIGRFAMTIGR00203NC built‐in
Aerobic respirationOxygen as electron acceptorcoxACytochrome c oxidase, subunit ITIGRFAMTIGR02891NC built‐in
Aerobic respirationOxygen as electron acceptorcoxBCytochrome c oxidase, subunit IITIGRFAMTIGR02866NC built‐in
Aerobic respirationOxygen as electron acceptorccoNCytochrome c oxidase, cbb3‐type, subunit ITIGRFAMTIGR00780NC built‐in
Aerobic respirationOxygen as electron acceptorccoOCytochrome c oxidase, cbb3‐type, subunit IITIGRFAMTIGR00781NC built‐in
Aerobic respirationOxygen as electron acceptorccoPCytochrome c oxidase, cbb3‐type, subunit IIITIGRFAMTIGR00782NC built‐in
Aerobic respirationOxygen as electron acceptorcyoAUbiquinol oxidase, subunit IITIGRFAMTIGR01433NC built‐in
Aerobic respirationOxygen as electron acceptorcyoDCytochrome o ubiquinol oxidase subunit IVTIGRFAMTIGR02847NC built‐in
Aerobic respirationOxidation of CO to CO2 under aerobic conditionscoxLCarbon‐monoxide dehydrogenase, large subunitTIGRFAMTIGR02416NC built‐in
Aerobic respirationOxidation of CO to CO2 under aerobic conditionscoxMCarbon‐monoxide dehydrogenase, medium subunitCustom (Anantharaman et al. 2018)carbon_monoxide_dehydrogenase_coxMTotal1841.00E‐20150
Aerobic respirationOxidation of CO to CO2 under aerobic conditionscoxSCarbon‐monoxide dehydrogenase, small subunitCustom (Anantharaman et al. 2018)carbon_monoxide_dehydrogenase_coxSTotal1301.00E‐2080
C1 compoundsFormaldehyde oxidationfae‐hpsFormaldehyde‐activating enzymeTIGRFAMTIGR03126NC built‐in
C1 compoundsFormaldehyde oxidationmchMethenyltetrahydromethanopterin cyclohydrolaseTIGRFAMTIGR03120NC built‐in
C1 compoundsFormaldehyde oxidationfrmSS‐(hydroxymethyl)glutathione dehydrogenaseTIGRFAMTIGR02818NC built‐in
C1 compoundsFormaldehyde oxidationS‐(hydroxymethyl)mycothiol dehydrogenaseS‐(hydroxymethyl)mycothiol dehydrogenaseTIGRFAMTIGR03451NC built‐in
C1 compoundsFormaldehyde oxidationfghAS‐formylglutathione hydrolaseTIGRFAMTIGR02821NC built‐in
C1 compoundsMathanol oxidationmdoNDMA‐dependent methanol dehydrogenaseTIGRFAMTIGR04266NC built‐in
C1 compoundsMethane metabolismmauAMethylamine dehydrogenase light chainKofamK15228Total86.96
C1 compoundsMethane metabolismmauBMethylamine dehydrogenase heavy chainKofamK15229Total271.93
C1 compoundsMethane metabolismqhpAQuinohemoprotein amine dehydrogenaseKofamK08685Total435.83
C1 compoundsMethane oxidation, methanotrophmdh1_mxaFMethanol dehydrogenase Pyrroloquinoline quinoneCustom (Anantharaman et al. 2018)methanol_dehydrogenase_pqq_xoxF_mxaFDomain5501.00E‐20300
C1 compoundsMethane oxidation, methanotrophmmoBMethane monooxygenase regulatory protein BPFAMPF02406Total221.00E‐2080
C1 compoundsMethane oxidation, methanotrophmmoDSoluble methane monooxygenase‐binding protein MmoDTIGRFAMTIGR04550NC built‐in
C1 compoundsMethanogenesismcrAMethyl‐coenzyme M reductase, alpha subunitTIGRFAMTIGR03256NC built‐in
C1 compoundsMethanogenesismcrBMethyl‐coenzyme M reductase, beta subunitTIGRFAMTIGR03257NC built‐in
C1 compoundsMethanogenesismcrGMethyl‐coenzyme M reductase, gamma subunitTIGRFAMTIGR03259NC built‐in
C1 compoundsMethanogenesis CO2fhcDFormylmethanofuran‐‐tetrahydromethanopterin N‐formyltransferaseTIGRFAMTIGR03119NC built‐in
Carbon fixation3HP‐4HBabfD4‐hydroxybutyryl‐CoA‐dehydrataseCustom (Anantharaman et al. 2018)Four‐hydroxybutyryl‐CoA‐dehydratase1.00E‐20280
Carbon fixation3HP‐4HB4‐hydroxybutyryl‐CoA‐synthetase4‐hydroxybutyryl‐CoA‐synthetaseKofamK18593Total1233.23
Carbon fixation3HP/3HP‐4HBpropionyl‐CoA‐synthasePropionyl‐CoA‐synthaseKofamK15018Total1311.9
Carbon fixationCalvin non‐phototrophicrubisco form IRubisco form ICustom (Anantharaman et al. 2018)rubisco_form_ITotal5001.00E−20220
Carbon fixationCalvin non‐phototrophicrubisco form IIRubisco form IICustom (Anantharaman et al. 2018)rubisco_form_IITotal5001.00E−20220
Carbon fixationReductive TCAaclAATP citrate lyase ACustom (Anantharaman et al. 2018)ATP_citrate_lyase_aclATotal2151.00E−20300
Carbon fixationReductive TCAaclBATP citrate lyase BCustom (Anantharaman et al. 2018)ATP_citrate_lyase_aclBTotal1771.00E−20200
Carbon fixationWood‐Ljungdahl pathwaycodhCCO dehydrogenase/acetyl‐CoA synthase, beta subunitKofamK14138Total930.4
Carbon fixationWood‐Ljungdahl pathwaycodhDCO dehydrogenase/acetyl‐CoA synthase, delta subunitTIGRFAMTIGR00381NC built‐in
Carbon fixationWood‐Ljungdahl pathwayfdhAFormate dehydrogenase, alpha subunitTIGRFAMTIGR01591NC built‐in
Carbon fixationWood‐Ljungdahl pathwayfdhBFormate dehydrogenase, beta subunitTIGRFAMTIGR01582NC built‐in
IronMetal (Iron/Manganese) oxidation/reductionmtrCDecaheme c‐type cytochromeTIGRFAMTIGR03507NC built‐in
IronMetal (Iron/Manganese) oxidation/reductionmtrBDecaheme c‐type cytochromeTIGRFAMTIGR03509NC built‐in
NitrogenAnammoxhzoHydrazine oxidaseCustom (Anantharaman et al. 2018)hydrazine_oxidase_hzoATotal3251.00E−20160
NitrogenAnammoxhzsHydrazine synthaseCustom (Anantharaman et al. 2018)hydrazine_synthase_hzsATotal4661.00E−20400
NitrogenAnammoxnirSNitrite reductaseCustom (Anantharaman et al. 2018)nitrite_reductase_nirSDomain2001.00E−20280
NitrogenDenitrificationnirKNitrite reductase, copper‐containingTIGRFAMTIGR02376NC built‐in
NitrogenDenitrificationnorBNitric oxide reductase subunit BCustom (Anantharaman et al. 2018)nitric_oxide_reductase_norBTotal801.00E−20230
NitrogenDenitrificationnorCNitric oxide reductase subunit CCustom (Anantharaman et al. 2018)nitric_oxide_reductase_norCDomain501.00E−2075
NitrogenDenitrificationnosZNitrous‐oxide reductase, Sec‐dependentTIGRFAMTIGR04246NC built‐in
NitrogenDissimilatory nitrate reductionnapAPeriplasmic nitrate reductase, large subunitTIGRFAMTIGR01706NC built‐in
NitrogenDissimilatory nitrate reductionnapBPeriplasmic nitrate reductase, diheme cytochrome c subunitPFAMPF03892Total251.00E−2080
NitrogenDissimilatory nitrate reductionnarGNitrate reductase, alpha subunitTIGRFAMTIGR01580NC built‐in
NitrogenDissimilatory nitrate reductionnarHNitrate reductase, beta subunitTIGRFAMTIGR01660NC built‐in
NitrogenDissimilatory nitrate reductionnirBNitrite reductase [NAD(P)H], large subunitTIGRFAMTIGR02374NC built‐in
NitrogenDissimilatory nitrate reductionnirDNitrite reductase [NAD(P)H], small subunitTIGRFAMTIGR02378NC built‐in
NitrogenDissimilatory nitrate reductionnrfHCytochrome c nitrite reductase, small subunitTIGRFAMTIGR03153NC built‐in
NitrogenNitrificationpmoA‐amoAMethane/ammonia monooxygenase subunit AKofamK10944Total192.77
NitrogenNitrificationpmoB‐amoBMethane/ammonia monooxygenase subunit BKofamK10945Total161.43
NitrogenNitrificationpmoC‐amoCMethane/ammonia monooxygenase subunit CKofamK10946Total152.7
NitrogenNitrification‐comammoxnxrANitrite oxidoreductase alpha subunitCustom (Anantharaman et al. 2018)nitrite_oxidoreductase_nxrATotal3501.00E−20500
NitrogenNitrification‐comammoxnxrBNitrite oxidoreductase beta subunitCustom (Anantharaman et al. 2018)nitrite_oxidoreductase_nxrBDomain2501.00E−20200
NitrogenNitrogen FixationnifDNitrogenase molybdenum‐iron protein alpha chainTIGRFAMTIGR01282NC built‐in
NitrogenNitrogen FixationnifHNitrogenase iron proteinTIGRFAMTIGR01287NC built‐in
NitrogenNitrogen FixationnifKNitrogenase molybdenum‐iron protein beta chainTIGRFAMTIGR01286NC built‐in
NitrogenNitrous oxide reductionnosDNitrous oxide reductase family maturation proteinTIGRFAMTIGR04247NC built‐in
SulfurDissimilatory sulfate reduction and sulfide oxidationaprAAdenylylsulfate reductase, alpha subunitTIGRFAMTIGR02061NC built‐in
SulfurDissimilatory sulfate reduction and sulfide oxidationdsrASulfite reductase alpha subunitTIGRFAMTIGR02064NC built‐in
SulfurDissimilatory sulfate reduction and sulfide oxidationdsrBSulfite reductase beta subunitTIGRFAMTIGR02066NC built‐in
SulfurDissimilatory sulfate reductiondsrDDissimilatory sulfite reductase DPFAMPF08679Total501.00E−2030
SulfurDissimilatory sulfate reduction and sulfide oxidationsatSulfate adenylyltransferaseTIGRFAMTIGR00339NC built‐in
SulfurSulfate reductionasrASulfite reductase, subunit ATIGRFAMTIGR02910NC built‐in
SulfurSulfate reductionasrBSulfite reductase, subunit BTIGRFAMTIGR02911NC built‐in
SulfurSulfate reductionasrCSulfite reductase, subunit CTIGRFAMTIGR02912NC built‐in
SulfurSulfide oxidationfccFlavocytochrome c sulfide de­≠hydrogenaseKofamK17230Domain89.1
SulfurSulfide oxidationsdoSulfur dioxygenaseCustom (Anantharaman et al. 2018)sulfur_dioxygenase_sdoTotal1701.00E−20110
SulfurSulfide oxidationsqrSulfide quinone oxidoreductaseCustom (Anantharaman et al. 2018)sulfide_quinone_oxidoreductase_sqrTotal2701.00E−20200
SulfurThiosulfate OxidationsoxBThiosulfohydrolase SoxBTIGRFAMTIGR04486NC built‐in
SulfurThiosulfate OxidationsoxCSulfite dehydrogenaseTIGRFAMTIGR04555NC built‐in
SulfurThiosulfate OxidationsoxDS‐disulfanyl‐L‐cysteine oxidoreductaseKofamK22622Domain133.73
SulfurThiosulfate OxidationsoxYAhiosulfate oxidation carrier protein SoxYTIGRFAMTIGR04488NC built‐in
SulfurThiosulfate reductionphsAAhiosulfate reductase / polysulfide reductase chain AKofamK08352Total516.13
ArsenicArsenite oxidationaoxAArsenite oxidase, small subunitTIGRFAMTIGR02694NC built‐in
ArsenicArsenite oxidationaoxBArsenite oxidase, large subunitTIGRFAMTIGR02693NC built‐in
ArsenicArsenite reductionarsC( glutathione/glutaredoxin type)Arsenate reductase, glutathione/glutaredoxin type, arsCTIGRFAMTIGR02689NC built‐in
ArsenicArsenite reductionarsC (glutaredoxin)ArsC (glutaredoxin)TIGRFAMTIGR00014NC built‐in
SelenateSelenate reductionsrdASelenate reductase subunit ACustomsrdATotal7681.00E−20500

Taxonomy and phylogeny

The longest amino acid sequence from each rpS3 protein sequence cluster was selected as a representative and was compared to a database of rpS3 protein sequences (Hug, Baker, et al., 2016; Hug, Thomas, et al., 2016) using the UBLAST function in USEARCH (Edgar, 2010). Results were filtered to include only the top hits with e‐values < 1e−5. While each cluster roughly correlates with a species, not all clusters could be taxonomically identified to that level. Therefore, further investigation relied on phylogenetic distance, which enables a high‐resolution analysis. A phylogenetic tree was created by aligning only the representative amino acid sequences using MAFFT with an automated strategy (Katoh & Standley, 2013) and trimming noninformative positions. A maximum‐likelihood tree was constructed on CIPRES (Miller, Pfeiffer, & Schwartz, 2010) with RAxML (Stamatakis, 2014), using the LG substitution model and bootstrapping, allowing the software to halt bootstrapping once it reached a consensus. The Eukaryote domain branch was set as root, and the tree was manually inspected for errors. The phylogenetic tree along with rpS3 gene abundance heatmap were visualized with iTol v4.2.3 (Letunic & Bork, 2016).

Statistics

Statistical analysis was conduct in R v3.4.3 (R Development Core Team, 2012) and Rstudio v1.1.423 (Rstudio Team, 2015). Abundance plots, ordinations and UniFrac calculations were conducted with Phyloseq v1.22.3 (McMurdie & Holmes, 2013). The abundance of each rpS3 cluster was corrected for uneven sequencing depth across samples by multiplying the coverage value for each sample by a factor calculated as the ratio of the number of bp in the largest sample divided by the number of bp in that sample. Factor selection of soil chemistry was carried with BIOENV (Clarke & Ainsworth, 1993) as implemented in the bio.env function from Vegan v2.4.6 (Oksanen et al.., 2018), with a Euclidean distance method and Bray–Curtis matrix. The exhaustive search for correlation between community dissimilarities and environmental distances requires extremely long time. Therefore, dissimilarities were partialled out when inspecting variables as recommended by the bioenv user's manual (Oksanen et al., 2018). The results were evaluated with Pearson's correlation. The significance of the results was validated with Mantel test also using Pearson's correlation. Maps were retrieved from Google maps database using Google Earth v7.3.2.

RESULTS

For the hillslope samples analyzed, the soils are loamy to silty loam (Figure A2 and Table A3). Shallow samples from PLM0 and PLM1 have higher sand content than downslope PLM3 and PLM4 samples, which have higher content of clay and silt, potentially as a result of downslope fining of transported sediments. Soil moisture increases with proximity to the East River, but decreases with depth (Figure A3 and Table A4). An exception to this is at the floodplain, where moisture increases close to the water table (72 cm below the ground surface at the time of sampling). The hillslope meadow is dotted with smooth brome (Bromus inermis) and lupines (Lupine sp.); however, neither occurred within a 50 cm radius of the sampling sites (qualitative assessment on site). In contrast, the floodplain is dominated by willows and sedges that are not present on the hillslope. Gopher activity increases downslope, but does not occur at the floodplain location (W. Brown, personal communication, February 2018).
Figure A2

Soil texture. All soils are categorized as silty‐loam (A) with the greatest variability being 20‐60% sand content. PLM0 and PLM1 are found to contain more sand than in PLM3 and PLM4 (B) and sand content is correlated with the PC1 which explains almost 80% of the variability between the samples

Table A3

Soil texture

SiteDepth (cm)Sand (%)Silt (%)Clay (%)
PLM0536.84023.2
1024.852.123.2
2032.84918.2
3041.639.718.8
4350.329.720
5041.338.120.6
6047.53319.6
6945.732.521.8
7530.650.419
8524.44827.6
PLM1531.153.415.5
1533.651.714.8
2433.14917.9
3439.145.715.2
4440.247.812
5343.641.914.5
6425.251.723.1
7524.852.422.8
8524.85025.2
9535.14222.8
PLM2522.554.223.3
1522.252.425.4
2527.950.421.7
3526.352.721
4524.247.328.5
85846.245.8
PLM357.359.633.1
1527424
258.763.627.7
3513.358.528.2
4515.757.227.1
5514.854.230.9
6511.551.836.7
7521.448.230.3
8516.752.730.5
9526.446.527.1
10524.549.526
11523.151.425.5
12331.547.520.9
PLM4505545
150.856.542.7
2506238
350.163.536.4
452.562.135.3
550.555.344.2
6510.749.240.1
7542.337.120.6
Figure A3

Geochemistry measurements of elements that showed a correlation to microbial community structure. Data is missing for PLM6. Sulfate measurements are missing for PLM0 40 cm, PLM1 60 cm and 90 cm, PLM2 5 cm and 30 cm

Table A4

Soil chemistry data

PLM0PLM1PLM2PLM3PLM4
ID0.10.30.6850.050.340.641.030.050.350.050.350.651.310.050.350.650.95
Depth range, cm 5–1525–3564–730–1028–4058–70105–1100–1030–400–1030–4060–70127–1340–1030–4060–7090–100
Moisture (%)12.5610.2810.8628.9916.9415.8910.7332.0914.244.0524.7620.6618.4563.2935.3853.31saturated
pH 6.386.726.917.486.937.047.417.097.087.067.267.427.987.317.487.637.36
TIC/TOCICmg/L98.9949.5218.20144.4047.2436.8620.6244.3148.6051.8729.9424.8225.65102.4024.7634.8949.55
OCmg/L116.503.7410.29114.5051.4621.918.6428.1445.7635.7915.9712.623.2629.347.205.469.60
TNμg/L10620.00143.90766.5012930.006793.0011345.00719.502455.003001.002326.002292.001626.50944.542140.613452.341497.54935.52
ICP‐MSLippb3.301.181.303.973.253.633.140.581.015.544.324.095.142.951.711.885.32
Stdev0.470.360.780.410.570.840.270.610.330.400.580.910.450.430.770.660.56
Bppb102.4287.6154.83143.3467.0248.0034.9644.6125.0260.7628.1245.8120.8325.635.461.2313.42
Stdev2.262.921.542.422.231.941.011.140.940.971.711.251.281.410.881.541.24
Nappb1355.171133.332169.751816.301976.772735.242683.65839.473111.972658.832741.903129.775154.506459.354463.976382.019178.14
Stdev39.3215.9544.5620.7352.6037.4344.5911.7874.5851.1925.6270.14108.66106.31282.17133.33140.39
Mgppb33411.6212191.775449.1842057.4915046.9716979.047351.4910546.2014864.6715212.149823.327792.047064.3425202.625328.358832.2831457.56
Stdev536.25286.94162.25422.76288.02268.02129.02244.60191.04172.1651.0258.5884.68303.7333.8753.27242.35
Alppb1143.111488.09638.25593.642214.93234.68305.58374.28296.68466.68256.63461.61129.2993.93175.7280.4859.44
Stdev27.0515.9314.406.2810.295.322.136.733.508.052.344.942.401.094.211.600.90
Sippb17253.4011804.699636.0117851.0712967.587732.727140.038986.845724.309266.365754.885901.794709.978028.884108.634190.043793.60
Stdev305.91134.93206.3094.38145.83100.90128.97161.9054.5567.4656.6653.3848.3395.1012.5935.9135.60
Kppb69876.5235556.541730.93176493.9015073.276181.771809.7724831.162250.395233.841080.11851.573140.805944.42647.13787.932812.52
Stdev687.96461.0217.574529.23124.0784.3312.12286.688.2942.578.3916.0920.2461.574.853.2916.96
Vppb6.294.652.495.577.031.141.431.341.501.551.122.080.380.480.770.140.41
Stdev0.190.190.110.060.940.050.090.040.040.060.050.300.020.030.040.020.03
Crppb2.211.741.701.262.560.610.780.810.400.800.371.201.810.210.450.500.11
Stdev0.270.070.110.040.380.060.110.040.020.030.030.200.060.020.040.050.02
Mnppb1075.44132.50125.73692.1494.2010.3515.44199.87111.6164.243.676.931.872162.534.370.76224.09
Stdev4.560.640.734.730.640.040.151.870.550.540.050.250.046.570.090.021.20
Cappb74757.1929934.7412142.6592501.2635731.7438525.5716044.9840618.9355255.0352303.1033697.8127905.7825445.34130998.7532341.4249413.02328102.01
Stdev479.35349.9769.231988.91274.53407.09149.02312.49357.24503.92127.09153.70127.231438.3696.21215.213980.56
Feppb892.90870.632594.82430.171918.56255.761469.97253.59241.18537.94502.271105.93266.06121.72736.6940.7185.76
Stdev14.0219.85126.844.48147.8410.09156.482.967.686.5935.6055.676.170.5026.601.350.99
Seppb1.130.760.341.791.800.590.400.580.711.130.802.140.902.041.938.123.83
Stdev0.370.120.220.500.170.170.250.110.140.090.140.180.280.440.340.240.63
Tippb24.9417.2213.7716.1924.836.267.087.025.8510.855.717.964.494.484.673.353.18
Stdev1.050.440.650.530.360.450.320.320.320.530.320.400.400.210.310.270.20
Nippb8.944.405.1110.095.063.085.243.983.796.573.697.563.3614.865.815.6936.61
Stdev0.200.060.230.240.200.160.090.220.140.800.110.350.070.420.080.230.33
Coppb7.731.912.103.901.290.310.661.440.720.720.430.360.152.800.250.162.92
Stdev0.100.100.110.050.040.030.050.060.020.060.020.030.020.060.020.030.08
Cuppb12.706.577.9711.3912.264.718.624.413.9514.744.5911.906.087.0216.048.0012.89
Stdev0.150.120.100.310.370.190.330.110.161.550.170.170.130.280.330.220.18
Znppb45.0938.9038.0650.4248.2639.1747.9134.5333.1844.5135.2440.6095.6946.8078.5625.2958.99
Stdev0.880.570.730.870.821.190.720.560.676.830.550.670.910.661.291.210.78
Geppb0.020.040.040.000.060.020.020.020.020.020.010.040.010.010.030.020.07
Stdev0.020.030.010.010.020.020.020.020.020.020.020.020.010.020.020.020.03
Asppb3.231.561.394.242.210.731.511.480.902.050.541.560.381.851.120.231.21
Stdev0.070.070.040.050.210.050.060.060.040.070.050.110.020.110.050.050.06
Rbppb28.4120.281.0629.667.602.620.325.033.481.180.390.680.440.990.490.270.94
Stdev0.350.210.080.200.100.060.020.070.060.050.020.040.020.030.030.010.04
Srppb208.8284.3828.56288.78105.47113.9545.85131.05169.93194.08129.23103.4194.74442.94111.79160.94742.94
Stdev2.010.970.434.860.610.540.291.200.732.800.780.480.352.550.470.516.32
Zrppb1.160.851.230.591.490.690.760.670.680.700.660.690.460.640.480.390.82
Stdev0.220.070.230.050.100.040.040.010.030.030.030.030.070.030.020.010.04
Moppb1.350.610.343.022.220.932.521.000.731.150.691.244.003.611.250.6112.69
Stdev0.090.040.030.130.060.050.170.040.040.020.070.070.110.060.070.060.08
Agppb0.220.100.230.240.290.220.380.020.110.040.010.890.06N.D0.110.060.01
Stdev0.020.020.020.020.040.030.040.010.010.020.010.170.02N.D0.010.010.01
Cdppb0.280.110.130.390.450.180.240.190.190.290.130.240.050.780.150.320.35
Stdev0.030.020.040.060.030.020.050.040.020.070.050.060.050.140.030.050.04
Sbppb0.280.180.310.290.340.180.540.150.270.290.200.370.290.770.690.524.97
Stdev0.040.010.030.030.050.020.060.030.040.060.020.010.040.030.030.030.11
Csppb0.150.160.080.850.240.020.010.050.040.070.020.050.020.010.030.010.01
Stdev0.020.010.010.020.010.000.000.000.010.010.000.000.010.000.010.000.00
Bappb362.84144.1730.64388.07131.2990.9834.53136.4798.59129.9068.5762.3441.35216.8441.7488.78164.14
Stdev1.250.851.053.130.880.980.701.141.211.540.441.130.571.360.910.611.00
Euppb0.190.110.400.210.210.070.080.080.070.090.050.150.030.090.060.050.07
Stdev0.020.010.010.010.020.010.010.010.010.020.010.020.010.020.010.020.01
Pbppb1.391.406.713.083.310.552.120.740.381.280.741.370.360.181.890.260.18
Stdev0.040.050.130.200.140.100.080.030.030.470.040.020.020.030.060.020.01
Uppb0.410.220.220.760.560.140.140.100.240.190.150.260.203.350.130.308.28
Stdev0.020.020.030.030.030.010.010.020.010.010.010.010.020.080.020.030.18
Pppb1347.62706.52376.291965.34918.02433.73355.96438.01445.24554.69334.56381.55285.54408.68262.80340.614794.72
Stdev23.3524.3812.1633.4916.1314.996.0813.458.3117.4213.3417.1621.1416.209.4338.911246.07
ICSulfatemg/L1.711.310.904.604.13NA2.38NANA8.875.304.578.835.784.8319.93730.04
Nitratemg/L2.00NA2.311.183.5536.121.032.252.061.664.063.182.431.7710.494.741.10

1:1 (soil:DIW mass ratio) extraction. The original porewater was taken into account as a part of the total water mass.

Uncertainty for pH measurements ±0.05.

Uncrertainty for TIC/TOC <3.

Uncrertainty for anions IC <5%.

Assembling reads from 41 samples, comprising 610 Gbp of sequence data, resulted in 6.5 million scaffolds longer than 1 kbp (Table A2). On average, 27.8% (±11) of the reads could be mapped back to these scaffolds. This is an expected result given huge diversity in soil and the near flat nature of most of the rank abundance curve. The unassembled reads likely derive from the background of rare organisms in soil. Encoded on the assembled scaffolds, 3,536 rpS3 amino acid sequences were identified and clustered into 1,660 clusters (at 99% identity), representing 37 microbial phyla. In general, the microbial communities are dominated by bacteria (relative abundance 0.95 ± 0.03 SD). The most abundant phyla across all samples are Acidobacteria, Actinobacteria, Chloroflexi, and Proteobacteria, but their relative abundances vary considerably across samples and depths (Figure 2). Species of Verrucomicrobia and Gemmatimonadetes are abundant at sites high on the hillslope, but while Verrucomicrobia species abundance decreases with proximity to the river (Pearson's r = −0.707, p‐value < 0.001), the abundance of Gemmatimonadetes is correlated with both proximity to the river (Pearson's r = −0.652, p‐value < 0.001) and soil depth (Pearson's r = −0.568, p‐value < 0.001).
Table A2

Sequencing depth and assembly information

SiteSample nameDepth (cm)ExtractionSequencing depth (Gbp)# Reads% Reads mappedLongest scaffold (bp)# Scaffolds longer than 1 Kbp SRA accession #
PLM0PLM0_5_b15DNA only12.306854490488.034,76739,307SRX3939289
PLM0_5_coex5Co‐extracted14.129969433328.630,94951,831SRX3938852
PLM0_30_b130DNA only12.3628516631020.6134,772693,948SRX3939244
PLM0_30_coex30Co‐extracted12.2728424006023.1257,168111,705SRX3938851
PLM0_60_b160DNA only12.7108775866423.7562,722113,728SRX3939286
PLM0_60_coex_redo60Co‐extracted25.02416941737439.0272,482327,331SRX4020904
PLM1PLM1_5_b15DNA only14.318981550749.2153,31251,156SRX3939403
PLM1_5_coex_redo5Co‐extracted27.16918400250824.7216,442207,336SRX4020906
PLM1_30_b130DNA only10.8117500780616.0218,09163,823SRX3939421
PLM1_30_coex30Co‐extracted12.9978947946021.3198,543104,859SRX3938854
PLM1_60_b1_redo60DNA only19.13213018864443.2453,003250,231SRX4020708
PLM1_60_coex60Co‐extracted12.5838685534024.9237,742117,222SRX3939072
PLM1_100_b1100DNA only11.0547570525033.2345,507120,315SRX3939422
PLM1_100_coex100Co‐extracted7.8735592318819.6254,70655,119SRX3938897
PLM2PLM2_5_coex5Co‐extracted13.4089137705223.684,681104,106SRX4038478
PLM2_5_b15DNA only47.13331938927640.4641,460625,826SRX4394284
PLM2_30_coex30Co‐extracted21.21814282926239.0139,269266,109SRX4394281
PLM2_30_b130DNA only17.85612033570031.3103,781186,106SRX4394282
PLM3PLM3_5_b15DNA only11.6278017189021.8307,370108,605SRX3939400
PLM3_5_coex5Co‐extracted8.0355642410018.6108,69570,402SRX3938970
PLM3_30_b130DNA only12.8688858368031.5266,935155,033SRX3939453
PLM3_30_b230DNA only9.4286595568223.6139,83590,355SRX3939695
PLM3_60_b160DNA only9.6196676988427.6137,172101,753SRX3939332
PLM3_60_coex60Co‐extracted12.3058429050834.8326,827147,715SRX3938971
PLM3_127_b1127DNA only11.3807842133841.0452,947186,029SRX3939455
PLM3_127_b2127DNA only13.9139549319843.5495,994200,738SRX3939725
PLM6PLM3_1_50_b150DNA only10.6307298908228.4393,885104,399SRX3939694
PLM3_1_50_b250DNA only8.0195642159025.0168,54965,789SRX3939727
PLM3_1_170_b1170DNA only12.0018234066434.0304,240147,944SRX3939697
PLM3_1_170_b2170DNA only12.8148807603234.51,153,49216,1473SRX3939726
PLM3_1_200_b1200DNA only12.0028231892224.2168,592116,565SRX3939696
PLM3_1_200_b2200DNA only11.7178079408825.0168,582113,002SRX3939728
PLM4PLM4_5_b15DNA only14.3389832068825.2396,580145,893SRX3939604
PLM4_5_coex_redo5Co‐extracted22.55815297335840.9319,999347,329SRX4020905
PLM4_32_b132DNA only14.1609710678225.3276,908153,216SRX3939582
PLM4_32_coex_redo32Co‐extracted23.59716057310439.5126,662336,253SRX4020878
PLM4_65_b1_redo65DNA only26.28117871013659.0298,824481,900SRX4020689
PLM4_65_coex65Co‐extracted12.3628485276442.1350,418172,891SRX3938984
PLM4_90_b190DNA only13.7249386071016.0193,38289,201SRX3939618
PLM4_90_b290DNA only10.5737286963816.1239,23868,870SRX3939724
PLM4_90_coex90Co‐extracted12.3198486403414.0186,76874,487SRX3939033
Figure 2

Relative abundances of phyla. Results show that Verrucomicrobia decrease in abundance with increasing depth and proximity to the floodplain site PLM4; Rokubacteria, on the other hand, show the opposite pattern

Relative abundances of phyla. Results show that Verrucomicrobia decrease in abundance with increasing depth and proximity to the floodplain site PLM4; Rokubacteria, on the other hand, show the opposite pattern Proteobacteria species comprise 22.7% (±10.8 SD) of all microbial abundance. This dominance increases systematically with distance down the hillslope, largely irrespective of the sampling depth (Figures 2 and 3a). Gammaproteobacteria species are almost undetectable in communities higher on the hillslope, whereas alphaproteobacterial species are prevalent at all sites (Figure 3a). Deltaproteobacteria species increase in abundance with increasing proximity to the floodplain and also with increasing proximity to the water table, with the highest representation observed in samples from below the water table. Distinct Deltaproteobacteria species are found in samples close to the water table (Desulfobacca acetoxidans in clade 1, and Geobacter spp. and Desulfuromonas sp. in clades 3 and 4, see Figure A4). Some distinct species (clade 2 in Figure A4) occur only below the water table (Syntrophaceae, Figure A4, clade 2). Thaumarchaeota related to Nitrososphaera sp. are the dominant archaea at every location other than at the floodplain (Figure 3b). At the floodplain site (PLM4), Pacearchaeota are present in soil samples close to, although above the water table whereas Bathyarchaeota and Euryarchaeota are present in samples below the water table.
Figure 3

Relative abundances of proteobacterial classes (a) and archaeal phyla (b) clusters across the sampling sites. Within bars of the same color, black lines separate distinct organisms. Samples are ordered from the top to the bottom of the hillslope transect. Within each site, samples are ordered by depth

Figure A4

A Maximum‐Likelihood phylogenetic tree of rpS3 clusters classified as Deltaproteobacteria. Black circles mark branch support greater than 0.8. Grey scale bar was calculated with the square root of relative abundance of each cluster. Clades of interest are marked 1 through 5

Relative abundances of proteobacterial classes (a) and archaeal phyla (b) clusters across the sampling sites. Within bars of the same color, black lines separate distinct organisms. Samples are ordered from the top to the bottom of the hillslope transect. Within each site, samples are ordered by depth Out of the 37 microbial phyla that were identified, 20 are candidate phyla (CP) (i.e., phyla that lack an isolated representative). Of the CP, eight are part of the Candidate Phyla Radiation of Bacteria (CPR) (Figure 4). Members of CP are present at all sites along the hillslope transect, but their detection is positively correlated with depth of sampling (Pearson's r = 0.851, p‐value < 0.0001) (Figure 4a). Moreover, depth could be used as a predictor for the abundance of CP as a linear regression has an r 2 = 0.66 and slope = 5.07 (p‐value < 0.0001). Interestingly, CPR bacteria are almost exclusively found at the floodplain site and only just above (7 cm above the water table) and within groundwater‐saturated sediment (Figure 4b). Although sampling sites above and below the water table are close spatially and may experience similar conditions when groundwater level fluctuate, they harbor bacteria from completely different CPR phyla.
Figure 4

Abundances of Candidate phyla (CP) and Candidate Phyla Radiation (CPR) bacterial clusters at hillslope sites. (a) Abundance of bacteria from CP other than CPR phyla. (b) Abundance of bacteria from CPR phyla. Samples are ordered by depth and within any specific depth, from top to bottom of the hillslope transect. CPR phyla were not detected in samples other than the six depicted in this figure

Abundances of Candidate phyla (CP) and Candidate Phyla Radiation (CPR) bacterial clusters at hillslope sites. (a) Abundance of bacteria from CP other than CPR phyla. (b) Abundance of bacteria from CPR phyla. Samples are ordered by depth and within any specific depth, from top to bottom of the hillslope transect. CPR phyla were not detected in samples other than the six depicted in this figure We investigated how distance from groundwater and weathered shale impact microbial community structure. Unweighted UniFrac‐based PCoA ordination, that allows addressing phylogenetic distance without assigning taxonomic levels, reveals that soils sampled at depths of 5 cm and 30 cm from all field sites group together (Figures 5a and A5a and b). However, the weighted UniFrac PCoA analysis (considering organism abundances) differentiates these 5 cm from 30 cm soil samples. Considering distance from the river while suppressing information describing depth below ground surface, these analyses also differentiate samples taken at PLM4 from those taken at PLM0, PLM1, and PLM2 but not from PLM3, which is closer to the floodplain. Lastly, weighted UniFrac separates samples from PLM4 from above and below the water table (Figures 5b and A5c and d). Thus, for soils that contain similar types of organisms, sampling depth and proximity to weathered rock shift organism abundance relative levels. Overall, distance from groundwater at the floodplain site and weathered shale at the hillslope sites seem to be dominant factors in determining the microbial community structure across the hillslope.
Figure 5

Samples cluster based on proximity to weathered shale and groundwater‐saturated soil. (a) NMDS based on unweighted UniFrac distance computed using maximum‐likelihood phylogenetic tree. (b) NMDS based on weighted UniFrac distances computed using maximum‐likelihood phylogenetic tree and abundance of each taxon. Confidence ellipses (95% interval) are shown in Figure A4

Figure A5

Samples cluster based on depth and distance from river. (A) and (B) NMDS based on unweighted UniFrac distance computed using Maximum‐Likelihood phylogenetic tree. Ellipses mark 95% confidence interval for samples grouped by site (A) or depth (B). Similar analysis by weighted UniFrac distances is shown in (C) and (D) where ellipses mark 95% confidence interval for samples grouped by site (C) or depth (D). Confidence ellipses were not calculated depths of 50 cm, 90 cm, 100 cm, 127 cm, 170 cm and 200 cm as there were not enough data points to conduct the statistic calculation. For confidence ellipses in (B) and (D) hillslope sites were considered separately from floodplain sites due to the their apparent sepration on the NMDS plot

Samples cluster based on proximity to weathered shale and groundwater‐saturated soil. (a) NMDS based on unweighted UniFrac distance computed using maximum‐likelihood phylogenetic tree. (b) NMDS based on weighted UniFrac distances computed using maximum‐likelihood phylogenetic tree and abundance of each taxon. Confidence ellipses (95% interval) are shown in Figure A4 Forty geochemical factors were assessed in order to elucidate the factors that shape community structure in the soil profile sites. The combination of soil moisture and concentrations of Na, Se, and Zn were correlated to microbial community structure (r = 0.751) (Figure 6). The results were validated with Mantel test (Pearson's r = 0.751, p‐value = 0.001, 999 permutations). Selenium had the highest concentration in samples taken above the water table, (PLM4 65 cm, 8.119 ± 0.235 ppb) whereas zinc concentrations were the highest in samples closest to weathered shale (PLM3 127 cm, 95.694 ± 0.915 ppb), which also had the highest acidity (pH = 7.98) (Table A4). Sodium (Na) concentrations were the highest in samples taken from below the water table (PLM4 90 cm, 9,178 ppb).
Figure 6

NMDS ordination of microbial communities and correlated geochemical factors. Spearman correlation was tested using Bray–Curtis distances and Euclidean distance matrix. Out of 40 geochemical measurements (Table A4) only soil moisture, Se, Na, and Zn were correlated with microbial community composition (r = 0.751, p‐value = 0.001). Stress = 0.0788. Numbers in figure are depth in cm. Raw values are provided in Table A4

NMDS ordination of microbial communities and correlated geochemical factors. Spearman correlation was tested using Bray–Curtis distances and Euclidean distance matrix. Out of 40 geochemical measurements (Table A4) only soil moisture, Se, Na, and Zn were correlated with microbial community composition (r = 0.751, p‐value = 0.001). Stress = 0.0788. Numbers in figure are depth in cm. Raw values are provided in Table A4 Metabolic potential, as depicted by detected genes, differentiates locations along the hillslope to floodplain transect. Out of 87 Hidden Markov Models (HMMs), 78 were found to exceed our detection threshold (see Section 2). An NMDS of gene abundances reveals a clear depth gradient in samples taken from the floodplain site (Figures 7 and A6). A depth‐dependent trend in overall metabolic potential is also observed along the hillslope. In addition, gradient in overall metabolic potential correlates with elevation (i.e., position on the hillslope).
Figure 7

Abundance of key metabolic enzymes cluster samples according to depth and proximity to river. An NMDS of key metabolic genes generated using 78 HMMs of carbon, nitrogen, sulfur, and selenium metabolic enzymes

Figure A6

NMDS of samples according to abundance of key metabolic enzymes. A. NMDS of key metabolic genes generated using 79 HMMs of carbon, nitrogen, sulfur and selenium metabolic enzymes. Ellipses mark 95% confidence interval for samples grouped by site (A) or depth (B). Confidence ellipses were not calculated depths of 50 cm, 90 cm, 100 cm, 127 cm, 170 cm and 200 cm as there were not enough data points to conduct the statistic calculation. For confidence ellipses in (B) hillslope sites were considered separately from floodplain sites due to their apparent separation on the NMDS plot

Abundance of key metabolic enzymes cluster samples according to depth and proximity to river. An NMDS of key metabolic genes generated using 78 HMMs of carbon, nitrogen, sulfur, and selenium metabolic enzymes The patterns identified in the NMDS are driven in part by genes encoding enzymes involved in N2 fixation (nifDHK), denitrification (norBC and nosZ), and the Wood–Ljungdahl carbon fixation pathway (codhC and codhD) (Figure A7). The dsrA and dsrB genes that encode reversible dissimilatory sulfite reductase are found in groundwater‐saturated saprolite samples PLM4 90 cm, in samples taken 10 cm above groundwater (PLM4 65 cm), and also present in samples collected at 5 cm depth. However, dsrD which is present only in samples from below groundwater and in samples taken 10 cm above it indicates that dsrA and dsrB are potentially responsible for sulfite reduction at these locations. Sequences of asrB which encodes for anaerobic sulfite reductase B were found exclusively in samples from groundwater saprolite (PLM4 90 cm). Also enriched in samples from below the water table is the catalytic subunit of thiosulfate reductase phsA, which catalyzes the reduction of thiosulfate to sulfite and hydrogen sulfide. Selenate reductase encoded by the gene srdA, which is associated with selenate respiration, is enriched in samples from below compared to above the water table and weathered shale compared to soil. The abundance of srdA was found to be correlated to selenium concentration (Pearson's r = 0.52, p‐value = 0.0325). Unfortunately, selenium measurements for PLM3 127 cm as well as PLM6 170 cm and 200 cm, where srdA abundance is the highest, were not available. These samples were taken from fractured shale which is rich with selenium, and therefore, it is assumed that adding these measurements will result in a stronger positive correlation.
Figure A7

Spatial abundance of genes central to metabolic pathways. Samples from the floodplain (blue colored clade) are distinct from samples from across the hillslope (black colored clade), particularly with respect to carbon fixation and selenate reduction. Sample names in red denote DNA samples that were co‐extracted with RNA (see Section 2). The sources of HMMs their description and detection cutoffs are given in Table A1

DISCUSSION

We integrated metagenomics and soil chemical analyses to investigate how microbial community structure and metabolic potential vary within the subsurface across a transect from high on an East River hillslope to its adjoining floodplain. Our analyses indicate that communities are differentiated according to depth and proximity to weathered shale and groundwater, and that microbial communities of the floodplain soils and sediments differ substantially from those collected along the hillslope. Notably, the abundance of species of Archaea, Proteobacteria and CPR bacteria have distinct spatial patterns. Thaumarchaeota, the dominant archaeal taxon in soils (Bates et al., 2011), are typically aerobic ammonium oxidizers that can drive nitrification (Colman, 2017). They were detected at every depth sampled across the hillslope, as found in hillslope soil pits in Colorado by Eilers, Debenport, Anderson, and Fierer (2012). The absence of Thaumarchaeota at the floodplain may be explained by extended periods of water saturation. Low redox conditions, inferred based on abundant genes involved in sulfate and selenate reduction, apparently selected instead for Bathyarchaeota and Euryarchaeota. The decrease in relative abundance of Alphaproteobacteria and Gammaproteobacteria with depth has been previously described in soil profiles from upper montane forest east of Boulder, CO, USA (Eilers et al., 2012). However, while the relative abundance of Betaproteobacteria was reported to decline with depth in the Boulder site, it mostly increased with depth at the hillslope. A similar pattern of increased relative abundance is observed in Deltaproteobacteria. It could be that the proximity to sulfate and nitrate rich Mancos shale bedrock supports the increased abundance of these organisms. Bacteria from CP increase in abundance with depth throughout PLM sites. They may have eluded prior cultivation studies due to their low abundances in more commonly sampled shallow soils. However, CPR bacteria, which elude most cultivation efforts (Solden, Lloyd, & Wrighton, 2016), are likely dependent on other microorganisms for basic cellular building blocks (Brown et al., 2015; Kantor et al., 2013). Other than the two occurrences of Yanofskybacteria species in deep samples close to the soil‐weathered shale transition (127 cm and 170 cm from PLM3 and PLM6, respectively), bacteria from CPR phyla were detected only in the floodplain samples. CPR bacteria are often found in anaerobic environments and have streamlined genomes, lacking many genes for independent survival. Many are likely obligate symbionts, and as such they may often associate with anaerobic hosts, although the identities of their hosts remain unclear (Brown et al., 2015; Castelle & Banfield, 2018; Hug, Baker, et al., 2016). The abundance of genes encoding methanol dehydrogenase (mdh1/mxaF/xoxF in Figure A6) and the catalytic subunit of carbon monoxide dehydrogenase (coxL in Figure A7) were consistently lower in the groundwater‐saturated floodplain samples than in any hillslope samples or floodplain samples from above the water table. Methanol dehydrogenase is involved in aerobic oxidation of methanol (which could derive from plant biomass or oxidation of methane), whereas CO dehydrogenase is involved in aerobic oxidation of CO (possibly produced by plants as a signaling molecule). Sulfite reduction may be a second biogeochemical process that differentiates microbial communities at the floodplain from those on the hillslope, particularly in samples below the water table and immediately above it, where dsrD, a hallmark for the reverse dsr pathway is relatively abundant (Anantharaman et al., 2018). Further, genes encoding for key enzymes (codhC and codhD) in the anaerobic Wood–Ljungdahl pathway for carbon fixation, and genes for nitrogen fixation (nifDHK) are relatively abundant at the floodplain site, specifically below groundwater and immediate above it compared to the hillslope sites. Interestingly, these samples contained the highest abundance of genes encoding for form I and II Ribulose‐1,5‐bisphosphate carboxylase/oxygenase (RubisCO) enzymes, known to play a role microbial carbon fixation (Berg et al., 2010). These patterns support the conclusion that groundwater‐saturated regions of the watershed support largely anaerobic microbial communities. Overall, the findings indicate that floodplain site metabolic potential is depth‐stratified, with one microhabitat below the water table that is colonized by organisms with anaerobic metabolisms, a second within the zone experiencing seasonal fluctuating redox conditions, and a third closer to the surface, where communities would experience oxidizing conditions throughout most of the year. A similar stratification, with a 70 cm alternating redox zone, was observed within a sediment profile from the Rifle river riverbed (Danczak et al., 2016). As in the current work, the microbial community of the alternating redox zone is easily distinguishable from those in both the shallow and deep zones. Overall, the spatial layout of the compartments may support complete redox cycles, analogous to sulfur cycling at oxygen‐minimum zones in the ocean (Canfield et al., 2010). Selenium concentration may be a major factor that differentiates microbial communities at the floodplain from those on the hillslope. Selenium occurs in insoluble metal selenides in Mancos Shale that underlies much of the Gunnison River basin (Colorado, USA; Elrashidi, 2018), which includes the East River watershed. Oxidation of selenium to soluble selenite and selenate under mildly reducing to oxidizing conditions (Presser, 1994) leads to its mobilization and probably accounts for its presence in pore fluids. Enrichment of srdA genes, which encode the catalytic subunit of the complex required for selenate reduction, in sequences from the floodplain site suggests that dissimilatory reduction of selenate (Fakra et al., 2015; Ike, Takahashi, Fujita, Kashiwa, & Fujita, 2000; Maiers, Wichlacz, Thompson, & Bruhn, 1988; Nancharaiah & Lens, 2015; Williams et al., 2013) supports microbial growth at this site. Geobacter species, which were identified almost exclusively in floodplain samples (Figure A4, clade 3) and are sometimes capable of selenite reduction (Pearce et al., 2009), may be responsible for these reactions. The detection of srdA genes in the three deepest samples from the hillslope (127–200 cm) suggests that selenate reduction may occur periodically close to the weathered shale–soil interface where seasonally variable redox conditions induced by groundwater fluctuations may enable microbe‐catalyzed selenium transformations. Across the hillslope sites, shallow soils have relatively similar community compositions. This might be explained by the low soil moisture that these locations experience over much of the year, as well as exposure to low temperatures during late fall and early winter prior to the onset of insulating snow cover. Further, soil community compositions are homogenized at some sites, likely due to soil mixing as a result of gopher activity (Yoo, Amundson, Heimsath, & Dietrich, 2005). Bioturbation may increase soil porosity and permeability and homogenize the mineral matrix and microbial community composition within a site, particularly close to the soil surface (reviewed by Platt, Kolb, Kunhardt, Milo, & New, 2016). It is also possible that similarity in vegetation at the nonfloodplain sites contributes to community similarity. Between‐site heterogeneity, which could arise due to periodic events or local changes in vegetation, could be eliminated by microbial dispersal. However, microbial dispersal is generally very limited in soils that are not saturated with groundwater (Elsas, Trevors, & Overbeek, 1991). Although groundwater and runoff from rain and snowmelt might transport microbes downslope and into the weathered rock, hydraulic measurements show that overland and lateral underground transport is likely limited at the hillslope sites (T. K. Tokunaga, J. Wan, K. H. Williams, W. Brown, A. Henderson, Y. Kim, A. P. Tran, M. E. Conrad, M. Bill, R. W. H. Carroll, W. Dong, Z. Xu, A. Lavy, B. Gilbert, S. Romero, J. N. Christensen, B. Faybishenko, B. Arora, E. R. Siirila‐Woodburn, R. Versteeg, J. H. Raberg, J. E. Peterson, & S. S. Hubbard, Unpublished data). Soil and weathered rock are water‐saturated for only a few weeks each year, other than at the floodplain. During this period, water moves at ~ 10 to 20 m per month parallel to the surface slope (Tokunaga et al., under review), distances that are too short to connect communities at our sampling sites. Our study of a hillslope lower montane meadow to floodplain transect revealed an ecosystem comprised of distinct subsystems. Specifically, our results documenting the abundance patterns of genes involved in selenium, sulfur, carbon, and nitrogen cycles suggest that hillslope and floodplain sites constitute distinct ecosystem compartments. Further, the hillslope sites are spatially differentiated into microhabitats close to (or within) weathered shale and proximal to the surface. Similarly, the floodplain site is resolved into largely anaerobic and aerobic communities over relatively short vertical distance, raising the possibility of elemental cycling across the interface. These results clarify the scale of heterogeneity in biogeochemical processes and improve our understanding of how these processes map onto the watershed. The ability to make predictions at more than one level of resolution requires identification of the processes of interest and the parameters that affect these processes at different scales (Turner, Dale, & Gardner, 1989). For that purpose, the current work focuses on the centimeter to meters scale, serving as a starting point for a “bottom‐up” approach for exploring microbial ecology across the watershed. The microhabitats that were identified in the hillslope and floodplain compartments of the watershed may be considered as “systems within systems” at a local scale. However, the term might also be applicable at a larger scale—one that spans across the entire watershed. Once validated by sampling at other hillslope and floodplain locations across the watershed, extrapolation of this knowledge could be used to improve our understanding of ecosystem functioning.

CONFLICT OF INTERESTS

The authors declare no competing interests in this study.

AUTHOR CONTRIBUTION

A.L designed research, performed research, analyzed data, and wrote the paper. D.G.M assisted in field and laboratory work. P.B.M.C conducted fieldwork. J.W performed chemistry analysis. Assisted in designing and conducting fieldwork. T.K.T conducted hydrological measurements. Assisted in designing and conducting fieldwork. B.C.T provided computational infrastructure and written bioinformatical software used in this work. K.H.W took part in the research design. Assisted in fieldwork. S.S.H took part in the research design. J.F.B supervised the study and mentored the first author.
  3 in total

1.  Novel bacterial clade reveals origin of form I Rubisco.

Authors:  Jose H Pereira; Albert K Liu; Douglas M Banda; Douglas J Orr; Michal Hammel; Christine He; Martin A J Parry; Elizabete Carmo-Silva; Paul D Adams; Jillian F Banfield; Patrick M Shih
Journal:  Nat Plants       Date:  2020-08-31       Impact factor: 15.793

2.  Soils and sediments host Thermoplasmata archaea encoding novel copper membrane monooxygenases (CuMMOs).

Authors:  Spencer Diamond; Adi Lavy; Alexander Crits-Christoph; Paula B Matheus Carnevali; Allison Sharrar; Kenneth H Williams; Jillian F Banfield
Journal:  ISME J       Date:  2022-01-05       Impact factor: 11.217

Review 3.  The microbial dimension of submarine groundwater discharge: current challenges and future directions.

Authors:  Clara Ruiz-González; Valentí Rodellas; Jordi Garcia-Orellana
Journal:  FEMS Microbiol Rev       Date:  2021-09-08       Impact factor: 16.408

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.