| Literature DB >> 25028422 |
Y Verastegui1, J Cheng1, K Engel1, D Kolczynski2, S Mortimer2, J Lavigne2, J Montalibet2, T Romantsov1, M Hall1, B J McConkey1, D R Rose1, J J Tomashek2, B R Scott2, T C Charles3, J D Neufeld3.
Abstract
Soil microbial diversity represents the largest global reservoir of novel microorganisms and enzymes. In this study, we coupled functional metagenomics and DNA stable-isotope probing (DNA-SIP) using multiple plant-derived carbon substrates and diverse soils to characterize active soil bacterial communities and their glycoside hydrolase genes, which have value for industrial applications. We incubated samples from three disparate Canadian soils (tundra, temperate rainforest, and agricultural) with five native carbon ((12)C) or stable-isotope-labeled ((13)C) carbohydrates (glucose, cellobiose, xylose, arabinose, and cellulose). Indicator species analysis revealed high specificity and fidelity for many uncultured and unclassified bacterial taxa in the heavy DNA for all soils and substrates. Among characterized taxa, Actinomycetales (Salinibacterium), Rhizobiales (Devosia), Rhodospirillales (Telmatospirillum), and Caulobacterales (Phenylobacterium and Asticcacaulis) were bacterial indicator species for the heavy substrates and soils tested. Both Actinomycetales and Caulobacterales (Phenylobacterium) were associated with metabolism of cellulose, and Alphaproteobacteria were associated with the metabolism of arabinose; members of the order Rhizobiales were strongly associated with the metabolism of xylose. Annotated metagenomic data suggested diverse glycoside hydrolase gene representation within the pooled heavy DNA. By screening 2,876 cloned fragments derived from the (13)C-labeled DNA isolated from soils incubated with cellulose, we demonstrate the power of combining DNA-SIP, multiple-displacement amplification (MDA), and functional metagenomics by efficiently isolating multiple clones with activity on carboxymethyl cellulose and fluorogenic proxy substrates for carbohydrate-active enzymes. Importance: The ability to identify genes based on function, instead of sequence homology, allows the discovery of genes that would not be identified through sequence alone. This is arguably the most powerful application of metagenomics for the recovery of novel genes and a natural partner of the stable-isotope-probing approach for targeting active-yet-uncultured microorganisms. We expanded on previous efforts to combine stable-isotope probing and metagenomics, enriching microorganisms from multiple soils that were active in degrading plant-derived carbohydrates, followed by construction of a cellulose-based metagenomic library and recovery of glycoside hydrolases through functional metagenomics. The major advance of our study was the discovery of active-yet-uncultivated soil microorganisms and enrichment of their glycoside hydrolases. We recovered positive cosmid clones in a higher frequency than would be expected with direct metagenomic analysis of soil DNA. This study has generated an invaluable metagenomic resource that future research will exploit for genetic and enzymatic potential.Entities:
Mesh:
Year: 2014 PMID: 25028422 PMCID: PMC4161255 DOI: 10.1128/mBio.01157-14
Source DB: PubMed Journal: MBio Impact factor: 7.867
Location and physicochemical characteristics of the soil samples selected for DNA stable-isotope probing incubations[]
| Sample | Location | Latitude and | Bulk | Amt of carbon | pH | Moisture | Amt of nitrogen | Soil type | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Total | Inorganic | Organic | ||||||||
| Arctic tundra (1AT) | Daring Lake, North-West | 64°52′N, | 0.2 | 46.9 | BDL[ | 46.9 | 3.9 | 417.7 | 1.42 | Organic |
| Temperate rainforest | Pacific coastal rainforest, | 48°36′N, | 0.6 | 10.8 | BDL | 10.8 | 4.9 | 69.8 | 0.35 | Coarse sandy loam |
| Agricultural soil-wheat | Elora Research Station, | 43°38′N, | 1.1 | 1.85 | 0.12 | 1.7 | 7.4 | 17.9 | 0.19 | Silt loam |
For more details, see http://www.cm2bl.org/.
BDL, below detection limit.
FIG 1 Carbon dioxide production for Arctic tundra (1AT) (A), temperate rainforest (7TR) (B), and agricultural (11AW) (C) soils. Soil samples were amended with labeled (13C) or unlabeled (12C) substrates, and serum bottles were aerated weekly to replenish oxygen and deplete carbon dioxide. The “control” represents a soil sample incubated without substrate.
FIG 2 Principal coordinate analysis (PCoA) biplots of weighted UniFrac distances for 16S rRNA gene sequences generated by assembled paired-end Illumina reads. Samples separated by soil type and fraction (A) as well as by carbon source (B). Native soils were associated with their respective light fractions. Gray spheres represent taxonomic affiliations of OTUs that correlated most strongly within the ordination space.
FIG 3 Cleveland plot of operational taxonomic unit (OTU) abundance for OTUs possessing the highest indicator values (i.e., >70%) for an association with DNA-SIP heavy DNA (black squares [average abundance]) for all substrates and soils combined, in comparison to light DNA (gray squares [average abundance]). Taxonomic affiliations are included for phyla, with additional classifications for order (o_), family (f_), and genus (g_). For additional details, see Table S1 in the supplemental material.
FIG 4 Glycoside hydrolase (GH) families associated with pooled heavy DNA. Functional annotation of the metagenomic data revealed diverse GH gene representation within the pooled heavy DNA.
Substrate-specific activities of positive metagenomic clones from the [13C]cellulose DNA-SIP library
| Clone | Insert size | Activity (μM MU released)[ | CMC | ||||
|---|---|---|---|---|---|---|---|
| α- | β- | β- | β- | ||||
| C122 | 21.6 | 0.4 | 0.2 | 0.6 | 0.7 | 124.2 | − |
| C424 | 8.2 | 0.9 | 57.6 | 109.4 | 1.6 | 0.7 | − |
| C762 | 13.5 | 2.4 | 5.4 | 21.2 | 0.7 | 0.4 | − |
| C1024 | 16.8 | 123.8 | 6.5 | 35.8 | 1.7 | 0.5 | − |
| C1088 | 11.9 | 0.5 | 25.6 | 79.2 | 1.2 | 0.6 | − |
| C2194 | 12.9 | 0.5 | 0.3 | 0.6 | 0.4 | 39.6 | − |
| C2380 | 14.9 | 0.38 | 0.46 | 0.53 | 0.41 | 0.40 | +++ |
| C2044 | 14.7 | 0.40 | 0.40 | 0.52 | 0.39 | 0.36 | ++ |
Cellulase activity was scored by Congo red staining of clones on the LB-CMC plate. Other activities were measured in cell-free extracts using methylumbelliferone-based substrates. MU, methylumbelliferone units based on equal volumes of sample for each assay.
CMC, carboxymethyl cellulose. Plate-based clearing (high, +++; medium, ++; negative, −) was detected by Congo red stain and activity based on comparison to those of positive and negative controls.
FIG 5 Restriction of cosmid DNA with EcoRI-HindIII-BamHI. DNA sizes in kb are marked on the left and right. M, molecular size markers. The sizes of digested DNA fragments except for the cosmid backbone (the very top band) were added up to obtain the insert sizes of the cloned metagenomic DNA.
Analysis of cosmid insert end sequences
| Clone | BLASTx result for[ | |||
|---|---|---|---|---|
| Forward read | Reverse read | |||
| Description | E value (% identity | Description | E value (% identity | |
| C122 | 4e–5 (29 [40/139]) | 8e–136 (82 [131/162]) | ||
| C424 | 1e–28 (69 [66/80]) | 2e–103 (91 [157/163]) | ||
| C762 | 1e–86 (78 [151/171]) | 2e–28 (54 [83/125]) | ||
| C1024 | 2e–17 (95 [34/40]) | 2e–46 (80 [85/96]) | ||
| C1088 | 6e–61 (68 [123/150]) | 5e–44 (75 [101/114]) | ||
| C2194 | 1e–91 (95% [140/142]) | Failed sequencing | ||
| C2380 | 2e–15 (52 [51/69]) | 3e–105 (96 [162/163]) | ||
| C2044 | 1e–71 (96 [116/118]) | 9e–129 (97 [181/184]) | ||
Cosmids were end sequenced with M13 forward and reverse primers flanking the site of metagenomic DNA insertion. For each clone, two end sequences were obtained and are referred to as “reverse” and “forward” reads. Top matches for BLASTx analyses are shown. Positive results are the number of amino acids from the query that match the amino acids from the subject sequence. The total number of amino acids from the subject is shown.