| Literature DB >> 22238585 |
Coralie Damon1, Frédéric Lehembre, Christine Oger-Desfeux, Patricia Luis, Jacques Ranger, Laurence Fraissinet-Tachet, Roland Marmeisse.
Abstract
Eukaryotic organisms play essential roles in the biology and fertility of soils. For example the micro and mesofauna contribute to the fragmentation and homogenization of plant organic matter, while its hydrolysis is primarily performed by the fungi. To get a global picture of the activities carried out by soil eukaryotes we sequenced 2×10,000 cDNAs synthesized from polyadenylated mRNA directly extracted from soils sampled in beech (Fagus sylvatica) and spruce (Picea abies) forests. Taxonomic affiliation of both cDNAs and 18S rRNA sequences showed a dominance of sequences from fungi (up to 60%) and metazoans while protists represented less than 12% of the 18S rRNA sequences. Sixty percent of cDNA sequences from beech forest soil and 52% from spruce forest soil had no homologs in the GenBank/EMBL/DDJB protein database. A Gene Ontology term was attributed to 39% and 31.5% of the spruce and beech soil sequences respectively. Altogether 2076 sequences were putative homologs to different enzyme classes participating to 129 KEGG pathways among which several were implicated in the utilisation of soil nutrients such as nitrogen (ammonium, amino acids, oligopeptides), sugars, phosphates and sulfate. Specific annotation of plant cell wall degrading enzymes identified enzymes active on major polymers (cellulose, hemicelluloses, pectin, lignin) and glycoside hydrolases represented 0.5% (beech soil)-0.8% (spruce soil) of the cDNAs. Other sequences coding enzymes active on organic matter (extracellular proteases, lipases, a phytase, P450 monooxygenases) were identified, thus underlining the biotechnological potential of eukaryotic metatranscriptomes. The phylogenetic affiliation of 12 full-length carbohydrate active enzymes showed that most of them were distantly related to sequences from known fungi. For example, a putative GH45 endocellulase was closely associated to molluscan sequences, while a GH7 cellobiohydrolase was closest to crustacean sequences, thus suggesting a potentially significant contribution of non-fungal eukaryotes in the actual hydrolysis of soil organic matter.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22238585 PMCID: PMC3253082 DOI: 10.1371/journal.pone.0028967
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Taxonomic affiliation of the different environmental sequences.
PCR amplified 18S rRNA (column 1) and cDNAs (columns 2 and 3) from spruce and beech soils were attributed to one of eight eukaryotic phyla or to the Bacteria. The “multiple affiliation” category contains sequences which could not be unambiguously placed in one of the other categories. (A), Results for all sequences which had a homolog in the GenBank/EMBL/DDJB database; (B), same analysis after removing cDNA sequences from bacteria and cDNA with uncertain “multiple affiliation” for a direct comparison between 18S rRNA and cDNA datasets. For the annotation of cDNA sequences using MEGAN, two different parameters were used; the “stringent analysis” (column 2) corresponded to the “Min Support” parameter set to default value of 5 which was set to 1 in the “less stringent” analysis (column 3).
Characteristics of the sequence datasets.
| Spruce soil | Beech soil | |
| No. of sequenced clones | 10,000 | 10,000 |
| No. of contaminating rRNA squences | 684 | 765 |
| No. of “good quality cDNAs” | 8606 | 7905 |
| Average length of cDNA sequences (nucleotides) | 482 | 430 |
| cDNA GenBank/EMBL/DDJB Accession Nos. | FR697056–706058 | FR706059–714330 |
| % cDNA in clusters (No. of clusters) | 9% (248) | 9% (260) |
| No. of singletons | 7851 | 7222 |
| % cDNA with BLASTX hits (GenBank nr) | 48% | 39.5% |
| % cDNA with GO terms | 39% | 31.5% |
| % cDNA with GO “Biological process” terms | 25.5% | 23.5% |
| % cDNA with E.C. No. (No. of E.C. Nos.) | 13% (321) | 12% (313) |
| % cDNA with taxonomic annotation (MEGAN) | 46% | 37% |
values obtained using the default settings of the MEGAN software (“Min.support” parameter set to five);
values obtained using the MEGAN software with “Min.support” parameter set to one.
A list of KEGG pathways relevant to soil nutrient (C, N, S) utilisation and microbial metabolism for which different cDNA sequences from the spruce and beech soil could be affiliated.
| No. of cDNAs | ||
| KEGG pathway | Beech | Spruce |
| Nitrogen metabolism | 13 (16.4) | 17 (19.7) |
| Alanine, aspartate and glutamate metabolism | 19 (24.0) | 26 (30.2) |
| Arginine and proline metabolism | 20 (25.3) | 22 (25.6) |
| beta-Alanine metabolism | 6 (7.6) | 13 (15.1) |
| Cysteine and methionine metabolism | 10 (12.7) | 19 (22.1) |
| Glycine, serine and threonine metabolism | 10 (12.7) | 14 (16.3) |
| Histidine metabolism | 4 (5.1) | 4 (4.6) |
| Lysine biosynthesis | 6 (7.6) | 2 (2.3) |
| Lysine degradation | 7 (8.9) | 10 (11.6) |
| Phenylalanine metabolism | 16 (20.2) | 14 (116.3) |
| Phenylalanine, tyrosine and tryptophan biosynthesis | 6 (7.6) | 8 (9.3) |
| Tryptophan metabolism | 8 (10.1) | 12 (13.9) |
| Tyrosine metabolism | 10 (12.7) | 13 (15.1) |
| Valine, leucine and isoleucine biosynthesis | 3 (3.8) | 6 (7.0) |
| Valine, leucine and isoleucine degradation | 18 (22.8) | 15 (17.4) |
| Citrate cycle (TCA cycle) | 19 (24.0) | 29 (33.7) |
| Glycolysis/Gluconeogenesis | 21 (26.6) | 48 (55.8) |
| Pentose phosphate pathway | 15 (19.0) | 20 (23.2) |
| Pyruvate metabolism | 20 (25.3) | 21 (24.4) |
| Galactose metabolism | 4 (5.1) | 7 (8.1) |
| Fructose and mannose metabolism | 12 (15.2) | 18 (20.9) |
| Starch and sucrose metabolism | 16 (20.2) | 18 (20.9) |
| Sulfur metabolism | 2 (2.5) | 3 (3.5) |
A list of some of the enzymes identified in several of these pathways is given in Table S3. Between parentheses are given the figures extrapolated to a common sample size of 10,000 cDNAs for both spruce and beech.
An illustration of the diversity of potential plant cell wall and other polysaccharides (starch, chitin) degrading enzymes identified among the beech and spruce soil ESTs.
| Category | Family | Potential enzyme activity (/substrate) | beech | spruce |
|
|
|
| ||
|
|
|
|
| |
|
|
|
|
| |
| CE1 | xylan, pectin | 1 | 1 | |
|
|
|
|
| |
|
|
|
|
| |
| PL1 | pectate lyase/pectin | 0 | 1 | |
|
|
|
|
| |
| GH5 | Cellulase/cellulose | 2 | 5 | |
| GH6 | endoglucanase/cellulose | 1 | 0 | |
| GH7 | endoglucanase/cellulose | 3 | 4 | |
| GH9 | endoglucanase/cellulose | 1 | 0 | |
| GH10 | endo-1,4-β-xylanase/hemicellulose | 0 | 1 | |
| GH11 | xylanase/hemicellulose | 2 | 1 | |
| GH12 | endoglucanase/xyloglucan | 0 | 1 | |
| GH28 | galacturonase/pectin | 0 | 2 | |
| GH13 | α-amylase/starch | 0 | 3 | |
| GH15 | glucoamylase/starch | 1 | 1 | |
| GH18 | chitinase/chitin | 3 | 6 | |
| GH31 | α-glucosidase/xyloglucan | 0 | 1 | |
| GH35 | β-galactosidase/pectin | 0 | 2 | |
| GH44 | endoglucanase/cellulose | 0 | 1 | |
| GH45 | endoglucanase/cellulose | 0 | 2 | |
| GH51 | α-L-arabinofuranosidase/pectin | 0 | 1 | |
| GH54 | α-L-arabinofuranosidase/pectin | 2 | 0 | |
| GH61 | Cu-metalloenzyme/cellulose | 1 | 6 | |
| GH76 | α-1,6-mannanase/hemicellulose | 0 | 2 | |
| GH78 | α-L-rhamnosidase/pectin | 0 | 1 | |
| GH93 | exo-α-L-1,5-arabinanase/pectin | 0 | 1 | |
|
|
|
| ||
|
|
|
|
| |
| LO1 | Laccase/Catechol oxydase | 2 | 1 | |
| LO2 | peroxydase | 1 | 1 | |
| LO3 | cellobiose dehydrogenase | 0 | 1 | |
|
|
|
|
| |
| LDA1 | aryl-alcohol oxydase | 0 | 3 | |
| LDA3 | glyoxal oxydase | 0 | 7 | |
| LDA5 | galactose oxydase | 0 | 1 | |
| LDA6 | glucose oxydase | 0 | 1 | |
| LDA7 | benzoquinone reductase | 3 | 5 | |
| LDA8 | alcohol oxydase | 0 | 2 |
For the main categories, between parentheses are given the figures extrapolated to a common sample size of 10,000 cDNAs. Differences between Beech and Spruce were tested using Pearson Chi-square test;
**, P<0.01;
*, 0.01
, not significant, P>0.05.
according to data from the CAZyme database, [18], [29] and [67].
Excluding CBMs which can be associated to other CAZYmes.
These figures include all CAZYmes, CE or GH identified in the datasets; not just the families illustrated in this table.
Diversity of fungal plasma membrane transporters, potentially involved in soil nutrient uptake, identified among the beech and spruce soil cDNAs.
| TCDB | description | beech | spruce |
|
| |||
| 2.A.1.1.- | Sugar porter family | 1 | 31 |
|
| |||
| 2.A.18.- | Amino acid/auxin permease (AAAP) | 2 | 3 |
| 2.A.3.10.- | Amino Acid-Polyamine-Organocation (APC) | 4 | 6(7) |
|
| |||
| 2.A.1.14.- | Anion:Cation Symporter (ACS) | 1 | 0 |
| 2.A.67.- | Oligopeptide Transporter (OPT) | 2 | 6 |
|
| |||
| 2.A.1.9.- | Phosphate: H+ Symporter (PHS) | 2(3) | 6 |
|
| |||
| 1.A.11.- | Ammonia Channel Transporter (Amt) Family |
|
|
|
|
|
|
Figures represent the No. of unique sequences after clustering; figures between brackets give the total No. of sequences before clustering. For the total number of transporters, differences between Beech and Spruce were tested using Pearson Chi-square test;
**, P<0.01.
The Transporter Classification Database (http://www.tcdb.org/).
Figure 2Putative phylogenetic origins of three environmental Glycoside Hydrolases belonging to families GH7 (A) and GH45 (B).
Bayesian (MrBayes) phylogenetic trees include protein sequences from different taxonomic groups, each identified by a specific colour. Diagrams drawn to scale illustrate the modular structure of each of the different GH45 protein sequences. Red rectangles, potential signal peptides; blue rectangles, GH45 domains used for the phylogenetic analysis; orange rectangles, family one Carbohydrate Binding Modules (CBM1) characteristic of fungal GH45; purple rectangle, CBM2 module only found in the environmental sequence. Boxed species names indicate protein sequences for which an endoglucanase catalytic activity has been experimentally established [30]–[33]. Posterior branch probabilities above 0.8 are given; branches with less than 0.5 probability support were collapsed.