| Literature DB >> 26757703 |
Yue Jiang1, Xuejian Xiong2, Jayne Danska3,4,5, John Parkinson6,7,8.
Abstract
BACKGROUND: Metatranscriptomics is emerging as a powerful technology for the functional characterization of complex microbial communities (microbiomes). Use of unbiased RNA-sequencing can reveal both the taxonomic composition and active biochemical functions of a complex microbial community. However, the lack of established reference genomes, computational tools and pipelines make analysis and interpretation of these datasets challenging. Systematic studies that compare data across microbiomes are needed to demonstrate the ability of such pipelines to deliver biologically meaningful insights on microbiome function.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26757703 PMCID: PMC4710996 DOI: 10.1186/s40168-015-0146-x
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1Workflow and Read Processing. a Workflow of the pipeline for processing, annotation and analyses of metatranscriptome (RNA-seq). b Composition of sequence reads for twelve mouse metatranscriptome datasets and four additional microbiomes (see Methods). c Distribution of reads annotated through three complementary sequence similarity search tools: (1) BWA and (2) BLAT searches against a database of 2271 microbial genomes and (3) BLASTX searches against the protein non-redundant database. The mouse dataset represents a summary of all 12 datasets analysed in this study
Pathways enriched in transcripts displaying large (>fivefold) differences in relative expression between mouse cecal wall and cecal flush samples
| Fold change in expression | Differentially expressed genes | Matched ECs/total ECs in pathway | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Genes up-regulated in cecal wall | Genes up-regulated in cecal flush | ||||||||
| Pathway |
| 5–10 | 10–20 | >20 | 5–10 | 10–20 | >20 | ||
| Glycolysis/gluconeogenesis | 9.35E-07 | 14 | 5 | 3 | 5 | 4 | 1 | 32 | 11/45 |
| Methane metabolism | 6.44E-05 | 10 | 5 | 3 | 4 | 5 | 2 | 29 | 11/68 |
| Carbon fixation in photosynthetic organisms | 1.00E-04 | 8 | 1 | 2 | 4 | 0 | 0 | 15 | 5/25 |
| One carbon pool by folate | 2.88E-04 | 4 | 3 | 0 | 1 | 2 | 0 | 10 | 6/24 |
| Starch and sucrose metabolism | 4.59E-04 | 5 | 4 | 2 | 3 | 3 | 0 | 17 | 10/71 |
| Alanine, aspartate and glutamate metabolism | 1.43E-03 | 9 | 1 | 0 | 0 | 1 | 0 | 11 | 7/43 |
| Citrate cycle (TCA cycle) | 1.52E-03 | 4 | 0 | 2 | 2 | 1 | 0 | 9 | 5/22 |
| Pyruvate metabolism | 3.08E-03 | 7 | 0 | 2 | 1 | 1 | 1 | 12 | 8/62 |
| Amino sugar and nucleotide sugar metabolism | 6.63E-03 | 5 | 3 | 1 | 1 | 3 | 2 | 15 | 9/85 |
| Oxidative phosphorylation | 1.10E-02 | 2 | 3 | 1 | 0 | 3 | 0 | 9 | 3/12 |
| Purine metabolism | 3.08E-03 | 10 | 0 | 0 | 2 | 0 | 2 | 14 | 9/100 |
| Propanoate metabolism | 3.37E-02 | 3 | 0 | 0 | 0 | 1 | 1 | 5 | 5/45 |
| Valine, leucine and isoleucine biosynthesis | 3.40E-02 | 1 | 1 | 0 | 0 | 1 | 2 | 5 | 3/18 |
| Aminoacyl-tRNA biosynthesis | 3.85E-02 | 2 | 0 | 0 | 1 | 0 | 3 | 6 | 4/32 |
| Histidine metabolism | 4.25E-02 | 1 | 3 | 0 | 1 | 0 | 0 | 5 | 4/33 |
| Drug metabolism—other enzymes | 4.49E-02 | 3 | 0 | 0 | 0 | 0 | 0 | 3 | 3/20 |
| Other glycan degradation | 4.88E-02 | 1 | 1 | 0 | 0 | 0 | 0 | 2 | 2/9 |
Fig. 2Taxonomic composition of five metatranscriptomic datasets. a Abundance of 10 major bacterial phyla and sub-phyla across the five samples. Also shown are the observed frequency of assignments in the 2271 microbial genomes used in the BWA and BLAT searches as well as the protein non-redundant database. b Phylogenetic representation of major genera (represented by at least 100 reads) associated with the five datasets. Node size represents the relative abundance of reads mapped to the corresponding genus in each sample. For each dataset, reads were normalized by the average read count associated with each sample (see Methods). c Top ten most abundant species associated with each dataset (by number of reads; minimum 100 transcripts)
Diversity analysis within mice samples and between five samples
| Sample name | Shannon index (mRNA) | Simpson index (mRNA) | Fisher’s alpha (mRNA) | Shannon index (16S rRNA) | Chao1 index (mRNA) | Chao1 index (16S rRNA) |
|---|---|---|---|---|---|---|
| Mouse cecal wall | 3.83 | 16.51 | 23.26 | 2.00 | 1162 | 283 |
| Mouse cecal flush | 4.43 | 43.34 | 30.52 | 2.57 | 1055 | 411 |
| Mouse combined | 4.51 | 17.14 | 167.33 | 2.48 | 1709 | 523 |
| Cow rumen | 4.14 | 21.79 | 140.67 | 4.15 | 1461 | 1042 |
| Kimchi | 1.69 | 3.27 | 56.07 | 2.91 | 634 | 615 |
| Deep sea | 5.01 | 35.75 | 481.29 | 5.02 | 4408 | 4565 |
| Permafrost | 3.82 | 10.98 | 24.31 | 4.5 | 295 | 348 |
Fig. 3Metabolic composition of five metatranscriptomic datasets. a Rarefaction analysis indicating the number of unique enzymes (as defined by enzyme classification numbers) captured by increasing numbers of putative mRNA reads generated. b Overlap of enzyme complements across four datasets reveals a common core of 592 enzymes. c Global metabolic network indicating taxonomic representation of metabolic activities within the combined mouse dataset. Pie charts indicate the relative proportion of each taxon, size of pie chart indicates relative expression (see key). Indicated are specific metabolic pathways
Pathways significantly enriched in ‘core’ microbiome enzymes
| Pathway name | Pathway classa |
| Core enzymes in pathway | Total enzymes in pathway |
|---|---|---|---|---|
| Aminoacyl-tRNA biosynthesis | O | 3.57E-08 | 22 | 32 |
| Purine metabolism | NT | 1.49E-06 | 44 | 100 |
| Peptidoglycan biosynthesis | G | 4.30E-06 | 12 | 15 |
| Glycolysis/gluconeogenesis | C | 2.89E-05 | 23 | 45 |
| Alanine, aspartate and glutamate metabolism | AA | 4.24E-05 | 22 | 43 |
| Valine, leucine and isoleucine biosynthesis | AA | 8.75E-05 | 12 | 18 |
| Pyrimidine metabolism | NT | 2.89E-04 | 27 | 63 |
| Phenylalanine, tyrosine and tryptophan biosynthesis | AA | 6.27E-04 | 15 | 29 |
| Pentose phosphate pathway | C | 7.05E-04 | 17 | 35 |
| Carbon fixation pathways in prokaryotes | E | 2.22E-03 | 17 | 38 |
| One carbon pool by folate | CO | 3.21E-03 | 12 | 24 |
| Lysine biosynthesis | AA | 3.32E-03 | 13 | 27 |
| Pyruvate metabolism | C | 3.37E-03 | 24 | 62 |
| Fatty acid biosynthesis | L | 3.88E-03 | 9 | 16 |
| Citrate cycle (TCA cycle) | C | 4.76E-03 | 11 | 22 |
| Amino sugar and nucleotide sugar metabolism | C | 5.66E-03 | 30 | 85 |
| Oxidative phosphorylation | E | 8.48E-03 | 7 | 12 |
| Drug metabolism—other enzymes | X | 2.34E-02 | 9 | 20 |
| Cysteine and methionine metabolism | AA | 2.55E-02 | 21 | 61 |
| Polyketide sugar unit biosynthesis | T | 2.74E-02 | 4 | 6 |
| Streptomycin biosynthesis | S | 3.48E-02 | 8 | 18 |
| Folate biosynthesis | CO | 3.61E-02 | 7 | 15 |
aDefined according to KEGG. AA amino acid, C carbohydrate, CO co-factor, E energy, G glycan, L lipid, NT nucleotode, O other, S secondary metabolites, T terpenoids, X xenobiotics
bHere, we used the hypergeometric test to examine enrichment of pathways compared to all KEGG defined pathways
Fig. 4Detailed views of taxonomic contributions to specific components of the tricarboxylic acid (TCA) cycle for four metatranscriptomic datasets. Each schematic indicates the taxonomic representation of enzymatic activities involved in the TCA cycle for four metatranscriptome datasets: mouse, kimchi, cow and deep sea. Pie charts indicate enzymes, with coloured sectors indicating the relative proportion of each taxon, size of pie chart indicates relative expression (see key). Small triangles indicate substrates with links indicating enzyme-substrate relationships
Fig. 5Taxonomic contributions to functional modules defined through protein-interaction networks. a ABC transporters and (b) cell wall biogenesis and cell division. Protein interactions were obtained from a previously generated network of functional interactions for E. coli [49]. Pie charts indicate the relative proportion of each taxon, size of pie chart indicates relative expression (see key). c Relative representation of specific functional groups across the four well sampled datasets