| Literature DB >> 31001361 |
Ayyappa Kumar Sista Kameshwar1,2, Luiz Pereira Ramos2, Wensheng Qin1.
Abstract
Rumen is one of the most complex gastro-intestinal system in ruminating animals. With bountiful of microorganisms supporting in breakdown and consumption of minerals and nutrients from the complex plant biomass. It is predicted that a table spoon of ruminal fluid can reside up to 150 billion microorganisms including various species of bacteria, fungi and protozoa. Several studies in the past have extensively explained about the structural and functional physiology of the rumen. Studies based on rumen and its microbiota has increased significantly in the last decade to understand and reveal applications of the rumen microbiota in food processing, pharmaceutical, biofuel and biorefining industries. Recent high-throughput meta-genomic and proteomic studies have revealed humongous information on rumen microbial diversity. In this study, we have extensively reviewed and reported present-day's progress in understanding the rumen microbial diversity. As of today, NCBI resides about 821,870 records based on rumen with approximately 889 genome sequencing studies. We have retrieved all the rumen-based records from NCBI and extensively catalogued the rumen microbial diversity and the corresponding genomic and proteomic studies respectively. Also, we have provided a brief inventory of metadata analysis software packages and reviewed the metadata analysis approaches for understanding the functional involvement of these microorganisms. Knowing and understanding the present progress on rumen microbiota and performing metadata analysis studies will significantly benefit the researchers in identifying the molecular mechanisms involved in plant biomass degradation. These studies are also necessary for developing highly efficient microorganisms and enzyme mixtures for enhancing the benefits of cattle-feedstock and biofuel industries.Entities:
Keywords: Bacteria; Fungi; Genomic; Microbiota; Proteomic; Protozoa.; Ruminal fluid; Ruminating animals
Year: 2019 PMID: 31001361 PMCID: PMC6470328 DOI: 10.7150/jgen.32164
Source DB: PubMed Journal: J Genomics
Figure 1Pictorial representation of rumen and the process of digestion and absorption of the food material in ruminating animals [Note: The boxes are colored to represent the food passage from mouth to the rumen].
Figure 2Breakdown products obtained from the process of fermentation by rumen microbiota. [Note: The uncolored boxes are the macromolecules and the blue colored boxes represent the products of metabolism].
Figure 3The major microorganism species reported in the previous studies and classified based on the substrates they degrade.
The NCBI public repository and its major classification of NCBI databases.
| NCBI Literature | Bookshelf, MeSH, NLM Catalog, PubMed and PubMed Central |
|---|---|
| NCBI Genes | Gene, GEO Database, GEO Profiles, HomoloGene, PopSet UniGene |
| NCBI Genetics | ClinVar, dbGap, dbSNP, dbVar, GTR, MedGen, OMIM |
| NCBI Genomes | Assembly, BioCollections, BioProject, BioSample, Genome, Nucleotide, Probe, SRA, Taxonomy |
| NCBI Proteins | Conserved Domains, Identical Protein Groups, Proteins, Protein Clusters, Sparcle, Structure |
| NCBI Chemicals | Biosystems, PubChem BioAssay, PubChem Compound, PubChem Substance |
Figure 4Pictorial representation of the present days collection a) the number of records and datasets in NCBI database based on the term “Rumen”, b) briefly lists details of the genomic and metagenomic sequencing studies i) sequencing institution, ii) level of assembly and iii) the classification of assembled genomes into archaea and bacteria groups and c) shows the division of assembled genomes of microorganisms at genus level (A, B, D, E and G - proteobacteria represents α, β, δ, γ, £- proteobacteria). [Note: The color gradient used for the images A and B were generated using Microsoft Excel with red color represents lowest count, green color represents highest count and yellow color represents for intermediate count. Especially for figure 4A was gradient colored based for each section separately].
List of the cultured and uncultured NCBI assembled genomes of the rumen isolated microorganisms retrieved from NCBI Assembly Database.
| NCBI Assembled genomes of Microorganisms retrieved from Rumen |
|---|
| Uncultured actinobacteria: |
| Uncultured α-Proteobacteria: |
| Uncultured Bacteria: |
| Uncultured β-Proteobacteria: |
| Uncultured CFB group bacteria: |
| Uncultured γ-Proteobacteria: |
| Uncultured ε-Proteobacteria |
| Uncultured δ-Proteobacteria: |
| Mycoplasmas: |
| Planctomycetes: |
| Spirochetes: |
Figure 5The project data details of rumen microorganisms and the highly occurring genus in the rumen isolates. [Note: The color gradient used for the images A and B were generated using Microsoft Excel with red color represents lowest count, green color represents highest count and yellow color represents for intermediate count].
Lists the details of ruminal bacteria with whole genome sequences, available from the genome database of NCBI.
| Name | Subgroup | Length (Mb) | Protein | GC% |
|---|---|---|---|---|
| Spirochaetia | 3.4539 | 2837 | 53.2% | |
| Firmicutes | 3.1123 | 2805 | 50.1% | |
| Actinobacteria | 3.1650 | 2721 | 60.2% | |
| Firmicutes | 4.0483 | 3504 | 40.149% | |
| δ/ε subdivision | 2.1103 | 2040 | 48.5% | |
| Fibrobacteres | 3.8428 | 3077 | 48.05% | |
| Firmicutes | 2.9589 | 2446 | 54.3% | |
| Firmicutes | 3.9690 | 3753 | 47.2% | |
| Firmicutes | 2.5424 | 2059 | 53.35% | |
| Firmicutes | 2.2760 | 2121 | 41% | |
| Firmicutes | 4.6654 | 3764 | 39.7% | |
| β-proteobacteria | 2.4528 | 2204 | 49.06% | |
| Firmicutes | 2.5177 | 2261 | 34.4% | |
| Firmicutes | 2.0249 | 1846 | 43.4% | |
| Firmicutes | 1.0849 | 919 | 40.6% | |
| Firmicutes | 3.3450 | 2973 | 41.5% | |
| Bacteroidetes | 3.5894 | 2908 | 47.7% | |
| α-proteobacteria | 5.2628 | 4816 | 65% | |
| γ-proteobacteria | 2.3170 | 2104 | 44.9% | |
| Firmicutes | 2.8313 | 2613 | 37.56% | |
| Firmicutes | 3.8452 | 3661 | 44.5% |
Lists the details of gene expression datasets that was retrieved from the NCBI-GEO database using the term “rumen”.
| GEO-ID | Organisms | Method | Platform | Samples |
|---|---|---|---|---|
| GSE89874 | RNA-Sequencing | Illumina HiSeq 2000 | 45 | |
| GSE81847 | RNA-Sequencing | Illumina HiSeq 2000 | 62 | |
| GSE107550 | Microarray (Rumen microbiota) | Custom Agilent gene expression array | 14 | |
| GSE89162 | RNA-Sequencing | Illumina HiSeq 2000 | 38 | |
| GSE78197 | RNA-Sequencing | Illumina HiSeq 2000 | 18 | |
| GSE99066 | RNA-Sequencing | Illumina HiSeq 2500 | 9 | |
| GSE76346 | RNA-Sequencing | Illumina HiSeq 2000 | 10 | |
| GSE86323 | RNA-Sequencing | Illumina HiSeq 2000 | 8 | |
| GSE93907 | RNA-Sequencing | Illumina MiSeq | 10 | |
| GSE76501 | RNA-Sequencing | Illumina HiSeq 2000 | 20 | |
| GSE80173 | Microarray (CAZy-chip) | Custom Agilent gene expression array | 94 | |
| GSE52193 | RNA-Sequencing | Illumina HiScanSQ | 63 | |
| GSE87391 | Microarray | Affymetrix Bovine genome array | 10 | |
| GSE83813 | Microarray | Agilent Bovine custom array | 7 | |
| GSE82272 | Microarray | Agilent Bovine custom array | 8 | |
| GSE74329 | RNA-Sequencing | Illumina HiSeq 2000 | 71 | |
| GSE74379 | RNA-Sequencing | Illumina HiSeq 2000 | 24 | |
| GSE68791 | RNA-Sequencing | Illumina MiSeq | 2 | |
| GSE71153 | RNA-Sequencing | Illumina Genome Analyzer II | 16 | |
| GSE63550 | RNA-Sequencing | Illumina HiSeq 2000 | 4 | |
| GSE62624 | Microarray | CUST Rumen Bacto-array | 12 | |
| GSE50448 | Microarray | Custom Agilent gene expression array | 8 | |
| GSE39206 | SARST libraries | SARST libraries | 9 | |
| GSE35212 | Microarray | Affymetrix Bovine genome array | 14 | |
| GSE21492 | Microarray | NimbleGen Custom array | 3 | |
| GSE16747 | Microarray | EDL933 Spotted PCR array | 7 | |
| GSE19802 | Microarray | Custom Bovine array | 10 | |
| GSE21544 | RNA-Sequencing | Illumina Genome Analyzer | 95 | |
| GSE18716 | Microarray | Custom microarray | 1 | |
| GSE18382 | Microarray | Custom Bovine microarray | 18 | |
| GSE17849 | Microarray | Affymetrix Bovine Genome array | 12 | |
| GSE15916 | Microarray | Custom microarray | 4 | |
| GSE3029 | Microarray | Custom microarray | 39 | |
| GSE1842 | SARST libraries | SARST libraries | 1 |
List of the details of characterized proteins with corresponding structural details and source organism, available from the NCBI genome database.
| PDB | MMDB | Structural details | Organism |
|---|---|---|---|
| 5K9H | 143569 | Glycoside Hydrolase 29 | |
| 4DEV | 107545 | Acetyl Xylan Esterase | |
| 3U37 | 107485 | Acetyl Xylan Esterase | |
| 5WH8 | 160168 | Cellulase | |
| 5U22 | 151772 | Glycoside Hydrolase 39 | |
| 5G0R | 149432 | Methyl-coenzyme-M-Reductase | |
| 2WTN | 79382 | Feruloyl esterase | |
| 2WTM | 79381 | Feruloyl esterase | |
| 5K6O | 143776 | Glycoside Hydrolase 3 | |
| 5K6N | 143775 | Glycoside Hydrolase 3 | |
| 5K6M | 143774 | Glycoside Hydrolase 3 | |
| 5K6L | 143773 | Glycoside Hydrolase 3 | |
| 4NOV | 123603 | Glycoside Hydrolase 43 | |
| 4KCB | 117247 | Glycoside Hydrolase 5 | |
| 4KCA | 117246 | Glycoside Hydrolase 5 | |
| 5LXV | 144187 | Scaffoldin C Cohesin | |
| 5D9P | 133948 | Glycoside Hydrolase 5 | |
| 5D9O | 133947 | Glycoside Hydrolase 5 | |
| 5D9N | 133946 | Glycoside Hydrolase 5 | |
| 5D9M | 133945 | Glycoside Hydrolase 5 | |
| 4YHG | 130962 | Glycoside Hydrolase 5 | |
| 4YHE | 129494 | Glycoside Hydrolase 5 | |
| 4W8B | 127728 | Glycoside Hydrolase 5 | |
| 4W8A | 127727 | Glycoside Hydrolase 5 | |
| 4W89 | 127726 | Glycoside Hydrolase 5 | |
| 4W88 | 127725 | Glycoside Hydrolase 5 | |
| 4W87 | 127724 | Glycoside Hydrolase 5 | |
| 4W86 | 127723 | Glycoside Hydrolase 5 | |
| 4W85 | 127722 | Glycoside Hydrolase 5 | |
| 4W84 | 127721 | Glycoside Hydrolase 5 | |
| 4KC8 | 117245 | Glycoside Hydrolase 43 | |
| 4KC7 | 117244 | Glycoside Hydrolase 43 | |
| 4N2O | 116090 | Cohesin | |
| 4IU3 | 109437 | Cohesin | |
| 4IU2 | 109436 | Cohesin | |
| 4EYZ | 108431 | Cellulosomal protein | |
| 4AEM | 106860 | Carbohydrate binding module | |
| 4AEK | 106648 | Carbohydrate binding module | |
| 4AFD | 106169 | Carbohydrate binding module | |
| 4BA6 | 105916 | Carbohydrate binding module | |
| 4AFM | 105889 | Carbohydrate binding module | |
| 3VDH | 96587 | Glycoside Hydrolase 5 | |
| 2AGK | 39565 | His6 protein | |
| 5UHX | 156474 | Cellulase | |
| 2VG9 | 61544 | Glycoside Hydrolase 11 |
List of the publicly available resources in NCBI Chemicals databases that are related to the term “rumen”.
| NCBI-BioSystems |
|---|
| 9 Conserved Biosystems: L-isoleucine biosynthesis V, Pyruvate fermentation to acetate VII, Oxalate degradation II, Phytol degradation, Coenzyme M biosynthesis I, Pyruvate fermentation to butanoate, L-isoleucine biosynthesis IV, Sulfur reduction I and Fatty acid alpha-oxidation II |
| 4 Organism Specific Biosystems: Phytol degradation, Pyruvate fermentation to acetate VII, Isoleucine biosynthesis IV and Fatty acid alpha-oxidation |
| PubChem BioAssay |
| Insecticidal activity: |
Figure 6Pictorial representation of metadata analysis workflow applied for analyzing and understanding the genomic metadata analysis approach;
List of softwares (or) R-packages used for the metadata analysis of gene expression datasets. (Note: All the words written in bold are the names of the softwares (or) packages).
| Name | Software type | Operating system | Programming language |
|---|---|---|---|
| MEM | NA | Web user interface | C++, JavaScript, Perl |
| Onto-Compare, PhenoGen, ExAtlas | NA | Web user interface | NA |
| ImaGEO, ShinyMDE | NA | Web user interface | R |
| Bloader | Package (or) Module | Unix/Linux, Mac OS, Windows | Graphical user interface |
| MAAMD | Package (or) Module | Mac OS, Windows | NA |
| Package (or) Module | mixOmics, MetaOmics, CoGAPS, MetaQC, metaAnalyzeAll, metafor | ||
| A-MADMAN | Package (or) Module | Unix/Linux, Mac OS, Windows | R, Python |
| MetaKTSP | Toolkit/Suite | Unix/Linux, Mac OS, Windows | R |
Figure 7Pictorial representation of metadata analysis workflow applied for analyzing and understanding the proteomic metadata analysis approach.