| Literature DB >> 26184859 |
Rahul Shubhra Mandal1, Sudipto Saha2, Santasabuj Das3.
Abstract
Gut microbiota of higher vertebrates is host-specific. The number and diversity of the organisms residing within the gut ecosystem are defined by physiological and environmental factors, such as host genotype, habitat, and diet. Recently, culture-independent sequencing techniques have added a new dimension to the study of gut microbiota and the challenge to analyze the large volume of sequencing data is increasingly addressed by the development of novel computational tools and methods. Interestingly, gut microbiota maintains a constant relative abundance at operational taxonomic unit (OTU) levels and altered bacterial abundance has been associated with complex diseases such as symptomatic atherosclerosis, type 2 diabetes, obesity, and colorectal cancer. Therefore, the study of gut microbial population has emerged as an important field of research in order to ultimately achieve better health. In addition, there is a spontaneous, non-linear, and dynamic interaction among different bacterial species residing in the gut. Thus, predicting the influence of perturbed microbe-microbe interaction network on health can aid in developing novel therapeutics. Here, we summarize the population abundance of gut microbiota and its variation in different clinical states, computational tools available to analyze the pyrosequencing data, and gut microbe-microbe interaction networks.Entities:
Keywords: 16S rRNA; Disease; Microbial interaction network; Operational taxonomic unit; Sequencing
Mesh:
Substances:
Year: 2015 PMID: 26184859 PMCID: PMC4563348 DOI: 10.1016/j.gpb.2015.02.005
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Gut microbiota studies in different species using pyrosequencing technology
| Host | Sample source | Sequencing method | Amount of data retrieved | GenBank ID | Ref. |
|---|---|---|---|---|---|
| Mouse | Cecum | 16S rRNA-based sequencing | 5088 16S rRNA sequences | DQ014552−DQ015671; AY989911−AY993908 | |
| Mouse | Cecum and feces | 16S rRNA-based sequencing | 2878 16S rRNA sequences | GQ491120−GQ493997 | |
| Mouse | Feces | 16S rRNA-based sequencing | 4172 16S rRNA sequences | FJ032696−FJ036849 ; EU584214−EU584231 | |
| Mouse and zebrafish | Zebrafish intestine and mouse cecum | 16S rRNA-based sequencing | 5545 16S rRNA sequences | DQ813844−DQ819377 | |
| Human | Colonic mucosa and feces | 16S rRNA-based sequencing | 11,831 16S rRNA sequences | AY916135−AY916390; AY974810−AY986384 | |
| Human | Feces | 16S rRNA-based sequencing | 9773 16S rRNA sequences | FJ362604−FJ372382 | |
| Human | Feces | 16S rRNA-based sequencing | 2064 16S rRNA sequences | DQ325545−DQ327606 | |
| Cat | Feces | 454 pyrosequencing | 187,396 reads | SRA012231.1 | |
| Dog | Feces | 454 pyrosequencing | 201,642 reads | SRA012231.1 | |
| Cow | Rumen | Whole genome sequencing | 268 G of metagenomic DNA | HQ706005−HQ706094; SRA023560 | |
| Yak | Rumen | 454 pyrosequencing | 88 Mb genomic DNA | NA |
Figure 1Association of gut microbiota with disease in PubMed publications
PubMed publications on different diseases involving gut microbiota were searched on February 09, 2015. IBD, inflammatory bowel disease; T2D, type 2 diabetes; CD, Crohn’s disease.
Highly-abundant bacterial species under different disease conditions
| Disease | Name of prevalent bacteria | Ref. |
|---|---|---|
| Symptomatic atherosclerosis | ||
| Type 2 diabetes | ||
| Obesity/IBD/CD | ||
| Colorectal cancer | ||
Note: IBD, inflammatory bowel disease; CD, Crohn’s disease.
Tools/webservers related to gut microbiota studies
| Name | Platform | Website | Main features | Ref. |
|---|---|---|---|---|
| QIIME | Stand alone | Network analysis, histograms of within- or between-sample diversity | ||
| mothur | Stand alone | Fast processing of large sequence data | ||
| RAMMCAP | Stand alone | Ultra fast sequence clustering and protein family annotation | ||
| MEGAN | Stand alone | Laptop analysis of large metagenomic shotgun sequencing data sets | ||
| MetaPhlAn | Stand alone | Faster profiling of the composition of microbial communities using unique clade-specific marker genes | ||
| MetaVelvet | Stand alone | High quality metagenomic assembler | ||
| SOAPdenovo2 | Stand alone | Metagenomic assembler, specifically for Illumina GA short reads | ||
| MOCAT | Stand alone | Generate taxonomic profiles and assemble metagenomes | ||
| SmashCommunity | Stand alone | Performs assembly and gene prediction mainly for data from Sanger and 454 sequencing technologies | ||
| HUMAnN | Stand alone | Analysis of metagenomic shotgun data from the Human Microbiome Project | ||
| FANTOM | Stand alone | Comparative analysis of metagenomics abundance data integrated with databases like KEGG Orthology, COG, PFAM and TIGRFAM, | ||
| MetaCV | Stand alone | Classification short metagenomic reads (75–100 bp) into specific taxonomic | ||
| Phymm | Stand alone | Phylogenetic classification of metagenomic short reads using interpolated Markov models | ||
| PhyloPythiaS | Web server | Fast and accurate sequence composition-based classifier that utilizes the hierarchical relationships between clades | ||
| TETRA | Web server | Correlation of tetranucleotide usage patterns in DNA | ||
| METAREP | Web server | Flexible comparative metagenomics framework | ||
| CD-HIT | Web server | Identity-based clustering of sequences | ||
| METAGENassist | Web server | Performs comprehensive multivariate statistical analyses on the data from different host and environment sites | ||
| CoMet | Web server | ORF finding and subsequent Pfam domain assignment to protein sequences | ||
| WebCARMA | Web server | Unassembled reads as short as 35 bp can be used for the taxonomic classification with less false positive prediction | ||
| MG-RAST | Web server | High-throughput pipeline for functional metagenomic analysis | ||
| CAMERA | Web server | Provides list of workflows for WGS data analysis | ||
| WebMGA | Web server | Implemented to run in parallel on local computer cluster |