Literature DB >> 28071710

MicroPattern: a web-based tool for microbe set enrichment analysis and disease similarity calculation based on a list of microbes.

Wei Ma1,2, Chuanbo Huang1,3, Yuan Zhou1,2, Jianwei Li1,4, Qinghua Cui1,2.   

Abstract

The microbiota colonized on human body is renowned as "a forgotten organ" due to its big impacts on human health and disease. Recently, microbiome studies have identified a large number of microbes differentially regulated in a variety of conditions, such as disease and diet. However, methods for discovering biological patterns in the differentially regulated microbes are still limited. For this purpose, here, we developed a web-based tool named MicroPattern to discover biological patterns for a list of microbes. In addition, MicroPattern implemented and integrated an algorithm we previously presented for the calculation of disease similarity based on disease-microbe association data. MicroPattern first grouped microbes into different sets based on the associated diseases and the colonized positions. Then, for a given list of microbes, MicroPattern performed enrichment analysis of the given microbes on all of the microbe sets. Moreover, using MicroPattern, we can also calculate disease similarity based on the shared microbe associations. Finally, we confirmed the accuracy and usefulness of MicroPattern by applying it to the changed microbes under the animal-based diet condition. MicroPattern is freely available at http://www.cuilab.cn/micropattern.

Entities:  

Mesh:

Year:  2017        PMID: 28071710      PMCID: PMC5223220          DOI: 10.1038/srep40200

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


The human body houses a huge number of microorganisms which are mainly composed of bacteria, and these microorganisms inhabit a variety of human organs such as mouth, stomach, gastrointestinal tract, urogenital tract, skin and respiratory1. In recent years, with the fast development of microbiome and meta-genome sequencing technology, many studies have identified a number of differentially regulated microorganisms under a variety of conditions and these microbes could play an important role in our health and diseases234. For example, in the obese individuals, it was found that the number of the H2-producing Prevotellaceae and the H2-utilizing methanogenic archaea Methanobacteriales increased. It is known that the interspecies H2 transfer between bacterial and archaeal species is an important mechanism for increasing energy uptake by human large intestine in obese individuals5. In type 1 diabetes, the butyrate-producing and lactate-utilizing bacteria were reduced6. In type 2 diabetes, the number of butyrate-producing bacteria was decreased while the number of sulphate reduction bacteria was increased, and the ratio of Bacteroidetes to Firmicutes as well as the ratio of Bacteroides-Prevotella group to Clostridium coccoides-Eubacterium rectale group showed a significantly positive correlation with plasma glucose concentration78. Moreover, it was reported that many environmental factors could affect the components of microbiota. For example, smoking could alter gut microbiota9. Different delivery way of infants had different gut microbiota10. Different season or diet also had big effects on the components of microbiota1112. These findings provided great helps for the understanding of how microbe and human interacted under different condition. However, currently, computational methods for analyzing the differentially regulated microbes from a microbiome study are limited. Enrichment analysis is one class of important and popular bioinformatics methods in discovering valuable biological patterns and insights from a list of biological items, such as genes, microRNAs, and metabolites etc. For example, DAVID is a web-based tool for enrichment analysis of a list of genes13. TAM and MSEA are tools for enrichment analysis of a list of microRNAs and a list of metabolites, respectively1415. Currently tools for enrichment analysis of a list of microbes are still not available. We have established a web-based tool named MicroPattern (http://www.cuilab.cn/micropattern) for microbe set enrichment analysis. In addition, MicroPattern also implemented an algorithm we presented previously for the calculation of microbe-based disease similarity16.

Results

Microbe sets

In total, 47 microbe sets were collected including 37 disease sets (where microbes in the same set is associated with the same disease) and 10 position sets (where microbes in the same set is colonized on the same body position). In this work, we just keep microbes that in genus or species rank. Thus, two disease sets were abandoned due to lack of such specified microbe association. Flowchart for microbe sets integration was showed in Fig. 1. Among these sets, the size of 36 sets was in the range of 1~5(77%), 5 sets in the range of 6~10(11%), 1 set in the range of 11~15(2%), 2 sets in the range of 16~20(4%) and 3sets in the range of 21~209(6%), see also Fig. 2. All sets can be downloaded from our web server.
Figure 1

Catalog of microbe set.

We grouped microbes that associated with the same disease or colonized on the same body position into the same microbe set. Different microbe sets could overlap with each other.

Figure 2

Size distribution of microbe sets.

The pie chart indicating the proportion of microbe sets of each size.

Analysis procedure of MicroPattern

The procedure for enrichment analysis is illustrated in Fig. 3. MicroPattern works in four steps. In Step 1, a list of interested microbes needs to be inputted. Step 2 is an optional step. The list of microbes inputted in Step 2 will be treated as the background. If a background list is not provided, all microbes in all sets will be used as the background list. In Step 3, the users would choose what sets should be used for analysis according to the size of sets. By default, only the microbe set that includes at least two microbes will be considered. In Step 4, the user can click button “Run” and the result page will be automatically generated after all calculations have been done. In the result page, the microbe set, number of match microbes to this set, percent of match microbes, fold of overrepresentation, Bonferroni value and FDR value are shown. When mouse moves over the name of the microbe set, the matched microbes and non-matched microbes in this set will be listed in a pop-up box. The user can also double click the set name to download the data. Click the button “Bar plot of result” can plot a bar plot.
Figure 3

Stepwise guideline for performing the microbe set enrichment analysis.

For disease similarity calculation, two steps are need. As shown in Fig. 4, in Step 1, the list of microbe-disease association pairs need to be entered or uploaded. In Step 2, click button “Run” and the result will be shown in a new page. In the result page, the first column and the second column are two diseases and the third column is similarity between them.
Figure 4

Stepwise guideline for running the disease similarity calculating procedure.

Detailed tutorial about how to use MicroPattern are shown on the “Help” page of our web server.

Diet altering the human gut microbiome, which is associated with disease

We applied MicroPattern to 51 changed microbes (Table 1) from a study screening the changed microbes in human gut after animal-based diet17. In this study, 10 American volunteers were involved including 6 male and 4 female. These volunteers were treated with plant-based diet and animal-based diet. Changed microbes were then identified by comparing animal-based diet versus normal diet. For the purpose of investigating the meaningful patterns of these changed microbes, we identified the enriched microbe sets for the changed microbes. As a result, liver cirrhosis was significantly enriched (Table 2; FDR = 2.20 × 10−6). This prediction was supported by another study. In this study, high-fat, high-cholesterol diet, which is also common in animal diet, could induce non-alcoholic steatohepatitis and progressing to liver cirrhosis18.
Table 1

Significant changed microbes under the animal-based diet condition.

Taxonomic rankMicrobes
Species rankEubacterium biforme, Microbe MLG480*, Actinobacillus porcinus, Alistipes finegoldii, Alistipes putredinis, Bacteroides coprocola, Bacteroides fragilis, Bacteroides salyersiae, Bifidobacterium adolescentis, Bifidobacterium gallicum, Bifidobacterium longum, Bilophila wadsworthia, Blautia producta, Clostridium bolteae, Clostridium orbiscindens, Collinsella aerofaciens, Dialister invisus, Faecalibacterium prausnitzii, Megasphaera elsdenii, Mitsuokella multacida, Parabacteroides johnsonii, Prevotella copri, Raoultella, Roseburia Eubacteriumrectale, Roseburia faecis, Ruminococcus bromii, Ruminococcus callidus, Ruminococcus flavefaciens, Ruminococcus gnavus,
Genus rankAlistipes, Akkermansia, Bacteroides, Bifidobacterium, Blautia, Catenibacterium, Clostridium, Coprococcus, Dialister, Escherichia, Eubacterium, Faecalibacterium, Lachnobacterium, Lachnospira, Odoribacter, Oscillospira, Parabacteroides, Phascolarctobacterium, Roseburia, Prevotella, Ruminococcus, Sutterella

*This microbe has no formal species name.

Table 2

MicroPattern analysis result for changed microbes under the animal-based diet condition.

Microbe setsP valueFDR
Disease
 Liver cirrhosis1.38 × 10−72.20 × 10−6
 Clostridium difficile0.01320.079
 Irritable bowel syndrome0.01970.079
 Arthritis, rheumatoid0.03670.1173
Position
 Gastrointestinal tract0.01610.079

Discussion

With the rapid development of high-throughput biological techniques, more and more studies were focus on microbiome. It was important to identify the relationships between microbe and disease. MicroPattern is tool for predicting associated diseases of changed microbes and calculating disease similarity based on their shared microbe associations. Thus, MicroPattern could figure out how disease and microbe interacted. Moreover, with the accumulation of study focus on human microbiome, more associations between microbe and disease will be curated and MicroPattern will be improved greatly.

Materials and Methods

Collection of microbe sets

We searched the microbiome-related articles from Pubmed with the keyword “human microbiome” and manually curated the microbe-disease associations from the literature. In total, we have curated 483 microbe-disease associations from 61 publications. The microbe-disease association was defined as the microbe significantly increase or decrease under disease condition, as judged by the authors of original publications. To be precise and consistent, only the microbes of species and genus ranks were retained. Uncertain associations, if reported, were also omitted. The microbe-disease association dataset includes a total of 39 human diseases and 292 microbes. Here one microbe set is defined as a group of microbes that have the same meaningful association. For example, the microbes associated with one disease will be grouped into a microbe set. We used the union set of associated microbes from different studies for each disease, because current microbiome data are too variable to obtain one consensus microbe set across different studies192021. In addition to the microbe-disease dataset, we also annotated the information for the body positions where the microbes colonized. So current microbe sets were collected according to two rules, the microbe associated disease and the microbe colonized positions. In total, we collected 47 microbe sets including 37 disease-microbe sets and 10 position-microbe sets.

Enrichment analysis

We used the hypergeometric test22 to determine the significant overrepresentation of the microbe sets among a list of microbes of interest. Assuming that N represents the number of microbes included in all microbe sets, n represents the number of microbes included in the tested microbe set, M represents the number of microbes included in the interested microbe list and m represents the number of microbes that matched the tested microbe set. The statistical significance of this microbe set overrepresentation among the interest microbes are represented by the following formula: Finally, the P values for all microbe sets are adjusted by Bonferroni and Benjamini-Hochberg FDR corrections.

Disease similarity calculation

We adapted the equation for the calculation of symptoms-based disease similarity to calculate the microbe-based disease similarity23. For every disease i (39 in total) and every microbe j (292 in total), we described the w as the quantitative strength of relationship between them: E (E ∈ [−1, 1]) represents the changing direction of microbe j in disease i. E equals to 1 when microbe j is increased in disease i, while E equals to −1 when microbe j is decreased in disease i. W represents the number of associations of disease i and microbe j. N (here is 39) is the number of all disease and n is the number of diseases associated with microbe j. Thus, for every disease i, it has a vector d of length M (M is the number total microbes, here is 292). Then we took the cosine similarity value between two vectors d and d as similarity between disease i and disease j as

Additional Information

How to cite this article: Ma, W. et al. MicroPattern: a web-based tool for microbe set enrichment analysis and disease similarity calculation based on a list of microbes. Sci. Rep. 7, 40200; doi: 10.1038/srep40200 (2017). Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  23 in total

Review 1.  The gut microbiota--masters of host development and physiology.

Authors:  Felix Sommer; Fredrik Bäckhed
Journal:  Nat Rev Microbiol       Date:  2013-02-25       Impact factor: 60.633

2.  A metagenome-wide association study of gut microbiota in type 2 diabetes.

Authors:  Junjie Qin; Yingrui Li; Zhiming Cai; Shenghui Li; Jianfeng Zhu; Fan Zhang; Suisha Liang; Wenwei Zhang; Yuanlin Guan; Dongqian Shen; Yangqing Peng; Dongya Zhang; Zhuye Jie; Wenxian Wu; Youwen Qin; Wenbin Xue; Junhua Li; Lingchuan Han; Donghui Lu; Peixian Wu; Yali Dai; Xiaojuan Sun; Zesong Li; Aifa Tang; Shilong Zhong; Xiaoping Li; Weineng Chen; Ran Xu; Mingbang Wang; Qiang Feng; Meihua Gong; Jing Yu; Yanyan Zhang; Ming Zhang; Torben Hansen; Gaston Sanchez; Jeroen Raes; Gwen Falony; Shujiro Okuda; Mathieu Almeida; Emmanuelle LeChatelier; Pierre Renault; Nicolas Pons; Jean-Michel Batto; Zhaoxi Zhang; Hua Chen; Ruifu Yang; Weimou Zheng; Songgang Li; Huanming Yang; Jian Wang; S Dusko Ehrlich; Rasmus Nielsen; Oluf Pedersen; Karsten Kristiansen; Jun Wang
Journal:  Nature       Date:  2012-09-26       Impact factor: 49.962

3.  Delivery mode shapes the acquisition and structure of the initial microbiota across multiple body habitats in newborns.

Authors:  Maria G Dominguez-Bello; Elizabeth K Costello; Monica Contreras; Magda Magris; Glida Hidalgo; Noah Fierer; Rob Knight
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-21       Impact factor: 11.205

4.  Gut microbiota in human adults with type 2 diabetes differs from non-diabetic adults.

Authors:  Nadja Larsen; Finn K Vogensen; Frans W J van den Berg; Dennis Sandris Nielsen; Anne Sofie Andreasen; Bente K Pedersen; Waleed Abu Al-Soud; Søren J Sørensen; Lars H Hansen; Mogens Jakobsen
Journal:  PLoS One       Date:  2010-02-05       Impact factor: 3.240

5.  Gut microbiome metagenomics analysis suggests a functional model for the development of autoimmunity for type 1 diabetes.

Authors:  Christopher T Brown; Austin G Davis-Richardson; Adriana Giongo; Kelsey A Gano; David B Crabb; Nabanita Mukherjee; George Casella; Jennifer C Drew; Jorma Ilonen; Mikael Knip; Heikki Hyöty; Riitta Veijola; Tuula Simell; Olli Simell; Josef Neu; Clive H Wasserfall; Desmond Schatz; Mark A Atkinson; Eric W Triplett
Journal:  PLoS One       Date:  2011-10-17       Impact factor: 3.240

6.  Variability and diversity of nasopharyngeal microbiota in children: a metagenomic analysis.

Authors:  Debby Bogaert; Bart Keijser; Susan Huse; John Rossen; Reinier Veenhoven; Elske van Gils; Jacob Bruin; Roy Montijn; Marc Bonten; Elisabeth Sanders
Journal:  PLoS One       Date:  2011-02-28       Impact factor: 3.240

7.  Moving pictures of the human microbiome.

Authors:  J Gregory Caporaso; Christian L Lauber; Elizabeth K Costello; Donna Berg-Lyons; Antonio Gonzalez; Jesse Stombaugh; Dan Knights; Pawel Gajer; Jacques Ravel; Noah Fierer; Jeffrey I Gordon; Rob Knight
Journal:  Genome Biol       Date:  2011       Impact factor: 13.583

8.  Seasonal variation in human gut microbiome composition.

Authors:  Emily R Davenport; Orna Mizrahi-Man; Katelyn Michelini; Luis B Barreiro; Carole Ober; Yoav Gilad
Journal:  PLoS One       Date:  2014-03-11       Impact factor: 3.240

9.  Sex differences and hormonal effects on gut microbiota composition in mice.

Authors:  Elin Org; Margarete Mehrabian; Brian W Parks; Petia Shipkova; Xiaoqin Liu; Thomas A Drake; Aldons J Lusis
Journal:  Gut Microbes       Date:  2016-06-29

10.  Dietary Factors: Major Regulators of the Gut's Microbiota.

Authors:  Alexander R Moschen; Verena Wieser; Herbert Tilg
Journal:  Gut Liver       Date:  2012-08-07       Impact factor: 4.519

View more
  6 in total

1.  TAM 2.0: tool for MicroRNA set analysis.

Authors:  Jianwei Li; Xiaofen Han; Yanping Wan; Shan Zhang; Yingshu Zhao; Rui Fan; Qinghua Cui; Yuan Zhou
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

2.  MicrobiomeAnalyst: a web-based tool for comprehensive statistical, visual and meta-analysis of microbiome data.

Authors:  Achal Dhariwal; Jasmine Chong; Salam Habib; Irah L King; Luis B Agellon; Jianguo Xia
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

3.  Metformin Alters Gut Microbiota of Healthy Mice: Implication for Its Potential Role in Gut Microbiota Homeostasis.

Authors:  Wei Ma; Ji Chen; Yuhong Meng; Jichun Yang; Qinghua Cui; Yuan Zhou
Journal:  Front Microbiol       Date:  2018-06-22       Impact factor: 5.640

4.  "EviMass": A Literature Evidence-Based Miner for Human Microbial Associations.

Authors:  Divyanshu Srivastava; Krishanu D Baksi; Bhusan K Kuntal; Sharmila S Mande
Journal:  Front Genet       Date:  2019-09-13       Impact factor: 4.599

5.  Exploration of the Potential Relationship Between Gut Microbiota Remodeling Under the Influence of High-Protein Diet and Crohn's Disease.

Authors:  Yiming Zhao; Lulu Chen; Liyu Chen; Jing Huang; Shuijiao Chen; Zheng Yu
Journal:  Front Microbiol       Date:  2022-03-03       Impact factor: 5.640

6.  The cancer microbiome atlas: a pan-cancer comparative analysis to distinguish tissue-resident microbiota from contaminants.

Authors:  Anders B Dohlman; Diana Arguijo Mendoza; Shengli Ding; Michael Gao; Holly Dressman; Iliyan D Iliev; Steven M Lipkin; Xiling Shen
Journal:  Cell Host Microbe       Date:  2021-01-06       Impact factor: 21.023

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.