| Literature DB >> 34178718 |
Yuxia Liu1, Wenhui Li1, Hongxia Yang1, Xiaoying Zhang1, Wenxiu Wang1, Sitong Jia1, Beibei Xiang2, Yi Wang1,3, Lin Miao1,3, Han Zhang1,4, Lin Wang5, Yujing Wang5, Jixiang Song5, Yingjie Sun5, Lijuan Chai1,4, Xiaoxuan Tian1.
Abstract
Irritable bowel syndrome (IBS) is a chronic gastrointestinal disorder characterized by abdominal pain or discomfort. Previous studies have illustrated that the gut microbiota might play a critical role in IBS, but the conclusions of these studies, based on various methods, were almost impossible to compare, and reproducible microorganism signatures were still in question. To cope with this problem, previously published 16S rRNA gene sequencing data from 439 fecal samples, including 253 IBS samples and 186 control samples, were collected and processed with a uniform bioinformatic pipeline. Although we found no significant differences in community structures between IBS and healthy controls at the amplicon sequence variants (ASV) level, machine learning (ML) approaches enabled us to discriminate IBS from healthy controls at genus level. Linear discriminant analysis effect size (LEfSe) analysis was subsequently used to seek out 97 biomarkers across all studies. Then, we quantified the standardized mean difference (SMDs) for all significant genera identified by LEfSe and ML approaches. Pooled results showed that the SMDs of nine genera had statistical significance, in which the abundance of Lachnoclostridium, Dorea, Erysipelatoclostridium, Prevotella 9, and Clostridium sensu stricto 1 in IBS were higher, while the dominant abundance genera of healthy controls were Ruminococcaceae UCG-005, Holdemanella, Coprococcus 2, and Eubacterium coprostanoligenes group. In summary, based on six published studies, this study identified nine new microbiome biomarkers of IBS, which might be a basis for understanding the key gut microbes associated with IBS, and could be used as potential targets for microbiome-based diagnostics and therapeutics.Entities:
Keywords: 16S rRNA; biomarkers; gut microbiome; irritable bowel syndrome; machine learning algorithm
Year: 2021 PMID: 34178718 PMCID: PMC8231010 DOI: 10.3389/fcimb.2021.645951
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1The top frames represent the three main steps, and the rectangular frame shows the data process. DADA2, divisive amplicon denoising algorithm 2; ASV, amplicon sequence variants; LEfSe, linear discriminant analysis effect size; RF, random forest; IBS, irritable bowel syndrome.
Size and characteristics of the IBS 16S rRNA datasets included in this study.
| Dataset | Groups (n) | Age (average ± sd.) | BMI (average ± sd.) | Sex F/M | Country | Data storage (NCBI SRA) | Amplification region | Sequencing method |
|---|---|---|---|---|---|---|---|---|
| Zhuang_2018 ( | HC (13) | 30.54 ± 6.75 | 20.84 ± 1.46 20.61 | 8/5 21/9 | China | SRP150089 | V3–V4 | Illumina-MiSeq |
| IBS-D (30) | 32.1 ± 8.11 | ± 3.26 | ||||||
| Presti_2019 ( | HC (47), IBS (44) | 54, 48 | 23, 24 | 20/27, 30/14 | ||||
| IBS-D (16) | ||||||||
| IBS-C (18) | NA | NA | NA | Italy | SRP110018 | V1-V3 | 454 GS Junior | |
| IBS- A (10) | ||||||||
| Pozuelo_2015 ( | HC (66), IBS (113) | 37.6 ± 13, 42.6 ± 13 | 23.7 ± 3.4, 23.7 ± 4 | 40/26, 80/33 | ||||
| IBS-D (54) | 41.9 ± 13 | 25 ± 4.6 | 29/25 | |||||
| IBS-C (32) | 39.4 ± 10.8 | 23.3 ± 3.8 | 31/2 | Spain | SRP050404 | V4 | Illumina-MiSeq | |
| IBS-M (27) | 48.2 ± 16.4 | 23.9 ± 3.6 | 32/8 | |||||
| Zhu_2019 ( | HC (15) | 28.27 ± 1.56 | NA | 7/8 | China | SRP222428 | V4 | Illumina-HiSeq |
| IBS(15) | 47.67 ± 14.24 | 1/2 | ||||||
| Saulnier_2011 ( | HC (22), IBS (22) | 9.32 ± 1.52, 9.41 ± 1.04 | NA | 11/11, 8/14 | US | SRP002457 | 454 GS FLX | |
| IBS-D (1) | 9.38 ± 1.19 | 5/8 | ||||||
| IBS-C (13) | 10 | 0/1 | V1-V3 | |||||
| IBS-U (7) | 9.26 ± 1.89 | 3/4 | V3-v5 | |||||
| other (1) | 9 | 0/1 | ||||||
| Labus_2017 ( | HC (23), IBS (29) | 26.0 ± 6.48, 26.1 ± 5.72 | 14/9, 21/8 | |||||
| IBS-D (10) | ||||||||
| IBS-C (11) | ||||||||
| IBS-M (5) | NA | NA | NA | US | SRP099239 | V3-V5 | 454 GS FLX | |
| IBS-A (1) | ||||||||
| IBS-U (2) |
NA, data not available; IBS, irritable bowel syndrome; HC, healthy controls; IBS-D, diarrhea-predominant IBS; IBS-C, constipation-predominant IBS; IBS-M, mixed IBS; IBS-A, alternating IBS; IBS-U, unsubtyped IBS.
Figure 2Most of the studies showed microbiome changes, and potential signatures were found at genus-level in each dataset. (A) Left: Area under the ROC curve (AUC) is calculated by the AdaBoost algorithm. Right: Area under the ROC curve (AUC) is calculated by the random forest algorithm. X-axis starts at 0.5. (B) The number of genera biomarkers with p < 0.05 were identified by linear discriminant analysis effect size (LEfSe) analysis.
Figure 3The AdaBoost and random forest classifiers identified important feature genera in distinguishing IBS from healthy subjects. (A) The important features selected by the AdaBoost classifier. a1. The top 23 genus biomarkers were ranked in descending order of the most relevant features to the model. a2. 10-fold cross-validation score on the Y-axis and the number of features on the X-axis. (B) The important features contributed to random forest corresponding to AdaBoost. b1. 25 important genera sorted in descending order based on mean decrease accuracy (MDA). b2. 10-fold cross-validation error on the Y-axis and the number of features on the X-axis.
Figure 4Forest plot reporting effect sizes calculated using a meta-analysis of standardized mean differences and a fixed-effects model on nine genera abundances between carcinomas and controls. The SMD had a positive value confirming the higher number of genera in patients with IBS, and a negative value showing more abundance of genera in healthy controls. The length of the error bar depicts the 95% CIs. The fixed model indicates the overall effect sizes SMD value of all studies.