| Literature DB >> 30545401 |
Jessica D Forbes1,2,3,4,5, Chih-Yu Chen3, Natalie C Knox3, Ruth-Ann Marrie1,6, Hani El-Gabalawy1,7, Teresa de Kievit8, Michelle Alfa4, Charles N Bernstein1,2, Gary Van Domselaar9,10,11.
Abstract
BACKGROUND: Immune-mediated inflammatory disease (IMID) represents a substantial health concern. It is widely recognized that IMID patients are at a higher risk for developing secondary inflammation-related conditions. While an ambiguous etiology is common to all IMIDs, in recent years, considerable knowledge has emerged regarding the plausible role of the gut microbiome in IMIDs. This study used 16S rRNA gene amplicon sequencing to compare the gut microbiota of patients with Crohn's disease (CD; N = 20), ulcerative colitis (UC; N = 19), multiple sclerosis (MS; N = 19), and rheumatoid arthritis (RA; N = 21) versus healthy controls (HC; N = 23). Biological replicates were collected from participants within a 2-month interval. This study aimed to identify common (or unique) taxonomic biomarkers of IMIDs using both differential abundance testing and a machine learning approach.Entities:
Keywords: 16S rRNA gene amplicon sequencing; Bacteria; Gut microbiota; Immune-mediated inflammatory disease; Inflammatory bowel disease; Machine learning classifiers; Multiple sclerosis; Rheumatoid arthritis; Taxonomic biomarkers
Mesh:
Substances:
Year: 2018 PMID: 30545401 PMCID: PMC6292067 DOI: 10.1186/s40168-018-0603-4
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Patient data at time of sample procurement
| Disease | Average age, yearsa | |
|---|---|---|
| Crohn’s disease | 49.9 | 20 (14/5) |
| Ulcerative colitis | 51.2 | 19 (11/8) |
| Multiple sclerosis | 47.3 | 19 (14/4) |
| Rheumatoid arthritis | 62.3 | 21 (14/7) |
| Healthy controls | 32.4 | 23 (12/11) |
aTabulated metadata does not include information from patients whose metadata was not available
Fig. 1Principal coordinate analysis (PCoA) based on the overall structure of the stool microbiota in all samples. Each data point represents an individual sample. PCoA was calculated using Bray-Curtis distances with a multivariate t-distribution. Ellipses represent an 80% confidence level. Color/shape is indicative of cohort
Fig. 2Alpha-diversity assessed by richness (Chao1, ACE) and diversity (Shannon, Simpson). Median estimates compared across cohorts using the Kruskal-Wallis test and Dunn’s post hoc tests for multiple comparisons. Boxes represent the interquartile range, lines indicate medians, and whiskers indicate the range. p values represent the overall FDR-corrected p values. aCD/UC; bCD/MS; cCD/RA; dCD/HC; eUC/MS; fUC/RA; gUC/HC; hMS/RA; iMS/HC; jRA/HC
Fig. 3Abundance of Gram-positive phyla. Median estimates compared across cohorts using the Kruskal-Wallis test and Dunn’s post hoc tests for multiple comparisons. Boxes represent the interquartile range, lines indicate medians, diamond indicates the mean, and whiskers indicate the range. p values represent the overall FDR-corrected p values. aCD/UC; bCD/MS; cCD/RA; dCD/HC; eUC/MS; fUC/RA; gUC/HC; hMS/RA; iMS/HC; jRA/HC
Abundant† taxa in IMID microbiota relative to HC. Presence of solid color is indicative of significantly higher abundance (color) or lower abundance (gray) compared to HC
†Taxa with median abundance > 2. Taxa unable to be classified to the genus level were classified to the nearest higher taxonomic rank. Statistics were performed using the nonparametric Kruskal-Wallis test and Dunn’s post hoc tests for multiple comparisons, with FDR correction. Differences considered significant at p < 0.05.
Model performance of the binary classifiers shown in balanced accuracy (BA) and area under ROC curve (AUC) indices using Gram-positive phyla
Rows represent pairs of cohorts in alphabetical order with “diseased” showing the classifier for all disease cohorts (i.e., CD, UC, MS, and RA) versus HCs. Columns represent BA and AUC indices when using either OTUs or genera as features. Performance levels are indicated relatively by a white (low) and blue (high) color scale.
Fig. 4Feature importance from random forest classifiers for CD versus HCs in addition to feature abundance. Results from OTU and genus classifiers are shown in figures a and b, respectively. The corresponding genera of OTU features were labeled for the ease of interpretation. Each heatmap displays the abundance of the top ten features (rows) in samples (columns) according to the machine learning classifiers. The column bar colors represent the categories of the samples. Feature importance is shown on the right, and features are ordered in decreasing importance from top to bottom according to the mean decrease in Gini index