| Literature DB >> 28198673 |
Chao Liang1, Han-Chi Tseng2, Hui-Mei Chen1, Wei-Chi Wang3, Chih-Min Chiu3, Jen-Yun Chang3, Kuan-Yi Lu3, Shun-Long Weng4,5,6, Tzu-Hao Chang7, Chao-Hsiang Chang8, Chen-Tsung Weng3, Hwei-Ming Wang3, Hsien-Da Huang9,10,11.
Abstract
BACKGROUND: Gastrointestinal microbiota, particularly gut microbiota, is associated with human health. The biodiversity of gut microbiota is affected by ethnicities and environmental factors such as dietary habits or medicine intake, and three enterotypes of the human gut microbiome were announced in 2011. These enterotypes are not significantly correlated with gender, age, or body weight but are influenced by long-term dietary habits. However, to date, only two enterotypes (predominantly consisting of Bacteroides and Prevotella) have shown these characteristics in previous research; the third enterotype remains ambiguous. Understanding the enterotypes can improve the knowledge of the relationship between microbiota and human health.Entities:
Keywords: 16S rDNA; Enterotype; Gut microbiome; Next-generation sequencing; Predictive model
Mesh:
Substances:
Year: 2017 PMID: 28198673 PMCID: PMC5310273 DOI: 10.1186/s12864-016-3261-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of optimal cluster numbers
| HCac | PAMa | PAMb | Kmeansb | |
|---|---|---|---|---|
| Weighted UniFrac | 3 (0.339) | 3 (0.350) | 3 (0.353) | 3 (0.354) |
| Altgower | 2 (0.280) | 2 (0.161) | 2 (0.305) | 2 (0.309) |
| Bray | 2 (0.302) | 2 (0.297) | 2 (0.309) | 2 (0.309) |
| Jaccard | 2 (0.221) | 2 (0.216) | 2 (0.225) | 2 (0.225) |
| Kulczynski | 2 (0.302) | 2 (0.297) | 2 (0.309) | 2 (0.309) |
| Maximum | 2 (0.459) | 2 (0.451) | 2 (0.464) | 2 (0.468) |
| Pearson | 8 (0.571) | 2 (0.614) | 2 (0.622) | 2 (0.622) |
| Horn | 2 (0.494) | 2 (0.596) | 2 (0.600) | 2 (0.600) |
| Euclidean | 2 (0.267) | 2 (0.418) | 2 (0.419) | 2 (0.418) |
The first number of each cell is the optimal cluster number and the second number is the Silhouette score. The optimal cluster number corresponding to the maximum score was picked from a limited series of cluster numbers (k ≤ 10)
aBeta matrix was applied as the input
bCoordinate (PC1–PC3) of each sample was applied as the input. Coordinate was generated by the classical multidimensional scaling method from the beta matrix
cHC, Hierarchical clustering
Fig. 1PCoA results based on different algorithms: a Euclidean and b Weighted UniFrac
Fig. 2Hierarchical clustering results of the stool samples. a Phylogenetic tree of the stool samples; b enterotype of stool samples based on different algorithms (1st row: hc_beta, 2nd row: pam_beta, 3rd row: pam_coor, 4th row: kmeans_coor); c proportion of bacteria in the stool samples at the genus level (blue: Escherichia, green: Bacteroides, red: Prevotella); and d proportion of bacteria in the stool samples at the family level (blue: Enterobacteriaceae, green: Bacteroidaceae, red: Prevotellaceae)
Fig. 3Bacterial community of three enterotypes: a bacterial proportion in the three enterotypes and b major bacteria in the three enterotypes
Fig. 4Linear regression results of abundance of Bacteroides and Prevotealla (Multiple R-squares: 0.658, adjusted R-squared: 0.654, and p-value < 2.2e-16)
Fig. 5Distribution of the Shannon diversity index of three enterotypes and the correlation of richness and Shannon diversity index
Fig. 6Composition of facultative, anaerobic, and aerobic bacteria of three enterotypes
Association between enterotypes and various other factors from the questionnaire
| Enterotype | Association | |||||
|---|---|---|---|---|---|---|
| Type 1 | Type 2 | Type 3 | Contrast |
|
| |
| Gender (global | Type 1 vs Type 2 | 0.0336 | 4.52 | |||
| male | 17 (56.7%) | 10 (27.2%) | 19 (47.5%) | Type 1 vs Type 3 | 0.6046 | 0.27 |
| female | 13 (43.3%) | 26 (72.2%) | 21 (52.5%) | Type 2 vs Type 3 | 0.1258 | 2.34 |
| Type 2 vs (Type 1 + Type 3)a | 0.0200 | 5.41 | ||||
| Protein (global | Type 1 vs Type 2 | 0.0290 | 4.77 | |||
| non-red-meat | 8 (32.0%) | 15 (68.2%) | 18 (66.7%) | Type 1 vs Type 3 | 0.0264 | 4.93 |
| red-meat | 17 (68.0%) | 7 (31.8%) | 9 (33.3%) | Type 2 vs Type 3 | 1 | 0.01 |
| Type 1 vs. (Type 2 + Type 3)a | 0.0081 | 7.00 | ||||
| Shape (global | Type 1 vs Type 2 | 0.6133 | 0.98 | |||
| Hard | 10 (41.7%) | 11 (55.0%) | 3 (11.1%) | Type 1 vs Type 3 | 0.0356 | 6.67 |
| Mid | 8 (33.3%) | 6 (30.0%) | 11 (40.7%) | Type 2 vs Type 3 | 0.0320 | 11.51 |
| Watery | 6 (25.0%) | 3 (15.0%) | 13 (48.1%) | (Type 1 + Type 2) vs Type 3a | 0.0038 | 11.15 |
| Stool (global | Type 1 vs Type 2 | 0.0384 | 6.52 | |||
| D1+ | 17 (58.6%) | 30 (83.3%) | 30 (81.1%) | Type 1 vs Type 3 | 0.0711 | 6.29 |
| D05 | 11 (37.9%) | 4 (11.1%) | 5 (13.5%) | Type 2 vs Type 3 | 0.9525 | 0.10 |
| Constipation | 1 (3.4%) | 2 (5.6%) | 2 (5.4%) | Type 1 vs (Type 2 + Type 3)a | 0.0133 | 8.64 |
aCombining two types based on no significant difference between groups and closed trend
Fig. 7Decision tree model of the three enterotypes. This model provided five rules. Each rule could classify one of three enterotypes. For instance, if the bacterial abundance of Prevotellaceae was over 0.26 in one sample, the sample was considered enterotype 3 (1:37 means 37 samples were successfully classified and one sample failed)
Performance of classification model in training sets and independent testing sets
| T1 | T2 | T3 | Accuracy | ||
|---|---|---|---|---|---|
| Sensitivity | Train | 93.3% (28/30) | 97.2% (35/36) | 100% (40/40) | 97.2% (103/106) |
| Test | 93.8% (15/16) | 79.3% (23/29) | 83.3% (25/30) | 84.0% (63/75) | |
| Group-specifica specificity | Train | 98.7% (75/76) | 100% (70/70) | 97.0% (64/66) | |
| Test | 88.1% (52/59) | 89.1% (41/46) | 100% (45/45) | ||
| Group-specifica precision | Train | 96.6% (28/29) | 100% (35/35) | 95.2% (40/42) | |
| Test | 68.2% (15/22) | 82.1% (23/28) | 100% (25/25) |
aGroup-specific specificity, e.g., T1/non-T1
Significant genus lists categorized by enterotype-related metadata
| Features | Family | Genus |
| Mean separation |
|---|---|---|---|---|
| Enterotype |
|
| <0.0001 | T2 > T1 = T3 |
|
|
| <0.0001 | T3 > T1 > T2 | |
|
|
| <0.0001 | T1 > T2 = T3 | |
|
|
| 0.0006 | T1 > T2 = T3 | |
|
|
| 0.0060 | T1 > T3 | |
|
|
| 0.0394 | T1 > T3 | |
|
|
| 0.0198 | T1 > T2 | |
|
|
| 0.0017 | T1 > T2 = T3 | |
| Stool |
|
| 0.0423 | D1 + < D05 |
|
|
| 0.0012§ | D1 + < D05 | |
| Shape |
|
| 0.0106 | Hard > Watery |
|
|
| 0.0056 | Hard > Watery | |
| Protein |
|
| 0.0170 | Red > non-red |
|
|
| 0.0268 | Red < non-red | |
|
|
| 0.0042 | Red > non-red |
§p value was calculated by ANOVA (not significant in the Kolmogorov–Smirnov test)
aGenus within the same family
bGenus associated with multiple metadata