| Literature DB >> 28125620 |
Cristina Rubio-Escudero1, Justo Valverde-Fernández2, Isabel Nepomuceno-Chamorro1, Beatriz Pontes-Balanza1, Yoedusvany Hernández-Mendoza3, Alfonso Rodríguez-Herrera2.
Abstract
OBJECTIVES: Analyze a set of data of hydrogen breath tests by use of data mining tools. Identify new patterns of H2 production.Entities:
Mesh:
Year: 2017 PMID: 28125620 PMCID: PMC5268498 DOI: 10.1371/journal.pone.0170385
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Data visualization by means of a heat map.
Fig 2Data visualization of the first two principal components.
Fig 3Results for Silhouette (4A), Davies-Bouldin (4B) and Dunn (4C) for k = 2 to k = 8.
Selection of the most suitable number of clusters.
| Selection of k | Silhouette | Davies-Bouldin | Dunn |
|---|---|---|---|
| Best | k = 6 | k = 5 | k = 7 |
| Second best | k = 7 | k = 6 | k = 6 |
Fig 4Graphical representation of the boxplots and median values of the groups obtained.
Summary of ANOVA results among the time points within the clusters.
| Source | SS | Df | MS | F | Prob > F |
|---|---|---|---|---|---|
| Columns | 1.05e+06 | 5 | 210850.9 | 152.92 | 2.67e-159 |
| Error | 2.27e+07 | 16464 | 1378.8 | -- | -- |
| Total | 2.38e+08 | 16469 | -- | -- | -- |
Patient distribution into average age and gender, according to discovered clusters.
| Feature | Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | Cluster F |
|---|---|---|---|---|---|---|
| Age average (years) | 4.99 | 5.05 | 7.53 | 5.24 | 5.95 | 5.09 |
| Males (%) | 50.23 | 70.59 | 54.53 | 49.79 | 52.63 | 52.08 |
| Females (%) | 49.77 | 29.41 | 45.47 | 50.21 | 47.37 | 47.92 |
Summary of ANOVA results among the age distribution within the clusters.
| Source | SS | Df | MS | F | Prob > F |
|---|---|---|---|---|---|
| Columns | 26453.8 | 5 | 5290.75 | 412.18 | 0 |
| Error | 89171.2 | 9138 | 9.76 | -- | -- |
| Total | 1156725 | 9143 | -- | -- | -- |