| Literature DB >> 34760797 |
Abstract
OBJECTIVES: The aims of this study were to explore the use of unsupervised machine learning in clustering the population based on reports of oral pain, psychological distress, and sleep problems and to compare demographic and socio-economic characteristics as well as levels of functional domains (work, social, and leisure) between clusters.Entities:
Keywords: Anxiety; depression; intrinsic sleep disorders; pain; unsupervised machine learning
Year: 2021 PMID: 34760797 PMCID: PMC8533034 DOI: 10.4103/jispcd.JISPCD_131_21
Source DB: PubMed Journal: J Int Soc Prev Community Dent ISSN: 2231-0762
Comparisons of demographic and socio-economic characteristics
| Whole sample ( | Cluster A ( | Cluster B ( | Cluster C ( | Cluster D ( | Cluster E ( | Statistics | |
|---|---|---|---|---|---|---|---|
| Sex, | |||||||
| Male | 840 (52.1) | 243 (61.2) | 252 (50.1) | 106 (45.5) | 151 (56.8) | 88 (41.1) | |
| Female | 773 (47.9) | 154 (38.8) | 251 (49.9) | 127 (54.5) | 115 (43.2) | 126 (58.9) | |
| Age, mean ± SD | 61.7 ± 11.3 | 65.4 ± 7.96a | 63.7 ± 9.9a | 59.5 ± 12.9b | 58.9 ± 12.1b,c | 56.2 ± 13.1c | |
| Race/ethnicity, | |||||||
| Mexican American | 155 (9.6) | 42 (10.6) | 49 (9.7) | 25 (10.7) | 22 (8.3) | 17 (7.9) | |
| Other Hispanic | 156 (9.7) | 46 (11.6) | 46 (9.1) | 21 (9) | 25 (9.4) | 18 (8.4) | |
| Non-Hispanic White | 652 (40.4) | 113 (28.5) | 202 (40.2) | 105 (45.1) | 118 (44.4) | 114 (53.3) | |
| Non-Hispanic Black | 389 (24.1) | 108 (27.2) | 129 (25.6) | 48 (20.6) | 65 (24.4) | 39 (18.2) | |
| Non-Hispanic Asian | 167 (10.4) | 66 (16.6) | 51 (10.1) | 22 (9.4) | 21 (7.9) | 7 (3.3) | |
| Other races | 94 (5.8) | 22 (5.5) | 26 (5.2) | 12 (5.2) | 15 (5.6) | 19 (8.9) | |
| Educational level, | |||||||
| Less than 9th grade | 134 (8.3) | 47 (11.8) | 37 (7.4) | 19 (8.2) | 15 (5.6) | 16 (7.5) | |
| 9th–11th grade | 177 (11) | 38 (9.6) | 50 (9.9) | 24 (10.3) | 31 (11.7) | 34 (15.9) | |
| High school graduate | 409 (25.4) | 101 (25.4) | 128 (25.4) | 50 (21.5) | 70 (26.3) | 60 (28) | |
| Associate degree | 544 (33.7) | 102 (25.7) | 185 (36.8) | 84 (36.1) | 92 (34.6) | 81 (37.9) | |
| College graduate or above | 349 (21.6) | 109 (27.5) | 103 (20.5) | 56 (24) | 58 (21.8) | 23 (10.7) | |
| Ratio of family income to poverty, mean ± SD | 2.5 ± 1.6 | 2.8 ± 1.6d | 2.6 ± 1.6d | 2.6 ± 1.6d | 2.4 ± 1.6d | 1.8 ± 1.3 |
SD = standard deviation. The same superscript letters indicate no statistical differences between clusters
Comparisons of oral pain, psychological distress, and sleep problems
| Whole sample | Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | Statistics | |
|---|---|---|---|---|---|---|---|
| Oral pain | |||||||
| Median (IQR) | 1 (0–2) | 0 (0-0) | 1 (0–2)a | 1 (0–1) | 1 (0–2)a | 2 (1–3) | |
| 10th–90th percentiles | 0–2 | 0–1 | 0–2 | 0–2 | 0–3 | 0–4 | |
| Depression (PHQ-9) | |||||||
| Median (IQR) | 2 (0–6) | 0 (0–1) | 2 (0–4)b | 2 (0.5–4)b | 4 (1–8) | 9 (5–13) | |
| 10th–90th percentiles | 0–10 | 0–4 | 0–7.6 | 0–8 | 0–13 | 3–17.5 | |
| Anxiety | |||||||
| Median (IQR) | 1 (0.5–3) | 0 (0–1) | 1 (1-1) | 3 (3–4) | 2 (1–3) | 4 (3–4) | |
| 10th–90th percentiles | 0–4 | 0–1 | 0–2 | 2–4 | 0–4 | 3–4 | |
| Sleep apnea | |||||||
| Median (IQR) | 0 (0–1) | 0 (0-0)c,d | 0 (0-0)d | 0 (0-0)c,d | 2 (2–3) | 0 (0-0)c | |
| 10th–90th percentiles | 0–2 | 0–1 | 0–1 | 0–1 | 1–3 | 0–1 | |
| Daytime sleepiness | |||||||
| Median (IQR) | 2 (1–3) | 0 (0–1) | 2 (2–3) | 1 (0.5–2) | 3 (2–3) | 3 (2–4) | |
| 10th–90th percentiles | 0–3 | 0–2 | 1–3 | 0–2 | 1–4 | 2–4 |
IQR = interquartile range. The same superscript letters indicate no statistical differences between clusters
Comparisons of functioning in work/social/leisure domains
| Whole sample | Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | Statistics | |
|---|---|---|---|---|---|---|---|
| Unable to work, | |||||||
| Yes | 497 (30.8) | 60 (15.1) | 130 (25.8) | 75 (32.2) | 96 (36.1) | 136 (63.6) | |
| No | 1116 (69.2) | 337 (84.9) | 373 (74.2) | 158 (67.8) | 170 (63.9) | 78 (36.4) | |
| Limited in amount of work, | |||||||
| Yes | 752 (46.6) | 109 (27.5) | 213 (42.3) | 111 (47.6) | 155 (58.3) | 164 (76.6) | |
| No | 861 (53.4) | 288 (72.5) | 290 (57.7) | 122 (52.4) | 111 (41.7) | 50 (23.4) | |
| Difficulty doing job due to oral problems | |||||||
| Median (IQR) | 0 (0-0) | 0 (0-0)a | 0 (0-0)b | 0 (0-0)a,b | 0 (0-0)b | 0 (0-1) | |
| 10th–90th percentiles | 0–1 | 0-0 | 0–1 | 0–1 | 0–1 | 0–2.5 | |
| Difficulty attending social event | |||||||
| Median (IQR) | 1 (1-1) | 1 (1-1) | 1 (1-1)c | 1 (1-1)c | 1 (1–2) | 2 (1–2) | |
| 10th–90th percentiles | 1–2 | 1-1 | 1–2 | 1–2 | 1–2 | 1–3 | |
| Difficulty with home leisure activity | |||||||
| Median (IQR) | 1 (1-1) | 1 (1-1)d | 1 (1-1)d,e | 1 (1-1)d,e | 1 (1-1)e | 1 (1–1.25) | |
| 10th–90th percentiles | 1-1 | 1-1 | 1-1 | 1-1 | 1–2 | 1–2 |
IQR = interquartile range. The same superscript letters indicate no statistical differences between clusters
Figure 1Visualization of the elbow method showing the optimal number of clusters (k=5)
Figure 2Comparisons of overall cluster profiles