| Literature DB >> 34789827 |
Benson Kung1, Maurice Chiang2, Gayan Perera3, Megan Pritchard3,4, Robert Stewart3,4.
Abstract
Current criteria for depression are imprecise and do not accurately characterize its distinct clinical presentations. As a result, its diagnosis lacks clinical utility in both treatment and research settings. Data-driven efforts to refine criteria have typically focused on a limited set of symptoms that do not reflect the disorder's heterogeneity. By contrast, clinicians often write about patients in depth, creating descriptions that may better characterize depression. However, clinical text is not commonly used to this end. Here we show that clinically relevant depressive subtypes can be derived from unstructured electronic health records. Five subtypes were identified amongst 18,314 patients with depression treated at a large mental healthcare provider by using unsupervised machine learning: severe-typical, psychotic, mild-typical, agitated, and anergic-apathetic. Subtypes were used to place patients in groups for validation; groups were found to be associated with future outcomes and characteristics that were consistent with the subtypes. These associations suggest that these categorizations are actionable due to their validity with respect to disease prognosis. Moreover, they were derived with automated techniques that might theoretically be widely implemented, allowing for future analyses in more varied populations and settings. Additional research, especially with respect to treatment response, may prove useful in further evaluation.Entities:
Mesh:
Year: 2021 PMID: 34789827 PMCID: PMC8599474 DOI: 10.1038/s41598-021-01954-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic information for the total sample as well as within patient groups.
| Full sample | Groups 3–5 | 1 | 2 | 3 | 4 | 5 | |||
|---|---|---|---|---|---|---|---|---|---|
| 18,314 | 12,115 | 3,140 | 3,059 | 4,844 | 4,291 | 2,980 | |||
| < 0.001 | < 0.001 | ||||||||
| Female | 11,377 (62.1) | 7825 (64.6) | 1849 (58.9) | 1703 (55.7) | 3441 (71.0) | 2500 (58.3) | 1884 (63.2) | ||
| Male | 6926 (37.8) | 4283 (35.4) | 1290 (41.1) | 1353 (44.2) | 1401 (28.9) | 1789 (41.7) | 1093 (36.7) | ||
| < 0.001 | 0.69 | ||||||||
| Asian | 915 (5.0) | 573 (4.7) | 151 (4.8) | 191 (6.2) | 227 (4.7) | 218 (5.1) | 128 (4.3) | ||
| Black | 2728 (14.9) | 1709 (14.1) | 448 (14.3) | 571 (18.7) | 670 (13.8) | 603 (14.1) | 436 (14.6) | ||
| Mixed | 400 (2.2) | 274 (2.3) | 62 (2.0) | 64 (2.1) | 111 (2.3) | 95 (2.2) | 68 (2.3) | ||
| Other | 1833 (10) | 1236 (10.2) | 305 (9.7) | 292 (9.5) | 506 (10.4) | 449 (10.5) | 281 (9.4) | ||
| White | 10,458 (57.1) | 6956 (57.4) | 1849 (58.9) | 1653 (54.0) | 2787 (57.5) | 2449 (57.1) | 1720 (57.7) | ||
| 1980 (10.8) | 1367 (11.3) | 325 (10.4) | 288 (9.4) | 543 (11.2) | 477 (11.1) | 347 (11.6) | |||
| < 0.001 | < 0.001 | ||||||||
| < 18 | 2352 (12.8) | 1750 (14.4) | 345 (11.0) | 257 (8.4) | 772 (15.9) | 664 (15.5) | 314 (10.5) | ||
| 18–34 | 5951 (32.5) | 3954 (32.6) | 1032 (32.9) | 965 (31.5) | 1580 (32.6) | 1289 (30.0) | 1085 (36.4) | ||
| 35–49 | 4513 (24.6) | 2923 (24.1) | 833 (26.5) | 757 (24.7) | 1175 (24.3) | 1033 (24.1) | 715 (24) | ||
| 50–64 | 2561 (14) | 1576 (13) | 480 (15.3) | 505 (16.5) | 620 (12.8) | 590 (13.7) | 366 (12.3) | ||
| 65& + | 2934 (16) | 1910 (15.8) | 449 (14.3) | 575 (18.8) | 696 (14.4) | 714 (16.6) | 500 (16.8) | ||
| 25.1 (10.2) | 25.1 (10.3) | 24.8 (10.2) | 25.4 (10.1) | 25.0 (10.0) | 25.2 (10.4) | 25.2 (10.2) | 0.16 | 0.05 |
Figure 1Probabilities of symptoms per patient group. The top two most likely symptoms per group were included.
Strength of the association, adjusted, between various outcomes with each symptom group versus the rest of the sample.
| Severe, typical | Psychotic | Mild, typical | Agitated | Anergic, apathetic | |
|---|---|---|---|---|---|
| Survival (HRa) time to death | < 0.001 | 1.05 (0.95, 1.16) .34 | < 0.001 | 0.97 (0.88, 1.06) 0.51 | 0.96 (0.87, 1.07) 0.46 |
| Crisis event (ORb) | 1.14 (0.98, 1.33) 0.08 | < 0.001 | 0.96 (0.86, 1.13) 0.82 | < 0.001 | |
| Emergency presentations (OR) | 1.16 (1.05, 1.29), 0.01 | < 0.001 | < 0.001 | 1.01 (0.91, 1.13) 0.83 | |
| Number of days active in SLAM (IRRc) | < 0.001 | < 0.001 | < 0.001 | < 0.001 | 1.00 (0.99, 1.01) 0.35 |
| Number of face-to-face contacts (IRR) | 1.00 (0.98, 1.02) 0.68 | < 0.001 | < 0.001 | < 0.001 |
*Adjusted for age, gender, ethnicity, and index of multiple deprivation score. aHazard ratio. bOdds ratio. cIncidence rate ratio.
Statistically significant results are bolded. [HR/ OR/ IRR (95% CI), P-value].
Prevalence of HoNOS problems compared by depression sub-group.
| Scale | Total 18,314 | Group 1, ‘severe, typical’ | Group 2, ‘psychotic’ | Group 3, ‘mild, typical’ | Group 4, ‘agitated’ | Group 5, ‘anergic, apathetic” | Total | Group 3, 4, and 5 |
|---|---|---|---|---|---|---|---|---|
| Agitation | 1397 (7.6) | 180 (5.7) | 282 (5.8) | 358 (8.3) | < 0.001 | < 0.001 | ||
| Self-Injury | 2624 (14.3) | 490 (16.0) | 561 (11.6) | 623 (14.5) | < 0.001 | < 0.001 | ||
| Drug Misuse | 1403 (7.7) | 261 (8.3) | 327 (6.8) | 329 (7.7) | .01 | .08 | ||
| Cognition | 1,328 (7.3) | 193 (6.1) | 289 (6.7) | 196 (6.6) | < 0.001 | .43 | ||
| Physical Illness | 3,846 (21.0) | 696 (22.2) | 890 (20.7) | 613 (20.6) | .06 | .61 | ||
| Hallucinations | 1178 (6.4) | 119 (3.8) | 179 (4.2) | 87 (2.9) | < 0.001 | < 0.001 | ||
| Depressed | 9,063 (49.5) | 1616 (51.5) | 2033 (47.4) | 1537 (51.6) | < 0.001 | < 0.001 | ||
| Relationship | 3,685 (20.1) | 691 (22.0) | 925 (19.1) | 822 (19.2) | < 0.001 | .03 | ||
| Daily Living | 3,130 (17.1) | 553 (17.6) | 726 (16.9) | 527 (17.7) | < 0.001 | < 0.001 | ||
| Living Conditions | 1,714 (9.4) | 355 (11.3) | 347 (8.1) | 258 (8.7) | < 0.001 | .53 | ||
| Occupational | 3,304 (18) | 619 (19.7) | 750 (17.5) | 531 (17.8) | < 0.001 | .01 | ||
| HoNOS Missing | 10,704 (58.4) | 1798 (57.3) | 244 (57) | 1751(58.8) | < 0.001 | < 0.001 |
aChi-squared test with 4 degrees of freedom. bChi-squared test with 2 degrees of freedom.
Groups with the highest likelihood of problems are presented in bold font and the lowest likelihood in italicized font.