| Literature DB >> 35316132 |
Erik C Nook1, Thomas D Hull2,3, Matthew K Nock1, Leah H Somerville1.
Abstract
SignificanceUsing language to "distance" ourselves from distressing situations (i.e., by talking less about ourselves and the present moment) can help us manage emotions. Here, we translate this basic research to discover that such "linguistic distancing" is a replicable measure of mental health in a large set of therapy transcripts (N = 6,229). Additionally, clustering techniques showed that language alone could identify participants who differed on both symptom severity and treatment outcomes. These findings lay the foundation for 1) tools that can rapidly identify people in need of psychological services based on language alone and 2) linguistic interventions that can improve mental health.Entities:
Keywords: internalizing symptoms; language; linguistic distance; psychotherapy; treatment outcomes
Mesh:
Year: 2022 PMID: 35316132 PMCID: PMC9060508 DOI: 10.1073/pnas.2114737119
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 12.779
Fig. 1.Effect size plot for mixed-effects regressions depicting relations between internalizing symptoms, linguistic distance, and time in treatment within the exploratory (black) and validation (gray) datasets. All 95% CIs do not include zero, indicating significant associations. Ss = Subjects.
Fig. 2.Bayesian mediation models testing whether within-person variance in client linguistic distance mediated changes in internalizing symptoms across time in the (A) exploratory and (B) validation datasets. The 95% CR for the indirect effect included zero for the exploratory but not the validation dataset, providing mixed evidence for the proposed mediation model. Median regression estimates are reported from Bayesian regression models, with their corresponding 95% CRs.
Fig. 3.Results of finite mixture regressions clustering participants based on the linguistic distance in their texts over the course of therapy for the (A–D) exploratory and (E–G) validation datasets. (A and E) Four clusters were identified, which differed in starting and ending linguistic distance as well as the slope of changes in linguistic distance over the course of treatment. (B and F) Clusters differed significantly in baseline internalizing symptoms such that the clusters that started with higher linguistic distance (i.e., clusters 1 and 2) had lower symptoms at the start of treatment than those that started with lower linguistic distance (i.e., clusters 3 and 4). (C and G) Clusters differed significantly in final internalizing symptoms, with clusters 1 and 2 also reporting significantly fewer symptoms than clusters 3 and 4. (D and H) Estimated marginal means of changes in internalizing symptoms across clusters (accounting for baseline symptom levels). Cluster 2, which had both a high starting level of linguistic distance and the strongest increase over time, achieved the best treatment response, significantly stronger than clusters 3 and 4. ***P < 0.001, **P < 0.01, *P < 0.05.
Sample and platform description
| Full sample | Exploratory subsample | Validation subsample | |
|---|---|---|---|
| Gender, No. (%) | |||
| Female | 4,742 (77.4) | 2,857 (77.7) | 1,885 (77.0) |
| Male | 1,306 (21.3) | 766 (20.8) | 540 (22.1) |
| Transgender female | 11 (0.2) | 8 (0.2) | 3 (0.1) |
| Transgender male | 11 (0.2) | 5 (0.1) | 6 (0.2) |
| Gender queer | 27 (0.4) | 21 (0.6) | 6 (0.2) |
| Gender variant | 6 (0.1) | 3 (0.1) | 3 (0.1) |
| Other | 20 (0.3) | 16 (0.4) | 4 (0.2) |
| No response | 106 | 53 | 53 |
| Age, No. (%) | |||
| 18–25 | 1,031 (22.2) | 606 (21.8) | 425 (22.8) |
| 26–35 | 2,536 (54.6) | 1,517 (54.6) | 1,019 (54.8) |
| 36–49 | 871 (18.8) | 526 (18.9) | 345 (18.5) |
| 50+ | 203 (4.4) | 131 (4.7) | 72 (3.9) |
| No response | 1,588 | 949 | 639 |
| Race, No. (%) | |||
| Caucasian | 1,172 (60.4) | 698 (60.3) | 474 (60.5) |
| African American | 284 (14.6) | 179 (15.5) | 105 (13.4) |
| Asian | 140 (7.2) | 82 (7.1) | 58 (7.4) |
| Hispanic | 120 (6.2) | 66 (5.7) | 54 (6.9) |
| Native American | 5 (0.3) | 2 (0.2) | 3 (0.4) |
| Other | 195 (10.1) | 115 (9.9) | 80 (10.2) |
| Declined to identify | 24 (1.2) | 15 (1.3) | 9 (1.1) |
| No response | 4,289 | 2,572 | 1,717 |
| Education level, No. (%) | |||
| Less than high school | 28 (0.5) | 14 (0.4) | 14 (0.7) |
| High school | 808 (15.7) | 477 (15.3) | 331 (16.2) |
| Associate’s degree | 78 (1.5) | 40 (1.3) | 38 (1.9) |
| Some college no degree | 200 (3.9) | 126 (4.0) | 74 (3.6) |
| Bachelor’s degree | 3,683 (71.4) | 2,238 (71.9) | 1,445 (70.7) |
| Master’s degree | 260 (5.0) | 163 (5.2) | 97 (4.7) |
| Professional degree | 43 (0.8) | 23 (0.7) | 20 (1.0) |
| Doctoral degree | 56 (1.1) | 31 (1.0) | 25 (1.2) |
| No response | 1,073 | 617 | 456 |
| Symptom measures, mean (SD) | |||
| Baseline internalizing symptoms | 22.21 (9.90) | 21.99 (9.89) | 22.54 (9.92) |
| Final internalizing symptoms | 15.16 (9.87) | 14.95 (9.79) | 15.48 (10.00) |
| Baseline depression symptoms | 11.03 (5.82) | 10.93 (5.84) | 11.18 (5.80) |
| Final depression symptoms | 7.56 (5.58) | 7.44 (5.54) | 7.73 (5.62) |
| Baseline anxiety symptoms | 11.18 (5.04 | 11.06 (5.02) | 11.36 (5.06) |
| Final anxiety symptoms | 7.60 (4.90 | 7.50 (4.83) | 7.75 (4.99) |
| Therapy and text qualities | |||
| Text-only subscription, No. (%) | 6,108 (98.1) | 3,659 (98.1) | 2,449 (98.0) |
| No. of client messages | 759,706 | 455,379 | 304,327 |
| Length of client messages (words), mean (SD) | 80.81 (146.80) | 81.61 (147.02) | 79.61 (146.46) |
| No. of therapist messages | 461,911 | 273,208 | 188,703 |
| Length of therapist messages (words), mean (SD) | 82.69 (107.30) | 84.32 (108.00) | 80.33 (106.23) |
| Present-tense verbs per message, mean (SD) | 8.73 (15.10) | 8.81 (15.11) | 8.61 (15.09) |
| Past-tense verbs per message, mean (SD) | 3.87 (9.34) | 3.89 (9.34) | 3.83 (9.34) |
| Future-tense verbs per message, mean (SD) | 0.82 (1.80) | 0.83 (1.81) | 0.81 (1.79) |
| First-person singular pronouns per message, mean (SD) | 8.13 (14.50) | 8.17 (14.49) | 8.06 (14.51) |
| Other pronouns per message, mean (SD) | 3.75 (9.38) | 3.78 (9.38) | 3.71 (9.38) |
| Number of therapists, mean (SD) | 1.21 (0.51) | 1.21 (0.51) | 1.22 (0.52) |
| Number of symptom measures, mean (SD) | 3.45 (0.51) | 3.45 (0.51) | 3.45 (0.51) |
| Exactly 3 measures, No. (%) | 3,453 (55) | 2,058 (55) | 1,395 (56) |
| Exactly 4 measures, No. (%) | 2,741 (44) | 1,651 (44) | 1,090 (44) |
| Exactly 5 measures, No. (%) | 35 (1) | 20 (1) | 15 (1) |
| Days between start of therapy and final symptom measure, mean (SD) | 63.93 (11.90) | 63.94 (11.89) | 63.91 (11.92) |
Percentages ignore clients who did not respond to each demographic question.
*Participants selected from predefined age bins. Age is given in years.