| Literature DB >> 31436785 |
Michael P Ewbank1, Ronan Cummins1, Valentin Tablan1, Sarah Bateup1, Ana Catarino1, Alan J Martin1, Andrew D Blackwell1.
Abstract
Importance: Compared with the treatment of physical conditions, the quality of care of mental health disorders remains poor and the rate of improvement in treatment is slow, a primary reason being the lack of objective and systematic methods for measuring the delivery of psychotherapy. Objective: To use a deep learning model applied to a large-scale clinical data set of cognitive behavioral therapy (CBT) session transcripts to generate a quantifiable measure of treatment delivered and to determine the association between the quantity of each aspect of therapy delivered and clinical outcomes. Design, Setting, and Participants: All data were obtained from patients receiving internet-enabled CBT for the treatment of a mental health disorder between June 2012 and March 2018 in England. Cognitive behavioral therapy was delivered in a secure online therapy room via instant synchronous messaging. The initial sample comprised a total of 17 572 patients (90 934 therapy session transcripts). Patients self-referred or were referred by a primary health care worker directly to the service. Exposures: All patients received National Institute for Heath and Care Excellence-approved disorder-specific CBT treatment protocols delivered by a qualified CBT therapist. Main Outcomes and Measures: Clinical outcomes were measured in terms of reliable improvement in patient symptoms and treatment engagement. Reliable improvement was calculated based on 2 severity measures: Patient Health Questionnaire (PHQ-9) and Generalized Anxiety Disorder 7-item scale (GAD-7), corresponding to depressive and anxiety symptoms respectively, completed by the patient at initial assessment and before every therapy session (see eMethods in the Supplement for details).Entities:
Year: 2020 PMID: 31436785 PMCID: PMC6707006 DOI: 10.1001/jamapsychiatry.2019.2664
Source DB: PubMed Journal: JAMA Psychiatry ISSN: 2168-622X Impact factor: 21.596
Factors Associated With Reliable Improvement–All Sessions
| Feature | No. of Words, Mean (SD) | Sessions, % | Odds Ratio (95% CI) | ||
|---|---|---|---|---|---|
| Hello | 12 (22.7) | 99.6 | 0.92 (0.88-0.96) | −3.57 | <.001 |
| Mood check | 5.6 (7) | 97.9 | 0.99 (0.95-1.03) | −0.34 | .73 |
| Obtain update | 16.4 (14.5) | 59.0 | 1.03 (0.99-1.08) | 1.56 | .12 |
| Bridge | 12.2 (17.9) | 27.9 | 0.95 (0.91-0.98) | −2.76 | .006 |
| Risk check | 13.6 (31.5) | 21.0 | 0.85 (0.81-0.89) | −7.54 | <.001 |
| Set agenda | 47.2 (43.5) | 71.3 | 1.08 (1.02-1.14) | 3.02 | .002 |
| Review homework | 18.5 (19.2) | 44.5 | 1.04 (1.00-1.09) | 2.00 | .04 |
| Set goals | 15.9 (30.8) | 19.4 | 1.00 (0.96-1.05) | 0.40 | .69 |
| Formulation | 30.3 (63.9) | 18.2 | 0.96 (0.92-1.00) | −1.89 | .06 |
| Give feedback | 33.6 (40) | 52.1 | 1.05 (1.00-1.10) | 2.20 | .02 |
| Change methods | 477.1 (236) | 97.9 | 1.11 (1.06-1.17) | 4.37 | <.001 |
| Perceptions of change | 1.6 (4.8) | 5.8 | 1.11 (1.06-1.16) | 4.59 | <.001 |
| Set homework | 63.2 (48.9) | 69.1 | 0.96 (0.92-1.00) | −1.68 | .09 |
| Planning for future | 1.1 (6) | 2.4 | 1.12 (1.06-1.19) | 4.01 | <.001 |
| Elicit feedback | 15.3 (16.4) | 55.3 | 1.06 (1.02-1.11) | 2.82 | .004 |
| Summarize session | 0.25 (2.6) | 0.4 | 0.99 (0.95-1.03) | −0.52 | .60 |
| Arrange next session | 30 (21.3) | 82.5 | 1.00 (0.96-1.04) | 0.05 | .96 |
| Goodbye | 15.4 (10.4) | 90.7 | 0.95 (0.91-0.99) | −2.34 | .02 |
| Socratic questioning | 24.1 (31.1) | 47.4 | 1.02 (0.98-1.06) | 0.95 | .34 |
| Therapeutic thanks | 5.4 (13.3) | 13.3 | 0.97 (0.93-1.01) | −1.48 | .14 |
| Therapeutic empathy | 21 (31.3) | 38.0 | 0.84 (0.81-0.88) | −8.21 | <.001 |
| Therapeutic praise | 30.6 (39.4) | 52.6 | 1.21 (1.15-1.27) | 7.18 | <.001 |
| Collaboration | 41 (45.9) | 61.9 | 0.97 (0.93-1.02) | −1.09 | .27 |
| Other | 121.1 (81) | 96.0 | 0.88 (0.85-0.92) | −5.82 | <.001 |
| Variable, mean/prevalence (SD) | |||||
| Total sessions, No. | 6.2 (2.9) | NA | 1.22 (1.17-1.27) | 9.01 | <.001 |
| Session duration, min | 62.4 (7.5) | NA | 0.95 (0.91-0.99) | −2.34 | .02 |
| Start PHQ-9 | 14.7 (5.4) | NA | 0.95 (0.91-0.99) | −2.41 | .03 |
| Start GAD-7 | 8.3 (5.7) | NA | 1.29 (1.23-1.34) | 11.8 | <.001 |
| Patient age, y | 34.8 (12.0) | NA | 1.16 (1.12-1.22) | 7.47 | <.001 |
| Patient sex, No. (%) | |||||
| Male | 3493 (26.7) | NA | 0.96 (0.88-1.05) | −0.89 | .50 |
| Female | 9537 (72.9) | NA | |||
| Unknown/not stated | 43 (0.4) | NA | 0.92 (0.49-1.78) | −0.24 | .74 |
| Long-term condition, No. (%) | |||||
| No | 6056 (46.4) | NA | |||
| Yes | 3632 (27.8) | NA | 0.72 (0.66-0.80) | −6.55 | <.001 |
| Unknown/not stated | 3383 (25.8) | NA | 0.78 (0.71-0.86) | −5.08 | <.001 |
| Psychotropic medication, No. (%) | |||||
| Prescribed not taking | 1116 (8.6) | NA | |||
| Not prescribed | 5971 (45.7) | NA | 1.23 (1.06-1.41) | 2.84 | .004 |
| Prescribed taking | 5535 (42.3) | NA | 0.98 (0.84-1.13) | −0.27 | .78 |
| Unknown/not stated | 451 (3.4) | NA | 0.85 (0.67-1.08) | −1.28 | .20 |
Abbreviations: GAD-7, Generalized Anxiety Disorder 7-item scale; NA, not applicable; PHQ-9, Patient Health Questionnaire.
Output of logistic regression investigating association between reliable improvement and mean number of words per feature across treatment. Standardized odds ratios indicate the association of an increase of 1 SD of a feature with the odds of improvement. Percentage of sessions indicates the percentage of the total number of sessions that contained utterances categorized as that feature. Female sex, no long-term conditions, and prescribed not taking psychotropic medication were reference classes for the categorical variables.
First-Session Factors Associated With IAPT Engagement
| Feature | No. of Words, Mean (SD) | Sessions, % | Odds Ratio (95% CI) | z Value | |
|---|---|---|---|---|---|
| Hello | 14.4 (34.7) | 99.7 | 0.93 (0.88-0.99) | −2.45 | .01 |
| Mood check | 5.6 (10.5) | 48.1 | 0.98 (0.93-1.03) | −0.96 | .33 |
| Obtain update | 12.3 (19.6) | 46.1 | 0.96 (0.92-1.01) | −1.54 | .11 |
| Bridge | 9.7 (24.8) | 22.7 | 0.94 (0.90-0.98) | −2.63 | .008 |
| Risk check | 22.8 (54.7) | 30.4 | 0.98 (0.94-1.03) | −0.69 | .48 |
| Set agenda | 61.3 (68.7) | 74.9 | 0.99 (0.94-1.05) | −0.27 | .79 |
| Review homework | 15.2 (27.3) | 39.4 | 0.96 (0.91-1.01) | −1.47 | .14 |
| Set goals | 28.3 (57.9) | 35.9 | 1.03 (0.98-1.09) | 1.07 | .28 |
| Formulation | 53.2 (126) | 30.4 | 1.10 (1.04-1.17) | 3.33 | <.001 |
| Give feedback | 17.4 (57.2) | 49.3 | 1.00 (0.95-1.07) | 0.31 | .75 |
| Change methods | 426.5 (279.5) | 97.6 | 1.20 (1.12-1.27) | 5.56 | <.001 |
| Perceptions of change | 1.13 (7.4) | 3.6 | 0.97 (0.93-1.01) | −1.42 | .14 |
| Set homework | 75.8 (74.4) | 78.4 | 1.09 (1.03-1.16) | 2.97 | <.002 |
| Planning for future | 0.56 (8.5) | 1.0 | 0.93 (0.89-0.96) | −3.77 | <.001 |
| Elicit feedback | 17.4 (25) | 60.9 | 1.09 (1.03-1.16) | 2.97 | .002 |
| Summarize session | 0.24 (4.67) | 0.3 | 1.00 (0.94-1.09) | 0.01 | .98 |
| Arrange next session | 33.1 (32.6) | 84.0 | 1.17 (1.10-1.24) | 5.30 | <.001 |
| Goodbye | 16.2 (15.6) | 90.9 | 1.02 (0.97-1.08) | 0.83 | .40 |
| Socratic questioning | 20 (39.5) | 40.8 | 0.94 (0.89-0.99) | −2.28 | .02 |
| Therapeutic thanks | 8.5 (24.3) | 19.4 | 1.13 (1.06-1.20) | 3.73 | <.001 |
| Therapeutic empathy | 25.5 (51.1) | 44.0 | 0.93 (0.88-0.97) | −3.20 | .001 |
| Therapeutic praise | 23.3 (47) | 41.8 | 1.05 (0.98-1.11) | 1.47 | .15 |
| Collaboration | 45.2 (72.8) | 60.4 | 1.01 (0.94-1.07) | 0.26 | .79 |
| Other | 141.1 (117.4) | 96.9 | 0.88 (0.84-0.92) | −5.12 | <.001 |
| Variable, mean/prevalence (SD) | |||||
| Session duration, min | 63.1 (9.9) | NA | 1.26 (1.20-1.33) | 8.89 | <.001 |
| Start PHQ-9 | 14.9 (5.5) | NA | 0.87 (0.82-0.92) | −4.81 | <.001 |
| Start GAD-7 | 8.8 (5.9) | NA | 1.00 (0.95-1.06) | −0.01 | .99 |
| Patient age | 34.8 (12.0) | NA | 1.07 (1.02-1.13) | 2.64 | .008 |
| Patient sex, % | |||||
| Male | 3967 (26.7) | NA | 1.02 (0.91-1.01) | 0.28 | .78 |
| Female | 10 882 (73.0) | NA | NA | NA | NA |
| Unknown/not stated | 50 (0.3) | NA | 0.95 (0.45-2.34) | −0.11 | .91 |
| Long-term condition, % | |||||
| No | 6860 (46.0) | NA | |||
| Yes | 4129 (27.7) | NA | 1.02 (0.90-1.15) | 0.24 | .81 |
| Unknown/not stated | 3910 (26.3) | NA | 0.90 (0.80-1.02) | −1.68 | .09 |
| Psychotropic medication, % | |||||
| Prescribed not taking | 1304 (8.8) | ||||
| Not prescribed | 6755 (45.3) | NA | 1.21 (1.02-1.44) | 2.19 | .03 |
| Prescribed taking | 6320 (42.4) | NA | 1.20 (1.01-1.47) | 2.06 | .04 |
| Unknown/not stated | 520 (3.5) | NA | 1.10 (1.01-1.43) | 0.64 | .52 |
Abbreviations: GAD-7, Generalized Anxiety Disorder 7-item scale; IAPT, Improving Access to Psychological Therapies; NA, not applicable; PHQ-9, Patient Health Questionnaire.
Output of logistic regression investigating association between patient engagement and number of words per feature in the first treatment session. Standardized odds ratios indicate the effect of an increase of 1 SD of a feature on the odds of engagement. Percentage of sessions indicates the percentage of the total number of first treatment sessions that contained utterances categorized as that feature. Female sex, no long-term conditions, and prescribed not taking psychotropic medication were reference classes for the categorical variables.
Figure 1. Factors Associated With Reliable Improvement–All Sessions
Forest plot of logistic regression model investigating association between mean number of words per feature across treatment and reliable improvement. Standardized odds ratios and 95% confidence intervals are shown (and listed in the right column). Adjusted for total number of sessions, symptom severity, patient sex, age, medication status, presence of long-term condition, and session duration.
aP < .001.
bP < .01.
cP < .05.
Figure 2. First-Session Factors Associated With IAPT Engagement
Forest plot of logistic regression model investigating association between mean number of words per feature in the first treatment session and patient engagement. Standardized odds ratios and 95% confidence intervals are shown (and listed in the right column). Adjusted for symptom severity, patient sex, age, medication status, presence of long-term condition, and session duration.
aP < .001.
bP < .01.
cP < .05.