| Literature DB >> 31591400 |
M L Birnbaum1,2,3, S K Ernala4, A F Rizvi5,6,7, E Arenare5,6,7, A R Van Meter5,6,7, M De Choudhury4, J M Kane5,6,7.
Abstract
Although most patients who experience a first-episode of psychosis achieve remission of positive psychotic symptoms, relapse is common. Existing relapse evaluation strategies are limited by their reliance on direct and timely contact with professionals, and accurate reporting of symptoms. A method by which to objectively identify early relapse warning signs could facilitate swift intervention. We collected 52,815 Facebook posts across 51 participants with recent onset psychosis (mean age = 23.96 years; 70.58% male) and applied anomaly detection to explore linguistic and behavioral changes associated with psychotic relapse. We built a one-class classification model that makes patient-specific personalized predictions on risk to relapse. Significant differences were identified in the words posted to Facebook in the month preceding a relapse hospitalization compared to periods of relative health, including increased usage of words belonging to the swear (p < 0.0001, Wilcoxon signed rank test), anger (p < 0.001), and death (p < 0.0001) categories, decreased usage of words belonging to work (p = 0.00579), friends (p < 0.0001), and health (p < 0.0001) categories, as well as a significantly increased use of first (p < 0.0001) and second-person (p < 0.001) pronouns. We additionally observed a significant increase in co-tagging (p < 0.001) and friending (p < 0.0001) behaviors in the month before a relapse hospitalization. Our classifier achieved a specificity of 0.71 in predicting relapse. Results indicate that social media activity captures objective linguistic and behavioral markers of psychotic relapse in young individuals with recent onset psychosis. Machine-learning models were capable of making personalized predictions of imminent relapse hospitalizations at the patient-specific level.Entities:
Year: 2019 PMID: 31591400 PMCID: PMC6779748 DOI: 10.1038/s41537-019-0085-9
Source DB: PubMed Journal: NPJ Schizophr ISSN: 2334-265X
Participant demographics
| Participant Demographics | Mean (years) | SD |
|---|---|---|
| Age | 23.96 | 4.59 |
| Sex |
|
|
|
| 36 | 70.58 |
|
| 15 | 29.41 |
| Race | ||
|
| 5 | 9.80 |
|
| 28 | 54.90 |
|
| 11 | 21.56 |
|
| 7 | 13.72 |
| Diagnosis | ||
|
| 34 | 66.66 |
|
| 13 | 25.49 |
|
| 4 | 7.84 |
Wilcoxon-signed rank test results comparing linguistic differences between a period of relapse and period of relative health per participant
| Feature | Mean during periods of relapse | Mean during periods of relative health | ||
|---|---|---|---|---|
| Co-tagging noon | 0.4074 | 0.1481 | 18.0 | <0.0001 |
| Posting morning | 21.8518 | 4.1481 | 74.5 | <0.01 |
| Co-tagging | 2.5185 | 0.7777 | 44.0 | <0.001 |
| Co-tagging midnight | 0.8148 | 0.2222 | 13.5 | <0.0001 |
| Co-tagging morning | 0.9630 | 0.2222 | 5.5 | <0.0001 |
| Friending | 0.2593 | 0.2222 | 2.5 | <0.0001 |
| Co-tagging night | 0.3333 | 0.1851 | 4.0 | <0.0001 |
| Posting midnight | 12.1481 | 4.4814 | 44.5 | <0.01 |
| Exclusive | 0.0063 | 0.0079 | 92.0 | <0.05 |
| Family | 0.0028 | 0.0021 | 18.0 | <0.0001 |
| Inclusive | 0.0253 | 0.0152 | 101.0 | <0.05 |
| Feel | 0.0059 | 0.0057 | 59.0 | <0.01 |
| Money | 0.0022 | 0.0020 | 18.0 | <0.0001 |
| Causation | 0.0028 | 0.0113 | 37.0 | <0.001 |
| Insight | 0.0073 | 0.0076 | 69.0 | <0.01 |
| Humans | 0.0044 | 0.0043 | 66.0 | <0.01 |
| Anger | 0.0049 | 0.0032 | 36.5 | <0.001 |
| Home | 0.0019 | 0.0045 | 28.5 | <0.001 |
| Sexual | 0.0032 | 0.0033 | 30.0 | <0.001 |
| Future tense | 0.0036 | 0.0016 | 10.0 | <0.0001 |
| Death | 0.0008 | 0.0004 | 16.5 | <0.0001 |
| Negation | 0.0075 | 0.0091 | 98.0 | <0.05 |
| Discrepancies | 0.0064 | 0.0051 | 41.0 | <0.001 |
| Religion | 0.0029 | 0.0025 | 29.0 | <0.001 |
| Verbs | 0.0635 | 0.0856 | 104.0 | <0.05 |
| Health | 0.0023 | 0.0092 | 11.0 | <0.0001 |
| First-person plural | 0.0016 | 0.0005 | 8.0 | <0.0001 |
| Bio | 0.0098 | 0.0180 | 78.0 | <0.01 |
| Tentativeness | 0.0070 | 0.0067 | 93.5 | <0.05 |
| Body | 0.0041 | 0.0045 | 54.5 | <0.01 |
| Inhibition | 0.0023 | 0.0012 | 12.0 | <0.0001 |
| Hear | 0.0066 | 0.0024 | 25.0 | <0.0001 |
| Second-person | 0.0081 | 0.0070 | 71.0 | <0.01 |
| Quantifier | 0.0052 | 0.0028 | 55.0 | <0.01 |
| Friends | 0.0042 | 0.0071 | 26.0 | <0.0001 |
| Achievement | 0.0061 | 0.0067 | 85.0 | <0.05 |
| Negative affect | 0.0029 | 0.0023 | 42.5 | <0.001 |
| Anxiety | 0.00078 | 0.0008 | 9.0 | <0.0001 |
| Certainty | 0.0040 | 0.0091 | 51.0 | <0.001 |
| Work | 0.0082 | 0.0164 | 74.5 | <0.01 |
| Indefinite pronoun | 0.0151 | 0.0235 | 91.0 | <0.05 |
| Sadness | 0.0016 | 0.0019 | 19.0 | <0.0001 |
| Swear | 0.0034 | 0.0017 | 16.0 | <0.0001 |
Descriptive statistics on Facebook timeline data comprising participant generated posts
| Data configuration | # Periods of relapse | # Periods of relative health | Total number of posts (periods of relapse) | Total number of posts (periods of relative health) | Mean, SD of number of posts (periods of relapse) | Mean, SD of number of posts (periods of relative health) |
|---|---|---|---|---|---|---|
| 1-Month periods of relapse and relative health | 49 | 719 | 1708 | 51,107 | 34.86, 136.25 | 71.08, 366.78 |
| 1-Month periods of relapse and 2-month periods of relative health | 49 | 421 | 1708 | 50,161 | 34.86, 136.25 | 119.15, 640.52 |
| 1-Month periods of relapse and 3-month periods of relative health | 49 | 312 | 1708 | 50,108 | 34.86, 136.25 | 160.60, 911.37 |
Class distributions and model performance on unseen test data for the one-class SVM models
| # Periods of relative health | # Periods of relapse | Sensitivity | Specificity | Positive predictive value | Negative predictive value | |
|---|---|---|---|---|---|---|
| 1-Month model | 719 | 49 | 0.47 | 0.65 | 0.66 | 0.46 |
| 2-Month model | 419 | 49 | 0.57 | 0.28 | 0.41 | 0.44 |
| 3-Month model | 312 | 49 | 0.90 | 0.04 | 0.37 | 0.4 |
| Ensemble model | 719 | 49 | 0.38 | 0.71 | 0.66 | 0.44 |
Fig. 1Flowchart of the relapse prediction machine-learning methodology
Fig. 2Distributions of number of participants, and number of posts for a 1-month, b 2-month, and c 3-month model
Fig. 3Descriptive statistics on Facebook activity-based features. Orange box plots correspond to the left-hand side y-axis and green box plots correspond to right-hand side y-axis