| Literature DB >> 28246066 |
Danielle Mowery1, Hilary Smith2, Tyler Cheney2, Greg Stoddard3, Glen Coppersmith4,5, Craig Bryan2, Mike Conway1.
Abstract
BACKGROUND: With a lifetime prevalence of 16.2%, major depressive disorder is the fifth biggest contributor to the disease burden in the United States.Entities:
Keywords: Twitter messaging; data annotation; machine learning; major depressive disorder; natural language processing; social media
Mesh:
Year: 2017 PMID: 28246066 PMCID: PMC5350450 DOI: 10.2196/jmir.6895
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Major depressive disorder scheme (parent categories).
Linguistic Inquiry and Word Count (LIWC) concepts and associated keywordsa.
| Depression categories | Linguistic Inquiry and Word Count | |
| Depressed mood | pain | |
| Weight change or change in appetite | appetite | |
| Disturbed sleep | insomnia | |
| Psychomotor agitation or retardation | restless, jitter*, groggy, dazed | |
| Fatigue or loss of energy | tired | |
| Feelings of worthlessness or excessive inappropriate guilt | guilt*, burden | |
| Diminished ability to think or concentrate, indecisiveness | concentrat*, focus*, indeci* | |
| Recurrent thoughts of death, suicidal ideation | suicid*, kill | |
| Problems with primary support group | death, die*, funeral, cremat*, | |
| divorc, abus*, neglect* | ||
| Occupational problems | fired, unemploy* | |
| Housing problems | homeless* | |
| LIWC “sad” keyword | abandon*, ache*, aching, agoni*, | |
| alone, broke*, cried, cries, crushed, | ||
| cry, damag*, defeat*, depress*, | ||
| depriv*, despair*, devastat*, | ||
| disadvantage*, disappoint*, | ||
| discourag*, dishearten*, disillusion*, | ||
| dissatisf*, doom*, dull*, | ||
| empt*, fail*, fatigu*, flunk*, | ||
| gloom*, grave*, grief, griev*, | ||
| grim*, heartbr*, helpless*, homesick*, | ||
| hopeless*, hurt*, inadequa*, inferior*, | ||
| isolat*, lame*, lone*, longing*, | ||
| lose, loser*, loses, losing, | ||
| loss*, lost, melanchol*, miser*, | ||
| mourn*, neglect*, overwhelm* | ||
| pathetic*, pessimis*, piti*, pity* , | ||
| regret*, reject*, remorse*, resign*, | ||
| ruin*, sad, sobbed, sobbing, sobs, | ||
| solemn*, sorrow*, suffer*, tears*, | ||
| traged*, tragic* , unhapp*, | ||
| unimportant, unsuccessful*, useless*, | ||
| weep*, wept, whine*, whining, | ||
| woe*, worthless*, yearn* |
aDepressive symptom anhedonia and psychosocial stressors such as problems with expected life course with respect to self, problems related to the social environment, educational problems, economic problems, problems with access to health care, problems related to the legal system and crime, other psychosocial and environmental problems, weather, and media do not have associated keywords.
Figure 2SAD corpus annotation in phases. A#=Annotator eg, A1=Annotator 1. SAD: Depressive Symptom and Psychosocial Stressors Acquired Depression.
Comparison of characteristics by corpus.
| Characteristic | SAD | CLPsych |
| Query-level | tweet-level | user-level |
| Number of unique tweets | 9300 | 1019 |
| Number of unique words | 19,822 | 3258 |
| Average number of words per tweet (SD) | 14.56 (7.40) | 15.44 (8.07) |
For the SAD corpus, interannotator agreement (F scores) between annotators according to depressive symptoms and psychosocial stressors. — means category not observed by annotators.
| Depression categories | A1/A2, (%) | A2/A3, | A1/A3, (%) |
| Overall | 81 | 78 | 76 |
| No evidence of depression | 89 | 86 | 87 |
| Symptoms | |||
| Depressed mood | 38 | 60 | 48 |
| Anhedonia | – | – | – |
| Weight change or change in appetite | – | 0 | 100 |
| Disturbed sleep | 100 | 50 | 0 |
| Psychomotor agitation or retardation | – | – | – |
| Fatigue or loss of energy | 74 | 78 | 94 |
| Feelings of worthlessness or excessive inappropriate guilt | 0 | 29 | 68 |
| Diminished ability to think or concentrate, indecisiveness | 100 | – | 0 |
| Recurrent thoughts of death, suicidal ideation | 100 | 100 | 75 |
| Stressors | |||
| Problems with expected life course with respect to self | 0 | 0 | 0 |
| Problems with primary support group | 0 | 40 | 36 |
| Problems related to the social environment | 23 | 42 | 58 |
| Educational problems | – | 50 | 0 |
| Occupational problems | – | 0 | – |
| Housing problems | – | 0 | – |
| Economic problems | – | 67 | 50 |
| Problems with access to health care | – | 0 | – |
| Problems related to the legal system and crime | – | 0 | 0 |
| Other psychosocial and environmental problems | – | 0 | 0 |
| Weather | – | 100 | – |
| Media | 50 | 0 | 67 |
Figure 3Distribution of tweet hits by precision with LIWC Keyword counts for each corpus. Black bars=SAD corpus; Gray bars= CLPsych corpus. SAD: Depressive Symptom and Psychosocial Stressors Acquired Depression.
Figure 4Prevalence of categories by corpus. Light purple: depressive symptoms, medium purple: psychosocial stressors, dark purple: no evidence of clinical depression.
Figure 5Most informative terms classified with associated depressive symptoms and psychosocial stressors. Shared terms occur at the intersect of the circled lists.
Figure 6SAD heat map of tweet-level, depressive symptom, and psychosocial stressor cooccurrences. Darker means larger measure of Cohen effect size; lighter means smaller measure of Cohen effect size. The number that indexes the category on the y-axis also corresponds to the category for the x-axis. For example, if “Depressed mood=1” appears on the y-axis, then “1” on the x-axis corresponds to the category “Depressed mood.” SAD: Depressive Symptom and Psychosocial Stressors Acquired Depression.