| Literature DB >> 30066665 |
Jingcheng Du1, Yaoyun Zhang1, Jianhong Luo1,2, Yuxi Jia1,3, Qiang Wei1, Cui Tao1, Hua Xu4.
Abstract
BACKGROUND: Suicide has been one of the leading causes of deaths in the United States. One major cause of suicide is psychiatric stressors. The detection of psychiatric stressors in an at risk population will facilitate the early prevention of suicidal behaviors and suicide. In recent years, the widespread popularity and real-time information sharing flow of social media allow potential early intervention in a large-scale population. However, few automated approaches have been proposed to extract psychiatric stressors from Twitter. The goal of this study was to investigate techniques for recognizing suicide related psychiatric stressors from Twitter using deep learning based methods and transfer learning strategy which leverages an existing annotation dataset from clinical text.Entities:
Keywords: Deep learning; Mental health; Named entity recognition; Psychiatric stressors; Social media; Suicide
Mesh:
Year: 2018 PMID: 30066665 PMCID: PMC6069295 DOI: 10.1186/s12911-018-0632-8
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Pipeline of suicides related psychiatric stressor recognition from Twitter
Examples of suicide-related keywords as queries for tweets retrieval and stop keywords used for irrelevant tweets filtering
| Keywords | Stop Keywords |
|---|---|
| “suicide”, “suicidal”, “suic”, “self-harm”, “self-injury”, “self harm”, “self injury”, “hang myself”, “hung myself”, “kill myself”, “kills myself”, “killed myself”, “take my life”, “takes my life”, “want to die”, “wanted to die”, “wants to die”, “want death”, “wants death”, “wanted death”, “to be dead” | “bomb”, “suicide attack”, “suicide attacks”, “car attack”, “car attacks”, “suicide hotline”, “https://”, “http://” |
Examples of tweets labeled with Positive or Negative in terms of relatedness to suicide
| Positive | i don’t know why my dad always comments on how much i’m eating because it makes me want to die |
| i’m tired of losing friends and people close to me cause of being suicidal | |
| i want to kill my self | |
| i’m in pain, wanna put ten shots in my brain i’ve been tripped by some things i can’t change suicidal | |
| Negative | in the uk the biggest killer for men is suicide. Good job feminists ignoring their issues |
| a dear friend of mine committed suicide with a shotgun two years ago | |
| i don’t say this lightly - hemingway’s life ended by suicide. His life was actually a loss | |
| these r not ur problems dear!! these r ur x bf’s commitng suicide |
Examples of psychiatric stressors annotation from the suicide-related tweets. Bold refers to the annotated stressors
| honestly every time i think about me | |
| i just realized that i was completely | |
| well i guess it’s too bad that i’m just | |
Fig. 2The architecture of CNN based binary classifier for suicide related labels prediction
Fig. 3The architecture of the RNN framework for NER
Experimental performance of suicide-related tweets classification, using the CNN based algorithm and word embedding features of different dimensions. Bold number denotes the largest number in that row
| D = 50 | D = 100 | D = 200 | ||
|---|---|---|---|---|
| Precision | Positive | 0.78 | 0.76 |
|
| Negative | 0.69 |
| 0.65 | |
| Recall | Positive | 0.88 |
| 0.84 |
| Negative | 0.51 | 0.45 |
| |
| F-1 measure | Positive |
| 0.82 | 0.81 |
| Negative | 0.59 | 0.55 |
|
Performance comparison of the CNN model with other algorithms. SVM: Support Vector Machine; ET: Extra Trees; RF: Random Forest; LR: Logistics Regression; Bi-LSTM: Bi-directional Long Short-Term Memory. Bold number denotes the largest number in that row
| CNN | SVM | ET | RF | LR | Bi-LSTM | ||
|---|---|---|---|---|---|---|---|
| Precision | Positive |
| 0.7 | 0.69 | 0.69 | 0.7 | 0.73 |
| Negative | 0.69 |
| 0.58 | 0.5 | 0.67 | 0.65 | |
| Recall | Positive | 0.88 |
| 0.94 | 0.88 | 0.94 | 0.9 |
| Negative |
| 0.21 | 0.17 | 0.24 | 0.23 | 0.37 | |
| F-1 measure | Positive |
| 0.81 | 0.79 | 0.77 | 0.8 | 0.81 |
| Negative |
| 0.33 | 0.27 | 0.33 | 0.34 | 0.47 | |
| Accuracy |
| 0.703 | 0.689 | 0.665 | 0.697 | 0.72 | |
Experimental performance metric of stressors recognition on different types of word embedding for both exact and inexact match. CRF: Conditional Random Fields. Bold number denotes the largest number in that column
| Precision | Recall | F-1 measure | ||||
|---|---|---|---|---|---|---|
| exact | inexact | exact | inexact | exact | inexact | |
| GloVe Twitter 50 | 0.4868 | 0.6843 | 0.4765 |
| 0.4816 |
|
| GloVe Twitter 100 | 0.5822 | 0.7123 | 0.4906 | 0.6057 |
| 0.6546 |
| GloVe Twitter 200 | 0.5248 | 0.6808 |
| 0.6484 | 0.5108 | 0.6642 |
| CRF |
|
| 0.398 | 0.572 | 0.478 | 0.661 |
Fig. 4Impact of transfer learning on the size of training data measured on F-1 measure by exact match
Fig. 5Impact of transferring the parameters up to each layer of the RNN model for stressors recognition measured by exact match
Fig. 6Impact of transferring the parameters up to each layer of the RNN model for stressors recognition measured by inexact match
Fig. 7Exact match F-1 measure by each epoch from the model transferring up to character LSTM layer
Common types of prediction errors. Bold: annotated entity; Underline: predicted entity
| Boundary | • when the |
| Annotation error: missing annotation | • |
| Negation | • im confident in my |
| Entity in wrong context | • gonna kill myself after |
| False negative | • when will my |