| Literature DB >> 34255687 |
Jiacheng Li1, Shaowu Zhang1, Yijia Zhang1, Hongfei Lin1, Jian Wang1.
Abstract
BACKGROUND: Suicide has become the fifth leading cause of death worldwide. With development of the internet, social media has become an imperative source for studying psychological illnesses such as depression and suicide. Many methods have been proposed for suicide risk assessment. However, most of the existing methods cannot grasp the key information of the text. To solve this problem, we propose an efficient method to extract the core information from social media posts for suicide risk assessment.Entities:
Keywords: attention mechanism; infodemiology; neural networks; social media; suicide risk assessment
Year: 2021 PMID: 34255687 PMCID: PMC8304127 DOI: 10.2196/28227
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Architecture of the multifeature fusion recurrent attention network. LSTM: long short-term memory; MLP: multilayer perceptron.
Hyperparameter settings.
| Hyperparameters | Optimal value |
| Word embedding dimension | 300 |
| BiLSTMa hidden units | 200 |
| Learning rate | 0.2 |
| Dropout rate | 0.5 |
| L2 regularization weight | 10–5 |
aBiLSTM: bidirectional long short-term memory.
Example posts from the SuicideWatch subreddit.
| Post | Risk | Existence | Urgency |
| A nihilist teetering on edge. Things were good before I came into being | a | Not exist | Not urgent |
| Has anyone attempted suicide and failed and then felt guilty for being incompetent? | b | Exist | Not urgent |
| Just sitting on a bench, waiting and thinking. I don’t want to, but it feels like the best option. | c | Exist | Urgent |
| Tell me how to commit suicide painlessly. | d | Exist | Urgent |
Experimental results of classification models.
| Models | Risk-F1 | Existence- F1 | Urgency -F1 |
| SVMa | 0.296 | 0.793 | 0.716 |
| CNNb | 0.336 | 0.834 | 0.742 |
| LSTMc | 0.397 | 0.862 | 0.766 |
| BiLSTMd | 0.404 | 0.863 | 0.774 |
| BiLSTM+CNN | 0.423 | 0.872 | 0.789 |
| BiLSTM+Attention (proposed model) | 0.448 | 0.887 | 0.796 |
aSVM: support vector machine.
bCNN: convolutional neural network.
cLSTM: long short-term memory.
dBiLSTM: bidirectional long short-term memory.
Experimental results of different features for support vector machine models.
| Input | Risk-F1 | Existence- F1 | Urgency -F1 |
| TF-IDFa | 0.257 | 0.783 | 0.691 |
| Bigram+TF-IDF | 0.271 | 0.802 | 0.712 |
| Trigram+TF-IDF | 0.276 | 0.798 | 0.709 |
| Lexicon+TF-IDF | 0.282 | 0.826 | 0.721 |
| Symbolic+TF-IDF | 0.254 | 0.784 | 0.684 |
| n-gram+lexicon+symbolic+TF-IDF | 0.284 | 0.835 | 0.724 |
aTF-IDF: term frequency-inverse document frequency.
Experimental results of deep learning–based models.
| Models and input | Risk-F1 | Existence- F1 | Urgency -F1 | |||
| BERTa | 0.467 | 0.889 | 0.861 | |||
|
|
|
|
| |||
|
| Word2vec | 0.404 | 0.863 | 0.774 | ||
|
| Glove | 0.412 | 0.861 | 0.793 | ||
|
| BERT | 0.474 | 0.914 | 0.857 | ||
|
| BERT+Features | 0.481 | 0.923 | 0.863 | ||
|
|
|
|
| |||
|
| Word2ve | 0.448 | 0.887 | 0.796 | ||
|
| Glove | 0.456 | 0.891 | 0.787 | ||
|
| BERT | 0.507 | 0.915 | 0.863 | ||
|
| BERT+Features | 0.514 | 0.931 | 0.876 | ||
aBERT: bidirectional encoder representations from transformers.
bBiLSTM: bidirectional long short-term memory.
Experimental results of existing methods.
| Models | Risk- | Existence- | Urgency- |
| Mohammadi et al [ | 0.481 | 0.922 | 0.776 |
| Matero et al [ | 0.459 | 0.842 | 0.839 |
| Bitew et al [ | 0.445 | 0.852 | 0.789 |
| Iserman et al [ | 0.402 | 0.902 | 0.844 |
| Allen et al [ | 0.373 | 0.876 | 0.773 |
| González Hevia et al [ | 0.312 | 0.897 | 0.821 |
| Multifeature fusion recurrent attention (this study) | 0.514 (+0.033) | 0.931 (+0.009) | 0.876 (+0.037) |
Figure 2Examples of attention visualization.