| Literature DB >> 33028921 |
Yaakov Ophir1,2, Refael Tikochinski3,4, Christa S C Asterhan3, Itay Sisso3, Roi Reichart4.
Abstract
Detection of suicide risk is a highly prioritized, yet complicated task. Five decades of research have produced predictions slightly better than chance (AUCs = 0.56-0.58). In this study, Artificial Neural Network (ANN) models were constructed to predict suicide risk from everyday language of social media users. The dataset included 83,292 postings authored by 1002 authenticated Facebook users, alongside valid psychosocial information about the users. Using Deep Contextualized Word Embeddings for text representation, two models were constructed: A Single Task Model (STM), to predict suicide risk from Facebook postings directly (Facebook texts → suicide) and a Multi-Task Model (MTM), which included hierarchical, multilayered sets of theory-driven risk factors (Facebook texts → personality traits → psychosocial risks → psychiatric disorders → suicide). Compared with the STM predictions (0.621 ≤ AUC ≤ 0.629), the MTM produced significantly improved prediction accuracy (0.697 ≤ AUC ≤ 0.746), with substantially larger effect sizes (0.729 ≤ d ≤ 0.936). Subsequent content analyses suggested that predictions did not rely on explicit suicide-related themes, but on a range of text features. The findings suggest that machine learning based analyses of everyday social media activity can improve suicide risk predictions and contribute to the development of practical detection tools.Entities:
Mesh:
Year: 2020 PMID: 33028921 PMCID: PMC7542168 DOI: 10.1038/s41598-020-73917-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Descriptive statistics and correlations of the psycho-diagnostic measures (N = 1650).
| Suicide | Depression | Anxiety | Brooding | Worry | SWL | Lonely | Open | Conscientious | Extravert | Agreeable | Neurotic | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Means (SD) | 0.87 (1.41) | 7.44 (5.94) | 14.25 (5.7) | 10.81 (3.53) | 50.76 (15.59) | 20.66 (8.14) | 23.89 (6.73) | 7.69 (2.03) | 7.52 (1.88) | 5.52 (2.40) | 6.92 (2.01) | 6.64 (2.40) |
| Depression | 0.436** | |||||||||||
| Anxiety | 0.377** | 0.754** | ||||||||||
| Brooding | 0.382** | 0.610** | 0.656** | |||||||||
| Worry | 0.346** | 0.552** | 0.711** | 0.656** | ||||||||
| SWL | − 0.360** | − 0.534** | − 0.449** | − 0.458** | − 0.423** | |||||||
| Lonely | 0.350** | 0.572** | 0.479** | 0.539** | 0.485** | − 0.607** | ||||||
| Open | 0.070* | − 0.019 | 0.033 | 0.047 | 0.033 | − 0.012 | − 0.044 | |||||
| Conscientious | − 0.184** | − 0.303** | − 0.187** | − 0.254** | − 0.192** | 0.269** | − 0.271** | 0.100** | ||||
| Extravert | − 0.191** | − 0.242** | − 0.210** | − 0.187** | − 0.249** | 0.273** | − 0.385** | 0.143** | 0.120** | |||
| Agreeable | − 0.179** | − 0.262** | − 0.282** | − 0.207** | − 0.278** | 0.262** | − 0.344** | 0.053 | 0.113** | 0.196** | ||
| Neurotic | 0.300** | 0.486** | 0.617** | 0.564** | 0.762** | − 0.393** | 0.461** | − 0.025 | − 0.272** | − 0.295** | − 0.281** |
Notice that the current research addressed low satisfaction with life whereas the SWL is formulated in a positive manner (i.e., high satisfaction with life). This positive formulation explains the negative correlation between SWL and depression.
Suicide the total score of the CSSRS, SWL satisfaction with life scale.
**p < 0.01.
Socio-demographic characteristics of users at general risk and of non-suicidal users (N = 1002).
| No risk ( | General risk ( | Statistics | Effect size Cohen's | |
|---|---|---|---|---|
| Number of participants | 641 (63.97%) | 361 (36.03%) | ||
| Mean number of posts | 74.1 (97.9) | 97.0 (119.9) | 0.207 [0.083,0.332] | |
| Mean age | 38.3 (11.0) | 34.6 (10.6) | − 0.322 [− 0.448, − 0.197] | |
| Gender (%male) | 23.4% | 23.0% | ||
| Annual income in US dollars | 58,563 (36,627) | 48,389 (35,526) | − 0.270 [− 0.396, − 0.145] |
No risk users who did not report of any suicidal ideation, General risk all the users who reported of at least passive ideation in the suicide scale.
**p < 0.01, ***p < 0.001.
Figure 1The single task model (STM). FC layers fully connected layers.
Figure 2The multi task model (MTM). FC layers fully connected layers; The sign ⊕ symbolizes the vector concatenation operator.
Detection performance (average AUC scores) of the STM and MTM (N = 1002).
| Task | General suicide risk | High suicide risk | ||
|---|---|---|---|---|
| Model | STM | MTM | STM | MTM |
| Average AUC scores | 0.621 | 0.746 | 0.629 | 0.697 |
| 95% confidence interval | 576, 0.657 | 0.727, 0.765 | 0.606, 0.660 | 0.690, 0.707 |
STM single task model, MTM multiple tasks model, AUC area under the receiver operating characteristic curve, Average AUC scores the average scores of the five AUC scores that were obtained in the cross-validation analyses.