| Literature DB >> 35455874 |
Evandro J S Diniz1,2, José E Fontenele2, Adonias C de Oliveira2, Victor H Bastos2, Silmar Teixeira2, Ricardo L Rabêlo3, Dario B Calçada4, Renato M Dos Santos2,3, Ana K de Oliveira3, Ariel S Teles1,2.
Abstract
People at risk of suicide tend to be isolated and cannot share their thoughts. For this reason, suicidal ideation monitoring becomes a hard task. Therefore, people at risk of suicide need to be monitored in a manner capable of identifying if and when they have a suicidal ideation, enabling professionals to perform timely interventions. This study aimed to develop the Boamente tool, a solution that collects textual data from users' smartphones and identifies the existence of suicidal ideation. The solution has a virtual keyboard mobile application that passively collects user texts and sends them to a web platform to be processed. The platform classifies texts using natural language processing and a deep learning model to recognize suicidal ideation, and the results are presented to mental health professionals in dashboards. Text classification for sentiment analysis was implemented with different machine/deep learning algorithms. A validation study was conducted to identify the model with the best performance results. The BERTimbau Large model performed better, reaching a recall of 0.953 (accuracy: 0.955; precision: 0.961; F-score: 0.954; AUC: 0.954). The proposed tool demonstrated an ability to identify suicidal ideation from user texts, which enabled it to be experimented with in studies with professionals and their patients.Entities:
Keywords: artificial intelligence; deep learning; eHealth; mental health; mobile application; natural language processing; suicide
Year: 2022 PMID: 35455874 PMCID: PMC9029735 DOI: 10.3390/healthcare10040698
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Figure 1Boamente system overview.
Suicide-related terms and expressions.
| Portuguese Language (PT-BR) | English Language |
|---|---|
| suicida | suicidal |
| suicídio | suicide |
| me matar | kill myself |
| meu bilhete suicida | my suicide note |
| minha carta suicida | my suicide letter |
| acabar com a minha vida | end my life |
| nunca acordar | never wake up |
| não consigo continuar | can’t go on |
| não vale a pena viver | not worth living |
| pronto para pular | ready to jump |
| dormir pra sempre | sleep forever |
| quero morrer | want to die |
| estar morto | be dead |
| melhor sem mim | better off without me |
| melhor morto | better of dead |
| plano de suicídio | suicide plan |
| pacto de suicídio | suicide pact |
| cansado de viver | tired of living |
| não quero estar aqui | don’t want to be here |
| morrer sozinho | die alone |
| ir dormir pra sempre | go to sleep forever |
Figure 2Methodology to find the best ML/DL model to be deployed in the inference engine.
Examples of tweets labeled by psychologists.
| Class | Tweet (PT-BR) | Tweet (English) |
|---|---|---|
| Negative | meu sonho é dormir pra sempre mas quem dorme pra sempre eh quem morre mas eu não quero morrer só quero dormir pra sempre msm. | my dream is to sleep forever, but the one who sleeps forever is the one who dies, but I don’t want to die, I just want to sleep much. |
| Positive | daí você mistura um monte de remédios esperando sei lá dormir pra sempre e acorda já no dia seguinte só com uma dor no estômago absurda acordada triste com dor no estômago mais azarada que eu. | then you mix a bunch of meds hoping, I don’t know, to sleep forever and wake up the next day only with an absurd stomachache, I’m awake sad, and my stomach hurts, more unlucky than me. |
Figure 3Word clouds containing terms after text cleaning and stop words removal for (a) positive class and (b) negative class.
Figure 4Number of instances labeled as negative and positive: (a) without data balancing (original dataset); (b) with 80% used for training and validation; (c) with 20% used for testing; and (d) with 80% used for training after applying SMOTE.
Figure 5Cross-validation applied for evaluating the performance of all models.
Confusion matrix.
| Actual Values | ||||
|---|---|---|---|---|
| Positive | Negative | Total | ||
| Predicted Values | Positive |
|
|
|
| Negative |
|
|
| |
| Total |
|
| ||
Figure 6Screenshots of the Boamente virtual keyboard.
Figure 7Dashboards on the Boamente web application home screen.
Figure 8Patient-specific dashboards on the Boamente web application.
Performance of the machine learning models.
| Algorithms/Metrics | Accuracy | Precision | Recall | F-Score |
|---|---|---|---|---|
| SVC | 0.902 | 0.825 | 0.840 | 0.832 |
| Extra trees classifier | 0.935 | 0.897 |
| 0.876 |
| Random forest classifier | 0.931 | 0.871 |
| 0.883 |
| Gradient boosting classifier | 0.866 | 0.723 | 0.870 | 0.789 |
| MLP classifier | 0.873 | 0.745 | 0.855 | 0.796 |
Figure 9Best results for accuracy and precision metrics.
Figure 10Best results for recall and F-score metrics.
Figure 11ROC curve with confidence interval of the BERTimbau Large model.
Figure 12Comparison of our best model with the best one developed in Carvalho et al. [38].