| Literature DB >> 36267123 |
Akshma Chadha1, Baijnath Kaushik1.
Abstract
Suicide deaths due to depression and mental stress are growing rapidly at an alarming rate. People freely express their feelings and emotions on social network sites while they feel hesitant to express such feelings during face-to-face interactions with their dear ones. In this study, a dataset comprising 20,000 posts was taken from Reddit and preprocessed into tokens using a variety of effective word2vec techniques. A new hybrid approach is proposed by combining the attention model in a convolutional neural network and long-short-term- memory. The objective of this research is to develop an effective learning model to evaluate the data on social media for the efficient and accurate identification of people with suicidal ideation. The proposed attention convolution long short-term memory (ACL) model uses hyperparameter tuning using a grid search to select optimized hyperparameters. From the experimental evaluation, it is shown that the proposed model, that is, ACL with Glove embedding after hyperparameter tuning gives the highest Accuracy of 88.48%, Precision of 87.36%, F1 score of 90.82% and specificity of 79.23% and ACL with Random embedding gives the highest Recall of 94.94% when compared to the state-of-the-art algorithms. © Ohmsha, Ltd. and Springer Japan KK, part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.Entities:
Keywords: Attention convolutional long short term memory; Deep learning; Glove embedding; Hyperparameters; Random embedding; Social networking sites; Suicide
Year: 2022 PMID: 36267123 PMCID: PMC9573777 DOI: 10.1007/s00354-022-00191-1
Source DB: PubMed Journal: New Gener Comput ISSN: 0288-3635 Impact factor: 1.180
Fig. 1Learning algorithms used
Fig. 2Summarization of workflow
Previous work limitations
| Author | Publication year | Limitation |
|---|---|---|
| Han-Chin Shing et al. [ | 2018 | Data is limited to Reddit that fails to generate some specific characteristics. There were no clinicians’ assessments |
| Akshma Chadha et al. [ | 2019 | Only online data is taken into consideration most of the time, offline data should also be considered for better results |
| Mark E. Larsen et al. [ | 2015 | Mostly the data collection period taken is very less but this period should be increased in the future |
| Chandler McClellan et al. [ | 2017 | Better filtering techniques should be applied to the data so that the results can be more accurate |
| Kate H. Bentley et al. [ | 2016 | Usually the individuals who are considered are very less in number, the number of individuals should also be increased for a better research outcome |
| Helen Christensen et al. [ | 2014 | The work is mainly performed on publically available data i.e. on Twitter, Reddit, etc. but work needs to be performed on private social sites such as Facebook, Instagram |
| Stephen P. Lewis et al. [ | 2012 | Usually the individuals who are considered are young adults, but we cannot just neglect the middle age and old age people as well |
Fig. 3Steps for data preprocessing, embedding and classification
Fig. 4Unprocessed data
Fig. 5Steps of data preprocessing
Fig. 6Preprocessed data
Fig. 7Proposed hybrid learning algorithm
Fig. 8Hyperparameter tuning using grid search
Optimized hyperparameters obtained using grid search
| Model | Variation | Grid of parameters | Finest parameters | |
|---|---|---|---|---|
| 1 | CNN | Random embedding | Dropout_rate = [0.1, 0.2, 0.3, 0.4, 0.5] Optimizer = [Adam, RMSprop, SGD] Weight_constraint = [1–5] | Dropout_rate = '0.1' Optimizer = 'Adam' Weight_constraint = 4 |
| Glove embedding | Neurons = [128, 256] Optimizer = [Adam, RMSprop, SGD] Batch_size = [32, 64, 128, 256] weight_constraint = [1–5] kernel_size = [3, 5] nb_filters = [128, 256] | Neurons = '128' Optimizer = 'RMSprop' Batch_size = 32 Weight_constraint = 5 Kernel_size = 3 nb_filters = 256 | ||
| 2 | CNN + bidirectional LSTM | Random embedding | Dropout_rate = [0.1,0.2,0.3,0.4,0.5] Optimizer = [Adam, RMSprop, SGD] Units = [10, 20, 29, 39] Kernel_size = [3, 5] nb_filters = [128, 256] Batch_size = [32, 64, 128, 256] | Dropout_rate = ‘0.1’ Optimizer = ‘Adam’ Units = ‘10’ Kernel_size = ‘3’ nb_filters = ‘256’ Batch_size = ‘64’ |
| Glove embedding | Optimizer = [Adam, RMSprop, SGD] Units = [10, 20, 64, 128] Kernel_size = [3, 5] Batch_size = [32,64,128,256] Dropout_rate = [0.1,0.2,0.3,0.4,0.5] | Optimizer = ‘Adam’ Units = ‘128’ Kernel_size = ‘3’ Batch_size = ‘128’ Dropout_rate = ‘0.5’ | ||
| 3 | CNN + bidirectional LSTM with attention | Random embedding | nb_filters = [128, 256] units = [10,20,64,128] Kernel_size = [3, 5] Batch_size = [32, 64, 128, 256] Neurons = [10,20,64,128] Pool_size = [2, 4] Dropout_rate = [0.1, 0.2, 0.3, 0.4, 0.5] | nb_filters = '128' Units = '128' Kernel_size = 3 Batch_size = '128' Neurons = '64' Pool_size = 4 Dropout_rate = '0.4' |
| Glove embedding | Pool size = [2, 4] Kernel size = [3, 5] Dropout rate = [0.1, 0.2, 0.3, 0.4, 0.5] units = [10, 20, 64, 128] nb_filters = [128, 256] Neurons = [10, 20, 64, 128] | Pool_size = '4' Kernel_size = '3' Dropout_rate = 0.2 Units = '128’ nb_filters = '128' Neurons = '128' | ||
Results of the different learning algorithms used before tuning hyperparameters.
| S no. | Model | Variations techniques used | Accuracy | Precision | Recall | Specificity | F1 score |
|---|---|---|---|---|---|---|---|
| 1 | Convolutional neural network | Random embedding | 81.55 | 82.95 | 87.35 | 72.75 | 85.09 |
| Glove embedding | 81.23 | 82.73 | 87.02 | 72.44 | 84.82 | ||
| 2 | Convolutional neural network + long short term memory | Random embedding | 85.18 | 85.56 | 90.71 | 76.78 | 88.06 |
| Glove embedding | 85.03 | 85.78 | 90.09 | 77.34 | 87.88 | ||
| 3 | Convolutional neural network + long short term memory with attention | Random embedding | 87.50 | 86.76 | 93.53 | 78.35 | 90.02 |
| Glove embedding | 87.25 | 86.15 | 93.94 | 77.09 | 89.88 |
Results of the different learning algorithms used after tuning hyperparameters
| S no. | Model | Variations techniques used | Accuracy | Precision | Recall | Specificity | F1 score |
|---|---|---|---|---|---|---|---|
| 1 | Convolutional neural network | Random embedding | 81.73 | 83.07 | 87.52 | 72.94 | 85.24 |
| Glove embedding | 81.80 | 83.12 | 87.60 | 73.00 | 85.27 | ||
| 2 | Convolutional neural network + long short term memory | Random embedding | 86.38 | 85.97 | 92.49 | 77.09 | 89.11 |
| Glove embedding | 86.40 | 86.17 | 92.24 | 77.53 | 89.10 | ||
| 3 | Convolutional neural network + long short term memory with attention (ACL) (proposed model) | Random embedding | 88.33 | 86.90 | 94.94 | 78.29 | 90.74 |
| Glove embedding | 88.48 | 87.36 | 94.57 | 79.23 | 90.82 |
Fig. 9Confusion matrices of glove embedding and random embedding for CNN model
Fig. 10Confusion matrices of glove embedding and random embedding for CNN model with hyperparameter tuning
Fig. 11Confusion matrices of glove embedding and random embedding for CNN + LSTM model
Fig. 12Confusion matrices of glove embedding and random embedding for CNN + LSTM model with hyperparameter tuning
Fig. 13Confusion matrices of glove embedding and random embedding for CNN + LSTM model with attention model
Fig. 14Confusion matrices of glove embedding and random embedding for CNN + LSTM model using attention model with hyperparameter tuning
Tenfold cross validation for model evaluation
| Model | Techniques | Folds | Mean | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | |||
| Convolutional neural network | Randomly initialized embedding | 96.54 | 97.55 | 91.33 | 92.58 | 95.66 | 97.25 | 94.88 | 96.99 | 93.33 | 92.56 | 95.06 |
| Glove embedding | 96.52 | 97.55 | 91.22 | 89.52 | 93.74 | 98.99 | 92.54 | 93.88 | 98.45 | 94.12 | 94.65 | |
| Convolutional neural network (optimized parameters) | Randomly initialized embedding | 96.44 | 91.78 | 92.73 | 95.22 | 97.44 | 91.54 | 92.88 | 97.52 | 94.69 | 91.49 | 94.77 |
| Glove embedding | 97.49 | 98.22 | 99.45 | 91.57 | 97.29 | 95.47 | 97.33 | 96.54 | 98.99 | 94.78 | 96.71 | |
| Convolutional neural network + long short term memory | Randomly initialized embedding | 98.22 | 99.82 | 94.52 | 97.45 | 98.69 | 91.56 | 92.44 | 97.48 | 97.56 | 98.36 | 96.61 |
| Glove embedding | 91.22 | 97.56 | 96.52 | 89.25 | 98.22 | 96.74 | 98.99 | 96.58 | 99.99 | 97.79 | 96.28 | |
| Convolutional neural network + long short term memory (optimized parameters) | Randomly initialized embedding | 96.58 | 97.88 | 97.58 | 99.52 | 93.54 | 98.11 | 97.52 | 96.78 | 96.58 | 97.99 | 97.20 |
| Glove embedding | 97.52 | 98.69 | 97.22 | 99.41 | 94.52 | 97.69 | 98.22 | 98.65 | 97.33 | 94.25 | 97.25 | |
| ACL (proposed model) | Randomly initialized embedding | 98.56 | 99.77 | 99.99 | 1.0 | 99.99 | 1.0 | 99.98 | 99.99 | 1.0 | 1.0 | 99.82 |
| Glove embedding | 98.90 | 1.0 | 1.0 | 1.0 | 99.99 | 1.0 | 1.0 | 99.99 | 1.0 | 1.0 | 99.88 | |
| ACL (proposed model) (optimized parameters) | Randomly initialized embedding | 98.58 | 1.0 | 1.0 | 1.0 | 99.99 | 99.99 | 99.96 | 99.99 | 1.0 | 1.0 | 99.85 |
| Glove embedding | 98.89 | 1.0 | 1.0 | 99.99 | 99.99 | 1.0 | 99.99 | 99.99 | 1.0 | 1.0 | 99.88 | |
Fig. 15Evaluation for CNN, fusion of CNN-LSTM and fusion along ATTENTION using two embedding technique based on testing accuracy before and after hyperparameter tuning
Fig. 16Evaluation based on specificity, recall, precision, and F1 score for CNN, fusion of CNN-LSTM and CNN-LSTM fusion along attention using two embedding technique before and after parameter tuning