| Literature DB >> 36105318 |
Arfan Ahmed1, Sarah Aziz1, Carla T Toro2, Mahmood Alzubaidi3, Sara Irshaidat4, Hashem Abu Serhan4, Alaa A Abd-Alrazaq1, Mowafa Househ3.
Abstract
Despite improvement in detection rates, the prevalence of mental health disorders such as anxiety and depression are on the rise especially since the outbreak of the COVID-19 pandemic. Symptoms of mental health disorders have been noted and observed on social media forums such Facebook. We explored machine learning models used to detect anxiety and depression through social media. Six bibliographic databases were searched for conducting the review following PRISMA-ScR protocol. We included 54 of 2219 retrieved studies. Users suffering from anxiety or depression were identified in the reviewed studies by screening their online presence and their sharing of diagnosis by patterns in their language and online activity. Majority of the studies (70%, 38/54) were conducted at the peak of the COVID-19 pandemic (2019-2020). The studies made use of social media data from a variety of different platforms to develop predictive models for the detection of depression or anxiety. These included Twitter, Facebook, Instagram, Reddit, Sina Weibo, and a combination of different social sites posts. We report the most common Machine Learning models identified. Identification of those suffering from anxiety and depression disorders may be achieved using prediction models to detect user's language on social media and has the potential to complimenting traditional screening. Such analysis could also provide insights into the mental health of the public especially so when access to health professionals can be restricted due to lockdowns and temporary closure of services such as we saw during the peak of the COVID-19 pandemic.Entities:
Keywords: Anxiety; Artificial intelligence; COVID-19; Depression; Machine learning; Social media; Social networking
Year: 2022 PMID: 36105318 PMCID: PMC9461333 DOI: 10.1016/j.cmpbup.2022.100066
Source DB: PubMed Journal: Comput Methods Programs Biomed Update ISSN: 2666-9900
Predictive models and their primary metrics observed in reviewed studies.
| Predictive model used | ||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Data type | No. | Ada-Boost | CNN | GRU | KNN | LR | LSTM | MLP | NB | Proposed Algo | RF | DT | SVM | XGBOOST | Others | Primary Performance metrices | Performance range(for data type) | Language of Data Samples Analyzed | Range of participants | References |
| 18 | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | BRR, NLP,SVR | F1 score: 7 Accuracy: 8 RMSE: 1 Not used:1 | 0.47 to 0.99 | English - 14 Bangla-1 Indonesian- 1 Spanish- 1 | 55 - 20,000 | S2,S9,S12,S13, S17,S21,S22,S31,S34,S35,S36,S42,S45,S46,S47, S48,S50,S52 | |||
| 6 | Y | Y | Y | Y | Y | GPR | F1 score: 3 Accuracy: 3 | 0.61 to 0.90 | English - 6 | 90 - 5,947 | S11,S16,S20,S27,S42,S43 | |||||||||
| 2 | Y | lr | Accuracy: 1 AUC:1 | 0.60 to 0.71 | English - 2 | 749 - 1,908 | S14,S28 | |||||||||||||
| Reditt | 13 | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | Y | F1 score:10 Accuracy: 2 ERDE: 1 | 0.043 to 0.96 | English -13 | 365 - 48,537 | S1,S3,S7,S8,S19,S23, S24,S25,S33,S41,S44,S49,S54 | |||
| Sina Weibo | 6 | Y | Y | Y | Bert,MFFN, built models based on linguistic and behavioral features | F1 score: 3 Accuracy: 2 Precision-:1 | 0.5-0.97 | English - 3 Chinese- 3 | 1000 - 30,000 | S4,S6,S10,S30,S37, S39 | ||||||||||
| facebook + twitter | 4 | Y | Y | Y | Y | Y | VGG-Net | F1 score: 1 Accuracy: 2 Pearson correlation: 1 | 0.56 to 0.77 | English - 4 Bangla- 1 | 150 - 3,498 | S15,S32,S38,S53 | ||||||||
| others | 5 | Y | Y | Y | Y | Y | GBM,Multi-class tree models | F1 score: 3 Accuracy: 1 Recall: 1 | 0.86 to 0.96 | English - 5 | 619 - 1,000,500 | S5,S18,S26,S31,S51 | ||||||||
Fig. 1PRISMA chart.
Inclusion and exclusion criteria.
| Criteria | Specified criteria |
|---|---|
| Focus on predicting anxiety and depression through analysis of social media data | |
| Analyzed the correlation between social network data and symptoms of mental illness |
general characteristics of studies.
| Characteristics | Number of studies | Studies reference (refer to Appendix A) |
|---|---|---|
| Year of publication | 2013: 2 | S40,S48 |
| 2014: 1 | S51 | |
| 2015: 2 | S47 | |
| 2016: 1 | S17,S35,S38 | |
| 2017: 3 | S11,S16,S20,S28,S33,S36,S54 | |
| 2018: 7 | S5,S8-10,S12,S13,S15,S19,S21,S23,S25,S29,S34,S41,S42,S44,S45,S50,S53 | |
| 2019: 19 | S1-S4,S6,S7,S14,S18,S22,S24,S26,S27,S30-S32,S37,S43,S49,S52 | |
| 2020: 19 | ||
| Country | Australia: 1 | S49 |
| Bangladesh: 7 | S12,S13,S15,S16,S20,S24,S32 | |
| Canada:1 | S35 | |
| China: 12 | S4,S6,S10,S18,S23,S26,S30,S37,S39,S45,S51,S54 | |
| France:1 | S48 | |
| Germany: 1 | S52 | |
| India: 8 | S5,S7,S9,S14,S17,S22,S34,S43 | |
| Indonesia: 2 | S36,S50 | |
| Ireland:1 | S29 | |
| Japan:1 | S46 | |
| Kazakhstan: 1 | S31 | |
| Korea:2 | S3,S47 | |
| New Zealand:1 | S2 | |
| Pakistan: 1 | S8 | |
| Saudi Arabia: 2 | S38,S44 | |
| Spain:3 | S1,S21,S25 | |
| UK: 3 | S11,S27,S42 | |
| USA: 6 | S19,S28,S33,S40,S41,S53 | |
| Type of publication | Conference: 30 | S6,S7,S10-S15,S19,S20,S22,S24,S27,S30-S33,S36,S38-S40,S42,S43,S46-S48,S51-S54 |
| Journal article: 24 | S1-S5,S8,S9,S16-S18,S21,S23,S25,S26,S28,S29,S34,S35,S37,S41,S44,S45, S49,S50 | |
| Disorder( | Anxiety Disorders (including social anxiety, PTSD, OCD): 7 | S3,S8,S9,S26,S41,S45,S53 |
| Depression: 47 | S1,S3-25,S27,S28,S31-S37,S39-S46,S48-S50,S52-S54 | |
| Mental illness criteria | CES-D: 7 | S11,S36,S39,S40,S42,S46,S48 |
| PHQ (any version): 2 | S5,S28 | |
| Self-declared: 10 | S1,S2,S18,S21,S23,S24,S33,S50,S52,S54 | |
| Mental health subreddits or groups: 6 | S3,S7,S8,S25,S41,S44 | |
| Manually annotated: 7 | S12,S13,S19,S20,S29,S32,S35 | |
| Automated annotation (mental health keywords or mental hashtags):11 | S4,S9,S14,S31,S37,S38,S43,S45,S47,S49,S51 | |
| Predefined depression dataset: 1 | S6 | |
| Random selected: 6 | S15,S16,S17,S27,S30,S34 | |
| Other or Unspecified: 4 | S10,S22,S26,S53 |
the numbers don't add up some of the studies addressed both disorders.
Models used within reviewed studies.
| Models Name | Abbreviation | number of studies | Study Reference |
|---|---|---|---|
| Adaptive Boosting | AdaBoost | 1 | S23 |
| Bayesian Ridge Regression | BRR | 1 | S29 |
| Bidirectional Encoder Representations from Transformers | Bert | 1 | S10 |
| built classification and regression models based on linguistic and behavioral features | 1 | S39 | |
| Convulational Neural Network | CNN | 7 | S2,S3,S10,S11,S19,S51,S52 |
| linear regression (elastic-net regularized) | LR | 1 | S28 |
| Gradient Boosting Machine | GBM | 2 | S9,S31 |
| Gaussian Process Regression. | GPR | 1 | S27 |
| Gated Recurrent Neural Network | GRU | 4 | S11,S13, S33,S42 |
| K- Nearest Neighbour | KNN | 3 | S16,S20,S32 |
| Logistic Regression (including L1,L2 regulized) | LR | 6 | S1,S5,S14,S23,S32,S41 |
| Long Short Term Memory (including Attention-Based Bidirectional Long Short-Term Memory (Attention-BiLSTM)) | LSTM | 8 | S7,S10,S12,S24,S30,S37,S42,S54 |
| Tree based (including Multi-Class Trees) | 2 | S25,S26 | |
| Multilayer Perceptrons | MLP | 2 | S11,S23 |
| Multimodal Feature Fusion Network | MFFN | 1 | S6 |
| Natural Language Processing techniques (with lexical approach) | NLP | 3 | S21,S36 |
| Navie bayes (including Multinomial) | NB | 8 | S8, S9,S15,S17,S32,S34,S38, S47 |
| Propose algorithms (including count the occurance of depressive words) | 2 | S43,S45 | |
| Random Forest | RF | 6 | S8,S9,S22,S23,S31,S32 |
| Decicison Tree (including Rule based) | DT | 3 | S5,S16, S50 |
| Support Vector Regression | SVR | 1 | S34 |
| Support Vector Machine (including deep integrated support vector machine (DISVM)) | SVM | 15 | S4,S8,S10,S15,S16,S17,S23,S32,S33,S35,S38, S40,S46,S48,S49 |
| Visual Geometry Group Network | VGG-Net | 1 | S53 |
| eXtreme Gradient Boosting | XGBOOST | 4 | S3,S5,S18,S54 |
| Reference | Study Title |
| S1 | Martínez-Castaño, R., J.C. Pichel, and D.E. Losada, |
| S2 | Ismail, N.H., et al., |
| S3 | Kim, J., et al., |
| S4 | Ding, Y., et al., |
| S5 | Jain, S., et al. |
| S6 | Wang, Y., et al. |
| S7 | Mahapatra, A., S.R. Naik, and M. Mishra. |
| S8 | Tariq, S., et al., |
| S9 | Kumar, A., A. Sharma, and A. Arora. |
| S10 | Wang, X., et al. |
| S11 | Wongkoblap, A., M.A. Vadillo, and V. Curcin. |
| S12 | Depression Analysis from Social Media Data in Bangla Language using Long Short Term Memory (LSTM) Recurrent Neural Network Technique |
| S13 | Uddin, A.H., D. Bapery, and A.S.M. Arif. |
| S14 | Jain, V., et al. |
| S15 | Al Asad, N., et al. |
| S16 | Islam, M.R., et al., |
| S17 | Deshpande, M. and V. Rao. |
| S18 | Li, Y., et al., |
| S19 | Shrestha, A. and F. Spezzano. |
| S20 | Islam, M.R., et al. |
| S21 | Leis, A., et al., |
| S22 | Kamite, S.R. and V. Kamble. |
| S23 | Tadesse, M.M., et al., |
| S24 | Shah, F.M., et al. |
| S25 | Cacheda, F., et al., |
| S26 | Ta, N., et al., |
| S27 | Lushi Chen, L., et al., |
| S28 | Ricard, B.J., et al., |
| S29 | Gruda, D. and S. Hasan, |
| S30 | Wang, X., et al. |
| S31 | Narynov, S., et al. |
| S32 | Victor, D.B., et al. |
| S33 | Sadeque, F., D. Xu, and S. Bethard. |
| S34 | Arora, P. and P. Arora. |
| S35 | Jamil, Z., |
| S36 | Oyong, I., E. Utami, and E.T. Luthfi. |
| S37 | Yao, X., et al., |
| S38 | Aldarwish, M. M., & Ahmad, H. F. (2017, March). Predicting depression levels using social media posts. In |
| S39 | Hu, Q., et al., |
| S40 | De Choudhury, M., et al. |
| S41 | Thorstad, R. and P. Wolff, |
| S42 | Wongkoblap, A., M.A. Vadillo, and V. Curcin. |
| S43 | Vanlalawmpuia, R. and M. Lalhmingliana. |
| S44 | De Choudhury, M., S. Counts, and E. Horvitz. |
| S45 | Zhou, T.H., G.L. Hu, and L. Wang, |
| S46 | Tsugawa, S., et al. |
| S47 | Lee, J.H., J.M. Kim, and Y.S. Choi. |
| S48 | De Choudhury, M., S. Counts, and E. Horvitz. |
| S49 | Shatte, A.B., et al., |
| S50 | Syarif, I., N. Ningtias, and T. Badriyah. |
| S51 | Lin, H., et al. |
| S52 | Trotzek, M., S. Koitka, and C.M. Friedrich, |
| S53 | Guntuku, S.C., et al. |
| S54 | Cong, Q., et al. |
| Author | The first author of the study. |
| Year Submission | The year in which the study was submitted. |
| Country of publication | The country where the study was published. |
| Publication type | The paper type (i.e., peer-reviewed, conference or preprint). |
| AI models/ algorithms | The specific AI models or algorithms that were used (e.g., Decision tree, Random forest, Convolutional neural network). |
| Data source trained upon | what data the predictive models are trained upon I.e. which social media platform facebook,twitter, any social forum,or survey forum |
| Metric used for results measurement | what metrics used to evaluate the detection process i.e. accuracy, recall, sensitivity or AUC |
| Data types | what data the predictive models are trained upon I.e. which social media platform facebook,twitter, any social forum,or survey forum |
| Language of Data samples | what language data analysis was performed upon i.e. English tweets, Chinese's tweets or any other language |
| Dataset size | The total number of data that were used (I.e., in case of tweets how many tweets are being analysed) |
| Number of Users | The total number of users identified in each study to analyze their social media accounts and extract data from. |
| Mental illness identification criteria | How the users regarded as having depression or anxiety were identified,i.e. any depression test (PHQ,CES-D etc.), self-declared or randomly selected etc. |
| Mental disorders identified | what type of mental order patient detected to tweet or use social media most to express themself |