| Literature DB >> 36159708 |
Junyeop Cha1, Seoyun Kim1, Eunil Park1,2.
Abstract
Globally, the number of people who suffer from depression is consistently increasing. Because both detecting and addressing the early stage of depression is one of the strongest factors for effective treatment, a number of scholars have attempted to examine how to detect and address early-stage depression. Recent studies have been focusing on the use of social media for depression detection where users express their thoughts and emotions freely. With this trend, we examine two-step approaches for early-stage depression detection. First, we propose a depression post-classification model using multiple languages Twitter datasets (Korean, English, and Japanese) to improve the applicability of the proposed model. Moreover, we built a depression lexicon for each language, which mental health experts verified. Then, we applied the proposed model to a more specific user group dataset, a community of university students (Everytime), to examine whether the model can be employed to address depression posts in more specific user groups. The classification results present that the proposed model and approach can effectively detect depression posts of a general user group (Twitter), as well as specific user group datasets. Moreover, the implemented models and datasets are publicly available.Entities:
Keywords: Cultural and media studies; Health humanities
Year: 2022 PMID: 36159708 PMCID: PMC9491270 DOI: 10.1057/s41599-022-01313-2
Source DB: PubMed Journal: Humanit Soc Sci Commun ISSN: 2662-9992
Fig. 1An workflow of data collection and architecture of the proposed CNN, BiLSTM, and BERT-based classification model.
Fig. 2The main language distribution of 10,000 randomly collected users on Twitter.
Depression lexicon for Korean, English, and Japanese.
Data description for each language.
| Language | Word count of posts (standard deviation) |
|---|---|
| Korean | 8.02 (7.93) |
| English | 13.22 (9.59) |
| Japanese | 3.45 (3.09) |
Results of the binary classification task on Twitter with normal sampling.
| Sampling | Model | Language | Label | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|---|---|
| Base | CNN | Korean | non-depression | 0.9990 | 0.9994 | 0.9996 | 0.9995 |
| depression | 0.9830 | 0.9763 | 0.9796 | ||||
| English | non-depression | 0.9984 | 0.9987 | 0.9997 | 0.9992 | ||
| depression | 0.9448 | 0.7897 | 0.8603 | ||||
| Japanese | non-depression | 0.9992 | 0.9995 | 0.9997 | 0.9996 | ||
| depression | 0.7213 | 0.6132 | 0.6628 | ||||
| BiLSTM | Korean | non-depression | 0.9991 | 0.9995 | 0.9995 | 0.9995 | |
| depression | 0.9815 | 0.9804 | 0.9809 | ||||
| English | non-depression | 0.9983 | 0.9988 | 0.9996 | 0.9992 | ||
| depression | 0.9221 | 0.8058 | 0.8600 | ||||
| Japanese | non-depression | 0.9993 | 0.9996 | 0.9997 | 0.9997 | ||
| depression | 0.7618 | 0.6676 | 0.7116 | ||||
| BERT | Korean | non-depression | 0.9787 | 0.9794 | 0.9992 | 0.9892 | |
| depression | 0.8145 | 0.1447 | 0.2457 | ||||
| English | non-depression | 0.9991 | 0.9995 | 0.9995 | 0.9892 | ||
| depression | 0.9264 | 0.9262 | 0.9263 | ||||
| Japanese | non-depression | 0.9993 | 0.9996 | 0.9997 | 0.9997 | ||
| depression | 0.7499 | 0.6623 | 0.7032 |
Results of the binary classification task on Twitter with under-sampling.
| Sampling | Model | Language | Label | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|---|---|
| Under-sampling | CNN | Korean | non-depression | 0.9893 | 0.9966 | 0.9894 | 0.9900 |
| depression | 0.9966 | 0.9819 | 0.9892 | ||||
| English | non-depression | 0.9928 | 0.9948 | 0.9908 | 0.9928 | ||
| depression | 0.9908 | 0.9948 | 0.9928 | ||||
| Japanese | non-depression | 0.9925 | 0.9874 | 0.9977 | 0.9925 | ||
| depression | 0.9976 | 0.9873 | 0.9924 | ||||
| Under-sampling | BiLSTM | Korean | non-depression | 0.9900 | 0.9835 | 0.9966 | 0.9900 |
| depression | 0.9966 | 0.9833 | 0.9899 | ||||
| English | non-depression | 0.9984 | 0.9975 | 0.9922 | 0.9948 | ||
| depression | 0.9922 | 0.9975 | 0.9949 | ||||
| Japanese | non-depression | 0.9925 | 0.9967 | 0.9984 | 0.9925 | ||
| depression | 0.9984 | 0.9865 | 0.9924 | ||||
| BERT | Korean | non-depression | 0.9966 | 0.9986 | 0.9946 | 0.9966 | |
| depression | 0.9946 | 0.9986 | |||||
| English | non-depression | 0.9965 | 0.9999 | 0.9931 | 0.9965 | ||
| depression | 0.9932 | 0.9999 | |||||
| Japanese | non-depression | 0.9939 | 0.9922 | 0.9956 | 0.9939 | ||
| depression | 0.9956 | 0.9922 |
Bold values represent the greatest levels.
Results of the binary classification task on Twitter with over-sampling.
| Sampling | Model | Language | Label | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|---|---|
| Over-sampling | CNN | Korean | non-depression | 0.9356 | 0.9254 | 0.9475 | 0.9363 |
| depression | 0.9462 | 0.9236 | 0.9348 | ||||
| English | non-depression | 0.9572 | 0.9459 | 0.9699 | 0.9577 | ||
| depression | 0.9691 | 0.9446 | 0.9567 | ||||
| Japanese | non-depression | 0.9643 | 0.9640 | 0.9647 | 0.9643 | ||
| depression | 0.9647 | 0.9639 | 0.9643 | ||||
| BiLSTM | Korean | non-depression | 0.9465 | 0.9310 | 0.9645 | 0.9475 | |
| depression | 0.9632 | 0.9285 | 0.9455 | ||||
| English | non-depression | 0.9607 | 0.9438 | 0.9798 | 0.9614 | ||
| depression | 0.9790 | 0.9417 | 0.9599 | ||||
| Japanese | non-depression | 0.9645 | 0.9560 | 0.9737 | 0.9648 | ||
| depression | 0.9732 | 0.9552 | 0.9641 | ||||
| BERT | Korean | non-depression | 0.5000 | 0.0000 | 0.0000 | 0.0000 | |
| depression | 0.5000 | 1.0000 | 0.6667 | ||||
| English | non-depression | 0.9896 | 0.9812 | 0.9983 | 0.9897 | ||
| depression | 0.9983 | 0.9809 | 0.9895 | ||||
| Japanese | non-depression | 0.5000 | 0.0000 | 0.0000 | 0.0000 | ||
| depression | 0.5000 | 1.0000 | 0.6667 |
Data description of Everytime dataset.
| Language | Word count of posts (standard deviation) |
|---|---|
| Korean | 24.23 (54.52) |
Fig. 3Summary of the results.
a Korean Twitter dataset (Study 1). b Everytime dataset (Study 2). c Trained by Korean Twitter dataset and tested by Everytime dataset.
Results of the binary classification task on Everytime with normal-sampling.
| Sampling | Model | Label | Accuracy | Precision | Recall | F1-score |
|---|---|---|---|---|---|---|
| Based | CNN | non-depression | 0.3732 | 0.3698 | 0.9926 | 0.5388 |
| depression | 0.7195 | 0.0112 | 0.0220 | |||
| BiLSTM | non-depression | 0.4956 | 0.4957 | 0.9956 | 0.6619 | |
| depression | 0.4651 | 0.0038 | 0.0075 | |||
| BERT | non-depression | 0.5423 | 0.5202 | 0.9900 | 0.6820 | |
| depression | 0.9120 | 0.1019 | 0.1833 | |||
| Under-sampling | CNN | non-depression | 0.5149 | 0.5079 | 0.9576 | 0.6638 |
| depression | 0.6299 | 0.0721 | 0.1294 | |||
| BiLSTM | non-depression | 0.5091 | 0.5079 | 0.9521 | 0.6579 | |
| depression | 0.6091 | 0.0733 | 0.1309 | |||
| BERT | non-depression | 0.7252 | 0.6514 | 0.9587 | 0.7757 | |
| depression | 0.9242 | 0.4955 | ||||
| Over-sampling | CNN | non-depression | 0.5831 | 0.5565 | 0.7832 | 0.6507 |
| depression | 0.6443 | 0.3862 | 0.4829 | |||
| BiLSTM | non-depression | 0.5638 | 0.6070 | 0.3581 | 0.4504 | |
| depression | 0.5438 | 0.7865 | ||||
| BERT | non-depression | 0.5000 | 0.0000 | 0.0000 | 0.0000 | |
| depression | 0.5000 | 1.0000 | 0.6667 |
Bold values represent the greatest levels.