| Literature DB >> 35498179 |
Akhilesh Kumar1, Anuradha Thakare2, Manisha Bhende3, Amit Kumar Sinha4, Arnold C Alguno5, Yekula Prasanna Kumar6.
Abstract
In researching social network data and depression, it is often necessary to manually label depressed and non-depressed users, which is time-consuming and labor-intensive. The aim of this study is that it explores the relationship between social network data and depression. It can also contribute to detecting and identifying depression. Through collecting and analyzing college students' microblog social data, a preliminary screening algorithm for college students' suspected depression microblogs based on depression keywords, and semantic expansion is researched; a comprehensive lexical grammar was proposed. This research provided has a preliminary screening method based on depression keywords and semantic expansion for college students' suspected depression microblogs, with a screening accuracy. This method forms a depression keyword table by constructing the basic keyword table and the semantic expansion based on the word embedding learning model Word2Vec. Finally, the word table is used to calculate the semantic similarity of the tested microblogs and then identify whether it is a suspected depression microblog. The experimental results on the microblog dataset of college students show that the comprehensive lexical method is better than the SDS questionnaire segmentation method and the expert lexical method in terms of screening accuracy; the comprehensive lexical approach can quickly and automatically screen out a tiny proportion of suspected doubts from a large number of college students' microblogs. Depression Weibo can reduce the workload of experts' annotation, improve annotation efficiency, and provide a suitable data processing basis for the subsequent accurate identification (classification problem) of patients with depression.Entities:
Mesh:
Year: 2022 PMID: 35498179 PMCID: PMC9050301 DOI: 10.1155/2022/8755922
Source DB: PubMed Journal: Comput Intell Neurosci
Microblog data of 8 colleges and universities in the capital.
| School | Number of posts | User number |
|---|---|---|
| 1 | 117 844 | 9858 |
| 2 | 231 004 | 16855 |
| 3 | 64822 | 5656 |
| 4 | 61802 | 5418 |
| 5 | 120281 | 5506 |
| 6 | 56987 | 627 |
| 7 | 13484 | 2618 |
| 8 | 19893 | 7302 |
Figure 1Identify algorithm steps.
5 topics and key words of selected depression groups.
| Topic ID | Topic heading |
|---|---|
| Topic 1 | No friends, lonely, no one to chat, need a place to go out, there are people around |
| Topic 2 | Fear of death, despair, relief, accident, face the terrible future, dare not now |
| Topic 3 | You and others are disgusting, useless, garbage, you only want to disgust |
| Topic 4 | Today I cannot sleep till tomorrow, I cannot eat at night |
| Topic 5 | Live hard, hope, come on, persist in pain, continue to live, see, live |
Expert evaluation results of screening algorithm.
| Evaluation item | SDS questionnaire segmentation method | Synthetic lexical |
|---|---|---|
| The total number of Weibo identified as suspected depression | 90771 | 20731 |
| Randomly select 2% of the number of Weibo | 1900 | 415 |
| Experts determine the number of Weibo with obvious negative emotions | 98 | 275 |
| Proportion (%) | 5.088 | 68.70 |
Figure 2Impact coverage rates of depression analysis.
Impact coverage rates of depression analysis.
| Serial number | SMS rank | Page rank | Fans rank | Repost rank |
|---|---|---|---|---|
| 1 | 0.2 | 0 | 0.1 | 0.1 |
| 2 | 0.2566 | 0.003 | 0.1349 | 0.1562 |
| 3 | 0.31 | 0.0268 | 0.1457 | 0.1789 |
| 4 | 0.3698 | 0.0298 | 0.1562 | 0.2145 |
| 5 | 0.3897 | 0.0311 | 0.1889 | 0.2598 |
| 6 | 0.4059 | 0.0388 | 0.2245 | 0.2897 |
| 7 | 0.4256 | 0.065 | 0.2598 | 0.3157 |
| 8 | 0.4455 | 0.0989 | 0.2997 | 0.3489 |
| 9 | 0.4892 | 0.1115 | 0.3257 | 0.3645 |
| 10 | 0.51 | 0.1256 | 0.3489 | 0.3985 |
| 11 | 0.5213 | 0.2596 | 0.3545 | 0.4258 |
| 12 | 0.5805 | 0.2999 | 0.3985 | 0.4356 |
| 13 | 0.6156 | 0.3986 | 0.435 | 0.4459 |
| 14 | 0.6325 | 0.4515 | 0.4356 | 0.5126 |
| 15 | 0.6589 | 0.4698 | 0.4459 | 0.5569 |
| 16 | 0.6685 | 0.4817 | 0.5126 | 0.6599 |
| 17 | 0.6899 | 0.4978 | 0.5526 | 0.6899 |
Figure 3Daily depressions of the top 5 users in “gene editing.”
Figure 4Daily depressions of the top 5 users in “food safety.”