| Literature DB >> 35340901 |
Donghun Kim1, Woojin Jung1, Seojin Nam1, Hongjin Jeon2,3, Jihyun Baek2, Yongjun Zhu4.
Abstract
Objective: Although there were few studies on how suicidal users behave on Twitter, they only investigated partial aspects such as tweeting frequency and tweet length. Therefore, we aim to understand the various information behavior of suicidal users in South Korea.Entities:
Keywords: Data analysis; disease; health communication; informatics; media; public health; public health informatics; social media
Year: 2022 PMID: 35340901 PMCID: PMC8943454 DOI: 10.1177/20552076221086339
Source DB: PubMed Journal: Digit Health ISSN: 2055-2076
Figure 1.The pipeline of data collection, data preprocessing, and data annotation.
Data collection and preprocessing.
| Step | Deleted | Results | Details | |
|---|---|---|---|---|
| 1 | Initial set of tweets | 457,947 | ||
| 2 | Removal of tweets written by non-suicidal users | 30,210 | 427,737 | Bots: 27,201 Universities: 1373 Entertainment: 607 Campaigns: 519 News:510 |
| 3 | Removal of tweets with hyperlinks | 39,015 | 388,722 | Tweets with hyperlinks |
| 4 | Removal of tweets with non-suicidal hashtags | 24,112 | 364,610 | Entertainment: 20,640 Marketing: 2493 Bots: 582 Campaigns: 123 Others: 274 |
Figure 2.Categories of suicidal users’ user IDs and usernames.
Figure 3.Categories of suicidal users’ descriptions.
Figure 4.Categories of pinned tweets posted by suicidal users.
Figure 5.Difference in tweet length between the experimental and control groups.
Figure 6.Difference in tweet frequency between the experimental and control groups, measured by the users’ average tweet count per month.
Figure 7.Difference in number of tweets by week between the experimental and control groups.
Figure 8.Tweet creation time in the experimental and control groups.
Number of suicide-related keywords in the top 500 frequent keywords generated using different vocabulary size.
| Vocabulary size | 2K | 4K | 6K | 8K | 10K | 12K | 14K | 16K |
|---|---|---|---|---|---|---|---|---|
| Number of suicide-related keywords | 20 | 63 | 94 | 94 | 91 | 56 | 74 | 68 |
Main suicide-related keywords extracted with the vocabulary size setting of 6K.
| Tokens | |
|---|---|
| Suicide | _want to die, _suicide attempt, _feel suicide impulse, _want to commit suicide, _even attempt suicide, _have to die, _suicidal thought, _how to suicide, _suicide plan, _don’t want to live, _strangled myself, _want to cut, _like cutter knife |
| Self-harm | _self-harmer, _want to self-harm, _self-harming, _arm warmer, _self-harm scar, _do self-harm terribly, _self-poisoning/harm by drugs |
Top 30 suicide-related hashtags.
| Rank | hashtag | Count | Rank | hashtag | Count |
|---|---|---|---|---|---|
| 1 | #self-harm | 92 | 16 | #dissociative_disorders | 15 |
| 2 | #introduction_depressioner_account | 63 | 17 | #self-harm_picture | 13 |
| 3 | #introduction_self-harm_account | 44 | 18 | #a_three_o'clock_ball | 13 |
| 4 | #introduction_ self-harmer | 41 | 19 | #obsessive-compulsive_disorder | 11 |
| 5 | #self-harm_account | 30 | 20 | #ATB (a_three_o'clock_ball) | 9 |
| 6 | #depressed | 29 | 21 | #introduction_queer | 9 |
| 7 | #introduction_depressioner | 28 | 22 | #self-harmer | 8 |
| 8 | #suicide | 25 | 23 | #gay | 8 |
| 9 | #depression_account | 25 | 24 | #introductionDepressedAccount | 7 |
| 10 | #IQ (introduction_queer) | 23 | 25 | #rt_reason_for_living_and_dying | 7 |
| 11 | #bipolar_disorder | 19 | 26 | #express_death_without_ words | 6 |
| 12 | #introduction_mental_patienter | 18 | 27 | #mental_disorder | 6 |
| 13 | #depression | 17 | 28 | #self_introduction_of_depressed_account | 5 |
| 14 | #panic_disorder | 16 | 29 | #alpram | 5 |
| 15 | #anxiety_disorder | 15 | 30 | #seroquel | 5 |