| Literature DB >> 30296299 |
Cynthia Van Hee1, Gilles Jacobs1, Chris Emmery2, Bart Desmet1, Els Lefever1, Ben Verhoeven2, Guy De Pauw2, Walter Daelemans2, Véronique Hoste1.
Abstract
While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online. Recent studies report that cyberbullying constitutes a growing problem among youngsters. Successful prevention depends on the adequate detection of potentially harmful messages and the information overload on the Web requires intelligent systems to identify potential risks automatically. The focus of this paper is on automatic cyberbullying detection in social media text by modelling posts written by bullies, victims, and bystanders of online bullying. We describe the collection and fine-grained annotation of a cyberbullying corpus for English and Dutch and perform a series of binary classification experiments to determine the feasibility of automatic cyberbullying detection. We make use of linear support vector machines exploiting a rich feature set and investigate which information sources contribute the most for the task. Experiments on a hold-out test set reveal promising results for the detection of cyberbullying-related posts. After optimisation of the hyperparameters, the classifier yields an F1 score of 64% and 61% for English and Dutch respectively, and considerably outperforms baseline systems.Entities:
Mesh:
Year: 2018 PMID: 30296299 PMCID: PMC6175271 DOI: 10.1371/journal.pone.0203794
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Definitions and brat annotation examples of more fine-grained text categories related to cyberbullying.
| Annotation category | Annotation example |
|---|---|
| Threat/blackmail | [I am going to find out who you are & I swear you are going to regret it.] |
| Insult | [Kill yourself] |
| Curse/Exclusion | [Fuck you.] |
| Defamation | [She slept with her ex behind his girlfiends back and she and him had broken up.] |
| Sexual Talk | [Naked pic of you now.] |
| Defense | [I would appreciate if you dindn’t talk shit about my bestfriend.] |
| Encour. to har. | [She is a massive slut] |
Inter-annotator agreement on the fine-grained categories related to cyberbullying.
| Threat | Insult | Defense | Sexual talk | Curse/exclusion | Defamation | Encouragements to the harasser | |
|---|---|---|---|---|---|---|---|
| 0.65 | 0.63 | 0.45 | 0.38 | 0.58 | 0.15 | N/A | |
| 0.52 | 0.66 | 0.63 | 0.53 | 0.19 | 0.00 | 0.21 |
Statistics of the English and Dutch cyberbullying corpus.
| Corpus size | Number(ratio) of bullying posts | |
|---|---|---|
| 113,698 | 5,375(4.73%) | |
| 78,387 | 5,106(6.97%) |
Hyperparameters in grid-search model selection.
| Hyperparameter | Values |
|---|---|
| Penalty of error term | 1 |
| Loss function | Hinge, squared hinge |
| Penalty: norm used in penalisation | ‘l1’ (‘least absolute deviations’) or ‘l2’ (‘least squares’) |
| Class weight (sets penalty | None or ‘balanced’, i.e. weight inversely proportional to class frequencies |
Cross-validated and hold-out scores (%) according to different metrics (F1, precision, recall, accuracy and area under the curve) for the English and Dutch three best and worst combined feature type systems.
| Feature combination | Cross-validation scores | Hold-out scores | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| F1 | P | R | Acc | AUROC | F1 | P | R | Acc | AUROC | ||
| Best three | B + C + D + E | 73.32 | 57.19 | 96.97 | 78.07 | 63.69 | 74.13 | 55.82 | 97.21 | 77.47 | |
| A + B + C | 64.24 | 73.22 | 57.23 | 96.96 | 78.09 | 74.08 | 56.83 | 97.24 | 77.96 | ||
| A + C + E | 63.84 | 73.21 | 56.59 | 96.94 | 77.78 | 62.94 | 72.82 | 55.42 | 97.14 | 77.24 | |
| Worst three | D | 40.48 | 38.98 | 42.12 | 94.10 | 69.41 | 39.56 | 39.56 | 39.56 | 94.71 | 68.39 |
| A + D + E | 38.95 | 31.47 | 51.10 | 92.37 | 72.76 | 40.71 | 33.87 | 51.00 | 93.49 | 73.22 | |
| E | 17.35 | 9.73 | 79.91 | 63.72 | 71.41 | 15.70 | 8.72 | 78.51 | 63.07 | 70.44 | |
| Baseline | word | 58.17 | 67.55 | 51.07 | 96.54 | 74.93 | 59.63 | 69.57 | 52.17 | 96.57 | 75.50 |
| profanity | 17.17 | 9.61 | 80.14 | 63.73 | 71.53 | 17.61 | 9.90 | 78.51 | 63.79 | 71.34 | |
| Best three | A + B + C + E | 56.76 | 66.40 | 94.47 | 81.42 | 58.13 | 54.03 | 62.90 | 94.58 | 79.75 | |
| A + B + C + D + E | 61.03 | 71.55 | 53.20 | 95.53 | 75.86 | 67.40 | 52.03 | 95.62 | 75.21 | ||
| A + C + E | 60.82 | 71.66 | 52.84 | 95.53 | 75.68 | 58.15 | 67.71 | 50.96 | 95.61 | 74.71 | |
| Worst three | D + B | 32.90 | 29.23 | 37.63 | 89.91 | 65.61 | 30.16 | 34.72 | 26.65 | 92.61 | 61.73 |
| D | 28.65 | 19.36 | 55.10 | 81.97 | 69.48 | 25.13 | 16.73 | 50.53 | 81.99 | 67.26 | |
| B | 24.74 | 21.24 | 29.61 | 88.16 | 60.94 | 17.99 | 23.15 | 14.71 | 91.98 | 55.80 | |
| Baseline | word | 50.39 | 67.80 | 40.09 | 94.81 | 69.38 | 49.54 | 64.29 | 40.30 | 95.09 | 69.44 |
| profanity | 28.46 | 19.24 | 54.66 | 81.99 | 69.28 | 25.13 | 16.73 | 50.53 | 81.99 | 67.26 | |
Feature group mapping (Table 5).
| A | word |
| B | subjectivity lexicons |
| C | character |
| D | term lists |
| E | topic models |
Cross-validated and hold-out scores (%) according to different metrics (F1, precision, recall, accuracy and area under the ROC curve) for English and Dutch single feature type systems.
| Feature type | Cross-validation scores | hold-out scores | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| F1 | P | R | Acc | AUROC | F1 | P | R | Acc | AUROC | ||
| word | 60.49 | 59.69 | 96.22 | 78.87 | 57.12 | 59.64 | 96.27 | 78.79 | |||
| subjectivity lexicons | 56.82 | 73.32 | 46.38 | 96.64 | 72.77 | 56.16 | 72.61 | 45.78 | 96.87 | 72.50 | |
| character | 52.69 | 58.70 | 47.80 | 95.91 | 73.06 | 53.33 | 62.37 | 46.59 | 96.43 | 72.65 | |
| term lists | 40.48 | 38.98 | 42.12 | 94.10 | 69.41 | 39.56 | 39.56 | 39.56 | 94.71 | 68.39 | |
| topic models | 17.35 | 9.73 | 79.91 | 63.72 | 71.41 | 15.70 | 8.72 | 78.51 | 63.07 | 70.44 | |
| word | 72.64 | 44.94 | 95.27 | 71.88 | 70.20 | 45.20 | 95.57 | 71.99 | |||
| subjectivity lexicons | 54.34 | 54.12 | 54.56 | 93.97 | 75.65 | 51.82 | 50.61 | 53.09 | 94.09 | 74.90 | |
| character | 51.70 | 67.58 | 41.86 | 94.86 | 70.22 | 50.46 | 65.20 | 41.15 | 95.17 | 69.88 | |
| term lists | 28.65 | 19.36 | 55.10 | 81.97 | 69.48 | 25.13 | 16.73 | 50.53 | 81.99 | 67.26 | |
| topic models | 24.74 | 21.24 | 29.61 | 88.16 | 60.94 | 17.99 | 23.15 | 14.71 | 91.98 | 55.80 | |
Error rates (%) per cyberbullying subcategory on hold-out for English and Dutch systems.
| Category | Nr. occurrences in hold-out | Profanity baseline | Word | Best system | |
|---|---|---|---|---|---|
| Curse | 14.68 | 30.28 | 24.77 | ||
| Defamation | 23.81 | 47.62 | 38.10 | ||
| Defense | 22.42 | 52.12 | 43.64 | ||
| Encouragement | 0.00 | 100.00 | 100.00 | ||
| Insult | 26.67 | 41.74 | 35.94 | ||
| Sexual | 63.80 | 21.47 | 21,47 | ||
| Threat | 8.33 | 41.67 | 25.00 | ||
| Not cyberbullying | 36.94 | 1.10 | 0.76 | ||
| Curse | 39.58 | 50.00 | 22.92 | ||
| Defamation | 100.00 | 66.67 | 33.33 | ||
| Defense | 52.50 | 63.50 | 46.00 | ||
| Encouragement | 40.00 | 60.00 | 40.00 | ||
| Insult | 43.38 | 47.89 | 28.17 | ||
| Sexual | 37.84 | 21.62 | 27.03 | ||
| Threat | 33.33 | 46.67 | 20.00 | ||
| Not cyberbullying | 15.63 | 1.23 | 3.07 | ||
Error rates (%) per cyberbullying participant role on hold-out for English and Dutch systems.
| Participant role | Nr. occurrences in hold-out | Profanity baseline | Word | Best system | |
|---|---|---|---|---|---|
| Harasser | 20.43 | 48.48 | 43.60 | ||
| Bystander assistant | 50.00 | 100.00 | 100.00 | ||
| Bystander defender | 7.69 | 38.46 | 25.64 | ||
| Victim | 27.91 | 57.36 | 50.39 | ||
| Not cyberbullying | 37.64 | 1.24 | 0.89 | ||
| Harasser | 47.13 | 56.70 | 29.89 | ||
| Bystander assistant | 50.00 | 66.67 | 50.00 | ||
| Bystander defender | 25.00 | 38.46 | 23.08 | ||
| Victim | 62.00 | 72.00 | 54.00 | ||
| Not cyberbullying | 16.01 | 1.42 | 3.41 | ||
Overview of the most related cyberbullying detection approaches.
| Reference | Classifier | Corpus | Bully rate | F1 score |
|---|---|---|---|---|
| [ | SVM | 1,762 tweets | 39% | 77% |
| [ | wvec+SVM | 1,762 tweets | 39% | 78% |
| [ | smSDA+SVM | 7,321 tweets | 29% | 72% |
| [ | smSDA+SVM | 1,539 MySpace posts | 26% | 78% |