| Literature DB >> 34883861 |
Fernando H Calderón1,2, Namrita Balani1, Jherez Taylor1, Melvyn Peignon1, Yen-Hao Huang1, Yi-Shin Chen1.
Abstract
The permanent transition to online activity has brought with it a surge in hate speech discourse. This has prompted increased calls for automatic detection methods, most of which currently rely on a dictionary of hate speech words, and supervised classification. This approach often falls short when dealing with newer words and phrases produced by online extremist communities. These code words are used with the aim of evading automatic detection by systems. Code words are frequently used and have benign meanings in regular discourse, for instance, "skypes, googles, bing, yahoos" are all examples of words that have a hidden hate speech meaning. Such overlap presents a challenge to the traditional keyword approach of collecting data that is specific to hate speech. In this work, we first introduced a word embedding model that learns the hidden hate speech meaning of words. With this insight on code words, we developed a classifier that leverages linguistic patterns to reduce the impact of individual words. The proposed method was evaluated across three different datasets to test its generalizability. The empirical results show that the linguistic patterns approach outperforms the baselines and enables further analysis on hate speech expressions.Entities:
Keywords: hate speech; linguistic patterns; social media
Mesh:
Year: 2021 PMID: 34883861 PMCID: PMC8659976 DOI: 10.3390/s21237859
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Notations.
| Notation | Description |
|---|---|
|
| A contextual graph built with output from |
|
| A |
|
| A |
|
| A learned embedding model of type |
|
| A stored vocabulary for a given embedding model |
|
| A word embedding model trained on TwitterClean |
|
| A word embedding model trained on HateComm |
Figure 2Surfaced keywords.
Experiment selection.
| Code Words | |
|---|---|
| niggers (positive control) | water (negative control) |
| snake | googles |
| cuckservatives | skypes |
| creatures | moslems |
| cockroaches | primitives |
Experiment Sample.
| another cop killed and set on fire by googles |
| @user i’m sick of these worthless googles >>#backtoafrica |
| strange mixed-breed creatures jailed for killing white woman |
| germany must disinfect her land. one cockroach at a time if |
| necessary |
Aggregate annotator classification.
| Hate Speech | Not Hate Speech | ||
|---|---|---|---|
| Precision | 0.88 | 1.00 | |
| HateCommunity | Recall | 1.00 | 0.67 |
| F1 | 0.93 | 0.80 | |
| Precision | 1.00 | 0.86 | |
| TwitterClean | Recall | 0.75 | 1.00 |
| F1 | 0.86 | 0.92 | |
| Precision | 0.75 | 0.83 | |
| TwitterHate | Recall | 0.75 | 0.83 |
| F1 | 0.75 | 0.83 |
HateCommunity Word: Ranking Distribution.
| HateCommunity Results | ||||
|---|---|---|---|---|
| Ground Truth | Annotators | |||
| Words | Label | Percent | Label | Percent |
| niggers | Very likely | 0.8 | Very likely | 0.68 |
| snakes | Unlikely | 0.4 | Neutral | 0.26 |
| googles | Very likely | 1.0 | Very likely | 0.41 |
| cuckservatives | Unlikely | 1.0 | Likely | 0.36 |
| skypes | Likely | 0.8 | Likely | 0.3 |
| creatures | Very likely | 0.6 | Very likely | 0.4 |
| moslems | Likely | 0.8 | Very likely | 0.39 |
| cockroaches | Very likely | 1.0 | Very likely | 0.40 |
| water | Very unlikely | 1.0 | Very unlikely | 0.65 |
| primatives | Very likely | 0.6 | Very likely | 0.37 |
Examples of patterns and templates extracted through the basic pattern extraction mechanism. The asterisk (*) refers to a wildcard token which can be replaced by other subject words.
| Templates | Pattern Examples |
|---|---|
|
| stupid *, like *, am * |
|
| love you *, shut up * |
|
| * for * |
|
| * on the |
|
| * <hashtag> |
Results obtained for Twitter datasets when using different approaches. Top performance highlighted in bold.
| HbT | HOL | W&H | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Model | Features | Acc. % | Prec.% | Rec. % | F1 % | Acc. % | Prec.% | Rec. % | F1 % | Acc. % | Prec. % | Rec. % | F1 % |
| NB | TF-IDF | 68.4 | 63.1 | 72.1 | 67.9 | 68.4 | 63.1 | 72.1 | 67.9 | 95.5 | 46.6 | 61.9 | 51.1 |
| NB | BOW | 86.0 | 39.7 | 58.7 | 42.1 | 86.0 | 51.2 | 77.8 | 54.2 | 61.8 | 37.2 | 72.4 | 57.1 |
| LR | BOW | 73.1 | 68.4 | 78.9 | 73.5 | 73.1 | 68.4 | 78.9 | 73.5 | 82.8 | 53.8 | 69.1 | 68.6 |
| FT | BOW | 74.0 | 66.7 | 79.1 | 73.3 | 74.0 | 66.7 | 79.1 | 73.3 | 84.7 | 71.7 | 62.0 | 72.8 |
| SOTA | 90.0 | 77.0 | 86.0 | 84.3 | 82.0 | 77.0 | 84.0 | 81.0 |
| 52.0 |
| 77.7 | |
| LP1 |
| 87.9 | 90.0 | 86.8 | 88.2 | 89.0 | 90.5 | 88.5 | 89.3 | 79.8 | 79.0 | 82.7 | 80.5 |
| LP2 |
|
|
|
|
|
|
|
|
| 82.1 |
| 86.7 |
|
Top 20 common patterns that were generated in all the datasets. “.+” represents the wildcard token.
| Without Derogatory Term | With Derogatory Term |
|---|---|
| another man .+ | they ass .+ |
| .+ her man | not fucking .+ |
| mad that .+ | ass niggas .+ |
| .+ know nothing | bitch no .+ |
| some girls .+ | faggot if .+ |
| getting money .+ | them niggas .+ |
| makes no .+ | bitch when .+ |
| .+ has nothing | fucking with .+ |
| funny how .+ | .+ bitches be |
| .+ my mouth | fuck my .+ |
| .+ come from | bitch niggas .+ |
| .+ going down | .+ a gay |
| trash that .+ | .+ some fucking |
| .+ the biggest | real nigger .+ |
| .+ you ugly | bitch .+ URLTOK |
| .+ you thought | .+ faggot & |
| come from .+ | .+ yo nigga |
| .+ could never | hoes .+ i |
| .+ stop making | hate .+ bitch |
| you .+ talking | .+ white bitches |