| Literature DB >> 33755932 |
Isabelle van der Vegt1, Maximilian Mozes2,3,4, Bennett Kleinberg2,3,5, Paul Gill2.
Abstract
This paper introduces the Grievance Dictionary, a psycholinguistic dictionary that can be used to automatically understand language use in the context of grievance-fueled violence threat assessment. We describe the development of the dictionary, which was informed by suggestions from experienced threat assessment practitioners. These suggestions and subsequent human and computational word list generation resulted in a dictionary of 20,502 words annotated by 2318 participants. The dictionary was validated by applying it to texts written by violent and non-violent individuals, showing strong evidence for a difference between populations in several dictionary categories. Further classification tasks showed promising performance, but future improvements are still needed. Finally, we provide instructions and suggestions for the use of the Grievance Dictionary by security professionals and (violence) researchers.Entities:
Keywords: Grievances; LIWC; Language; Psycholinguistic dictionary; Threat assessment; Violence
Mesh:
Year: 2021 PMID: 33755932 PMCID: PMC8516761 DOI: 10.3758/s13428-021-01536-2
Source DB: PubMed Journal: Behav Res Methods ISSN: 1554-351X
Dictionary categories with example words (defined in later steps)
| Category | Examples |
|---|---|
| Planning | long-term, tactic, organize |
| Violence | bloodshed, fight, bullet |
| Weaponry | AK-47, ammo, fire arm |
| Help seeking | support, SOS, save |
| Hate | enemy, loathe, hatred |
| Frustration | annoyed, problem, powerless |
| Suicide | die, overdose, last resort |
| Threat | warn, danger, unsafe |
| Grievance | wrong, disappointed, injustice |
| Fixation | obsess, possess, watch |
| Desperation | sorrow, last chance, urgent |
| Deadline | time run out, due date, upcoming |
| Murder | kill, stab, fatal |
| Relationship | marry, romantic, love |
| Loneliness | disconnected, nobody, abandon |
| Surveillance | spy, CCTV, monitor |
| Soldier | fighter, battle, patriot |
| Honor | integrity, hero, brave |
| Impostor | impersonate, fraudulent, undercover |
| Jealousy | cheat, resent, bitter |
| God | pray, holy, almighty |
| Paranoia | suspicious, conspiracy, suspect |
Corpora used for internal consistency computation
| Corpus | Number of documents (number of tokens) |
|---|---|
| Deception detection experimentsa | 2547 (454,217) |
| Novels (Lahiri, | 3036 (247,142,420) |
| IMDB movie reviews (Maas et al., | 50,000 (13,934,687) |
| Reddit posts (Demszky et al., | 70,000 (1,081,539) |
Note. aHotel reviews (Ott, Choi, Cardie, & Hancock, 2011; Ott, Cardie, & Hancock, 2013), descriptions of past and planned activities (Kleinberg et al., 2019)
Internal consistency scores
| Category | Cronbach’s alpha |
|---|---|
| Deadline | 0.27 |
| Desperation | 0.21 |
| Fixation | 0.12 |
| Frustration | 0.22 |
| God | 0.35 |
| Grievance | 0.16 |
| Hate | 0.30 |
| Help | 0.19 |
| Honor | 0.26 |
| Impostor | 0.19 |
| Jealousy | 0.21 |
| Loneliness | 0.18 |
| Murder | 0.35 |
| Paranoia | 0.23 |
| Planning | 0.31 |
| Relationship | 0.33 |
| Soldier | 0.37 |
| Suicide | 0.26 |
| Surveillance | 0.25 |
| Threat | 0.30 |
| Violence | 0.36 |
| Weaponry | 0.34 |
Correlations (with confidence interval) Grievance Dictionary and LIWC
| Category | Strongest correlating LIWC categories | ||
|---|---|---|---|
| Deadline | cause: 0.10 [0.06–0.13] | drives: 0.06 [0.03–0.09] | work: 0.11 [0.06–0.16] |
| Desperation | discrep: 0.27 [0.15–0.40] | sad: 0.16 [0.08–0.25] | verb: 0.13 [0.09–0.16] |
| Fixation | insight: 0.24 [0.15–0.33] | pronoun: 0.18 [0.08–0.29] | verb: 0.20 [0.12–0.27] |
| Frustration | feel: 0.17 [0.07–0.26] | negemo: 0.13 [0.07–0.19] | sad: 0.09 [0.05–0.14] |
| God | affiliation: 0.21 [0.10–0.31] | posemo: 0.14 [0.11–0.18] | relig: 0.32 [0.12–0.52] |
| Grievance | affect: 0.08 [0.07–0.09] | negemo: 0.16 [0.06–0.26] | sad: 0.12 [0.05–0.18] |
| Hate | affect: 0.09 [0.06–0.12] | anger: 0.23 [0.12–0.34] | negemo: 0.15 [0.09–0.21] |
| Help | affect: 0.17 [0.10–0.25] | posemo: 0.20 [0.14–0.26] | reward: 0.23 [0.12–0.35] |
| Honor | affect: 0.18 [0.09–0.27] | drives: 0.16 [0.07–0.26] | posemo: 0.22 [0.12–0.32] |
| Impostor | power: – 0.03 [– 0.04–0.02] | relativ: – 0.05 [–0.09--0.02] | space: – 0.04 [–0.07–0.02] |
| Jealousy | cogproc: 0.11 [0.06–0.16] | discrep: 0.07 [0.05–0.10] | insight: 0.15 [0.07–0.23] |
| Loneliness | discrep: 0.06 [0.03–0.10] | sad: 0.08 [0.03–0.13] | time: 0.06 [0.04–0.08] |
| Murder | affect: 0.09 [0.04–0.13] | anger: 0.20 [0.10–0.31] | negemo: 0.17 [0.07–0.27] |
| Paranoia | anx: 0.11 [0.05–0.17] | cogproc: 0.08 [0.04–0.13] | negemo: 0.11 [0.06–0.16] |
| Planning | Authentic: 0.13 [0.05–0.21] | focuspresent: 0.14 [0.08–0.19] | insight: 0.15 [0.07–0.23] |
| Relation. | affiliation: 0.28 [0.12–0.43] | family: 0.23 [0.13–0.33] | social: 0.28 [0.10–0.46] |
| Soldier | achieve: 0.12 [0.10–0.15] | drives: 0.15 [0.12–0.18] | power: 0.17 [0.09–0.25] |
| Suicide | death: 0.16 [0.09–0.23] | health: 0.17 [0.07–0.28] | sad: 0.14 [0.08–0.21] |
| Surveillance | affect: – 0.05 [–0.07–0.02] | anger: – 0.04 [–0.06–0.02] | negemo: – 0.04 [– 0.06–0.02] |
| Threat | anger: 0.23 [0.13–0.33] | negemo: 0.17 [0.10–0.25] | Tone: – 0.14 [– 0.20–0.07] |
| Violence | anger: 0.21 [0.10–0.32] | death: 0.20 [0.09–0.32] | negemo: 0.28 [0.10–0.45] |
| Weaponry | negemo: 0.10 [0.05–0.15] | posemo: – 0.07 [–0.11–0.04] | Tone: – 0.11 [–0.16–0.05] |
Note. All correlations were statistically significant at the p < 0.0023 (0.05/22 categories) level.
Corpora used for statistical tests
| Corpus | No. of documents | Mean word count (SD) |
|---|---|---|
| Lone-actor terrorist manifestos | 4572 | 100 (4) |
| Neutral texts from blogs and forums | 680,792 | 243 (503) |
| Stormfront posts | 461,950 | 95 (229) |
| Stream-of-consciousness (SOC) essays | 789 | 121 (35) |
| Abusive writing directed at politicians | 789 | 121 (38) |
Mean dictionary matches (percentage) per dataset
| Category | Lone-actormanifestos | Neutraltexts | Stormfrontposts | SOC | Abusivewriting |
|---|---|---|---|---|---|
| Deadline | 2.58 | 1.47 | 1.23 | 2.38 | 1.57 |
| Desperation | 0.83 | 0.76 | 0.67 | 2.32 | 0.95 |
| Fixation | 0.38 | 0.71 | 0.58 | 2.04 | 1.02 |
| Frustration | 0.52 | 0.38 | 0.29 | 1.91 | 0.55 |
| God | 2.81 | 0.67 | 0.72 | 0.63 | 0.75 |
| Grievance | 0.56 | 0.38 | 0.40 | 1.71 | 0.56 |
| Hate | 2.04 | 0.50 | 0.84 | 1.83 | 1.29 |
| Help | 1.75 | 1.22 | 1.26 | 1.25 | 1.28 |
| Honor | 1.75 | 0.55 | 0.73 | 0.61 | 1.27 |
| Impostor | 0.34 | 0.15 | 0.20 | 0.09 | 0.44 |
| Jealousy | 0.55 | 0.29 | 0.29 | 1.35 | 0.46 |
| Loneliness | 0.79 | 0.96 | 0.84 | 1.87 | 1.03 |
| Murder | 3.22 | 0.87 | 1.30 | 0.96 | 1.45 |
| Paranoia | 0.82 | 0.49 | 0.45 | 1.93 | 0.56 |
| Planning | 3.68 | 1.81 | 1.72 | 2.89 | 2.04 |
| Relationship | 3.56 | 2.26 | 2.29 | 2.97 | 2.49 |
| Soldier | 4.17 | 0.70 | 1.22 | 1.05 | 1.01 |
| Suicide | 1.85 | 0.83 | 0.80 | 1.27 | 0.95 |
| Surveillance | 2.41 | 1.17 | 1.40 | 0.89 | 1.11 |
| Threat | 2.52 | 0.38 | 0.77 | 0.67 | 0.86 |
| Violence | 3.74 | 0.67 | 1.25 | 0.77 | 1.30 |
| Weaponry | 3.00 | 0.37 | 0.87 | 0.18 | 0.55 |
| No match | 56.14 | 82.44 | 79.89 | 68.43 | 76.52 |
Statistical test results (effect size d with confidence interval and Bayes factor)
| Manifestos vs. neutral | Manifestos vs. Stormfront | Abuse vs. SOC | ||||
|---|---|---|---|---|---|---|
| BF | BF | BF | ||||
| Deadline | 0.71 [0.70;0.71] | 0.85 [0.85;0.86] | – 0.43 [– 0.52;– 0.31] | |||
| Desperation | 0.07 [0.06;0.07] | 1.69 | 0.16 [0.15;0.16] | – 0.88 [– 1.03;– 0.78] | ||
| Fixation | – 0.47 [– 0.48;– 0.47] | – 0.27 [– 0.28;– 0.27] | – 0.68 [– 0.82;– 0.57] | |||
| Frustration | 0.21 [0.2;0.21] | 0.34 [0.33;0.34] | – 0.87 [– 1.00;– 0.74] | |||
| God | 0.87 [0.86;0.87] | 0.84 [0.84;0.84] | 0.10 [– 0.00;0.23] | 0.85 | ||
| Grievance | 0.26 [0.26;0.26] | 0.22 [0.21;0.22] | – 0.84 [– 0.97;– 0.73] | |||
| Hate | 1.16 [1.16;1.17] | 0.84 [0.84;0.84] | – 0.32 [– 0.43;– 0.20] | |||
| Help | 0.41 [0.41;0.42] | 0.36 [0.36;0.36] | 0.03 [– 0.09;0.15] | -2.95 | ||
| Honor | 0.85 [0.85;0.86] | 0.69 [0.69;0.70] | 0.53 [0.41;0.65] | |||
| Impostor | 0.36 [0.35;0.36] | 0.23 [0.22;0.23] | 0.45 [0.38;0.55] | |||
| Jealousy | 0.38 [0.37;0.38] | 0.36 [0.36;0.36] | – 0.72 [– 0.83;– 0.61] | |||
| Loneliness | – 0.17 [– 0.17;– 0.16] | – 0.05 [– 0.05;– 0.05] | – 0.29 | – 0.57 [– 0.70;– 0.47] | ||
| Murder | 1.27 [1.27;1.27] | 0.96 [0.95;0.96] | 0.33 [0.22;0.43] | |||
| Paranoia | 0.39 [0.38;0.39] | 0.42 [0.42;0.43] | – 0.99 [-1.11;– 0.88] | |||
| Planning | 0.94 [0.94;0.95] | 0.97 [0.97;0.98] | – 0.41 [– 0.53;– 0.29] | |||
| Relation. | 0.55 [0.55;0.56] | 0.52 [0.52;0.53] | – 0.21 [– 0.32;– 0.09] | |||
| Soldier | 1.57 [1.57;1.57] | 1.27 [1.26;1.27] | – 0.03 [– 0.14;0.07] | -2.82 | ||
| Suicide | 0.74 [0.74;0.75] | 0.74 [0.74;0.75] | – 0.22 [– 0.34;– 0.12] | |||
| Surveillance | 0.71 [0.70;0.71] | 0.54 [0.54;0.55] | 0.17 [0.05;0.27] | 7.84 | ||
| Threat | 1.46 [1.46;1.46] | 1.11 [1.10;1.11] | 0.16 [0.05;0.27] | 7.15 | ||
| Violence | 1.55 [1.55;1.55] | 1.16 [1.16;1.16] | 0.36 [0.26;0.49] | |||
| Weaponry | 1.39 [1.39;1.40] | 1.05 [1.05;1.06] | 0.39 [0.30;0.48] | |||
Notes. A positive d denotes a higher score on the category for the lone-actor terrorist manifestos (test 1 and 2) and abusive texts (test 3). A BF above 10 (in bold) constitutes strong evidence for the alternative hypothesis
Classification results
| Task | Feature set | Accuracy | Kappa | Specificity | Precision | Recall |
|---|---|---|---|---|---|---|
| 1. LA vs. neutral | a. Grievance | 0.96 | 0.92 | 0.97 | 0.97 | 0.96 |
| b. LIWC | 0.99 | 0.98 | 0.99 | 0.99 | 0.99 | |
| c. Grievance + LIWC | 0.99 | 0.98 | 0.99 | 0.99 | 0.99 | |
| 2. LA vs. Stormfront | a. Grievance | 0.94 | 0.87 | 0.94 | 0.94 | 0.94 |
| b. LIWC | 0.99 | 0.98 | 0.99 | 0.99 | 0.99 | |
| c. Grievance + LIWCa | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | |
| 3. Abuse vs. neutral | a. Grievance | 0.83 | 0.67 | 0.86 | 0.86 | 0.82 |
| b. LIWCa | 0.96 | 0.92 | 0.98 | 0.98 | 0.94 | |
| c. Grievance + LIWCa | 0.96 | 0.92 | 0.98 | 0.98 | 0.94 |
aThe best performing model for these tasks was a linear SVM, rather than a random forest model (best performing in all other tasks)
Feature importance per task (top five, full list of features on OSF)
| Task | Feature set | Important features |
|---|---|---|
| 1. LA vs. neutral | a. Grievance | soldier, weaponry, violence, impostor, threat |
| b. LIWC | analytic language, present focus, power, differentiation, work | |
| c. Grievance + LIWC | analytic language, differentiation, present focus, soldier, violence | |
| 2. LA vs. Stormfront | a. Grievance | soldier, relationship, impostor, threat, hate |
| b. LIWC | differentiation, analytic language, present focus, tentative, discrepancies | |
| c. Grievance + LIWC | differentiation, analytic language, present focus, tentative, discrepancies | |
| 3. Abuse vs. neutral | a. Grievance | paranoia, grievance, frustration, fixation, desperation |
| b. LIWC | authentic language, social words, clout, feel, male | |
| c. Grievance + LIWC | authentic language, social words, clout, feel, male |