| Literature DB >> 35039755 |
Javier Torregrosa1, Gema Bello-Orgaz1, Eugenio Martínez-Cámara2, Javier Del Ser3, David Camacho1.
Abstract
Extremism has grown as a global problem for society in recent years, especially after the apparition of movements such as jihadism. This and other extremist groups have taken advantage of different approaches, such as the use of Social Media, to spread their ideology, promote their acts and recruit followers. The extremist discourse, therefore, is reflected on the language used by these groups. Natural language processing (NLP) provides a way of detecting this type of content, and several authors make use of it to describe and discriminate the discourse held by these groups, with the final objective of detecting and preventing its spread. Following this approach, this survey aims to review the contributions of NLP to the field of extremism research, providing the reader with a comprehensive picture of the state of the art of this research area. The content includes a first conceptualization of the term extremism, the elements that compose an extremist discourse and the differences with other terms. After that, a review description and comparison of the frequently used NLP techniques is presented, including how they were applied, the insights they provided, the most frequently used NLP software tools, descriptive and classification applications, and the availability of datasets and data sources for research. Finally, research questions are approached and answered with highlights from the review, while future trends, challenges and directions derived from these highlights are suggested towards stimulating further research in this exciting research area.Entities:
Keywords: Deep learning; Extremism; Machine learning; Natural language processing; Radicalization
Year: 2022 PMID: 35039755 PMCID: PMC8754364 DOI: 10.1007/s12652-021-03658-z
Source DB: PubMed Journal: J Ambient Intell Humaniz Comput
Fig. 1Overall structure of the review. Blue color: theoretical conceptualization; Yellow color: literature analysis; green color: tools; Pink: prospective analysis
Fig. 2Graphic representation of the overlap between extremism and concepts usually mentioned in the same context. A deeper analysis can be found on Table 1
Concepts, definitions and distinction from extremism
| Concept | Definition | Distinction from extremism | Example of the concept |
|---|---|---|---|
| Supremacism | The ideology that one group is naturally superior to another one, due to their race, sex, economic status, nation, etc. (Schaefer | Supremacism would be a subtype of extremism, as supremacist groups are contrary to the existence of equal rights | White supremacist movement (Kantrowitz |
| Sectarianism | Form of discrimination between groups based on a specific factor. For years, it was limited to religion, but nowadays this concept is technically similar to supremacism (Phillips | As with supremacism, sectarianism would be a sub-type of extremism, as it is contrary to the existence of equal rights | There are several examples of sectarian groups, such as the Islamic State (Roy |
| Terrorism | Systematic use of violence, propaganda and fear towards and specific population to achieve ideological objectives (López et al. | Terrorism always implies violence, while extremism does not necessarily use it. However, both are against one or more fundamental values of a society | There are many examples of terrorism, both in national and international contexts, such as IRA in Ireland/North Ireland (Pruitt |
| Polarization | Ideological movement towards a more extreme point of view in whatever direction is indicated by the member’s predeliberation tendency (Sunstein | As occurs with radicalization, polarization is not necessarely violent or against fundamental values of a society | Political or “partisan” polarization (Prior |
| Fundamentalism | Tendency to follow literally certain dogmas or ideologies from the “fundamental” and unchangeable practices of the past. As sectarianism, it has a religious connotation (Hunsberger | Fundamentalism is not necessarely violent or against democratic values | The “Amish” are an example of christian fundamentalist group (Hill and Williamson |
| Nationalism | Ideology based on the nodal point “nation”, on which a community is tied to a certain space, and that is structured through the opposition between the nation and different outgroups (De Cleen | Nationalism does not necessarily imply a negative connotation. When it turns extremist, it would convert to supremacism | Catalonia, Scotland and Canada have some renowned political movements related to nationalism (Keating |
| Hate speech | Language that incites violence or hate against groups, based on specific characteristics, and that can be used with different linguistic styles, such as humour (Fortuna and Nunes | While extremist discourses frequently include hate speech, they both target different audiences (general public vs minorities) and show different objectives (activation vs discrimination). Also, extremism discourse includes more topics than hate speech, such as recruitment or persuasion (McNamee et al. | Anti-semitism or anti-homosexual speech (Leets |
Articles extracted from the different databases that apply NLP to extremism research
| Data source | No. articles |
|---|---|
| ScienceDirect | 95 |
| Scopus | 573 |
| Web of Science | 41 |
| IEEE Xplore | 20 |
| Total | 729 |
Fig. 3Type of extremism addressed by the articles included in the survey
Fig. 4Word cloud of keywords extracted from the analyzed articles
Summary of NLP techniques for feature generation used in the reviewed literature
| Approach | NLP technique | Percentage use | Articles |
|---|---|---|---|
| Lexical or vectorial | N-grams | 28.12% |
de Pablo et al. ( |
| Dictionaries | 37.5% |
Scrivens et al. ( | |
| TF | 50% |
Abdelzaher ( | |
| TF-IDF | 23.43% |
Alghamdi and Selamat ( | |
| Dichotomous appearance | 1.56% |
Wadhwa and Bhatia ( | |
| Log-likelihood | 3.12% |
Stankov et al. ( | |
| Neural language models | Word2Vec | 9.37% |
Abd-Elaal et al. ( |
| FastText | 4.68% |
Ahmad et al. ( | |
| GloVe | 3.12% |
Araque and Iglesias ( | |
| Sintantic and semantic | Part-of-speech | 25% |
Devyatkin et al. ( |
| NER | 7.81% |
Bisgin et al. ( | |
| LSF | 4.68% |
Kim et al. ( | |
| Parse trees | 1.56% |
Sikos et al. ( | |
| LDA | 15.62% |
Bisgin et al. ( | |
| NMF | 4.68% |
Heidarysafa et al. ( | |
| Sentiment scoring | 37.49% |
Wignell et al. ( | |
| Semantic tagging | 12.50% |
Wignell et al. ( | |
| Word/sentence length | 7.81% |
Stankov et al. ( | |
| Use of emoticons | 3.12% |
Agarwal and Sureka ( | |
| Use of punctuation | 3.12% |
Sikos et al. ( |
Type of n-gram model used in the reviewed articles
| N-gram type | Percentage use | Articles using it |
|---|---|---|
| Bi-gram | 15.62% |
de Pablo et al. ( |
| Bi-gram + Tri-gram | 6.25% |
Rekik et al. ( |
| Bi-gram + Tri-gram + Skip-gram | 4.68% |
Abd-Elaal et al. ( |
| Tri-gram + Tetra-gram + Penta-gram | 1.56% |
Hall et al. ( |
Comparison of vector space model based techniques to generate features in the reviewed articles
| Technique | Advantages | Disadvantages |
|---|---|---|
| N-grams | Able to keep semantic information | Captures basic semantic information |
| High versatility, due to its independence from the text (useful for multi-language texts) | The tokens detected may not have interest for the researcher | |
| Dictionaries | Useful to conduct psycho-linguistic meaningful analysis | Low versatility (vulnerable to changes on the language and word structure) |
| Useful to detect and classify specific slang and terminology | Highly dependent on the lexicons included | |
| TF/TF-IDF | Simple and widely used | Not capture semantic context information |
| TF needs a previous stop-words filtering | ||
| Dichotomous appearance | The simplest technique | Does not capture semantic context information |
| Log likelihood | Captures information of association among terms | Few applied in the area information |
Comparison of neural techniques to generate features used in the reviewed articles
| Technique | Advantages | Disadvantages |
|---|---|---|
| Word2Vec | Allows predicting words depending on the context | Does not recognize words not included in the trained lexicon (problematic in multilingual approaches) |
| FastText | Allows incorporating words not contained on trained lexicon | Few applied in the area |
| GloVe | High amount of trained models to work with | Scarcely applied in the area |
Type of sentiment analysis approaches using in the reviewed articles on extremism
| Sentiment analysis approach | Percentage use | Articles using it |
|---|---|---|
| Sentiment scoring (dimensional) | 32.81% |
Wignell et al. ( |
| Emotion scoring (categorical) | 9.37% |
Wignell et al. ( |
Comparison of syntactic and semantic based techniques to generate features for text representation
| Technique | Advantages | Disadvantages |
|---|---|---|
| POS | Allows to detect the grammatical type of tokens | Regarding nouns, not as informative as NER |
| Widely used in the area with different applications (term disambiguation or classification) | ||
| NER | Detects entities, categorizing them. Useful to identify the main actors in an extremist discourse | Not as extended as POS, limited to nouns and to a trained lexicon |
| LSF | Provides a meaningful relationship among tokens. | Does not perform better in the applications within the area than more simple features |
| PT | Finds sentences with a grammatically similar structure | Does not inform about the tokens itself. Not commonly used on extremism literature |
| LDA | Widely used on extremism research | Performs poorly in short texts, such as tweets (very used to conduct extremism analysis) |
| Performs closer to a human topic classifier than other techniques | Tends to over-generalize topics | |
| NMF | Alternative for LDA showing a good performance over short texts. | Not commonly used by authors, who tend to use LDA |
| SS (Dim.) | Simple way of measuring a sentence emotional value | Does not provide elaborate information about emotions in the sentence |
| Useful to detect opinions, specially useful when combined with the detection of entities in the radical discourse | ||
| SC. (Cat.) | Provides information about emotions in the sentence, tagging tokens and sentences with emotional categories (Happiness, sadness, anger...) | Not so useful to detect opinions or tone towards a token |
| ST | As an evolution of NER, this approach “tags” nouns with their entity, concept and category | Useful to discriminate a word thanks to its context, very useful on extremism research |
| Text formatting | Captures more information than those provided by the text itself | Has to be used as a complement to other text features |
Fig. 5Frequency of articles using classification techniques versus those not using them
Fig. 6Type of ML model used in the literature related to extremism research
Type of features input to the ML models employed in the reviewed articles
| ML method | Features | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N-grams | Dic. | TF-IDF | TF | POS | NER | LSF | PT | SS | LDA | Emb. | ST | Others | |
| SVM |
Hartung et al. ( |
Figea et al. ( |
Yang et al. ( |
Hartung et al. ( |
Figea et al. ( |
Hartung et al. ( |
Hartung et al. ( |
Sikos et al. ( |
Figea et al. ( |
Saif et al. ( |
Araque and Iglesias ( |
Saif et al. ( |
Sikos et al. ( |
| KNN |
Sharif et al. ( |
Agarwal and Sureka ( |
Sharif et al. ( |
Ahmad et al. ( |
Wei et al. ( | ||||||||
| NB |
Masood ( |
Yang et al. ( |
Yang et al. ( |
Masood ( |
Yang et al. ( |
Yang et al. ( |
Masood ( |
Wei et al. ( |
Scanlon and Gerber ( |
Masood ( |
Saif et al. ( |
Yang et al. ( | |
| Boosting |
Scanlon and Gerber ( |
Scanlon and Gerber ( |
Devyatkin et al. ( |
Devyatkin et al. ( | |||||||||
| J48 |
Sharif et al. ( |
Fernandez et al. ( |
Sharif et al. ( |
Sharif et al. ( |
Owoeye and Weir ( |
Owoeye and Weir ( |
Abd-Elaal et al. ( |
Fernandez and Alani ( |
Weir et al. ( | ||||
| RF |
Masood ( |
Figea et al. ( |
Ahmad et al. ( |
Masood ( |
Figea et al. ( |
Masood ( |
Figea et al. ( |
Masood ( |
Devyatkin et al. ( |
Weir et al. ( | |||
| Adaboost |
Figea et al. ( |
Yang et al. ( |
Figea et al. ( |
Figea et al. ( |
Yang et al. ( |
Figea et al. ( |
Yang et al. ( | ||||||
| Log R |
Masood |
Smith et al. ( |
Sharif et al. ( |
Masood ( |
Devyatkin et al. ( |
Masood ( |
Wei et al. ( |
Araque and Iglesias ( |
Devyatkin et al. ( | ||||
| LMM |
Smith et al. ( |
Smith et al. ( | |||||||||||
| XGBoost |
Kim et al. ( |
Kim et al. ( |
Kim et al. ( |
Kim et al. ( | |||||||||
| Maximum entropy |
Mirani and Sasi ( |
Mirani and Sasi ( | |||||||||||
| Bagging |
Mirani and Sasi ( |
Mirani and Sasi ( | |||||||||||
| RNN |
Mariconti et al. ( |
Ahmad et al. ( |
Johnston and Marku ( | ||||||||||
| CNN |
Ahmad et al. ( |
Ahmad et al. ( | |||||||||||
| FCNN |
Johnston and Weiss ( | ||||||||||||
| Extra random trees |
Mariconti et al. ( |
Mariconti et al. ( | |||||||||||
| Ensemble methods |
Sharif et al. ( |
Sharif et al. ( | |||||||||||
| SGD |
Sharif et al. ( |
Sharif et al. ( |
Sharif et al. ( | ||||||||||
SVM support vector machine, KNN K-nearest neighbors, NB Naïve Bayes, RF random forest, Log R logistic regression, LMM linear mixed models, RNN recurrent neural networks, CNN convolutional neural networks, FCNN fully-connected neural networks, SGD stochastic gradient descent
Descriptive linguistic approach used by the reviewed articles
| Descriptive linguistic approach | Percentage use | Articles using it |
|---|---|---|
| Terms | 67.85% |
Heidarysafa et al. ( |
| Topics | 46,42% |
Heidarysafa et al. ( |
| Sentiment | 39.28% |
Heidarysafa et al. ( |
| Semantic | 17.85% |
Wignell et al. ( |
| Punctuation | 3.57% |
Stankov et al. ( |
Publicly available datasets for extremism research
| Dataset | Size | Language | Source | Articles using this source |
|---|---|---|---|---|
| Al-Firdaws (Artificial-Intelligence-Lab | 39.715 posts—2.187 users | Arabic | Dark web forum |
Chen ( |
| Montada (Artificial-Intelligence-Lab | 1.865.807 posts—52.546 users | Arabic | Dark web forum |
Chen ( |
| Ansar1 (Artificial-Intelligence-Lab | 29.492 posts—382 users | English | Dark web forum |
Scanlon and Gerber ( |
| How ISIS uses Twitter (Kaggle) (Fifth-Tribe | 17.410 tweets—112 users | English |
Araque and Iglesias ( | |
| Automated Hate Speech Detection and the Problem of Offensive Language Davidson et al. ( | 24.802 tweets—N/A users | English |
Johnston and Marku ( | |
| Crisis Lex Dataset (not specified) (Olteanu et al. | Not specified | English |
Zahra et al. ( | |
| UDI-TwitterCrawl-Aug2012 (Li et al. | 50.000.000 tweets—147.909 users | English |
Agarwal and Sureka ( | |
| Dataset-ATM-TwitterCrawl-Aug2013 (Li et al. | 5.000.000 tweets—N/A users | English |
Agarwal and Sureka ( | |
| Religious Texts Used By ISIS (Fifth-Tribe | 2685 religious texts | English | Religious texts |
Rehman et al. ( |
| Tweets targeting ISIS (ActiveGalaXy | 122.000 tweets—95.725 users | English |
Rehman et al. ( | |
| Gawaher (Artificial-Intelligence-Lab | 372.499 posts—9.629 users | English | Dark web forum |
Scrivens et al. ( |
| Turn to Islam (Artificial-Intelligence-Lab | 335.338 posts—10.858 users | English | Dark web forum |
Scrivens et al. ( |
Publicly available extremist data sources
| Data source | Type of source | Articles using this source |
|---|---|---|
| Dabiq (Global-Terorrism-Research-Project | Extremist magazine |
Macnair and Frank ( |
| Rumiyah (Global-Terorrism-Research-Project | Extremist magazine |
Macnair and Frank ( |
| Inspire (Global-Terorrism-Research-Project | Extremist magazine |
Sikos et al. ( |
| Azan (Mujahid-Azhar | Extremist magazine |
Skillicorn ( |
Fig. 7NLP tools used by the articles reviewed
Fig. 8Diagram showing the main items of the replies to the posed research questions
Fig. 9Future trends and challenges of NLP approaches applied to the extremism research area