| Literature DB >> 36090694 |
Xin Pei1, Deval Mehta2.
Abstract
Transcending the binary categorization of racist texts, our study takes cues from social science theories to develop a multidimensional model for racism detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of BERT and topic modelling, this categorical detection enables insights into the underlying subtlety of racist discussion on digital platforms during COVID-19. Our study contributes to enriching the scholarly discussion on deviant racist behaviours on social media. First, a stage-wise analysis is applied to capture the dynamics of the topic changes across the early stages of COVID-19 which transformed from a domestic epidemic to an international public health emergency and later to a global pandemic. Furthermore, mapping this trend enables a more accurate prediction of public opinion evolvement concerning racism in the offline world, and meanwhile, the enactment of specified intervention strategies to combat the upsurge of racism during the global public health crisis like COVID-19. In addition, this interdisciplinary research also points out a direction for future studies on social network analysis and mining. Integration of social science perspectives into the development of computational methods provides insights into more accurate data detection and analytics.Entities:
Keywords: COVID-19; Deviant behaviours; Racism; Social media
Year: 2022 PMID: 36090694 PMCID: PMC9451118 DOI: 10.1007/s13278-022-00967-9
Source DB: PubMed Journal: Soc Netw Anal Min
Definition and example of categorization of racist and xenophobic behaviours
| Category | Definition | Example |
|---|---|---|
| Stigmatization | Confirming negative stereotypes for conveying a devalued social identity within a particular context (Miller and Kaiser | For all the #ChinaVirus jumped from a bat at the wet market |
| Offensiveness | Attacking a particular social group through aggressive and abusive language (Jeshion | Real misogyny in communist China. #chinazi #China_is_terrorist #China_is_terrorists #FuckTheCCP |
| Blame | Attributing the responsibility for the negative consequences of the crisis to one social group (Coombs and Schmidt | These Chinese are absolutely disgusting. They spread the #ChineseVirus. Their lies created a pandemic #ChinaMustPay |
| Exclusion | The process of othering to draw a clear boundary between in-group and out-group members (Bailey and Harindranath | China deserves to be isolated by all means forever. SARS was also initiated in China, 2003 by eating anything & everything #BoycottChina |
Fig. 1Analysis of the number of tweets returned daily by our custom wrapper (based on Tweepy API) from 1 Jan to 30 Apr 2020
Fig. 2Density distribution of token lengths of the tweets in our dataset
Performance of different models on the manually annotated test dataset.
| Technique | Accuracy(%) | F1-score |
|---|---|---|
| SVM | 69.04 | 0.66 |
| SVM + TF-IDF features | 75.8 | 0.74 |
| SVM + BOW features | 71.6 | 0.70 |
| SVM + Word2Vec features | 71.4 | 0.70 |
| LSTM | 74.01 | 0.72 |
| BERT |
Mean accuracy and f1-score for the fivefolds
Fig. 3Confusion matrix (averaged for the fivefolds of validation data) of our trained BERT model for racism classification
Extracted topics and their corresponding keywords for the category of stigmatization spread across the three stages S1, S2, and S3
| S1 | T1.Virus | Virus | Spread | Country | Travel | Year | Control |
| Ban | Corona | Show |
| T2.China/Chinese |
| Virus | Deadly |
| Situation | Mask | Stop | Animal | Source | Eat | |
| T3.Infection | People | Case | Health | Infect | Confirm | Death | Sar | Number | Report | Market | |
| T4.Outbreak |
| Coronavirus | Wuhan | Outbreak | City | Hospital | News | Patient | Put | State | |
| T5.Travel | World |
| Government | Make | People | Time | Day | Bad | Flight | Start | |
| S2 | T1.Emergency | Virus | Spread | Day | Year | Corona | Show | Emergency | Food | Kit | Supply |
| T2.Globe |
| World | Time | Country | Report | Death | Global | Health | Travel | Confirm | |
| T3.Infection | People | Case | Call | ncov | Infect | Kill | Pack | State | Flu | Number | |
| T4.China |
| Coronavirus | Wuhan | Outbreak | Quarantine | Stop | Find | Man | Dead | Thing | |
| T5.Chinese |
| Make | Mask | Government | News | Good | Work | Citizen | Start | Respirator | |
| S3 | T1.Government |
| World | Spread | Country | Lie | Pay | Communist | Government | ccp | Make |
| T2.? | Time | Make |
| Good | Give | Work | Day | Back | Fight | Buy | |
| T3.China |
| Coronavirus | Case | Death | Covid | Country | Economy | War | Number | Wuhan | |
| T4.Chinese |
| Virus | People | Call | Stop | Racist | Start | Die | Blame | Corona | |
| T5.US |
| Trump |
| Medium | President |
| News | Great | Propaganda | Show |
Extracted topics and their corresponding keywords for the category of exclusion spread across the three stages S1, S2, and S3
| S1 | T1.Government | support | gov | join | people | evil | time | stand | sanction | Government | money |
| T2.Human right | product | world | stop | human_right | freedom | tag | good | challenge | ppl | economic_infiltration | |
| T3.Boycott |
|
| fight | regime | boycott | show | international | control | trust | communist | |
| T4.Trade | make | buy | ccp | day | thing | friend |
|
| hope | today | |
| T5.Virus | Country |
| people | spread | Year | human | animal | protect | virus | eat | |
| S2 | T1.Nation | people |
| animal | happen | Government | initiative | nation | show | economy | law |
| T2.Virus | virus | control | truth | support | live | kill | boycott | start | stand | cover | |
| T3.Threat |
| time | lie | threat | company | trust | big | entire | spy |
| |
| T4.Human right | world | Country | freedom | spread | human_right | economic | thing | evil | steal | raise | |
| T5.Trade | make | product | stop | buy | day |
| good | ccp | challenge | coronavirus | |
| S3 | T1.Virus |
| virus | world | pay | spread | ccp | covid | corona | market | call |
| T2.Pandemic | world |
| company | communist | coronavirus | pandemic | global | nation | trust | war | |
| T3.Trade |
| make | product | buy | boycott | stop | good |
| economy |
| |
| T4.Human right | people | lie | Government | human | life | back | animal | kill | eat | bring | |
| T5.China |
| Country | time | start | business | give | thing | app | sell | money |
Distribution of tweets amongst the four categories across the three stages
| Category | Total | S1 | S2 | S3 |
|---|---|---|---|---|
| Stigmatization | 116,584 | 3723 | 5687 | 107,174 |
| Offensiveness | 10,503 | 1722 | 1808 | 6973 |
| Blame | 39,765 | 31 | 777 | 38,957 |
| Exclusion | 10,293 | 872 | 1341 | 8080 |
Extracted topics and their corresponding keywords for the category of offensiveness spread across the three stages S1, S2, and S3
| S1 | T1.? | Country | ccp | Citizen | Virus | Arrest | Live | System | Security | Foreign | Understand |
| T2.Government | People | Government | Democracy | Support | Life | Year | Regime |
| Camp | Give | |
| T3.? |
| World | Spread | Stop | Communist | Happen |
| Wuhan | Govt | Ban | |
| T4.Muslim |
| Make | Muslim | Good | Kill | Police | Terrorist | Bad | Party | Lie | |
| T5.Human right | World | Freedom |
| Human | Human_right | Time | Free | Stand |
| Fight | |
| S2 | T1.Freedom | World | Stop | Freedom | Truth | Spread | Good | Free |
| Speech | Life |
| T2.Ccp |
|
| ccp | Virus | Happen |
| Evil | Communist | Time |
| |
| T3.People | People | Make | Kill | Lie | ppl | Trust | Camp | Police | Thing | Man | |
| T4.China |
| Country | Regime | Pay | Money | Outbreak | Start | Work | Force | Control | |
| T5.Human right | Government | Citizen | Human | Fight | Support | Hong_Kong | Taiwan | Give | Democracy | Death | |
| S3 | T1.Death | World | People | Pay | Lie | Kill | Truth | Fight | Life | Die | Humanity |
| T2.Government | Time | Call | Government |
| Communist | Pandemic | Give | Global | Send | Real | |
| T3.Virus |
| Virus | Spread |
| Corona | Product | Buy | Control | Big | Day | |
| T4.China |
| Country | Make | ccp | Stop | Good | Coronavirus | Human | Trust | Support | |
| T5.World |
| World | War | Case | Start | Covid | Economy | Death | State |
|
Extracted topics and their corresponding keywords for the category of blame spread across the three stages S1, S2, and S3
| S1 | T1.Lie | Lie | Spread | Virus | Autocracy | Deceit | Imagine | True | Horrible | Infect | Country |
| T2.Death |
| Dead | Die | Day | Order | Monstrosity | True | Thing | Kong | High | |
| T3.Safety | Coronavirus | Move | Lot | Cvirus | Epicentre | Safety | March | Careful | Knowingly | Health | |
| T4.Time |
| Lunar_new | Sick | Year | Time | Absolutely | Medium | Mutate | Emperor | Truth | |
| T5.Infection | People |
| Make | Online | Pandemic | Catch | Number | Infect | Community | Official | |
| S2 | T1.Government | Lie |
| Coronavirus | Government |
| Cover | Day | Body | Thing | Care |
| T2.Spread | World | Country | Spread | Happen | Trust | Kill | Threat | Steal | Dead | Face | |
| T3.China |
| Truth | Bad | Free | Money | Communist | Case | Find | Start | Move | |
| T4.Virus | Virus | Stop | Make | Control | Good |
| Fight | Live | Report | Human | |
| T5.Death | People | Time | Number | Die | Real | life | Entire | Back | Citizen | Death | |
| S3 | T1.World | World |
| Country | Pay | Pandemic | Kill | Global | Economy | War | |
| T2.? | People | Stop | Human |
| Eat | Put | President | Market | Happen | Live | |
| T3.Lie |
| Lie | Coronavirus |
| blame | Die | Case | Cover | Truth | Number | |
| T4.? | Make | Time |
| Good | Start | Buy | Trust | Back | Thing | Country | |
| T5.Government |
| Virus |
| Government | Call | Communist | ccp | Covid | Spread | Hold |