| Literature DB >> 34831927 |
Abdelghani Ghanem1, Chaimae Asaad1,2, Hakim Hafidi1, Youness Moukafih1, Bassma Guermah1, Nada Sbihi1, Mehdi Zakroum1, Mounir Ghogho1, Meriem Dairi3, Mariam Cherqaoui4, Karim Baina2.
Abstract
The impact of COVID-19 on socio-economic fronts, public health related aspects and human interactions is undeniable. Amidst the social distancing protocols and the stay-at-home regulations imposed in several countries, citizens took to social media to cope with the emotional turmoil of the pandemic and respond to government issued regulations. In order to uncover the collective emotional response of Moroccan citizens to this pandemic and its effects, we use topic modeling to identify the most dominant COVID-19 related topics of interest amongst Moroccan social media users and sentiment/emotion analysis to gain insights into their reactions to various impactful events. The collected data consists of COVID-19 related comments posted on Twitter, Facebook and Youtube and on the websites of two popular online news outlets in Morocco (Hespress and Hibapress) throughout the year 2020. The comments are expressed in Moroccan Dialect (MD) or Modern Standard Arabic (MSA). To perform topic modeling and sentiment classification, we built a first Universal Language Model for the Moroccan Dialect (MD-ULM) using available corpora, which we have fine-tuned using our COVID-19 dataset. We show that our method significantly outperforms classical machine learning classification methods in Topic Modeling, Emotion Recognition and Polar Sentiment Analysis. To provide real-time infoveillance of these sentiments, we developed an online platform to automate the execution of the different processes, and in particular regular data collection. This platform is meant to be a decision-making assistance tool for COVID-19 mitigation and management in Morocco.Entities:
Keywords: COVID-19; emotion analysis; machine learning; polar sentiment analysis; topic modeling; universal language model for Moroccan dialect
Mesh:
Year: 2021 PMID: 34831927 PMCID: PMC8624830 DOI: 10.3390/ijerph182212172
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1General methodology for COVID-19 Real-Time Infoveillance in Morocco.
Figure 2An excerpt of the timeline of COVID-19 related events in Morocco.
Figure 3Three-step process for creating MD-ULM. (a) General Domain Language Model Pre-training (MD-ULM Pre-training). (b) Target Task Language Model Fine-tuning (MD-ULM fine-tuning). (c) Target Task Classification.
Performance results for Emotion, Topic and Polarity classification.
| Emotion | Topic | Polarity | |
|---|---|---|---|
| Model | Accuracy | ||
| MultinomialNB | 0.31 | 0.51 | 0.64 |
| Logistic Regression | 0.33 | 0.53 | 0.61 |
| SVM | 0.33 | 0.59 | 0.65 |
| MD-ULM |
|
|
|
Figure 4Temporal evolution of COVID-19 related social media communications in Morocco.
Figure 5Summary of evolution of emotions in time.
Figure 6Evolution of emotions in time (Number of comments/Time). (a) Evolution of ‘Anger’ in time. (b) Evolution of ‘Fear’ in time. (c) Evolution of ‘Sadness’ in time. (d) Evolution of ‘Mistrust’ in time. (e) Evolution of ‘Optimism’ in time. (f) Evolution of ‘Approval’ in time.
Figure 7Distribution of topics over documents.
Figure 8Evolution of topics in time (Number of comments/Time). (a) Evolution of ‘Economy’ in time. (b) Evolution of ‘Education’ in time. (c) Evolution of ‘Health’ in time. (d) Evolution of ‘Government’ in time. (e) Evolution of ‘Sanitary Measures’ in time. (f) Evolution of ‘Social Life’ in time. (g) Evolution of ‘Statistics’ in time.
Figure 9Connections between the most dominant words in COVID-related communications in Moroccan social media. Thicker lines correspond to higher co-occurrences. (a) Connections between the most dominant words for ‘Economy’ related communications. (b) Connections between the most dominant words for ‘Education’ related communications. (c) Connections between the most dominant words for ‘Health’ related communications. (d) Connections between the most dominant words for ‘Government’ related communications. (e) Connections between the most dominant words for ‘Sanitary Measures’ related communications. (f) Connections between the most dominant words for ‘Social Life’ related communications. (g) Connections between the most dominant words for ‘Statistics’ related communications.
Figure 10Distribution of emotions over the identified topics. (a) Distribution of emotions over the topic ‘Economy’. (b) Distribution of emotions over the topic ‘Education’. (c) Distribution of emotions over the topic ‘Health’. (d) Distribution of emotions over the topic ‘Government’. (e) Distribution of emotions over the topic ‘Social Life’. (f) Distribution of emotions over the topic ‘Statistics’. (g) Distribution of emotions over the topic ‘Sanitary Measures’.
Figure 11Moroccan platform for a better management of COVID-19 pandemic.