| Literature DB >> 33139966 |
A H Alamoodi1, B B Zaidan1,2, A A Zaidan1, O S Albahri1, K I Mohammed1, R Q Malik3, E M Almahdi1, M A Chyad1, Z Tareq4, A S Albahri5, Hamsa Hameed6, Musaab Alaa7.
Abstract
The COVID-19 pandemic caused by the novel coronavirus SARS-CoV-2 occurred unexpectedly in China in December 2019. Tens of millions of confirmed cases and more than hundreds of thousands of confirmed deaths are reported worldwide according to the World Health Organisation. News about the virus is spreading all over social media websites. Consequently, these social media outlets are experiencing and presenting different views, opinions and emotions during various outbreak-related incidents. For computer scientists and researchers, big data are valuable assets for understanding people's sentiments regarding current events, especially those related to the pandemic. Therefore, analysing these sentiments will yield remarkable findings. To the best of our knowledge, previous related studies have focused on one kind of infectious disease. No previous study has examined multiple diseases via sentiment analysis. Accordingly, this research aimed to review and analyse articles about the occurrence of different types of infectious diseases, such as epidemics, pandemics, viruses or outbreaks, during the last 10 years, understand the application of sentiment analysis and obtain the most important literature findings. Articles on related topics were systematically searched in five major databases, namely, ScienceDirect, PubMed, Web of Science, IEEE Xplore and Scopus, from 1 January 2010 to 30 June 2020. These indices were considered sufficiently extensive and reliable to cover our scope of the literature. Articles were selected based on our inclusion and exclusion criteria for the systematic review, with a total of n = 28 articles selected. All these articles were formed into a coherent taxonomy to describe the corresponding current standpoints in the literature in accordance with four main categories: lexicon-based models, machine learning-based models, hybrid-based models and individuals. The obtained articles were categorised into motivations related to disease mitigation, data analysis and challenges faced by researchers with respect to data, social media platforms and community. Other aspects, such as the protocol being followed by the systematic review and demographic statistics of the literature distribution, were included in the review. Interesting patterns were observed in the literature, and the identified articles were grouped accordingly. This study emphasised the current standpoint and opportunities for research in this area and promoted additional efforts towards the understanding of this research field.Entities:
Keywords: COVID-19; Disease mitigation; Epidemic; Infectious disease; Opinion mining; Pandemic; Sentiment analysis
Year: 2020 PMID: 33139966 PMCID: PMC7591875 DOI: 10.1016/j.eswa.2020.114155
Source DB: PubMed Journal: Expert Syst Appl ISSN: 0957-4174 Impact factor: 6.954
Fig. 1Sentiment analysis flow.
Fig. 2SLR protocol.
Data extraction elements.
| Data Item | Description |
|---|---|
| Title | (Title of the paper) |
| Year | (Publication Year) |
| Database | Name of the database where an article was downloaded ( |
| Publication Type | |
| Outbreak Discussed | Name of ( |
| Goal | Primary goal of the study ( |
| Source Data | Where the data used in the sentiment analysis were collected from ( |
| Volume of Data | Size of data (How many |
| Duration of Collection | How long the data collection lasted for sentiment analysis ( |
| Challenges | Issues or concerns raised in the publication |
| Motivations | Significance or benefits identified in the publication |
Inclusion and exclusion criteria.
| Inclusion Criteria | Exclusion Criteria | |||
|---|---|---|---|---|
| Search | English Articles | Non-English Articles | ||
| Selected Databases | ||||
| Article, Review, Conference or Book Section | Older than 2010 | |||
| Published between 2010 and 2020 | ||||
| Topic | Infectious Disease | AND | Sentiment Analysis OR Opinion Mining | Other diseases ( |
| Pandemic | ||||
| Epidemic | Without Sentiment Analysis Or Opinion Mining | |||
| Outbreak | ||||
| Virus |
Fig. 3Demographic statistics.
Fig. 4Taxonomy for related literature.
Taxonomy analysis details.
| Ref | Source Data | Volume of Data | Duration of Collection | Outbreak Discussed | Work Applied | Taxonomy Category |
|---|---|---|---|---|---|---|
| E. H.-J. | Twitter News Publications | 16,189 news articles 7,106,297 tweets | Between 1 June 2014 and 31 August 2014 | Ebola | Sentiment Dynamics of the Hot Health Issue of Ebola | Lexicon Based Models |
| 255,118 tweets | January 2015 | Ebola | Ebola Outbreak Discussions on Twitter | Lexicon Based Models | ||
| Sina | 140,000 tweets | June to November 2014 | Ebola | Distributed Mining System for Online Opinion Data Collecting and Mining. | Lexicon Based Models | |
| 1,500,000 tweets | 3 months | MERS-CoV | Addressing New Challenges for Big Data Platform with an Opinion Mining Approach | Lexicon Based Models | ||
| K. | Twitter | 525 Reviews | 2 Months | Outbreaks | Locations Monitoring for Disease Outbreak | Lexicon Based Models |
| N/A | N/A | Outbreak | Outbreak Notification Framework Using Twitter Mining for Smart Home Dashboards | Lexicon Based Models | ||
| N/A | N/A | COVID-19 | Sentiment analysis of Social Media Response on the COVID-19 Outbreak | Lexicon Based Models | ||
| 20,325,929 tweets | 28 January 2020 to 9 April 2020 | COVID-19 | Examining Worldwide Trends of Different Emotions During the COVID-19 Pandemic. | Lexicon Based Models | ||
| Raamkumar, Tan, and Wee (2020) | 3,185,460 posts | 1 January 2019 to 18 March 2020 | COVID-19 | Understand the Communication Strategies by Public Health Authorities to Examine Public Sentiment and Responses in Social Media | Lexicon Based Models | |
| N/A | 3 Weeks | COVID-19 | Sentiment analysis of Filipinos and Effects of Extreme Community Quarantine due to the Coronavirus (COVID-19) Pandemic | Lexicon Based Models | ||
| Lim, Tucker, and Kumara (2017) | 37,599 Tweets | From August 2012 to May 2013 | Infectious Diseases | Identify Real-World Latent Infectious Diseases by Mining Social Media Data | ML Based Models | |
| 54,065 Tweets | N/A | Epidemic | Detection Of Epidemic Disease Model for Discovering Latent Infectious Diseases | ML Based Models | ||
| 574,643 Tweets | 10 weeks | N1H1 | Classification Methods for the Detection of Influenza-Related Messages (N1H1) from Twitter | ML Based Models | ||
| A. | 20,000 Tweets | 2 weeks of May 2013 | MERS-Cov | Tweets Monitoring Platforms with ML Approach to Automatically Detect and Classify Healthcare Tweets Associated with MERS-COV | ML Based Models | |
| N/A | 12 Study | 2010–2017 | Epidemic | Discussing Sentiment Analysis ML based with Pandemics | ML Based Models | |
| 10,000 Tweets | September 2016 To November 2016 | Outbreak | Tracking of Real-Time Public Sentiments for Outbreak | Hybrid Based | ||
| Around 15 Million Tweets. | From 13 March | Infectious Diseases | Measuring Public Concern Using a Two-Step Sentiment Classification Approach | Hybrid Based Models | ||
| 3,718 Publicly Available Tweets | 14–20 August 2015 | Epidemic | Sentiment Spreading Based Approach For Epidemic | Hybrid Based Models | ||
| News Media | 86 Million Words | 70 Days | MERS-Cov | Monitoring and Understanding Responses | Hybrid Based Models | |
| 10,657 sentences | June 2001 to May 2002 | Epidemic | Epidemic Sentiment Monitoring System | Hybrid Based Models | ||
| 126,833 tweets | 26–28 September 2011 9–18 October 2011 | Epidemic | Epidemic Sentiment Monitoring System | Hybrid Based Models | ||
| S. | posts from 17,865 users | 13–26 January 2020 | COVID-19 | Exploring the Impacts of COVID-19 on People’s Mental Health to Assist Policy and Provide Timely Services to Infected Populations | Hybrid Based Models | |
| 91,495 tweets | 1 February 2015 to 31 March 2015 | N1H1 | Method for Tracking the N1H1 Using the Contents of Twitter | Hybrid Based Models | ||
| L. | 367,462 posts | 30 December 2019 to 1 February 2020 | COVID-19 | Characterising the Propagation of Situational Information in Social Media During COVID-19 Epidemic: | Hybrid Based Models | |
| Facebook | N/A | 1 April 2017 to 30 June 2017 | Epidemics | Discussion of the Importance of Utilising Big Data Means Like Sentiment Analysis | Individuals | |
| Social Media | N/A | N/A | Outbreaks | Discussion for Social Media SM Predictive Power | Individuals | |
| 141,161 posts | May–August 2016 | Zika Virus | Analysing Image Sentiment from Instagram for the Spread of Zika Virus and Information Dissemination | Individuals | ||
| Al-garadi, Khan, Varathan, Mujtaba, and Al-Kabsi (2016) | Digital Databases | 20 Study | 2005–2014 | Pandemics | A Systematic Literature Search for Studies with the Primary Aim of Using OSN to Detect and Track a Pandemic | Individuals |
Application languages of sentiment analysis.
| Language | Number of Publications | References | |
|---|---|---|---|
| Application | Author | ||
| USA | 7 | ||
| India | 3 | ||
| Australia | 1 | K. | |
| Republic of Korea | 1 | E. H.-J. | |
| Italy | 1 | ||
| Greece | 1 | ||
| Singapore | 2 | ||
| Malaysia | 1 | ||
| Qatar | 1 | A. | |
| Jordan | 1 | ||
| Saudi Arabia | 2 | ||
| China | 3 | L. | |
| Korea | 1 | ||
| India | 2 | ||
| Philippines | 1 | ||