Literature DB >> 36092862

Covid-19 vaccine hesitancy: Text mining, sentiment analysis and machine learning on COVID-19 vaccination Twitter dataset.

Miftahul Qorib1, Timothy Oladunni2, Max Denis3, Esther Ososanya4, Paul Cotae4.   

Abstract

In 2019 there was an outbreak of coronavirus pandemic also known as COVID-19. Many scientists believe that the pandemic originated from Wuhan, China, before spreading to other parts of the globe. To reduce the spread of the disease, decision makers encouraged measures such as hand washing, face masking, and social distancing. In early 2021, some countries including the United States began administering COVID-19 vaccines. Vaccination brought a relief to the public; it also generated a lot of debates from anti-vaccine and pro-vaccine groups. The controversy and debate surrounding COVID-19 vaccine influenced the decision of several people in either to accept or reject vaccination. Because of data limitations, social media data, collected through live streaming public tweets using an Application Programming Interface (API) search, is considered a viable and reliable resource to study the opinion of the public on Covid-19 vaccine hesitancy. Thus, this study examines 3 sentiment computation methods (Azure Machine Learning, VADER, and TextBlob) to analyze COVID-19 vaccine hesitancy. Five learning algorithms (Random Forest, Logistics Regression, Decision Tree, LinearSVC, and Naïve Bayes) with different combination of three vectorization methods (Doc2Vec, CountVectorizer, and TF-IDF) were deployed. Vocabulary normalization was threefold; potter stemming, lemmatization, and potter stemming with lemmatization. For each vocabulary normalization strategy, we designed, developed, and evaluated 42 models. The study shows that Covid-19 vaccine hesitancy slowly decreases over time; suggesting that the public gradually feels warm and optimistic about COVID-19 vaccination. Moreover, combining potter stemming and lemmatization increased model performances. Finally, the result of our experiment shows that TextBlob + TF-IDF + LinearSVC has the best performance in classifying public sentiment into positive, neutral, or negative with an accuracy, precision, recall and F1 score of 0.96752, 0.96921, 0.92807 and 0.94702 respectively. It means that the best performance was achieved when using TextBlob sentiment score, with TF-IDF vectorization and LinearSVC classification model. We also found out that combining two vectorizations (CountVectorizer and TF-IDF) decreases model accuracy.
© 2022 The Authors.

Entities:  

Keywords:  Covid-19; Machine Learning; Sentiment Analysis; Twitter; Vaccine Hesitancy

Year:  2022        PMID: 36092862      PMCID: PMC9443617          DOI: 10.1016/j.eswa.2022.118715

Source DB:  PubMed          Journal:  Expert Syst Appl        ISSN: 0957-4174            Impact factor:   8.665


  23 in total

Review 1.  Utilization of text mining as a big data analysis tool for food science and nutrition.

Authors:  Dandan Tao; Pengkun Yang; Hao Feng
Journal:  Compr Rev Food Sci Food Saf       Date:  2020-02-16       Impact factor: 12.811

2.  COVID-19 vaccine mandate for healthcare workers in the United States: a social justice policy.

Authors:  K Hagan; R Forman; Elias Mossialos; Paul Ndebele; Adnan A Hyder; Khurram Nasir
Journal:  Expert Rev Vaccines       Date:  2021-11-16       Impact factor: 5.217

3.  Vaccine Hesitancy on Social Media: Sentiment Analysis from June 2011 to April 2019.

Authors:  Hilary Piedrahita-Valdés; Diego Piedrahita-Castillo; Javier Bermejo-Higuera; Patricia Guillem-Saiz; Juan Ramón Bermejo-Higuera; Javier Guillem-Saiz; Juan Antonio Sicilia-Montalvo; Francisco Machío-Regidor
Journal:  Vaccines (Basel)       Date:  2021-01-07

4.  The Advisory Committee on Immunization Practices' Interim Recommendation for Use of Pfizer-BioNTech COVID-19 Vaccine - United States, December 2020.

Authors:  Sara E Oliver; Julia W Gargano; Mona Marin; Megan Wallace; Kathryn G Curran; Mary Chamberland; Nancy McClung; Doug Campos-Outcalt; Rebecca L Morgan; Sarah Mbaeyi; José R Romero; H Keipp Talbot; Grace M Lee; Beth P Bell; Kathleen Dooling
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2020-12-18       Impact factor: 17.586

5.  COVID-19 Vaccine Hesitancy on Social Media: Building a Public Twitter Data Set of Antivaccine Content, Vaccine Misinformation, and Conspiracies.

Authors:  Goran Muric; Yusong Wu; Emilio Ferrara
Journal:  JMIR Public Health Surveill       Date:  2021-11-17

6.  The effects of anti-vaccine conspiracy theories on vaccination intentions.

Authors:  Daniel Jolley; Karen M Douglas
Journal:  PLoS One       Date:  2014-02-20       Impact factor: 3.240

7.  Association of social distancing and face mask use with risk of COVID-19.

Authors:  Sohee Kwon; Amit D Joshi; Chun-Han Lo; David A Drew; Long H Nguyen; Chuan-Guo Guo; Wenjie Ma; Raaj S Mehta; Fatma Mohamed Shebl; Erica T Warner; Christina M Astley; Jordi Merino; Benjamin Murray; Jonathan Wolf; Sebastien Ourselin; Claire J Steves; Tim D Spector; Jaime E Hart; Mingyang Song; Trang VoPham; Andrew T Chan
Journal:  Nat Commun       Date:  2021-06-18       Impact factor: 14.919

8.  An exploration of how fake news is taking over social media and putting public health at risk.

Authors:  Salman Bin Naeem; Rubina Bhatti; Aqsa Khan
Journal:  Health Info Libr J       Date:  2020-07-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.