| Literature DB >> 33897274 |
Harleen Kaur1, Shafqat Ul Ahsaan1, Bhavya Alankar1, Victor Chang2.
Abstract
With the rise in cases of COVID-19, a bizarre situation of pressure was mounted on each country to make arrangements to control the population and utilize the available resources appropriately. The swiftly rising of positive cases globally created panic, anxiety and depression among people. The effect of this deadly disease was found to be directly proportional to the physical and mental health of the population. As of 28 October 2020, more than 40 million people are tested positive and more than 1 million deaths have been recorded. The most dominant tool that disturbed human life during this time is social media. The tweets regarding COVID-19, whether it was a number of positive cases or deaths, induced a wave of fear and anxiety among people living in different parts of the world. Nobody can deny the truth that social media is everywhere and everybody is connected with it directly or indirectly. This offers an opportunity for researchers and data scientists to access the data for academic and research use. The social media data contains many data that relate to real-life events like COVID-19. In this paper, an analysis of Twitter data has been done through the R programming language. We have collected the Twitter data based on hashtag keywords, including COVID-19, coronavirus, deaths, new case, recovered. In this study, we have designed an algorithm called Hybrid Heterogeneous Support Vector Machine (H-SVM) and performed the sentiment classification and classified them positive, negative and neutral sentiment scores. We have also compared the performance of the proposed algorithm on certain parameters like precision, recall, F1 score and accuracy with Recurrent Neural Network (RNN) and Support Vector Machine (SVM).Entities:
Keywords: COVID-19; Heterogeneous Euclidean overlap metric (H-EOM); Hybrid heterogeneous support vector machine (H-SVM); Recurrent neural network (RCN); Sentiment analysis; Twitter
Year: 2021 PMID: 33897274 PMCID: PMC8057010 DOI: 10.1007/s10796-021-10135-7
Source DB: PubMed Journal: Inf Syst Front ISSN: 1387-3326 Impact factor: 5.261
Fig. 1Global Internet users in the World 2020 distribution by world regions (Source: https://www.broadbandsearch.net/blog/internet-statistics)
Fig. 3Flowchart of the sentiment analysis process
Fig. 4Precision
Fig. 5Recall
Fig. 6F1 Score
Fig. 7Accuracy
Fig. 8Analysis of a sample of 20 tweets a SVM result b RNN result
Fig. 9Analysis of a sample of 50 tweets using a SVM b RNN
Fig. 10Analysis of a sample of 250 tweets a SVM result b RNN result
Fig. 11Analysis of a sample of 600 tweets a SVM b RNN
Fig. 12Sentiment analysis
Fig. 13Most frequently used words over Twitter handle during COVID-19