| Literature DB >> 33242762 |
Marichi Gupta1, Aditya Bansal2, Bhav Jain3, Jillian Rochelle4, Atharv Oak3, Mohammad S Jalali5.
Abstract
OBJECTIVE: The potential ability for weather to affect SARS-CoV-2 transmission has been an area of controversial discussion during the COVID-19 pandemic. Individuals' perceptions of the impact of weather can inform their adherence to public health guidelines; however, there is no measure of their perceptions. We quantified Twitter users' perceptions of the effect of weather and analyzed how they evolved with respect to real-world events and time.Entities:
Keywords: Individuals’ perceptions; Machine learning; Opinion mining; SARS-CoV-2 transmission; Topic modeling
Mesh:
Year: 2020 PMID: 33242762 PMCID: PMC7654388 DOI: 10.1016/j.ijmedinf.2020.104340
Source DB: PubMed Journal: Int J Med Inform ISSN: 1386-5056 Impact factor: 4.046
Fig. 1Flow diagram of filtering and machine learning processes.
Manual Annotation Scheme for Effect and Class Proportions.
| Class | Proportion (out of 2442) |
|---|---|
| Uncertain | 40.4 % (987) |
| No Effect | 33.5 % (817) |
| Effect | 26.1 % (638) |
| Improve Warmer Weather | 585 |
| Worsen Warmer Weather | 33 |
| Improve Cooler Weather | 4 |
| Worsen Cooler Weather | 16 |
Number of Tweets from Countries Across Full Dataset and Relevant Tweets Set.
| Full Dataset (166,005 tweets) | Relevant Tweets (28,555 tweets) | ||
|---|---|---|---|
| United States | 60,749 | United States | 9740 |
| United Kingdom | 13,579 | United Kingdom | 992 |
| Canada | 7159 | India | 912 |
| India | 4738 | Canada | 836 |
| Australia | 1739 | Nigeria | 446 |
| Nigeria | 1385 | Pakistan | 337 |
| South Africa | 1079 | Australia | 198 |
| Ireland | 1065 | South Africa | 142 |
| Pakistan | 866 | Philippines | 105 |
| France | 613 | Kenya | 100 |
| Philippines | 557 | Germany | 96 |
| Germany | 536 | Spain | 93 |
| Kenya | 529 | Ireland | 74 |
| Other (<500 tweets) | 10,143 | Other (<73 tweets) | 2068 |
| No Data | 61,268 | No Data* | 12,416 |
“No Data” represents tweets where neither the tweet nor tweet author had location data available.
Fig. 2Relevant original tweet volumes over time, with most frequent headlines and reporting organizations on four key peaks identified.
Fig. 3Class proportion over time for annotated Tweets. Tweets are smoothed by 7 days, binned in 14-day windows, and weighted according to the individual tweet’s number of retweets.
Machine learning classification results.
| Class | Proportion (out of 28,555) |
|---|---|
| No Effect/Uncertain | 83.5 % (23,836) |
| Effect | 16.5 % (4719) |
Model: Gradient Descent Support Vector Machine, TD-IDF.
AUC-PR (95 % CI): 0.561 (0.542, 0.58).
AUC-ROC (95 % CI): 0.768 (0.749,0.787).
Fig. 4Cluster Frequencies over Time by Week, color coding presents the frequency of discussion, where darker blue is the highest frequency. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article).