| Literature DB >> 33937179 |
Samira Yousefinaghani1, Rozita Dara1, Samira Mubareka2, Shayan Sharif3.
Abstract
The ongoing COVID-19 pandemic has posed a severe threat to public health worldwide. In this study, we aimed to evaluate several digital data streams as early warning signals of COVID-19 outbreaks in Canada, the US and their provinces and states. Two types of terms including symptoms and preventive measures were used to filter Twitter and Google Trends data. We visualized and correlated the trends for each source of data against confirmed cases for all provinces and states. Subsequently, we attempted to find anomalies in indicator time-series to understand the lag between the warning signals and real-word outbreak waves. For Canada, we were able to detect a maximum of 83% of initial waves 1 week earlier using Google searches on symptoms. We divided states in the US into two categories: category I if they experienced an initial wave and category II if the states have not experienced the initial wave of the outbreak. For the first category, we found that tweets related to symptoms showed the best prediction performance by predicting 100% of first waves about 2-6 days earlier than other data streams. We were able to only detect up to 6% of second waves in category I. On the other hand, 78% of second waves in states of category II were predictable 1-2 weeks in advance. In addition, we discovered that the most important symptoms in providing early warnings are fever and cough in the US. As the COVID-19 pandemic continues to spread around the world, the work presented here is an initial effort for future COVID-19 outbreaks.Entities:
Keywords: COVID-19; Google Trends; Twitter; digital data stream; early warning
Year: 2021 PMID: 33937179 PMCID: PMC8085269 DOI: 10.3389/fpubh.2021.656635
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Overall approach.
Twitter query input.
| Included symptom keywords | Shortness of breath, cough, fever, sore throat, loss of smell, loss of taste |
| Included precaution keywords | Face mask, quarantine, wearing mask, wash hand, ovid-19 vaccine, covid-19 vaccine, covid vaccine, corona vaccine, coronavirus vaccine, physical distancing, social distancing |
| Excluded symptom keywords | Flu, influenza, cold, diabetes, jungle fever, Saturday night fever, fever swamp, baby fever, fever pitch, fever dream, fever 333, dog fever, cat scratch fever, blackouts coastal fever, tattoo fever, Kennel cough, smoke, smoking, allergy, allergies |
| Excluded precaution keywords | Handle, handling, body wash, hand cream, cold, flu, yogurt, honey, watermelon, cucumber, hair mask |
Figure 2Weekly comparison of online activities and actual number of cases in Canada. (A) Symptom-related tweets vs. cases, (B) Symptom-related searches vs. cases, (C) Precaution-related tweets vs. cases, (D) Precaution-related searches vs. cases.
Figure 3Weekly comparison of symptom-related tweets and the actual number of cases in provinces with first wave of disease (category I). (A) Virginia, (B) Connecticut, (C) New York, (D) New Jersey.
Figure 4Weekly comparison of symptom-related tweets and the actual number of cases in states with only the second wave of disease (category II). (A) Alabama, (B) Tennessee, (C) Utah, (D) Texas.
The average prediction value of Canadian provinces (with an early wave).
| Symptoms (week lags) | 1.19 | 4.3 |
| Symptoms (detection percentage) | 50% | 50% |
| Precautions (week lags) | 0.4 | 2.8 |
| Precautions (detection percentage) | 83% | 83% |
| Symptoms (week lags) | 1.2 | 3.1 |
| Symptoms (detection percentage) | 83% | 83% |
| Precautions (week lags) | 1.2 | 3.2 |
| Precautions (detection percentage) | 75% | 75% |
The average prediction value of the US states.
| Category I | Symptoms (week lags) | 5 | 6 | – | |
| Symptoms (detection percentage) | 81% | 3.2% | 0% | ||
| Precautions (week lags) | 0.94 | 4.39 | 2 | 3.42 | |
| Precautions (detection percentage) | 97% | 89% | 6% | 22% | |
| Category II | Symptoms (week lags) | – | – | 1.86 | – |
| Symptoms (detection percentage) | – | – | 0% | ||
| Precautions (week lags) | – | – | 1.14 | 2.5 | |
| Precautions (detection percentage) | – | – | 44% | ||
| Category I | Symptoms (week lags) | 1.54 | 4.75 | 7 | 3.87 |
| Symptoms (detection percentage) | 86% | 3% | 26% | ||
| Precautions (week lags) | 1.4 | 4.75 | 6 | 1.75 | |
| Precautions (detection percentage) | 86% | 3% | 13% | ||
| Category II | Symptoms (week lags) | – | – | 1 | 1.75 |
| Symptoms (detection percentage) | – | – | 44% | ||
| Precautions (week lags) | – | – | 1.14 | 4 | |
| Precautions (detection percentage) | – | – | 11% | ||
The bold values are the best obtained results.
Correlation coefficients (r) of weekly online activities and COVID-19 cases.
| Canada | 0.85 (lag = 5) | 0.93 (lag = 3) | 0.75 (lag = 5) | 0.84 (lag = 3) |
| Massachusetts | 0.94 (lag = 5) | 0.9 (lag = 3) | 0.66 (lag = 5) | 0.86 (lag = 4) |
| Michigan | 0.7 (lag = 3) | 0.81 (lag = 2) | 0.38 (lag = 4) | 0.87 (lag = 3) |
| New Jersey | 0.95 (lag = 4) | 0.87 (lag = 2) | 0.7 (lag = 5) | 0.9 (lag = 3) |
| New York | 0.97 (lag = 3) | 0.86 (lag = 2) | 0.72 (lag = 4) | 0.91 (lag = 3) |
| Vermont | 0.9 (lag = 2) | 0.83 (lag = 1) | 0.63 (lag = 3) | 0.88 (lag = 2) |
Figure 5Symptom reporting in Twitter (United States).
The average prediction values of the US states (detection of early waves).
| Fever (week lags) | 4.45 | 4.03 |
| Fever (detection percentage) | 53% | 58% |
| Cough (week lags) | 5.2 | 4.2 |
| Cough (detection percentage) | 55% | 44% |
| Tiredness (week lags) | 5.2 | – |
| Tiredness (detection percentage) | 20% | 0% |
| Shortness of breath (week lags) | 4.38 | 4.27 |
| Shortness of breath (detection percentage) | 29% | 24% |
| Loss of smell (week lags) | 3.33 | – |
| Loss of smell (detection percentage) | 7% | 0% |
| Sore throat (week lags) | 4.29 | 4.24 |
| Sore throat (detection percentage) | 38% | 55% |