| Literature DB >> 36231693 |
Tobias Saegner1, Donatas Austys1.
Abstract
The probability of future Coronavirus Disease (COVID)-19 waves remains high, thus COVID-19 surveillance and forecasting remains important. Online search engines harvest vast amounts of data from the general population in real time and make these data publicly accessible via such tools as Google Trends (GT). Therefore, the aim of this study was to review the literature about possible use of GT for COVID-19 surveillance and prediction of its outbreaks. We collected and reviewed articles about the possible use of GT for COVID-19 surveillance published in the first 2 years of the pandemic. We resulted in 54 publications that were used in this review. The majority of the studies (83.3%) included in this review showed positive results of the possible use of GT for forecasting COVID-19 outbreaks. Most of the studies were performed in English-speaking countries (61.1%). The most frequently used keyword was "coronavirus" (53.7%), followed by "COVID-19" (31.5%) and "COVID" (20.4%). Many authors have made analyses in multiple countries (46.3%) and obtained the same results for the majority of them, thus showing the robustness of the chosen methods. Various methods including long short-term memory (3.7%), random forest regression (3.7%), Adaboost algorithm (1.9%), autoregressive integrated moving average, neural network autoregression (1.9%), and vector error correction modeling (1.9%) were used for the analysis. It was seen that most of the publications with positive results (72.2%) were using data from the first wave of the COVID-19 pandemic. Later, the search volumes reduced even though the incidence peaked. In most countries, the use of GT data showed to be beneficial for forecasting and surveillance of COVID-19 spread.Entities:
Keywords: COVID-19; Google Trends; forecasting; surveillance
Mesh:
Year: 2022 PMID: 36231693 PMCID: PMC9566212 DOI: 10.3390/ijerph191912394
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Selection process of the articles to review.
Publications with positive results of GT use for COVID-19 prediction and surveillance.
| Author and Year | The Main Findings about Google Trends | Country | Period | Keywords |
|---|---|---|---|---|
| Husnayain, Fuad, Su (2020) [ | GT can be used for public restlessness monitoring towards COVID-19 pandemic 1–3 days before the increase in confirmed cases. | TW | 12 2019–02 2020 | Coronavirus, hand wash, face masks |
| Walker, Hopkins, Surda (2020) [ | Strong correlation between smell-related information search frequency and onset of COVID-19 infection. | IT, ES, UK, US, DE, FR, NL, IR | 12 2019–03 2020 | Smell, loss of smell, anosmia, hyposmia, olfaction, taste, loss of taste, dysgeusia. The keywords were automatically translated to national languages of study countries. |
| Mavragani (2020) [ | Significant correlations between online interest of coronavirus and COVID-19 cases and deaths. | IT, ES, FR, DE, UK | 01 2020–03 2020 | Coronavirus |
| Venkatesh and Gandhi (2020) [ | Google Web, together with other internet-based tools might be useful in predicting COVID-19 outbreaks 2–3 weeks earlier than conventional disease surveillance. | IN | 01 2020–04 2020 | Coronavirus, COVID, COVID-19, corona, virus |
| Kurian, Bhatti, Alvi, Ting, Storlie, Wilson, Shah, Liu, Bydon (2020) [ | The information obtained from GT precedes COVID-19 outbreaks. This information could allow better preparation and planning of health care systems. | US | 01 2020–04 2020 | COVID symptoms, coronavirus symptoms, sore throat + shortness of breath + fatigue + cough, coronavirus testing center, loss of smell, Lysol (sanitizer), antibody, face mask, coronavirus vaccine, COVID stimulus check |
| Panuganti, Jafari, MacDonald, DeConde (2020) [ | Google search of fever and shortness of breath are better indicators of COVID-19 incidence than anosmia. | US | 01 2020–04 2020 (excluding a short timeframe (March 22 to March 24)) | COVID, coronavirus, COVID-19, SARS-CoV-2, and COVID19, nonsmell symptoms of COVID-19 (shortness of breath, fatigue, cough, and fever) loss of smell, anosmia, lose smell, sense of smell, cannot smell, can’t smell and hyposmia, nasal irrigation and sinus rinse, (dysgeusia, taste change and taste loss, COVID, coronavirus, COVID-19, SARS-CoV-2, and COVID19), (shortness of breath, fatigue, cough, and fever), and smell loss anosmia, loss of smell, reduced smell, decreased smell, lose your sense of smell, lost sense of smell, decreased sense of smell, decrease your sense of smell, decreased my sense of smell, reduce your sense of smell, reduced my sense of smell, reduced sense of smell, loss of sense of smell, loss of smell, hyposmia |
| Mavragani and Gkillas (2020) [ | Significant correlation found between GT search queries and COVID-19 incidence. | US | 03 2020–04 2020 | coronavirus (virus) and coronavirus (search term) |
| Higgins, Wu, Sharma, Illing, Rubel, Ting, Alliance (2020) [ | Many search terms showed significant correlations with COVID-19 cases and mortality rate. | CN, US, IT, ES | 01 2020–04 2020 | Real world deaths, Coronavirus, COVID-19, Fever, SOB, Cough, Sputum, Anosmia, Dys/ageusia, Nasal congestion, Rhinorrhea, Sneezing, Sore throat, Headache, Myalgia, Chest pain, Eye pain, Diarrhea |
| Ahmad, Flanagan, Staller (2020) [ | Google searches for gastrointestinal symptoms preceded the increase in COVID-19 cases in a predictable manner. | US | 01 2020–04 2020 | ageusia, abdominal pain, loss of appetite, anorexia, diarrhea, and vomiting |
| Cherry, Rocke, Chu, Liu, Lechner, Lund, Kumar (2020) [ | GT data containing searches related to loss of smell could potentially identify COVID-19 outbreaks. | IT, ES, FR, BR, US | 02 2020–05 2020 | loss of sense of smell, loss of sense of taste, sense of smell, sense of taste |
| Cousins, Cousins, Harris, Pasquale (2020) [ | Identifiable patterns in internet searches could predict COVID-19 outbreaks, although stochastic changes in search intensity can alter these predictions. | US | 01 2020–04 2020 | 463 unique search queries. |
| Sharma and Sharma (2020) [ | A positive correlation between COVID-19 cases and GT values has been recorded. | US, ES, IT, FR, UK, CN, IR, IN | 03 2020–04 2020 | COVID-19 |
| Schnoell, Besser, Jank, Bartosik, Parzefall, Riss, Mueller, Liu (2021) [ | Clear correlation found between GT data and COVID-19 incidence. GT data might be useful in selecting the best timing for web-based COVID-19-specific information and prevention measures. | AU, BR, CA, DE, IT, ZA, ROK, ES, UK, US | 01 2020–06 2020 | Coronavirus, corona |
| Jimenez, Estevez-Rebored, Santed, Ramos (2020) [ | Significant correlation found between the rise of COVID-19 incidences and GT search queries with a lag of 11 days. | ES | 02 2020–05 2020 | cansancio, which translates as fatigue; coronavirus, COVID 19, covid 19, and COVID19; diarrea, which translates as diarrhea; dolor de garganta, which translates as sore throat; fiebre, which translates as fever; neumonia, which translates as pneumonia and was searched without an accent due to being more relevant; perdida de olfato, which translates as lost sense of smell and was also searched without an accent; tos, which translates as cough |
| Lippi, Mattiuzzi, Cervellin (2020) [ | Significant correlations found between GT search data and newly diagnosed COVID-19 cases with a 3-week lag. | IT | 02 2020–05 2020 | tosse (i.e., cough), febbre (i.e., fever), and dispnea (i.e., dyspnea) |
| Strzelecki, Azevedo, Albuquerque (2020) [ | There was a correlation between COVID-19 spread and GT search data for personal protective gear and hand hygiene. | PL, PT | 01 2020–06 2020 | máscara cirúrgica (face mask), desinfetante (sanitizer), and álcool (alcohol) |
| Badell-Grau, Cuff, Kelly, Waller-Evans, Lloyd-Evans (2020) [ | Strong correlations found between COVID-19-related search terms and cases and mortality rates from COVID-19. | AU, DE, IT, ES, UK, US | 11 2019–04 2020 | keywords used in three categories and four languages: Government Policy, Medical Interventions, and Misinformation |
| Rajan, Sharaf, Brown, Sharaiha, Lebwohl, Mahadev (2020) [ | GT data could be used to identify active disease transmission areas in the beginning of new outbreaks. | US | 10 2019–05 2020 | diarrhea, nausea, vomiting, and abdominal pain. The terms fever and cough were included as positive controls. The term constipation was included as a negative control. |
| Xie, Tan, Li (2020) [ | Monitoring internet search activity could prevent and control the epidemic and rumors around it. | CN | 01 2020–02 2020 | Coronavirus |
| Hartwell, Greiner, Kilburn, Ottwell (2020) [ | GT data relating to the public interest of COVID-19 preventative measures correlated with stay-at-home expiration dates and decreased new COVID-19 cases after that expiration. In addition, states with higher interest in preventative measures had higher COVID-19-related deaths per capita and higher case-fatality rates. | US | 05 2020 | hand sanitizer, social distancing, COVID testing, contact tracing |
| Effenberger, Kronbichler, Shin, Mayer, Tilg, Perco (2020) [ | Significant correlations were found between GT data relating to coronavirus and new COVID-19 cases across studied countries. The time lag was 11.5 days. | KR, JP, IR, IT, AT, DE, UK, US, EG, AU, BR, CN | 12 2019–04 2020 | Coronavirus (virus) |
| Lin, Liu, Chiu (2020) [ | Google searches for “wash hands” from January to February correlated with lower COVID-19 spread from February to March in 21 countries. | IT, IR, KR, FR, ES, DE, US, CH, NL, SE, NO, AT, AU, CA, JP, UK, BE, SG, HK, TW, TH | 01 2020–02 2020 | wash hands, face mask |
| Brunori and Resce (2020) [ | Significant positive correlation found between google search queries of COVID-19 symptoms and reported COVID-19 deaths. | IT | 02 2020–03 2020 | ‘fever’, ‘dry cough’, ‘cough’, ‘sore throat’, ‘loss of sense of smell’, and ‘loss of sense of taste’ |
| Sulyok, Ferenci, Walker (2021) [ | Strong positive correlation found between Google search queries for coronavirus and COVID-19 cases in Europe. | BE, FE, DE, HU, IE, IT, NL, NO, ES, SE, CH, UK | 01 2020–03 2020 | Coronavirus |
| Abbas, Morland, Hall, El-Manzalawy (2021) [ | The dynamics of the correlations found between GT data COVID-19 cases and deaths suggest that it would be possible to make predictions of COVID-19 cases and mortality rates up to 3 weeks in advance. | US | Dataset released 09 2020, accessed 11 2020 | 422 symptoms and conditions dataset. |
| Pellegrini, Ferrucci, Guaraldi, Bernabei, Scorcia, Giannaccare (2021) [ | GT data on conjunctivitis reveals significant correlations with COVID-19 new cases with a lag of 14–18 days. | IT, FR, UK, US | 01 2020–04 2020 | “conjunctivitis” and the translation in Italian (“congiuntivite”) and French (“conjonctivite”) |
| Yousefinaghani, Dara, Mubareka, Sharif (2021) [ | GT data allowed to identify starts and peaks of COVID-19 waves 1 and 3 weeks earlier, respectively. Strong correlation was found between Twitter/GT data and the number of COVID-19 cases in Canada with 3–5-week lags. | CA, US | 01 2020–09 2020 | Shortness of breath, cough, fever, sore throat, loss of smell, loss of taste, face mask, quarantine, wearing mask, wash hand, COVID-19 vaccine, COVID-19 vaccine, covid vaccine, corona vaccine, coronavirus vaccine, physical distancing, social distancing |
| Cinarka, Uysal, Cifter, Niksarlioglu, Çarkoğlu (2021) [ | Online interest shown in COVID-19 pulmonary symptoms can reliably predict later reported cases of the first COVID-19 wave. | TR, IT, ES, FR, UK | 01 2020–08 2020 | fever, cough, dyspnea |
| Husnayain, Chuang, Fuad, Su (2021) [ | Significant correlations between COVID-19 and GT data reached their highest point in June and decreased as the outbreak progressed. | US | 01 2020–12 2020 | Data retrieved for COVID-19-related terms, topics, and disease; the top related queries; most-searched COVID-19 terms in 2020 with a lag of 7 days |
| Kristensen, Lorenz, May, Strauss, (2021) [ | Significant correlations found between term “RKI” and increase in COVID-19 cases (2–12-day lag). Similar pattern was observed for the term “corona”. Searches for “protective mask” peaked 6–12 days after the peak of COVID-19 cases. | DE | 02 2020–04 2020 | ‘RKI’ (Robert Koch Institut), ‘Mundschutz’ (protective mask), and ‘corona’ |
| Hu, Lou, Xu, Meng, Xie, Zhang, Zou, Liu, Sun, Wang (2020) [ | Slightly positive significant correlation found between GT data regarding COVID-19 and daily confirmed COVID-19 cases. | US, UK, CA, IE, AU, NZ | 12 2019–02 2020 | 2019-nCoV + SARS-CoV-2 + novel coronavirus + new coronavirus + COVID-19 + Corona Virus Disease 2019 |
| Schuster, Tizek, Schielein, Ziehfreund, Rothe, Spinner, Biedermann, Zink (2021) [ | Moderate correlation found between GT data and confirmed new COVID-19 cases over the study period. | DE | 01 2020–07 2020 | coronavirus |
| Li, Chen, Chen, Zhang, Pang, Chen (2020) [ | Internet search terms had high correlation with daily COVID-19 cases. | CN | 01 2020–02 2020 | coronavirus, pneumonia |
| Walker, Sulyok (2020) [ | Search terms related to coronavirus had a significant correlation with confirmed COVID-19 cases. | UK | 01 2020–04 2020 | Coronavirus (virus), hand washing (search term), and face mask (search term) |
| Samadbeik, Garavand, Aslani, Ebrahimzadeh, Fatehi (2022) [ | Terms related to COVID, COVID-19, and coronavirus had a significant correlation with confirmed weekly COVID-19 cases. | IR | 02 2020–01 2021 | corona [Persian], Covid [Persian], COVID-19, corona, and coronavirus |
| Ahmed, Abid, de Oliveira, Ahmed, Siddiqui, Siddiqui, Jafri, Lippi (2021) [ | ‘Loss of smell’ was the best predictor for positive weekly COVID-19 cases. | PK | 03 2021–06 2021 | Fever, cough, headache, shortness of breath, taste loss, and hearing loss, COVID-19, coronavirus, virus, COVID |
| Yuan, Xu, Hussain, Wang, Gao, Zhang (2020) [ | COVID-19 search terms had a strong correlation with confirmed COVID-19 cases and deaths in the USA. | US | 03 2020–04 2020 | COVID-19, COVID, coronavirus, SARS-CoV-2, pneumonia, high temperature, cough, COVID heart, COVID pneumonia, and COVID diabetes |
| Aragón-Ayala, Copa-Uscamayta, Herrera, Zela-Coila, Cender Udai Quispe-Juli (2021) [ | Most countries showed a moderate to strong significant correlation between COVID-19 searches and daily new cases. | AR, BO, BR, CL, CO, CR, CU, EC, SV, GT, HN, MX NI, PA, PY, PE, PR, DO, UY, VE | 12 2019–04 2020 | “coronavirus + COVID-19 + SARS-CoV2 + nuevo coronavirus + 2019-nCoV”, “coronavirus + coronavírus + COVID-19 + SARS-CoV2 + novo coronavirus + novo coronavírus + 2019-nCoV” |
TW—Taiwan, IT—Italy, ES—Spain, UK—United Kingdom, US—United States, DE—Germany, FR—France, NL—Netherlands, IR—Iran, IN—India, CN—China, BR—Brazil, AU—Australia, CA—Canada, ZA—South Africa, PL—Poland, PT—Portugal, KR—Republic of Korea, JP—Japan, AT—Austria, EG—Egypt, CH—Switzerland, SE—Sweden, NO—Norway, BE—Belgium, SG—Singapore, HK—Hong Kong, TH—Thailand, HU—Hungary, TR—Turkey, IE—Ireland, AR—Argentina, BO—Bolivia, CL—Chile, CO—Columbia, CR—Costa Rica, CU—Cuba, EC—Ecuador, SV—El Salvador, GT—Guatemala, HN—Honduras, MX—Mexico, NI—Nicaragua, PA—Panama, PY—Paraguay, PE—Peru, PR—Puerto Rico, DO—Dominican Republic, UY—Uruguay, VE—Venezuela, and PK—Pakistan.
Publications where GT data were analyzed using more complex methods.
| Author and Year | The Main Findings about Google Trends | Country | Period | Keywords |
|---|---|---|---|---|
| Ayyoubzadeh, Zahedi, Ahmadi, Niakan Kalhori (2020) [ | Data mining algorithms (linear regression and long short-term memory) can predict COVID-19 outbreak trends. | IR | 02 2020–03 2020 | Corona, COVID-19, Coronavirus, Antiseptic selling, Antiseptic buying, Hand washing, Hand sanitizer, Ethanol, Antiseptic |
| Prasanth, Singh, Kumar, Tikkiwal, Chong (2021) [ | Data obtained from GT significantly improved deep learning model (long short-term memory optimized with Grey Wolf optimization) for forecasting COVID-19 numbers. | IN, US, UK | 02 2020–05 2020 | Coronavirus symptoms, Coronavirus, Covid, Hand wash, Healthcenter, Mask, Positive cases, Sanitizer, Coronavirus vaccine |
| Niu, Liang, Zhang, Zhang, Qu, Su, Zheng, Chen et al. (2021) [ | GT data combined with Adaboost algorithm had strong predictive ability of COVID-19 infection with hopes to further enhance the online prediction system. | IT | 02 2020–03 2020 | 40 keywords. |
| Peng, Li, Rong, Chen, Chen (2020) [ | A model with GT data and Random Forest Classification, developed from 20 countries worldwide, can be used for epidemic alert level prediction. | 202 countries. | 01 2020–04 2020 | Coronavirus, Pneumonia, Cough, Diarrhea, Fatigue, Fever, Nasal congestion and Rhinorrhea |
| Rabiolo, Alladio, Morales, McNaught, Bandello, Afifi, Marchese et al. (2021) [ | GT data could improve statistical models (ERS, ARIMA, and NNA models fitted on the first two principal components) of nowcasting and forecasting COVID-19 incidence with a 15-day time lag and could be used as one of surveillance systems for this disease. | AU, BR, FR, IN, IR, ZA, UK, US | 01 2015–07 2020 (weekly data) and 01 2020–12 2020 (daily data) | 20 |
| Turk, Tran, Rose, McWilliams (2021) [ | GT data were incorporated in a vector error correction model, which showed very good results in forecasting regional COVID-19 hospital census. | US | 02 2020–08 2020 | Coronavirus, covid testing + covid test + covid19 Testing + covid19 test + covid 19 Testing + covid 19 test, headache, pneumonia, “shortness of breath” + “trouble breathing” + “difficulty breathing”, CDC |
| Peng, Li, Rong, Pang, Chen, Chen (2021) [ | Random forest regression algorithm with integrated previous incidence and GT data was able to accurately predict increase in COVID-19 cases in most countries 7 days in advance. | 215 countries. | 01 2020–07 2020 | Fourteen terms, including coronavirus, pneumonia, and COVID-19; six symptom-related terms (cough, diarrhea, fatigue, fever, nasal congestion, and rhinorrhea); five prevention-related terms (hand washing, hand sanitizer, mask, social distance, and social isolation) |
IR—Iran, IN—India, US—United States, UK—United Kingdom, IT—Italy, AU—Australia, BR—Brazil, FR—France, and ZA—South Africa.
Publications with negative results of GT use for COVID-19 prediction and surveillance.
| Author and Year | The Main Findings about Google Trends | Country | Period | Keywords |
|---|---|---|---|---|
| Szmuda, Ali, Hetzger, Rosvall, Słoniewski (2020) [ | GT data did not correlate with COVID-19 incidence and mortality; however, they had a strong correlation with international WHO announcements. | 40 European countries. | 12 2019–04 2020 | Coronavirus |
| Asseo, Fierro, Slavutsky, Frasnelli, Niv (2020) [ | The correlation between internet searches for symptoms and new COVID-19 cases varied significantly over time. High fluctuations show that relying only on GT data to monitor the spread of COVID-19 is not a viable strategy. | IT, US | 03 2020–04 2020 | taste loss, smell loss, sight loss (control), hearing loss (control), COVID symptoms (and the same in Italian) |
| Muselli, Cofini, Desideri, Necozione (2021) [ | The volume of Google searches did not reflect the actual epidemiological situation. It has been seen that official communications and government activity has more impact on public interest in the disease. | IT | 12 2019–03 2020 | coronavirus, coronavirus symptoms (in Italian), coronavirus news (in Italian), and coronavirus Italy (in Italian) |
| Rovetta (2021) [ | Big number of anomalies seen in multiple cities’ relative search volumes (RSVs) made these data unusable for statistical inference. Furthermore, correlations varied greatly depending on the day RSVs were collected. | IT | 02 2020–12 2020 and 02 2020–05 2020 | coronavirus + covid |
| Satpathy, Kumar, Prasad (2021) [ | Correlations found between GT queries and COVID-19 cases maybe either because of media-coverage-induced curiosity or health-seeking curiosity. | IN | 01 2020–05 2020 | 88 terms in Hindi and English. |
| Sato, Mano, Iwata, Toda (2021) [ | Results suggest that search keywords, previously identified as candidates for COVID-19 prediction, might be unreliable. | JP, AU, CA, UK, IE, IN, SG, US, ZA | 10 2017–10 2020 | 54 English keywords and the corresponding 60 Japanese keywords. |
| Dagher, Lamé, Hubiche, Ezzedine, Duong (2021) [ | Google searches for chilblain were influenced by media coverage and government policies during the COVID-19 pandemic, showing that GT, as a monitoring tool for emerging infectious diseases, should be used with caution. | US, UK, FR, IT, ES, DE | 01 2020–05 2020 | (1) toe or chilblains and (2) coronavirus, |
| Madden, Feldman (2021) [ | Search terms do not give any evidences suggesting earlier COVID-19 spread. | US | 09 2015–03 2020 | Can’t smell OR can’t taste or smell OR why can’t i smell or taste OR why can’t i taste or smell anything |
| Sousa-Pinto, Anto, Czarlewski, Anto, Fonseca, Bousquet (2020) [ | COVID-19-related searches are more closely related to media coverage than to ongoing COVID-19 epidemic. | RA, AU, BE, BR, CA, CL, FR, DE, IT, PT, RU, ES, SE, CH, NL, UK, US | 2015 04–2020 05 | coronavirus, cough, anosmia, ageusia |
IT—Italy, US—United States, IN—India, JP—Japan, AU—Australia, CA—Canada, UK—United Kingdom, IE—Ireland, SG—Singapore, ZA—South Africa, FR—France, ES—Spain, DE—Germany, RA—Argentina, BE—Belgium, BR—Brazil, CL—Chile, PT—Portugal, RU—Russia, SE—Sweden, CH—Switzerland, and NL—The Netherlands.