Tomasz Szmuda1, Shan Ali2, Tarjei Vevang Hetzger3, Philip Rosvall3, Paweł Słoniewski1. 1. Neurosurgery Department, Medical University of Gdansk, Dębinki 7, Gdansk, 80-952, Poland. 2. Scientific Circle of Neurology and Neurosurgery, Medical University of Gdansk, Dębinki 7, Gdansk, 80-952, Poland. Electronic address: shanali@gumed.edu.pl. 3. Scientific Circle of Neurology and Neurosurgery, Medical University of Gdansk, Dębinki 7, Gdansk, 80-952, Poland.
Abstract
BACKGROUND: Previous studies on the novel Coronavirus (COVID-19) have found strong correlations between online searches and the epidemiology of the disease. AIM: Our aim was to determine if online searches for COVID-19 related to international media announcements or national epidemiology. METHODS: Searches for "coronavirus" were made on Google Trends from December 31, 2019 to April 13, 2020 for 40 European countries. The online COVID-19 searches for all countries were correlated with each other. COVID-10 epidemiology (i.e. incidence and mortality) was correlated with the national online searches. Major announcements by the World Health Organization (WHO) were taken into consideration with peaks in online searches. Correlations were made using Spearman's rank correlation coefficient. RESULTS: Overall, the online searches for COVID-19 were not correlated with the actual incidence and mortality of COVID-19. The mean Spearman correlation for incidence was 0.20 (range -0.66 to 0.76) and for mortality was 0.35 (range -0.75 to 0.85). Online searches in Europe were all strongly synchronized with each other; a mean Spearman correlation of 0.93 (range 0.62 to 0.99). CONCLUSIONS: Online searches for COVID-19 in Europe are not correlated with epidemiology but strongly correlated with international WHO announcements. Our study challenges previous Google Trends studies and emphasizes the role of the WHO in raising awareness of a new disease.
BACKGROUND: Previous studies on the novel Coronavirus (COVID-19) have found strong correlations between online searches and the epidemiology of the disease. AIM: Our aim was to determine if online searches for COVID-19 related to international media announcements or national epidemiology. METHODS: Searches for "coronavirus" were made on Google Trends from December 31, 2019 to April 13, 2020 for 40 European countries. The online COVID-19 searches for all countries were correlated with each other. COVID-10 epidemiology (i.e. incidence and mortality) was correlated with the national online searches. Major announcements by the World Health Organization (WHO) were taken into consideration with peaks in online searches. Correlations were made using Spearman's rank correlation coefficient. RESULTS: Overall, the online searches for COVID-19 were not correlated with the actual incidence and mortality of COVID-19. The mean Spearman correlation for incidence was 0.20 (range -0.66 to 0.76) and for mortality was 0.35 (range -0.75 to 0.85). Online searches in Europe were all strongly synchronized with each other; a mean Spearman correlation of 0.93 (range 0.62 to 0.99). CONCLUSIONS: Online searches for COVID-19 in Europe are not correlated with epidemiology but strongly correlated with international WHO announcements. Our study challenges previous Google Trends studies and emphasizes the role of the WHO in raising awareness of a new disease.
On December 31, 2019, the first cases of an unknown cause of pneumonia, in Wuhan, People's Republic of China, were reported to the World Health Organization (WHO). It was later dubbed the novel coronavirus, SARS-CoV-2, which caused Coronavirus disease (COVID-19). COVID-19 rapidly spread, being declared a health emergency of national concern by WHO on January 30, 2020, and finally being declared a pandemic on March 11, 2020 (World Health Organization).After the outbreak of COVID-19, much online research was done on the spread of the virus; some was conducted with Google Trends (among other modern analytical tools). Google trends allows one to view the relative search frequency of various search terms worldwide. It gathers its data from the Google search engine, which is the most common search engine in the world, with a worldwide market share of over 90% (Statcounter Global Stats). Several studies have used Google Trends and/or other search engine data to track the outbreak of COVID-19 in Taiwan, China, Europe and elsewhere (Husnayain et al., 2020, Li et al., 2020, Lin et al., 2020, Mavragani, 2020, Walker et al., 2020). All of these studies have compared the online searches for COVID-19 (or related terms) with the national epidemiology of COVID-19. Significant positive correlations between the epidemiology and online search frequency were found in all these studies; the conclusion was that search engines could be used as robust tool for online surveillance. However, these studies may have been subject to bias as they did not comprehensively cover all countries on a continent and only selectively picked a handful of countries for analysis. Moreover, previous studies showed that Google Trends is influenced more by “media clamor than by true epidemiological burden (Cervellin et al., 2017).” With the heavy worldwide attention that COVID-19 is receiving, it is possible that these correlations mentioned before mistakenly analyzed the increasing media clamor for COVID-19 instead of the increasing incidence of COVID-19.Our aim was to comprehensively assess if online searches for COVID-19 are dependent on the epidemiological burden of the disease or major media announcements. To do this, we evaluated all the countries in Europe that had sufficient data. In this way, our study may confirm to deny the use of Google Trends as a tool for online surveillance in the case of COVID-19.
Methods and materials
Search strategy
Searches for “coronavirus” were made as “topic searches” referring to the virus instead of an exact search term. This way, various synonyms, misspellings and translations from different languages were included in the analysis. Data was collected for analysis from December 31, 2019 to April 13, 2020 under the “health” category as shown in Figure 1
. All data were collected on April 16, 2020.
Figure 1
A display of settings for obtaining information about online searches concerning Coronavirus. In this case the trends are shown for Germany. Information for all 40 European countries were gathered in this way.
A display of settings for obtaining information about online searches concerning Coronavirus. In this case the trends are shown for Germany. Information for all 40 European countries were gathered in this way.From December 30, 2019 to April 13, 2020, Google made up 92% of the search engine market share worldwide and 93% of the market share in Europe. Thus, the online data gathered are robust (Statcounter Global Stats).
European countries included
The 44 countries deemed to be in Europe, according to the United Nations, were taken into analysis. These countries were: Albania, Andorra, Austria, Belarus, Belgium, Bosnia and Herzegovina, Bulgaria, Croatia, Czechia, Denmark, Estonia, Finland, France, Germany, Greece, Holy See, Hungary, Iceland, Ireland, Italy, Latvia, Liechtenstein, Lithuania, Luxembourg, Malta, Moldova, Monaco, Montenegro, Netherlands, North Macedonia, Norway, Poland, Portugal, Romania, Russia, San Marino, Serbia, Slovakia, Slovenia, Spain, Sweden, Switzerland, Ukraine and the United Kingdom. However, some of these countries that tended to be geographically smaller (Holy See, Liechtenstein, Monaco and San Marino) had insufficient data from Google Trends and were not analyzed. Thus, altogether, 40 countries were taken into the analysis.
WHO announcements
To find WHO announcements that were considered significant to be analyzed with online health patterns we looked on the official WHO page listing the updates. All dated announcements highlighted on the “summary” sidebar of WHO's official updates page (on April 16, 2020) were considered significant. There were 3 of these announcements, namely: when a pneumonia of unknown cause was first reported to WHO (on 31 December, 2019), when WHO declared COVID-19 a “Public Health Emergency of International Concern” (on January 30, 2020) and when WHO announced “COVID-19” as the name for the new coronavirus disease (February 11, 2020). Next, any declarations on the epidemiological status of the virus were taken into account. This yielded one unique announcement when WHO declared COVID-19 a pandemic (on March 11). Next, any announcement(s) addressing a particular European country or Europe at-large were considered (this was done since our analysis focuses on Europe); this yielded 2 announcements: when a WHO-led team of experts travelled to Italy (on February 24, 2020) and when Europe was declared the epicenter of the pandemic (on 13 March 2020). Together this yielded 6 announcements.
Method of analysis
First, the online COVID-19 searches for all countries were graphed and a line of best fit was drawn. Major announcements by the WHO organization were taken into consideration with peaks in online searches. Second, the online COVID-19 searches for all 40 countries were correlated with each other. This was to determine if countries all had their own unique patterns of searches or if they were the same throughout Europe. Third, the COVID-10 epidemiology (i.e. incidence and mortality) was correlated with the online searches in each specific country. This was to determine if online searches correlated more to the epidemiology of COVID-19 in a particular country or to major WHO media announcements. Epidemiological data was provided from publicly available data files from the European Centre for Disease Prevention and Control (European Centre for Disease Prevention and Control, 2020).
Statistical analysis
Spearman's rank correlation coefficient was used to ascertain the strength and direction of a relationship between 2 variables. P < 0.05 was deemed significant. Google Sheets (Google LLC) and Past (Hammer and Harper, Oyvind Hammer, Natural History Museum, University of Oslo) was used for statistical analysis and illustrations.
Results
Figure 2
shows that the online search trends for Coronavirus from December 31, 2019 to April 13, 2020 are similar for all 40 European countries. Most countries have a peak in searches on January 30, February 11, February 24 and March 11. On these dates the WHO made major announcements.
Figure 2
A scatter plot of the relative online search volumes for Coronavirus of 40 European countries from December 31, 2019 to April 13, 2020. A line of best fit is drawn to show the similarities of the trends. Points marked A, B, C, D, E and F signify announcements from the World Health organization (WHO). (A) On December 31 a pneumonia of unknown cause was first reported to WHO. (B) On January 30, WHO declared COVID-19 a “Public Health Emergency of International Concern”. (C) On February 11, WHO announced “COVID-19” as the name for the new coronavirus disease. (D) On February 24, a WHO-led team of experts travelled to Italy. (E) On March 11, WHO declared COVID-19 a pandemic. (F) on 13 March, Europe was declared the epicenter of the pandemic.
A scatter plot of the relative online search volumes for Coronavirus of 40 European countries from December 31, 2019 to April 13, 2020. A line of best fit is drawn to show the similarities of the trends. Points marked A, B, C, D, E and F signify announcements from the World Health organization (WHO). (A) On December 31 a pneumonia of unknown cause was first reported to WHO. (B) On January 30, WHO declared COVID-19 a “Public Health Emergency of International Concern”. (C) On February 11, WHO announced “COVID-19” as the name for the new coronavirus disease. (D) On February 24, a WHO-led team of experts travelled to Italy. (E) On March 11, WHO declared COVID-19 a pandemic. (F) on 13 March, Europe was declared the epicenter of the pandemic.Figure 3
shows a Spearman rank correlation analyzing the online search trends of Coronavirus in 40 European countries. The figure shows that European online searches concerning COVID-19 are synchronized with each other since all the correlations are greater than 0.60 with a mean of 0.93 (range 0.62 to 0.99). Belarus, Russia and Italy have slightly weaker correlations (around 0.06) with the rest of the European countries.
Figure 3
Spearman's rank correlation coefficient values are represented on a spectrum of blue or red dots; blue values are closer to 1.0 (a perfect positive correlation) and red values are closer to 0.6 (a slightly weaker positive correlation). Note that all countries indicate a positive correlation in search trends for Coronavirus (since they are all above 0.6) from December 31, 2019 to April 13, 2020 in selected European countries. Abbreviations: LU, Luxembourg; UK, United Kingdom; NL, Netherlands; CH, Switzerland; BiH, Bosnia and Herzegovina; MNE, Montenegro; MKD, North Macedonia.
Spearman's rank correlation coefficient values are represented on a spectrum of blue or red dots; blue values are closer to 1.0 (a perfect positive correlation) and red values are closer to 0.6 (a slightly weaker positive correlation). Note that all countries indicate a positive correlation in search trends for Coronavirus (since they are all above 0.6) from December 31, 2019 to April 13, 2020 in selected European countries. Abbreviations: LU, Luxembourg; UK, United Kingdom; NL, Netherlands; CH, Switzerland; BiH, Bosnia and Herzegovina; MNE, Montenegro; MKD, North Macedonia.The online searches for COVID-19 were correlated with the actual incidence and mortality of COVID-19 in all 40 European countries analyzed. This is shown in Table 1
. The mean Spearman correlation for incidence was 0.20 (range −0.66 to 0.76) and for mortality was 0.35 (range −0.75 to 0.85).
Table 1
Spearman's rank correlation coefficient values for correlations between the online searches for coronavirus and the mortality and the incidence of coronavirus in 40 European countries from December 31, 2019 to April 13, 2020. Abbreviations: UK, United Kingdom; BiH, Bosnia and Herzegovina; MKD, North Macedonia.
Country
Spearman's Rank correlation coefficient
Searches and mortality
Searches and incidence
Albania
−0.49
−0.57
Andorra
−0.11
−0.61
Austria
0.46
0.77
Belarus
0.38
0.68
Belgium
0.61
0.81
BiH
−0.54
−0.53
Bulgaria
−0.19
−0.14
Croatia
0.34
0.74
Czechia
0.26
0.69
Denmark
0.52
0.82
Estonia
0.24
0.72
Finland
0.25
0.63
France
0.73
0.83
Germany
0.51
0.80
Greece
0.58
0.79
Hungary
−0.3
−0.44
Iceland
0.1
0.73
Ireland
0.5
0.77
Italy
0.67
0.73
Latvia
−0.1
−0.16
Lithuania
0.39
0.71
Luxembourg
0.4
0.69
Malta
−0.21
−0.55
Moldova
0.28
−0.08
Montenegro
−0.33
−0.30
Netherlands
0.55
0.72
MKD
0.71
0.41
Norway
0.36
0.74
Poland
−0.54
−0.45
Portugal
−0.66
−0.53
Romania
0.45
0.78
Russia
0.59
0.79
Serbia
−0.65
−0.75
Slovakia
−0.17
−0.3
Slovenia
−0.45
0.22
Spain
0.76
0.85
Sweden
0.53
0.78
Switzerland
0.59
0.76
Ukraine
−0.32
−0.25
UK
0.74
0.84
Spearman's rank correlation coefficient values for correlations between the online searches for coronavirus and the mortality and the incidence of coronavirus in 40 European countries from December 31, 2019 to April 13, 2020. Abbreviations: UK, United Kingdom; BiH, Bosnia and Herzegovina; MKD, North Macedonia.In summary, online trends are more correlated with major WHO announcements (as seen in Figure 2, Figure 3) than epidemiology of COVID-19 (as seen in Table 1).
Discussion
Key findings
We found that online searches for COVID-19 in Europe are weakly correlated with the epidemiology (i.e. incidence and mortality) but strongly correlated with WHO announcements. In other words, the WHO greatly determines the search trends across Europe. This finding emphasizes the role of the WHO in raising awareness of a new disease. While several Google Trends studies have already been done on COVID-19, our findings are novel as they highlight that search trends seem to follow international major media announcements rather than national epidemiology. This challenges the current scientific evidence concerning COVID-19 (Husnayain et al., 2020, Li et al., 2020, Lin et al., 2020, Mavragani, 2020, Walker et al., 2020). Moreover, this study analyzed 40 European countries using a comprehensive selection criterion; to date, this is the largest sample pool ever taken into analysis for an online COVID-19 study.
Context
We understand that Google trends may be used for real-time surveillance to some extent as many studies have been done in the past suggesting this (Carneiro and Mylonakis, 2009, Seifter et al., 2010, Pervaiz et al., 2012, Alicino et al., 2015, Al-garadi et al., 2016, Hassid et al., 2017, Husnayain et al., 2019, Santangelo et al., 2019). However, Google Trends is most effective in tracking a new disease outbreak in the absence of related media coverage. In our study, we showed how online search spikes for COVID-19 were likely in reaction to WHO media reports. Past studies have shown that media influences online search trends more than epidemiology (Cervellin et al., 2017). We encourage future scientists to analyze online searches with epidemiology, but also to take into consideration media coverage in their analysis to gain a more wholesome picture of the situation.A 5 country analysis (of Italy, Spain, France, Germany, and the United Kingdom) found that online searches of COVID-19 strongly correlated with the epidemiology (Mavragani, 2020). Our study confirms this relatively strong positive correlation (as seen in Table 1), however, most other European countries had a much lower correlation as the mean correlation among the 40 countries were 0.33 for incidence and 0.17 for mortality. Therefore, we believe this to be a major selection bias as negative results were not reported. Our study has a more robust method as we indiscriminately chose all the countries in Europe that had sufficient data.Several studies have been conducted on infectious disease outbreaks. Lin et al. found that searches for “wash hands” strongly correlated (Pearson's correlation coefficient of −0.70) with a lower spreading speed of COVID-19 (Lin et al., 2020). A study found that Middle East respiratory syndrome and COVID-19 spread correlated with Google searches (Shin et al., 2016, Husnayain et al., 2020, Li et al., 2020). Walker et al. found a strong correlation between online searches related to the loss of smell (a symptom of COVID-19) with COVID-19 epidemiology (Walker et al., 2020). However, all these studies were prone to selection bias as their sample pool of analyzed countries was small, as explained before. Moreover, we suggest that online search trends simply overlapped with epidemiology since disease incidence increases at the same time when major media announcements are made. In other words, Google trends is better at analyzing media clamor than actual disease incidence.In our study, we found that the online search patterns for Europe were all very similar. However, Belarus, Russia, and Italy had slightly different online search patterns (as seen in Figure 3). Notably, Google has a market share of 93% in Europe, however, it only has a market share of 55% in Russia and 78% in Belarus (Statcounter Global Stats). Thus, Russia's and Belarus's poor correlation with the rest of the European countries analyzed may have resulted from this difference of the search engine used. Italy was the first European country with a major outbreak of COVID-19; this may explain why Italy had a slightly different online search trend than the rest of western Europe.
Limitations of analysis
Testing for COVID-19 varies among each European country; thus, the reported incidence among each country may be far lower than the actual incidence of COVID-19. Therefore, for this study we also took into account the daily mortality rate, which may be more consistent with the actual number of fatalities.The demographics (i.e. age, gender, location or education level) of online searches and COVID-19 cases were not taken into analysis. This information is not currently available; however, future studies may analyze this.We focused our media analysis only on WHO announcements since the WHO is responsible for international public health, because it has established a prominent authority on the international stage, and because national policy in a greatly influenced by WHO suggestions. However, other national and international announcements (that may have influenced public online search patterns) were not analyzed. This is a limitation of the study. While it is beyond the scope of this paper, we encourage future researchers to develop a computer bot (that crawls the web and automatically records relevant information) to go through hundreds of announcements (and in various languages) from local governments, news outlets, public figures, prominent social media accounts, etc. to conduct a more complete analysis.
Future directions
Future studies may choose to focus on analyzing Asian countries. This analysis would require the use of additional search engines (i.e. Baidu, Yandex RU and Sogu), since they are more popular than Google in parts of Asia. Additionally, the online analysis may be expanded not only to various search engines but also to public health behavior on various other social media platforms (i.e. Facebook, Twitter, Instagram).
Public health implications
Each country in Europe acted independently of each other in regards to quarantine, closing borders with other countries and closing public areas (e.g. schools, movie theaters, shops), canceling mass events and testing for COVID-19 among the population. However, the online search trends among all the 40 European countries analyzed were strong. This shows that although the WHO only plays an advisory role, it still exerts major influence on the public perception of a disease across Europe.Google trends has classically been used in other studies to track the spread of diseases. However, our study showed that it may be a better tool for tracking the dissemination of information. Global and national institutions may use Google Trends to see if their announcements created a spike in public searches online. By tracking the public response to an announcement, institutions may evaluate the level of public unrest. This method may also help identify mis-information circulating throughout the Internet. During the COVID-19 pandemic, online telecommunications are emphasized and utilized more often to limit the spread of disease (Szmuda et al., 2020b). This allows online surveillance, such as with Google Trends, to be even more powerful during this time as populations rely more heavily on online sources of information. To ensure patient safety and help reduce the spread of mis-information, we encourage physicians to share understandable, high-quality and reliable health information online using trusted sources (Basch et al., 2020, Szmuda et al., 2020a, Szmuda et al., 2020e, Szmuda et al., 2020d, Szmuda et al., 2020c).
Conclusion
In our paper, we challenged recent literature which found that Google Trends was able to surveil the epidemiology of COVID-19. Though a comprehensive methodology, our study shows that these findings in previous publications are most likely incidental. Instead, Google trends, in the case of COVID-19, may be better suited to surveil the public response to major news announcements such as those from the WHO.
Authors: Adam P Tobias; Jonathan Berg; Roseanne Cetnarskyj; Zosia Miedzybrodzka; Mary E Porteous; Edward S Tobias Journal: Front Genet Date: 2021-07-08 Impact factor: 4.599