Literature DB >> 27869135

Heat stroke internet searches can be a new heatwave health warning surveillance indicator.

Tiantian Li1, Fan Ding2, Qinghua Sun1, Yi Zhang1, Patrick L Kinney3.   

Abstract

The impact of major heatwave shocks on population morbidity and mortality has become an urgent public health concern. However, Current heatwave warning systems suffer from a lack of validation and an inability to provide accurate health risk warnings in a timely way. Here we conducted a correlation and linear regression analysis to test the relationship between heat stroke internet searches and heat stroke health outcomes in Shanghai, China, during the summer of 2013. We show that the resulting heatstroke index captures much of the variation in heat stroke cases and deaths. The correlation between heat stroke deaths, the search index and the incidence of heat stroke is higher than the correlation with maximum temperature. This study highlights a fast and effective heatwave health warning indicator with potential to be used throughout the world.

Entities:  

Year:  2016        PMID: 27869135      PMCID: PMC5116664          DOI: 10.1038/srep37294

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Over the last few decades, the impact of major heatwave shocks on population morbidity and mortality has become an urgent public health concern12. In the future, the frequency, intensity and duration of heatwaves is likely to increase due to climate change34 and there is growing recognition of the risks associated with the warming climate and the need for effective measures to anticipate and avoid its most severe consequences. Reducing the risk to health has been highlighted as a priority issue by the World Health Organization56. However, a lack of knowledge and preparation within the medical and healthcare systems for dealing with the effects of heatwaves is evident in most countries around the world7. Current heatwave warning systems suffer from a lack of validation and an inability to provide accurate health risk warnings in a timely way. Surveillance during heatwave conditions is of critical public health concern, in order to determine whether the heatwave is associated with an increase in mortality or morbidity so that appropriate public health actions can be taken. However gathering epidemiological surveillance data using traditional methods is a labor-intensive process involving large surveys, chart reviews, prospective studies as well as data extraction and processing from databases. Current public health surveillance methods of heatwaves do not provide the information quickly enough to detect the increased number of adverse health outcomes in time to respond7 so its use is normally limited to retrospective analyses in order to better understand the relationship between heatwaves and health. As a result, effective surveillance systems for adverse health effects during heatwaves are very rare8 despite the fact that prompt surveillance information during a heatwave may be of great benefit to public health. Mortality data is widely used in surveillance for heatwave health analysis, however there is a considerable time lag between hot conditions and the reporting of deaths, which diminishes its usefulness in affecting health outcomes7. Furthermore, the mortality data represents the worst health outcome, which is not necessarily appropriate for use in characterizing more general public health concerns during the heatwave period9. Recently, syndromic surveillance data have been used in heatwave health surveillance, such as by monitoring emergency department visits and calls to emergency or health services101112. Even though syndromic surveillance is near real-time and forms a better representation of public health, it still does not reflect the heatwave related public health risk rapidly enough for health services to take well-timed proactive measures such as implementing a prevention plan and disseminating health warning information to ameliorate health risks13. There are still several reasons (such as delays in reporting from doctors, operational errors, old computer technology, etc.) that delay the reporting of health surveillance information. Making improvements in any of these areas would involve the expenditure of a large amount of human capital, administration costs and investment funds. In current heatwave health warning systems, the prediction of possible health effects is done by modeling the relationship between temperature and health. Although this method is widely used in epidemiological studies, it has some weaknesses. Controlling for multiple confounding factors is very challenging when predicting health risks using temperature as an indicator14. From previous epidemiological studies, heatwaves are associated with a forward displacement of mortality known as the harvest effect. So far there is no method for accurately modeling the harvest effect during a heatwave and it is very difficult to predict health outcomes in the presence of the harvest effect15. Furthermore, the availability of surveillance data on health outcomes is very rare in most countries. Most of the temperature and health outcomes are studied using mortality data while few use morbidity data, but the former is not well suited for representing general population health and it still leaves a high level of uncertainty in health risk prediction during heatwaves. Nowadays, the increasing availability of datasets from sources such as social media posts, search engine queries and other internet data have shown the potential for analyzing patterns, trends and social phenomena in a variety of domains including finance1617, science18, tourism1920 and health21222324. In this study, we conducted a correlation and linear regression analysis to test the relationship between heat stroke internet searches and heat stroke health outcomes in Shanghai, China, during the summer of 2013. We also developed and tested a new preliminary surveillance proxy during the heatwave period aimed at addressing the shortcomings of current heatwave health warning surveillance.

Results

Our analysis is based upon 92 days of data from June 1 to August 31, 2013. The mean daily maximum temperature during this period was 32.9 °C, with a maximum temperature of 39.9 °C. In China, a heatwave is defined as three consecutive days in which the maximum temperature is at or above 35 °C25. There were 23 days in which the temperature reached 35 °C, which led to four heatwaves, two of which lasted more than 10 days. The longest heatwave continued for 15 days. There were on average 8.0 heat stroke cases and 0.8 heat stroke deaths per day over the summer period. The maximum number of heat stroke cases in one day was 93 and the most deaths in one day was 12. Table 1 shows that the mean daily heat stroke internet search index, as published by the Baidu search engine was 450.4 and the maximum index was 1165.0. Figure 1 plots the temperature, internet search index, number of cases of heat stroke and number of deaths from heat stroke for the summer period. It can be seen that the heat stroke index captures the variation of heat stroke cases and deaths very well. Whereas, the maximum temperature time series plots do not appear to correlate as well with heat stroke cases and deaths. Table S1 shows this in quantitative terms in that the correlation between the number of heat stroke cases and the search index was higher than the correlation between maximum temperature and heat stroke cases. Figure 2(a and b) show that the search index, with an R2 of 0.80, is a better predictor of heat stroke cases than the maximum temperature. Table S3 shows that the correlation between heat stroke cases and deaths with the search index was higher than its correlation with maximum temperature during the overall study period.
Table 1

Descriptive statistical analysis of data from June 1 to Aug 31, 2013, Shanghai.

 NMeanSDMinMax
Maximum Temperature9232.95.222.039.9
Searching Index92450.4329.3116.01165.0
Heat Stroke Cases928.017.0093.0
Heat Stroke Deaths920.81.9012.0
Figure 1

Time series plots of maximum temperature, heat stroke internet searching index, heat stroke surveillance data in the summer of 2013, Shanghai.

Figure 2

plots of variables for the whole summer period and linear regression model for temperatures above 35 °C.

(a) Heat stroke cases ~ Maximum Temperature (same day); (b) Heat stroke cases ~ Searching index (same day); (c) Heat stroke deaths ~ Maximum Temperature (lag 1 day); (d) Heat stroke deaths ~ Search index (lag 1 day); (e) Heat stroke deaths ~ Heat stroke cases (lag 1 day). Red dots represent occasions when the maximum temperature was ≥35 °C, and the red lines represent the linear regression line for those occasions.

The same day correlation was the highest for both the search index and maximum temperature. When only using the data for which the maximum temperature is at or above 35 °C, we observed higher values for the same day correlation, in particular, the same day correlation between heat stroke cases and search index had a value of 0.89 (p < 0.01) (Table S2). The lag 1 day correlation was highest among all the lag day variations. When only taking data where there was a maximum temperature at or above 35 °C, the lag 1 day correlation values were larger, with a correlation coefficient of 0.81 (p < 0.01) for the heat stroke search index and 0.86 (p < 0.01) for heat stroke cases (Table S4). Figure 2(c–e) show that the 1 day lagged search index and 1 day lagged heat stroke cases were both good indicators for predicting the number of heat stroke deaths with an R2 of 0.66 and 0.73 respectively. Figure 3 shows the number of days for the delay of heat stroke case registration. The surveillance data we used in Shanghai for this study is nearly real time, collecting the heat stroke cases directly from hospital and transferred by website. However, only 5% of cases reported on the same day. Hence, it is very difficult to identify the heat wave health epidemic rapidly using present surveillance data.
Figure 3

Reporting lag days of heat stroke case registration.

Discussion

Our study showed that the relationship between heat stroke internet searches and health outcomes is much higher than that between temperature and health outcomes. The heat stroke internet searches had better predictive ability for health risk than temperature during the heatwave. The temperature and health epidemiological studies show the lag effect of mortality, is in agreement with previous studies1526. Our study also shows a significant correlation between heat stroke cases and heat stroke deaths. This indicates that morbidity surveillance data could be a good indicator for mortality during a heatwave. Hence, the identification and prevention of morbidity during a heatwave can be expected to decrease the mortality rate. A new method of syndromic surveillance has emerged8222728 which is known as web-based keyword searching. This has been shown to be a feasible surveillance method for influenza, kidney stone disease, dengue and other conditions. Heat stroke is a widely known effect of heatwaves and is associated with easily recognizable heatwave related syndromic keywords in search engines. The consistency of both factors in our study is much higher than in previous studies of other diseases and internet keyword searching8282930. Therefore, the heat stroke internet search is well suited for use as a new heatwave surveillance health outcome proxy. In 2013, the internet search based Google Flu Trends was predicting more than double the proportion of doctor visits for influenza like illnesses than the Center for Disease Control and Prevention (CDC)31. Lazer et al. explored the “big data hubris” and algorithm dynamics that contributed to Google Flu Trends’ mistakes31. The problem was mainly due to a mismatch between number of searches, about 50 million, and the small volume of surveillance data of 1152 cases31. In our study, the number of searches is about 450 per day and the number of heat stroke cases and deaths is about 8 and 1 per day respectively. The search index is at most several hundred times the number of health outcomes, which is much smaller than that of Google Flu Trends. Google Flu Trends also used an algorithm to capture the dynamics of the cases31. However, in our analysis, the search index value was directly compared with heat stroke health outcomes, which capture the variation in cases and deaths very well. Therefore, heat stroke searches can be expected to largely avoid the two issues that led to Google Flu Trends prediction errors. A recent study of Google Flu Trends showed that using the aggregate frequency of selected queries as the only predictor could lead the prediction errors32. We also aggregated all queries about “heat stroke” in a single predictor, hence our indicator may also be prone to similar errors. However, the purpose of this study is not to build a predictive model for heat wave related health outcomes. We identify this potential weakness for further studies to address. Recently the use of web based searching for disease identification has been gaining interest and mining the web is a valuable new direction to quickly identify diseases and epidemics8242933. Our study is the first to show that web based searching could be useful for predicting risks to health during heatwaves and provides a new heatwave health warning system. This offers the prospect that stakeholders could recognize the heatwave health epidemic in a timely and cost effective way, which could translate into a practical and rapid health protection response. It is however also important to note the limitations of this new syndromic surveillance tool. Demographic information on users is not available. Therefore, it is not possible to identify the most vulnerable population during the heatwave. In addition, the precise reason for users searching for terms is not clear. The keyword “heat stroke” is not the only one that could be used and the effectiveness of alternative keywords will be investigated in further research. There are also many confounding factors that could not be modeled. It is noteworthy that publicity due to a health awareness campaign or items from the media can affect internet searches23. This could be addressed using a web-based database to monitor news items, which could be used to adjust the model. In Shanghai about 30% of people do not have access to the internet in 201334 and it is possible that this group includes a disproportionate number of older people who are especially vulnerable to the effects of heatwaves. This may be another source of uncertainty in using internet searches for heatwave surveillance data. As a result of data limitations, we have only been able to test the internet searches during heatwaves in one location during one summer period. Further testing of the relationship between “heat stroke” internet searches and health outcomes during heatwaves in other locations in different heatwave periods is anticipated in the future. Internet searches are easily accessible and economical, which could be of benefit for the early warning of health risks and taking preventive measures during the heat wave period. A unique strength of internet searches is the immediacy of access to the data, which provides the basis for an alternative real-time health surveillance system. Compared with traditional surveillance data, internet searches could be used for recognizing the onset of epidemics more quickly and producing real-time health warnings during heatwaves. Heatwaves are a global public health concern and will be more frequent and severe in a changing climate. However, health surveillance during heatwaves and heatwave health warning systems are very rare around the world. Our study shows that heat stroke internet searching could form a new tool for confronting the challenge of heatwaves worldwide. Nowadays, with the rapid development of the internet, web searching will become more accurate and representative of the whole population. It could be used in different areas, especially in regions with no health surveillance records.

Methods

The study period was June 1 to Aug 30, 2013. The data on heat stroke cases and deaths in Shanghai, China were obtained from the heat stroke surveillance register of the Chinese Center for Disease Control and Prevention. Temperature data for the Shanghai Xujiahui station was obtained from the Chinese Meteorological Data Sharing Service System. Daily index values in web searches containing the key word “heat stroke” within Shanghai were collected from the Baidu index website. “Heat stroke” corresponds to an exclusive technical term in Chinese, which is “”. We used the Chinese characters “” as the key word to collect the corresponding Baidu index values. The Baidu index reflects keyword search volume, which was developed by Baidu Inc. It uses the search queries on Baidu web searches to calculate the Baidu index, and represents the search frequency of certain keywords on Baidu web searches based on the actual searching behavior of its users35. In 2013, Baidu occupied more than 80% of the Chinese search engine market36, making it better suited than Google trends to model the search behavior of the Chinese Population. In recent years, the Baidu index has been used to forecast infectious diseases, tourist volumes, economic indicators etc.16,19,37. Shanghai is one of the fastest developing cities in China, the number of internet users was 16.83 million with an internet penetration rate of 70.7% in 201334. Therefore, the Baidu index provides a good representation of searching behavior in Shanghai. Correlation analysis and linear regression analysis were performed in this study using R 3.2.2.

Additional Information

How to cite this article: Li, T. et al. Heat stroke internet searches can be a new heatwave health warning surveillance indicator. Sci. Rep. 6, 37294; doi: 10.1038/srep37294 (2016). Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
  28 in total

1.  More intense, more frequent, and longer lasting heat waves in the 21st century.

Authors:  Gerald A Meehl; Claudia Tebaldi
Journal:  Science       Date:  2004-08-13       Impact factor: 47.728

2.  Surveillance of Australian suicidal behaviour using the internet?

Authors:  Andrew Page; Shu-Sen Chang; David Gunnell
Journal:  Aust N Z J Psychiatry       Date:  2011-10-29       Impact factor: 5.744

Review 3.  A stitch in time: improving public health early warning systems for extreme weather events.

Authors:  Kristie L Ebi; Jordana K Schmier
Journal:  Epidemiol Rev       Date:  2005       Impact factor: 6.222

Review 4.  Early detection of disease outbreaks using the Internet.

Authors:  Kumanan Wilson; John S Brownstein
Journal:  CMAJ       Date:  2009-04-14       Impact factor: 8.262

5.  The effect of temperature on mortality in Stockholm 1998--2003: a study of lag structures and heatwave effects.

Authors:  Joacim Rocklöv; Bertil Forsberg
Journal:  Scand J Public Health       Date:  2008-06-20       Impact factor: 3.021

6.  Internet search trends analysis tools can provide real-time data on kidney stone disease in the United States.

Authors:  Scott D Willard; Mike M Nguyen
Journal:  Urology       Date:  2011-06-15       Impact factor: 2.649

7.  Assessment of a syndromic surveillance system based on morbidity data: results from the Oscour network during a heat wave.

Authors:  Loïc Josseran; Anne Fouillet; Nadège Caillère; Dominique Brun-Ney; Danièle Ilef; Gilles Brucker; Helena Medeiros; Pascal Astagneau
Journal:  PLoS One       Date:  2010-08-09       Impact factor: 3.240

8.  Heat waves in the United States: mortality risk during heat waves and effect modification by heat wave characteristics in 43 U.S. communities.

Authors:  G Brooke Anderson; Michelle L Bell
Journal:  Environ Health Perspect       Date:  2010-10-07       Impact factor: 9.031

Review 9.  Heatwave early warning systems and adaptation advice to reduce human health consequences of heatwaves.

Authors:  Dianne Lowe; Kristie L Ebi; Bertil Forsberg
Journal:  Int J Environ Res Public Health       Date:  2011-12-12       Impact factor: 3.390

10.  Correlation between national influenza surveillance data and google trends in South Korea.

Authors:  Sungjin Cho; Chang Hwan Sohn; Min Woo Jo; Soo-Yong Shin; Jae Ho Lee; Seoung Mok Ryoo; Won Young Kim; Dong-Woo Seo
Journal:  PLoS One       Date:  2013-12-05       Impact factor: 3.240

View more
  6 in total

1.  Internet searches and heat-related emergency department visits in the United States.

Authors:  Quinn H Adams; Yuantong Sun; Shengzhi Sun; Gregory A Wellenius
Journal:  Sci Rep       Date:  2022-05-31       Impact factor: 4.996

2.  An evaluation of Internet searches as a marker of trends in population mental health in the US.

Authors:  Evgeniy Gabrilovich; Gregory A Wellenius; Uma Vaidyanathan; Yuantong Sun; Tomer Shekel; Katherine Chou; Sandro Galea
Journal:  Sci Rep       Date:  2022-05-27       Impact factor: 4.996

3.  Using web data to improve surveillance for heat sensitive health outcomes.

Authors:  Jihoon Jung; Christopher K Uejio; Chris Duclos; Melissa Jordan
Journal:  Environ Health       Date:  2019-07-09       Impact factor: 5.984

4.  Association between active cooling and lower mortality among patients with heat stroke and heat exhaustion.

Authors:  Jun Kanda; Shinji Nakahara; Shunsuke Nakamura; Yasufumi Miyake; Keiki Shimizu; Shoji Yokobori; Arino Yaguchi; Tetsuya Sakamoto
Journal:  PLoS One       Date:  2021-11-17       Impact factor: 3.240

5.  Web-Based Data to Quantify Meteorological and Geographical Effects on Heat Stroke: Case Study in China.

Authors:  Qinmei Han; Zhao Liu; Junwen Jia; Bruce T Anderson; Wei Xu; Peijun Shi
Journal:  Geohealth       Date:  2022-08-01

6.  Determining the correlation between outdoor heatstroke incidence and climate elements in Daegu metropolitan city.

Authors:  Jung Ho Kim; Hyun Wook Ryoo; Sungbae Moon; Tae Chang Jang; Sang Chan Jin; You Ho Mun; Byung Soo Do; Sam Beom Lee; Jong-Yeon Kim
Journal:  Yeungnam Univ J Med       Date:  2019-07-02
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.