Literature DB >> 32362698

Modeling respiratory illnesses with change point: A lesson from the SARS epidemic in Hong Kong.

Heung Wong1, Quanxi Shao2, Wai-Cheung Ip1.   

Abstract

It is generally agreed that respiratory disease is closely related to ambient air quality and weather conditions. Besides, hygiene related factors such as the public health measures by the government and possible personal awareness in the community can also affect the spread of infectious respiratory diseases. However, there is no quantitative support for this conclusion, because of lack of quality data. The severe acute respiratory syndrome (or SARS) outbreak in 2003 triggered strict public health measures and personal awareness in the prevention of infectious respiratory diseases, providing us an opportunity to quantify the impact of hygiene related factors in the spread of the disease. In this paper, we model the number of the respiratory illnesses by a semiparametric model which models the environmental and weather impacts using a multiple index model and the impact of other public health measures and possible personal awareness using a growth curve with jump. Using data from Hong Kong, we found that public health measures contributed to about 39% of reduction in the number of respiratory illnesses during the SARS period. However, the impact of hygienically related factors eventually fades as time passes. The results provide indirect quantitative support to the usefulness of governmental campaigns to arouse the awareness of the public in staying away from transmission of respiratory diseases during the full outbreak of the disease. The results also show the fast fading of alertness of Hong Kong people towards the epidemic. Furthermore, our model also offers a way to model the impacts of environmental factors on respiratory diseases, when the data contains the effect of human intervention, by introducing the change point and growth curve to remove such an effect.
Copyright © 2012 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Air pollution; Change point problem; Multiple index model; Non-parametric regression; Public health; Respiratory illness; Severe acute respiratory syndrome (SARS)

Year:  2012        PMID: 32362698      PMCID: PMC7185837          DOI: 10.1016/j.csda.2012.07.029

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


Introduction

It is widely agreed that the exposure to ambient air pollution may cause serious respiratory illnesses (Katsouyanni et al., 1996, Navrud, 2001, Aunan and Pan, 2004, Neuberger et al., 2004, Scoggins et al., 2004, Jalaludin et al., 2004, Maynard, 2004, Lee et al., 2009) and that weather conditions may also contribute to the seriousness (Smoyer et al., 2000). Several statistical models have been used to assess the impacts of air quality and weather factors on respiratory illnesses, including simple statistical correlation analysis, regression method, meta-analysis, spatial analysis and nonparametric statistical models such as the generalized additive model (GAM). However, quantifying the effects of various pollutants as well as weather conditions is always a difficult task due to the high unknown nonlinearities on the impact of these environmental and weather factors on the onset of the respiratory illnesses and possible interactions amongst these factors. Nonparametric regression is a powerful tool to model unknown functional relationships. With the consideration of a possible incubative period and/or cumulative effects, we need to consider the lags of potential factors that result in a large number of covariates in the regression. In order to avoid the so-called ‘curse of dimensions’ referring to the inaccuracy when the multivariate function is estimated nonparametrically, some statistical tools have been developed so as to let the model determine the best combinations in the sense of their contribution to the regression. Representative models include a varying/functional coefficient model (Chen and Tsay, 1993, Hastie and Tibshirani, 1993), regression graphics (Cook, 1998), sliced inverse regression (Li, 1991), single index model (Härdle and Stoker, 1989) and multiple index model (Xia et al., 2002). In this paper, we will adopt the multiple index model for our data, following Xia et al. (2002) and Shao et al. (2010). Further complexity in modeling the number of respiratory illnesses is made by the effect due to hygienic related factors such as the public health measures by government and possible personal awareness in the community because they can also affect the spread of infectious respiratory diseases. Unfortunately, the effect by hygiene related factors has not been considered in literature, partly because of the lack of quality information which in turn makes it difficult to quantitatively measure and support the effect of hygiene related factors on respiratory illnesses. From the data collected in Hong Kong, we observed an unusual change in the number of respiratory illnesses in early 2003, while all other air quality and weather data were relatively stable. The change coincides with the outbreak of severe acute respiratory syndrome (or SARS). The background and severity of this disease is briefed below. SARS is a respiratory disease in humans and is caused by the SARS-associated coronavirus (SARS–CoV). It was first reported in Asia in February 2003. Over the following few months, the disease spread to more than two dozen countries in North America, South America, Europe and Asia before the SARS global outbreak of 2003 was contained. According to the World Health Organization’s (WHO) concluding report on 21 April 2004 (see http://www.who.int/csr/sars/country/table2004_04_21/en/index.html), this major outbreak consisted of 8096 known cases with 774 deaths (a mortality rate of 9.6%). Hong Kong was the most seriously affected area in terms of population size (6.8 million) and land area (1100 km2). There were 1755 cases in total (1 case per 4000 population with 977 females and 778 males) with 299 deaths (a mortality rate of 17%). The source of infection was traced back to an infected Guangzhou professor who arrived in Hong Kong in late February 2003. During his stay in a hotel, the infection was spread to other guests and visitors, subsequently triggering off a chain of outbreaks in Hong Kong, Singapore, Canada and Vietnam. A detailed history and public health experiences can be found in Hong Kong’s SARS Expert Committee Report (2003) and Cheng (2003). The official announcement of the SARS epidemic was on 8 March 2003. The last case was reported on 2 June 2003. After an observation period of 20 days, Hong Kong was removed from the WHO’s list of areas with recent local transmission of SARS on 23 June 2003. The WHO’s research concluded that SARS was spread by close person-to-person contact. SARS–CoV is thought to be transmitted most readily by respiratory droplets of infectious persons when droplets from the cough or sneeze of the infectious persons are propelled a short distance (generally up to 3 ft) through the air and deposited on the mucous membranes of the mouth, nose, or eyes of persons who are nearby. The virus also can be transmitted when a person touches a surface or object contaminated with infectious droplets and then touches his/her mouth, nose, or eye(s). In addition, it is possible that the SARS–CoV might spread more broadly through the air (airborne spread) or by other ways that are not known. For infectious diseases, efficient prevention before major outbreak and treatment during outbreak are primary ways of disease control. However, community- or population-based health protection should be another important way to stop the spread of such infectious diseases but has not been addressed seriously before. During the SARS epidemic, Hong Kong’s Department of Health introduced a large-scale health awareness program to publicize the importance of personal and environmental hygiene. The actions included wearing face masks in public by everyone, washing hands regularly (especially before touching mouth and eyes and having food). Official actions included cleaning surfaces in public areas such as handles and buttons with disinfectants regularly, and temperature checks at airports and other immigration ports. An educational campaign for personal and environmental hygiene was widely promoted by TV advertisements and posters in public areas (Leisure Cultural Services Department, 2003, Housing Authority, 2003, Leung et al., 2003). Such public health measures showed great success in controlling the disease. Some measures in public areas have been kept in force until now. Therefore, the data provide us an opportunity to model the effect of strict hygiene related factors on respiratory illnesses. For the purpose of public health, it is very important and interesting to see how effective these public health measures are. By developing a method to handle data with sudden change(s) in modeling, we can also assess the change of the number of illness before and after the SARS epidemic. The results can be used as an indirect support of the effectiveness of such efforts by governmental and community campaigns. The paper is organized as follows. Section 2 is a brief description of the data set. Model and estimation procedures are given in Section 3, followed by a statistical test to quantify the efficacy of community-based hygiene campaign. The results are given in Section 4, and Section 5 concludes the paper.

Data set

Hong Kong became a Special Administrative Region of the People’s Republic of China on July 1, 1997, after a century and a half of British administration. It is located in the south of Mainland China with a population of 6.8 million and an area of 1103 km2 covering Hong Kong Island and the more rural New Territories. Its climate is sub-tropical with temperatures between 10 °C in winter and 33 °C in summer. Air pollution data were obtained from the Environmental Protection Department of HKSAR (www.epd.gov.hk). These include the daily average level of sulfur dioxide (SO2, μgm−3), respirable suspended particulates (RSP μgm−3), nitrogen oxides (NOx, μgm−3), nitrogen dioxide (NO2, μgm−3), ozone (OZ, μgm−3). Weather data were obtained from Hong Kong Observatory, including temperature (Temp, degree Celsius) and relative humidity (Humi). The same variables were used by Xia et al. (2002) and Shao et al. (2010). Daily admission figures to public hospitals in Hong Kong due to respiratory illnesses were obtained from the Hospital Authority of HKSAR. The cases included all respiratory illnesses. These figures can be regarded as the daily number of illnesses due to respiratory diseases. Although the Air Pollution Index is usually reported to the public, we do not use it in this study as it is secondary data derived from the highest indices of several key pollutants. Daily data over six years were collected for this study, from 01/01/2000 to 31/12/2005. The numbers of respiratory illnesses are plotted in Fig. 1, Fig. 2 , from which a sudden drop in the number of respiratory illnesses can be seen in early 2003 during the outbreak of SARS. The data was divided into two parts: Pre-SARS period refers to the data before 8 March 2003 and post-SARS after 23 June 2003 (inclusive). We do not use the data during the SARS epidemic as it is a transition period for the change. The summary statistics for individual variables are given in Table 1 and box plots are given in Fig. 3 . We can see that the temperature and humidity are very similar during the pre- and post-SARS period. However, SO2, RSP and Ozone increase for all regions while NOs and NO2 are relatively stable. In contrast, the number of illnesses decreases.
Fig. 1

Map of Hong Kong.

Fig. 2

Time series plot of the number of respiratory illnesses for Hong Kong. The trend lines (thick line) are provided by a smoothing spline. The two dotted vertical lines indicate the start and end of the SARS period.

Table 1

Summary statistics for individual variables.

CasesSO2NOxNO2RSPOzoneTempHumi
Before SARS
Minimum1222.8423.0415.6513.983.107.2541.25
1st quartile1939.7678.4744.7931.4819.1919.1273.38
Median21814.29101.4055.2345.2628.2523.8279.25
Mean22016.95115.8057.6351.0431.8022.9678.01
3rd quartile24620.93139.2067.7165.6941.9427.3284.50
Maximum350106.10441.90162.10172.6099.6330.4897.25
STD38.9111.1453.4919.3224.8915.705.039.72

After SARS
Minimum22.8826.9517.6613.425.568.6019.00
1st quartile16612.6874.3042.0131.3218.6919.4671.88
Median20618.3598.7055.6352.8631.5625.0878.50
Mean20722.60107.8058.2358.0735.4523.5476.92
3rd quartile23826.22127.0071.8078.7148.3727.9484.00
Maximum413140.50419.00160.90207.30123.5031.90100.00
STD64.1116.4347.6322.1131.3219.895.2311.21
Fig. 3

Box-plot to compare daily values of variables before and after the SARS epidemic. Note: For each item on the horizontal axis, the two box-plots refer to values before and after SARS.

Map of Hong Kong. Time series plot of the number of respiratory illnesses for Hong Kong. The trend lines (thick line) are provided by a smoothing spline. The two dotted vertical lines indicate the start and end of the SARS period. Summary statistics for individual variables. Box-plot to compare daily values of variables before and after the SARS epidemic. Note: For each item on the horizontal axis, the two box-plots refer to values before and after SARS. Both Xia et al. (2002) and Shao et al. (2010) considered the weekly pattern of the respiratory illnesses. It is possible that the weekly pattern in the number of the respiratory illnesses confounds with similar patterns in air pollution. To check if it is the case, the box-plots of all the variables for each day of the week is given in Fig. 4 , from which it can be seen that only the number of admitted patients shows some weekly pattern, while all the air pollutants/weather variables show no weekly pattern (in fact, they remain quite constant over the day of the week). Therefore, the weekly pattern in the number of the respiratory illnesses could be modeled together with the pollutants/weather variables.
Fig. 4

Box-plot to compare daily values of variables for each day of the week.

Box-plot to compare daily values of variables for each day of the week.

Methods

Let be our response variable, which is the number of daily admission to a regional hospital due to respiratory illnesses, and be the row-vector of covariates (with possible lags) affecting the response variable. The potential covariates in our study include (SO2, NOx, RSP, OZ, Temp, Humi). Let ( March 2003) denote the full outbreak of the SARS epidemic and (23 June 2003) the end of the SARS epidemic. In order to evaluate the effectiveness of public health measures and air quality on the reduction of respiratory illnesses, with consideration of weekly effect, we consider the following candidate models: with The individual terms in the model are considered as follows. (i) is a zero-mean, constant variance error term. (ii) measures the impact of public health measures with and being real values representing the lower and upper asymptotes, respectively, being a positive value representing the growth rate and being a real value representing the time of maximum growth. That is, the effect of public health measures as measured by the shift in the total number of illnesses. By noting the magnitude of impact decreases eventually over time due to many factors including behavioral and memory dynamics, the growth curves are widely used in psychological research to model the changes over time (Mcardle and Nesselroade, 2002). The logistic growth curve is a special case of the generalized Richard growth curve (Richard, 1959). A negative value of indicates a reduction of the number of illnesses, meaning an effective impact of public health measures. A positive value of means that the impact of public health measures will fade and vanish over time. We do not use the full Richard curve because it involves complicated computing due to a limit or embedded cases (Shao, 2002) and the special case here is good enough to fit the data. Bear in mind that we do not use the data during the SARS epidemic as it is highly unstable. (iii) Due to the hospital system in Hong Kong where only emergency departments are in normal service during weekends, the effect of the day-of-the-week is indicated in previous studies (Xia et al., 2002, Xia and Tong, 2006). Here the dummy variables are defined as with day 1 to 6 referring to Saturday to Thursday respectively and Friday as the reference day. The second term measures the effect of the day-of-the-week which is also important (Dominici et al., 2000, Forster De and Solomon, 2003, Bruckmann and Wichmann-Fiebig, 1997, Xia and Tong, 2006). (iv) The effect of environmental and weather factors on the illnesses is characterized by a Multiple Index Model defined as with and almost surely. Here is an unknown smooth function and is the coefficient matrix with being the th direction or components, denotes the transpose of a vector or matrix. Without loss of generality, the standardized version of is used in the model, i.e. and . To ensure the uniqueness of the coefficient matrix, one should impose some identification condition on . Usually it is assumed that , that is, is an orthogonal matrix. Model estimation Xia et al. (2002) used the minimum average variance estimation technique (MAVE) to estimate in the multiple index model and then cross validation (CV) to determine . A brief outline of the method is given below. For a fixed , the minimum average variance estimation is to find by with the constraint that . The name MAVE is given in light of the following equation where the last term is the conditional variance with a given . There are two conditional expectations we need to deal with. Assume that there are observations with corresponding covariates . For any given , the inner expectation at can be approximated by a local linear expansion of . That is where and evaluated at . Note that the outer conditional expectation is in fact the conditional variance with a given . Following the idea of local linear smooth estimation, we can estimate by exploiting the approximation where are some weights with . Usually kernel smoothing is employed by defining the weights as where with being the dimension of kernel function and the bandwidth. On the basis of the above approximation of conditional expectations, the MAVE estimate of the coefficient matrix is obtained by solving the minimization where is a row vector. Note that the constraint makes the above minimization more difficult. Furthermore, the weights depend on the choice of coefficient matrix . We can use an iteration procedure to find the solution. A suggested stop rule for is and , where is a predetermined tolerance value. To determine the number of efficient dimensions, cross validation is used by Xia et al. (2002) as where with . The with the smallest value is the estimated dimension. Computing issues Note that the conditional expectation in (7), (8) does not involve the time of observation. We can apply the algorithm developed by Xia et al. (2002) for the multiple index model but with more steps for the estimation of other parameters. The algorithm has the following steps. Step 1. Initialize the estimate of the coefficient in the multiple Index part as . Step 2. Initialize the parameter estimates of and as and . Good estimates can be the least squares estimates of the model . Step 3. In the th iteration , compute and update the coefficient of the multiple index part by MAVE with model . The updated estimate is denoted as . Step 4. Compute and update the parameter estimates of and by the least squares method. The updated estimate is denoted as and . Step 5. Check the differences between the current estimates and previous estimates. If the maximum difference is less than a predetermined tolerance value (which is 0.001 in this study), then stop the iteration. Otherwise, return to Step 3. Based on the above five steps, estimation for the parameters and the unknown function in model (1) (denoted as and respectively) can be obtained. To construct the confidence bounds of each parameter in the growth curve and weekly pattern, we employ the bootstrap method (see, for example Efron and Tibshirani, 1993) as below. Step I. Randomly resample the model residuals , where with replacement to form a new residual series . Note that the superscript ‘*’ denotes something calculated from bootstrap resampling. Step II. Add to the fitted model output to form the new values . Because the new residual series is generated with replacement, any model residual can be sampled more than one time or not sampled at all. Step III. Calibrate the bootstrap sample to obtain a bootstrap estimator of parameter vector . Step IV. Repeat the bootstrap sampling for times. In this study the resampling is repeated 500 times. Since the bootstrapping produces a large number of estimates for each parameter (say for example), the bootstrap confidence interval (or percentile interval) for can be estimated by simply taking quantiles from empirical marginal distribution . To do this, we first obtain the ordered bootstrap estimates derived from the bootstrap resampling method. The two-sided confidence interval at level is then given by . Note that unlike standard asymptotic confidence intervals, percentile intervals obtained from bootstrapping will not generally be symmetric around the parameter estimate.

Results

The cumulative effects of pollutants and weather factors on the respiratory illnesses have been recognized by many researchers (Schwartz, 2000, Zeger et al., 1999, Dominici et al., 2003, Xia et al., 2002, Xia and Tong, 2006, Shao et al., 2010). Relevant investigations on the situation were conducted by (Xia et al., 2002, Xia and Tong, 2006, Shao et al., 2010). In our data analysis, the ‘weekend-effect’ and the change of hygiene habits, together with the environmental effect, are modeled by (3). Log-transformation is applied to the number of respiratory cases before model fitting. The cumulative effects are incorporated into the model by using lagged variables. For ease of implementation and interpretation, we use the same lags for all variables in our model (Xia et al., 2002, Shao et al., 2010). That is, for each time point is the collection of all the environmental and weather factors with lag , where with being observed value of the th variable at time and is an integer to be estimated. The long-term effect of changing hygiene habits is measured by the expected amount of change (jump) on the number of admissions after SARS epidemic. For a given lag , the efficient dimension and the optimal bandwidth are determined by the cross validation criterion given in Eq. (10). We did all computations in Matlab (see http://www.mathworks.com/products/matlab/). The optimal results have lag and , which are the same as those given before (Xia et al., 2002, Shao et al., 2010). The parameter estimates for the SARS effect (modeled by a growth curve) and the weekly effect are given in Table 2 . The optimal bandwidth is . It can be seen from Table 2 that all the parameters in the growth curve part are statistically significant at the level of 0.05. The expected difference of the number of respiratory cases (after log-transformation) due to the change of public health measures is estimated as (the second term in the growth curve is very small in the beginning of the public health measures), which means that the number of total respiratory cases after the SARS epidemic reduces 39.21% in comparison with the expected number of respiratory cases (original data) before the SARS epidemic. The shape and 95% confidence limits of the growth curve which model the effect of SARS over time is given in Fig. 5 which clearly shows the significance of the public health measure due to the SARS outbreak on the number of administration. However, the impact eventually fades over time as modeled by the logistic growth curve. The impact is obvious as nearly all the parameter estimates are significantly different from zero at 95% of the confidence level (Table 2).
Table 2

Parameter estimates and 95% confidence bounds.

Parameterξ1ξ2ξ3ξ4α1α2α3α4α5α6
Estimate−0.4980.0270.565252.25−0.109−0.1210.0740.0180.0070.046
2.5% bound−0.5240.0210.524203.25−0.159−0.1670.023−0.031−0.0460.002
97.5% bound−0.4600.0340.593327.08−0.055−0.0610.1320.0750.0630.106
Fig. 5

The shape (solid line) and 95% confidence limits (dotted lines) of the growth curve which model the effect of SARS over time.

Parameter estimates and 95% confidence bounds. The shape (solid line) and 95% confidence limits (dotted lines) of the growth curve which model the effect of SARS over time. It can also be seen from Table 2 that the weekly effect is statistically significant, with an obvious lower number of administration during Saturday and Sunday but higher on Monday. The cross-validation . The root mean squared error between observed and fitted values is , which translates to the root mean squared error for the original number of the cases. Therefore, the model works reasonably well to capture the change of the number of hospital administrations. The residuals are plotted in Fig. 6 . This reduction in the number of respiratory diseases is not an accidental event. It has come from a combination of better public awareness and the efforts of the HKSAR government.
Fig. 6

Time series plots of the fitted errors (difference between observed and fitted number of cases) for models with (line) and without change (dots). The results are separated by before (top panel) and after (bottom panel) SARS.

Time series plots of the fitted errors (difference between observed and fitted number of cases) for models with (line) and without change (dots). The results are separated by before (top panel) and after (bottom panel) SARS. A by-product of our model is the assessment of the weekend-effect and the impact of different factors on the number of admitted patients. The change of ratios of the total respiratory cases for each day of the week to that for Friday is given in Table 2. We can see that the number of reported cases on Saturday is less than Friday and decreases a little further on Sunday, before a big increase on Monday. Tuesday and Wednesday are only slightly more than Friday before more cases on Thursday. Friday has the smallest reported cases in comparison with other weekdays, perhaps again due to the weekend effect. We do not tend to study the detailed holiday effect any further here. The impact of different factors on the number of admitted patients can be assessed by the coefficients of the index, which are given in Table 3 . We pay special attention to the large coefficients associated with the factors. Following (Xia et al., 2002, Shao et al., 2010), we regard coefficients with absolute value greater than 0.1 as significant coefficients. With this rationale, a simple way to interpret the results is to look for coefficients larger than 0.1. Let us call these coefficients as large coefficients. Bear in mind that the first dimension accounts the most contribution to the model. We can see that SO2, temperature and humidity are important factors for diseases, compared with NO2 and OZ which have moderate contributions. The RSP and NOx are less important to the respiratory diseases in Hong Kong. The second dimension is set to model the variance which is not included in the first dimension. We can see that for the second dimension, SO2, NOx and Humidity are the major factors and RSP still does not show its importance. For the third dimension, all the factors show similar importance except humidity which has no coefficient larger than 0.1.
Table 3

Coefficients of the first efficient dimension reduction direction with and . The coefficients of greater than 0.1 are highlighted by bold font.

Lag01234567
Dimension one
SO20.181980.135210.070440.086800.103180.146470.081440.21273
RSP−0.12792−0.04405−0.074150.168260.05507−0.016550.06469−0.02638
NOx−0.02260−0.077160.006830.00786−0.076360.13606−0.03518−0.03128
NO2−0.09867−0.03139−0.075280.025170.10034−0.084790.158330.24851
Ozone0.168840.107480.063740.15246−0.02362−0.058290.06656−0.05566
Temp0.43964−0.013960.101790.147530.256980.03630−0.082130.32950
Humi0.216260.077750.133710.055450.102760.125520.177450.13737

Dimension two
SO20.03920−0.012340.140830.162700.141570.107550.13330−0.09973
RSP0.01641−0.002040.057540.041970.125980.077260.065600.02284
NOx0.195830.138060.278090.265010.197210.163920.216500.14070
NO20.22760−0.080430.22063−0.066290.244120.146170.16537−0.06376
Ozone0.04157−0.02062−0.011930.021080.169670.122760.00378−0.01231
Temp0.26097−0.010780.071240.263210.05637−0.055660.055630.04100
Humi−0.087680.139890.131020.114940.120310.10616−0.040130.11437

Dimension three
SO20.018690.091620.145650.143090.120040.07965−0.012650.12548
RSP0.090270.158870.062330.228670.099410.081860.01825−0.00017
CO−0.016850.06540−0.08793−0.012260.113790.182840.101500.13091
NO20.202830.25812−0.066010.330880.173580.214850.102280.14089
Ozone−0.013320.058160.02803−0.015510.140010.153300.034930.19790
Temp0.063530.128610.040300.20063−0.034440.352660.171070.27707
Humi0.015860.043630.019290.032010.044320.02327−0.033520.05263
Coefficients of the first efficient dimension reduction direction with and . The coefficients of greater than 0.1 are highlighted by bold font. As a comparison, we report here the best model for (i.e., the change of hygiene habits does not affect the respiratory cases), with and . The root mean squared error between observed and fitted values is , which translates to the root mean squared error for the original number of the cases. To see the difference between the models with and without change, we provide the fitted errors (differences between the observed and fitted numbers of cases) in time series plots in Fig. 6 and summary statistics in Table 3, Table 4 . We can see visually from Fig. 6 that the model without change tends to under-fit the observed numbers for the data before SARS and over-fit the observed numbers for the data after SARS. This can be confirmed by the summary statistics in Table 3. The model with change has smaller residuals with smaller standard deviation for the overall data, the data before SARS and the data after SARS. For the model with change, the summary statistics do not show any trend when the data are splitinto two part (before and after SARS). However, for the model without change, both the mean and median prediction errors are positive for the data before SARS but are negative for the data after SARS, reassuring the importance of incorporating change in the model in order to capture the change after the SARS outbreak.
Table 4

Summary statistics of the fitted errors (difference between observed and fitted number of cases) for models with and without change. The model is fitted using log-transformed data.

Overall
Before SARS
After SARS
WithWithoutWithWithoutWithWithout
Minimum−0.950−1.034−0.489−0.447−0.950−1.034
1st quartile−0.101−0.098−0.083−0.054−0.113−0.164
Median0.0010.004−0.0030.0240.001−0.022
Mean0.0000.0000.0000.0290.000−0.036
3rd quartile0.0990.1040.0880.1110.1200.092
Maximum0.6260.5730.4300.5520.6260.573
STD0.1680.1800.1400.1420.1990.213
Summary statistics of the fitted errors (difference between observed and fitted number of cases) for models with and without change. The model is fitted using log-transformed data.

Conclusion and discussion

In this paper, we develop a model to assess the effect of ambient air quality and weather conditions on respiratory diseases with consideration of the impact of the change of hygiene related factors by using the multiple index model with change, and model the change after the introduction of hygiene related measures (after the SARS outbreak in the case of Hong Kong data) by a growth curve. The model consists of three additive parts: the nonparametric multiple index model part takes account of the impact of pollutants and climatic factors, a growth curve after the SARS outbreak models the impact due to public health measures and a categorical variable models the weekly effect. The data from Hong Kong are used for this study as the 2003 SARS epidemic initiated a change of public health measures in the community due to government effort and public awareness. In addition, another possible driving force for maintaining individual hygiene habits during the epidemic was the power of the media. During the time of the SARS epidemic, all media (including newspapers, radio and TV) reported daily death cases due to SARS. This was probably a most effective way to alert people of the dreadful disease and the importance of having good hygienic habits. It was well observed that during the SARS period the people of Hong Kong followed the advice of the government on hygienic measures closely, such as wearing face masks in the public, washing hands frequently, avoiding going to the public in case of having a flu, etc. The effects of pollutants and climatic factors on respiratory diseases are complicated due to the latent effect, nonlinearity and interactions between factors. The multiple index model is used to model such complicated relationship due to it flexibility and ability of handling a large number of dependent variables. The results provide indirect quantitative support to the governmental awareness program during the outbreak of the epidemic, and also the effectiveness of the media in disease control. Another important observation, however, is that the growth curve starts to increase sharply near the end of 2003 (see Fig. 5), and has flattened before mid-2004. This means that the number of illnesses had reverted to pre-SARS levels in about 6 months’ time. This in turn reflects the forgetfulness of Hong Kong people, or people in many large cities. Large city people are occupied with many attractions, when the immediate threat of an epidemic fades, their alertness will wane fast. The media’s interest in the epidemic is also short-lived if it does not continue to cause casualties. This suggests strongly that public health organizations and officers have to frequently remind people to keep up with their hygienic habits, so as to minimize the damage of infectious diseases. In terms of modeling practice, it is noted that the growth curve function becomes positive after the summer 2004 due to a short upward trend in the first half of year 2005. Although it is possible to restrict the growth curve function to the maximum of zero by letting , it is not a good exercise to implement such a restriction before clearer understanding of the short upward trend. Furthermore, our model setting has no problem to include more parametric terms such as a linear function of time for long-term trends.
  12 in total

1.  Harvesting and long term exposure effects in the relation between air pollution and mortality.

Authors:  J Schwartz
Journal:  Am J Epidemiol       Date:  2000-03-01       Impact factor: 4.897

2.  Harvesting-resistant estimates of air pollution effects on mortality.

Authors:  S L Zeger; F Dominici; J Samet
Journal:  Epidemiology       Date:  1999-03       Impact factor: 4.822

3.  Airborne particulate matter and mortality: timescale effects in four US cities.

Authors:  Francesca Dominici; Aidan McDermott; Scott L Zeger; Jonathan M Samet
Journal:  Am J Epidemiol       Date:  2003-06-15       Impact factor: 4.897

4.  Observations of a "weekend effect" in diurnal temperature range.

Authors:  Piers M de F Forster; Susan Solomon
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-18       Impact factor: 11.205

Review 5.  Key airborne pollutants--the impact on health.

Authors:  Robert Maynard
Journal:  Sci Total Environ       Date:  2004-12-01       Impact factor: 7.963

6.  Cumulative effects of air pollution on public health.

Authors:  Yingcun Xia; Howell Tong
Journal:  Stat Med       Date:  2006-10-30       Impact factor: 2.373

7.  Air pollution and health in Scotland: a multicity study.

Authors:  Duncan Lee; Claire Ferguson; Richard Mitchell
Journal:  Biostatistics       Date:  2009-04-17       Impact factor: 5.899

8.  Short term effects of air pollution on health: a European approach using epidemiologic time series data: the APHEA protocol.

Authors:  K Katsouyanni; J Schwartz; C Spix; G Touloumi; D Zmirou; A Zanobetti; B Wojtyniak; J M Vonk; A Tobias; A Pönkä; S Medina; L Bachárová; H R Anderson
Journal:  J Epidemiol Community Health       Date:  1996-04       Impact factor: 3.710

9.  Exposure-response functions for health effects of ambient air pollution applicable for China -- a meta-analysis.

Authors:  Kristin Aunan; Xiao-Chuan Pan
Journal:  Sci Total Environ       Date:  2004-08-15       Impact factor: 7.963

10.  The impact of community psychological responses on outbreak control for severe acute respiratory syndrome in Hong Kong.

Authors:  G M Leung; T-H Lam; L-M Ho; S-Y Ho; B H Y Chan; I O L Wong; A J Hedley
Journal:  J Epidemiol Community Health       Date:  2003-11       Impact factor: 3.710

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.