| Literature DB >> 23760528 |
Krishnan Bhaskaran1, Antonio Gasparrini, Shakoor Hajat, Liam Smeeth, Ben Armstrong.
Abstract
Time series regression studies have been widely used in environmental epidemiology, notably in investigating the short-term associations between exposures such as air pollution, weather variables or pollen, and health outcomes such as mortality, myocardial infarction or disease-specific hospital admissions. Typically, for both exposure and outcome, data are available at regular time intervals (e.g. daily pollution levels and daily mortality counts) and the aim is to explore short-term associations between them. In this article, we describe the general features of time series data, and we outline the analysis process, beginning with descriptive analysis, then focusing on issues in time series regression that differ from other regression methods: modelling short-term fluctuations in the presence of seasonal and long-term patterns, dealing with time varying confounding factors and modelling delayed ('lagged') associations between exposure and outcome. We finish with advice on model checking and sensitivity analysis, and some common extensions to the basic model.Entities:
Keywords: Time series; air pollution; environmental epidemiology
Mesh:
Substances:
Year: 2013 PMID: 23760528 PMCID: PMC3780998 DOI: 10.1093/ije/dyt092
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Example rows of time series data from the London dataset showing daily levels of environmental variables and daily number of deaths
| Date | Ozone (µg/m3) | Temperature (°C) | Relative humidity (%) | |
|---|---|---|---|---|
| 1 Jan 2002 | 4.59 | −0.2 | 75.7 | 199 |
| 2 Jan 2002 | 4.88 | 0.1 | 77.5 | 231 |
| 3 Jan 2002 | 4.71 | 0.9 | 81.3 | 210 |
| 4 Jan 2002 | 4.14 | 0.5 | 85.4 | 203 |
| 5 Jan 2002 | 2.01 | 4.3 | 93.5 | 224 |
| 6 Jan 2002 | 2.4 | 7.1 | 96.4 | 198 |
| 7 Jan 2002 | 4.08 | 5.2 | 93.5 | 180 |
| 8 Jan 2002 | 3.13 | 3.5 | 81.5 | 188 |
| 9 Jan 2002 | 2.05 | 3.2 | 88.3 | 168 |
| 10 Jan 2002 | 5.19 | 5.3 | 85.4 | 194 |
| 11 Jan 2002 | 3.59 | 3.0 | 92.6 | 223 |
| 12 Jan 2002 | 12.87 | 4.8 | 94.2 | 201 |
Figure 1Raw plots showing outcome (deaths) and exposure (ozone) data over time (London data)
Figure 2Three alternative ways of modelling long-term patterns in the data (seasonality and trends)
Figure 3Residual variation in daily deaths after ‘removing’ (i.e. modelling) season and long-term trend. Fitted values were from a spline model for season and long-term trend only (as illustrated in Fig 2c)
Figure 4Modelling lagged (delayed) associations between exposure and outcome. Asterisk indicates that the constraint applied was that the lagged associations for days 1 and 2 were the same, and for days 3–7 were the same