Literature DB >> 35639562

Assessing seasonality and the role of its potential drivers in environmental epidemiology: a tutorial.

Lina Madaniyazi^1,2, Aurelio Tobias^1,3, Yoonhee Kim⁴, Yeonseung Chung⁵, Ben Armstrong⁶, Masahiro Hashizume^2,7.

Abstract

Several methods have been used to assess the seasonality of health outcomes in epidemiological studies. However, little information is available on the methods to study the changes in seasonality before and after adjusting for environmental or other known seasonally varying factors. Such investigations will help us understand the role of these factors in seasonal variation in health outcomes and further identify currently unknown or unmeasured risk factors. This tutorial illustrates a statistical procedure for examining the seasonality of health outcomes and their changes, after adjusting for potential environmental drivers by assessing and comparing shape, timings and size. We recommend a three-step procedure, each carried out and compared before and after adjustment: (i) inspecting the fitted seasonal curve to determine the broad shape of seasonality; (ii) identifying the peak and trough of seasonality to determine the timings of seasonality; and (iii) estimating the peak-to-trough ratio and attributable fraction to measure the size of seasonality. Reporting changes in these features on adjusting for potential drivers allows readers to understand their role in seasonality and the nature of any residual seasonal pattern. Furthermore, the proposed approach can be extended to other health outcomes and environmental drivers.

Entities: Chemical

Keywords: Seasonality; attributable fraction; mortality; peak to trough; temperature; time-series

Mesh：

Year: 2022 PMID： 35639562 PMCID： PMC9557844 DOI： 10.1093/ije/dyac115

Source DB: PubMed Journal: Int J Epidemiol ISSN： 0300-5771 Impact factor: 9.685

Examining the seasonality of a health outcome before and after adjusting for the potential environmental drivers may provide an insight into the aetiology of the disease. We propose a set of statistical procedures to summarize and compare the seasonality of a health outcome in three features: shape, timings and size, before and after the adjustment. We recommend two indicators to measure the size of seasonality: peak-to-trough ratio and attributable fraction to estimate the amplitude and impact of seasonality, respectively. Considering the extent and the peak of residual seasonality can also help identify further environmental or behavioural risk factors.

Introduction

Seasonal variation in health outcomes (hereafter, seasonality) has been largely recognized. Seasonality is widely caused by various environmental factors, such as weather and air pollution, and others such as holidays. Several methods have been used to describe and assess the seasonality in epidemiological studies, including statistical tests and graphical methods. In recent environmental epidemiology studies, seasonality is usually considered as one of the main confounders when examining the health impact of environmental factors. However few studies have focused on seasonality, and little attention has been paid to studying seasonality changes in the health outcome after adjusting for environmental or other known seasonally varying factors. Such investigation is of interest because it provides an insight into the aetiology of the disease. The changes in seasonality after adjustment will help us understand these factors' contributions to seasonality. In addition, the periodic and generally regular patterns in the residual time series may provide clues as to the presence and importance of other currently unknown or unmeasured causes, such as human behaviour. We aim to introduce a set of statistical procedures to assess the seasonal variation of a health outcome and its changes after accounting for the potential environmental drivers. Throughout, concepts and methods will be illustrated through an example dataset for investigating the change in seasonality of all-cause mortality before and after adjusting for the short-term and direct effect of temperature in London. We used temperature as an example for the environmental driver because of its well-documented associations with mortality and the easy accessibility of the data.

Example dataset

We collected daily counts of all-cause mortality and daily mean temperature in London between 1993 and 2006. Here, the number of all-cause deaths shows a repeating seasonal pattern that appears approximately sinusoidal, with an increase in cold seasons followed by a decrease in warm seasons (Figure 1 and Table 1). This dataset has been previously used as an example elsewhere.,

Figure 1

Daily time-series of all-cause mortality and mean temperature in London from 1993 to 2006

Table 1

Descriptive summary of all-cause mortality and ambient temperature by season, London 1993–2006 [mean (Standard Deviation, SD)]

Variable (daily)	Whole year	Season
		Winter	Spring	Summer	Autumn
		(Dec–Feb)	(Mar–May)	(Jun–Aug)	(Sep–Nov)
All-cause mortality (cases)	165.3 (29.2)	190.9 (34.3)	163.3 (20.0)	149.8 (19.5)	157.7 (22.6)
Mean temperature (ºC)	11.7 (5.5)	6.0 (3.1)	10.5 (3.7)	18.0 (3.0)	12.2 (4.0)

Daily time-series of all-cause mortality and mean temperature in London from 1993 to 2006 Descriptive summary of all-cause mortality and ambient temperature by season, London 1993–2006 [mean (Standard Deviation, SD)] All the statistical analyses were conducted using R, version 3.6.3. The example dataset, along with the R code to reproduce the analyses, are available as Supplementary Material at IJE online.

Assessing seasonality

Time series regression with a cyclic spline

A wide variety of methods can be applied to assess seasonality in a health outcome, such as regression models with indicator variables for the month, cosinor models (a single sine+cosine pair with annual periodicity) and extending this by adding harmonics, often called Fourier functions. Recently, several studies have applied a cyclic spline function to model seasonality on a daily basis., A cyclic spline function is a smoothing method to estimate periodic variation such as daily or annual pattern of time-series observations. Basically, it is a periodic piece-wise cubic function with continuity up to the second derivative, so the function of day-of-year changes continuously at the end of the year. Here, we illustrate the modelling approach using a time-series regression with a cyclic spline function to assess seasonality by following our previous work: without temperature adjustment: with temperature adjustment: where: is mortality on day t assumed to follow a Poisson distribution with overdispersion (i.e. quasi-Poisson). is the day of year on day t ranging from 1 to 366 to model seasonality. We used a cyclic spline (cs) with 4 degrees of freedom (df) for the day of year. is the strata defined by year, day of week and their interaction to control long-term trend and the effect of the day of week. is a vector obtained using a cross-basis function of daily mean temperature; l is the lag days. For the cross-basis function, a natural cubic B-spline basis with three internal knots at the 25th, 50th and 75th percentiles of temperature distribution is used for the exposure-response association, and another natural cubic B-spline basis with 3 df with extended lag up to 21 days is used for the lag-response association. We assessed the seasonality before and after temperature adjustment separately.

Summarizing and comparing seasonality

The key features of seasonality include its shape, timings (peak and trough) and size (amplitude and impact) (Figure 2, panel a). We can summarize and compare the seasonality by each key feature before and after the adjustment (Figure 2, panel b).

Figure 2

Key features for summarizing and comparing seasonality. PTR, peak-to-trough ratio; AF, attributable fraction; RR (r. △PTR (change in PTR after adjustment) =; △AF (change in AF after adjustment) =

Key features for summarizing and comparing seasonality. PTR, peak-to-trough ratio; AF, attributable fraction; RR (r. △PTR (change in PTR after adjustment) =; △AF (change in AF after adjustment) = The empirical confidence intervals (eCIs) for timings and impact can be obtained through Monte Carlo simulations., In brief, random samples are taken from the original parameters of the cyclic spline function, which are assumed to follow a multivariate normal distribution with their point estimates and variance matrix derived from the regression model. The timings and impact are computed from these samples, which empirically reconstruct their distributions. The related 2.5th and 97.5th percentiles of these distributions are interpreted as 95% eCIs. The results from our example are presented in Table 2 and Figure 3 and discussed in detail below.

Table 2

Seasonality assessment of all-cause mortality before and after temperature adjustment

Temperature adjustment	Shape	Timings (day-of-year)		Size
		(95% empirical confidence interval)		Size
		Peak	Trough	Peak-to-trough ratio	Attributable fraction (%)
		Peak	Trough	(95% confidence interval)	(95% empirical confidence interval)
Unadjusted	High mortality in cold seasons and low mortality in warm seasons	9 (8, 10)	250 (244, 255)	1.34 (1.32, 1.35)	10.6 (10.1, 11.1)
Adjusted	High mortality in cold seasons and low mortality in warm seasons; a smaller amplitude	1 (362, 3)	249 (101, 257)	1.14 (1.10, 1.17)	4.1 (3.8, 5.9)

Figure 3

The day of year with maximum and minimum mortality estimates is identified as the peak (triangle) and trough (circle) day, respectively, of the seasonality of mortality. Monte Carlo simulation was used to obtain empirical confidence intervals for peak and trough days

Seasonality of all-cause mortality, and its peak and trough days before (solid) and after (dashed) temperature adjustment. The seasonality is assessed using a time-series regression model with a cyclic spline function with 4 degrees of freedom. The relative risk (RR) is the ratio of mortality estimates on the day of year x to daily minimum mortality estimates at the trough day with 95% confidence intervals (95% CI): The day of year with maximum and minimum mortality estimates is identified as the peak (triangle) and trough (circle) day, respectively, of the seasonality of mortality. Monte Carlo simulation was used to obtain empirical confidence intervals for peak and trough days Seasonality assessment of all-cause mortality before and after temperature adjustment

Shape: the fitted seasonal curve

The shape of seasonality can be estimated through the fitted seasonal curve from the regression model described above, and compared by visual inspection before and after adjusting for the environmental driver. Figure 3 shows the seasonal pattern for all-cause mortality in London with a unimodal shape: a sharp peak in winter and a more extended trough in warmer months. After adjusting for temperature, the seasonal pattern remained similar with reduced amplitude.

Timings: peak and trough

The day-of-year maximum and minimum mortality estimates were identified from the fitted seasonal curve as the peak and trough, respectively. The eCIs for peak and trough were obtained through Monte Carlo simulations., In our example (Table 2), the peak and trough estimates were observed at Days 9 (95% eCI = 8, 10) and 250 (95% eCI = 244, 255), respectively. After adjusting for temperature, the peak and trough days moved forward to Day 1 (95% eCI = 362, 3) and Day 249 (95% eCI = 101, 257). It should be noted that the uncertainty of the trough is higher after temperature adjustment.

Size: amplitude and impact

We propose to summarize the size of seasonality by measuring the amplitude and estimating its impact on the health outcome. The amplitude of seasonality can be measured as the ratio of the maximum mortality estimate at peak day to the minimum mortality estimate at trough day (i.e. peak-to-trough ratio, PTR), and its 95% CI can be obtained from the variance matrix of the estimated coefficients from the cyclic spline function. However, the PTR is not sensitive to seasonality shape and, more importantly, offers limited information on the impact of seasonality on mortality. In this context, the attributable fraction (AF) may help understand the public health burden of seasonality. A general definition of for a given exposure can be provided by , where refers to the association of the outcome with a specific exposure intensity compared with a reference value . Here, we obtained the overall AF as, where represents the day of year, ranging from 1 to 366, is the log relative risk of mortality on the day of year compared with the minimum mortality estimated at the trough, and is the percentage of cases on the day of year . The empirical CIs for were estimated through Monte Carlo simulations. Thus, the AF estimates the fraction by which mortality would be reduced in a counterfactual scenario where mortality risk never rose above its seasonal trough. Table 2 reports an estimated PTR of 1.34 (95% CI = 1.32, 1.35), which is substantially reduced after adjusting for temperature to 1.14 (95% CI = 1.10, 1.17). The estimated indicates that 10.6% of deaths (95% eCI = 10.1, 11.1) are attributable to seasonality within the study period. After adjusting for temperature, the decreases to 4.1% (95% eCI = 3.8, 5.9). The difference in the PTR and AF before and after the adjustment can be interpreted as the contribution of temperature to the size of the seasonality of mortality and can be calculated as and , where are the estimates of the size before and after the adjustment, and are their respective standard errors. Here, we assume and are independent, which may overestimate . Consequently, in our example, we observed an absolute reduction in the log(PTR) of 0.16 (95% CI = 0.14, 0.17), and an absolute reduction in the AF of 6.5% (95% eCI = 4.6, 8.5) We recommend reporting the changes in PTR and AF jointly since we may find a particular set of situations in which, for example, we do not observe a change in the size of seasonality through the PTR, but with different impact through the AF, before and after the adjustment (Figure 4, panel a). Likewise, when the timings of seasonality are displaced notably before and after the adjustment (Figure 4, panels b, c and d), the estimation of PTR and AF after adjustment will be based on different shapes and timings of seasonality. Therefore, the changes would require additional descriptions and careful interpretations. The peak and/or trough displacement must be reported to make the readers understand how seasonality changed after adjustment.

Figure 4

Example s of particular issues when assessing changes in the seasonal pattern before (solid) and after (dashed) adjusting for an environmental driver

Modelling choice

Alternative functions for seasonality

In our example, we have used a cyclic spline function to assess seasonality in daily mortality. Alternative models with different specifications can also be used: for example, a stationary cosinor, non-stationary cosinor, loess smoothing, Fourier function and conditional autoregression with the month as a random effect. Barnett and Dobson offer a thorough overview of these methods. The readers should select the appropriate model based on their data, research question and model fit. For instance, a cosinor is a single cosine/sine couple and can be considered as a special case of the Fourier function. A stationary cosinor is preferred for data covering a short period. A non-stationary cosinor is more appropriate for irregularly spaced data and will enable us to investigate the temporal changes of seasonality over a long period. This method can also be updated further to estimate the temporal changes. However, these two methods assume a sinusoidal seasonal pattern. Whereas loess smoothing is useful for fitting a non-sinusoidal seasonal pattern, it only estimates the mean but not confidence intervals. On the other hand, spline functions provide a flexible and efficient way to fit irregular and/or complex seasonal patterns, even for those irregularly spaced data with few parameters. Fourier functions have similar properties but may need more degrees of freedom (basis variables) than a cyclic spline to capture typical seasonal patterns. Model choice may be based on model fit criteria such as deviance, Akaike's information criterion (AIC) or tests for white noise in residuals. In addition to the function for seasonality, it is also desirable to consider model choices for the adjustment of environmental drivers, long-term trend and other specific factors.

Model checking

It is important to examine the models' residuals to check key assumptions of the regression models, including a scatter plot of residuals against the independent variable, the independence of residuals over time (e.g. autocorrelation) and the distribution of residuals.,

Sensitivity analysis

Since the modelling process involves many decisions, multiple sensitivity analyses are recommended to check the robustness of the main conclusions.,, A series of sensitivity analyses on df of the cyclic spline and temperature adjustment has been conducted in our previous study where the example dataset was included. In this tutorial, we compared the cyclic spline and a cosinor function (Supplementary Figure S1, available as Supplementary data at IJE online). Our sensitivity analysis showed a lower quasi-AIC for the model with a cyclic spline (Supplementary Table S1, available as Supplementary data at IJE online), indicating that the cyclic spline fits the data better than a cosinor function, each with the same degrees of freedom.

Discussion

This article outlines a set of statistical procedures for examining the seasonality of health outcomes and their changes after adjusting potential environmental drivers (Figure 5). We illustrate the procedure by modelling the seasonal curve using a time-series regression with cyclic splines and summarizing the seasonality in different aspects from the fitted curve. In particular, we recommend summarizing and comparing seasonality by shape, timings of peaks and troughs, and size (PTR and AF) before and after the adjustment.

Figure 5

Summary of key steps in quantifying seasonality changes. PTR, peak-to-trough ratio; AF, attributable fraction

Summary of key steps in quantifying seasonality changes. PTR, peak-to-trough ratio; AF, attributable fraction We believe that these procedures are applicable to a wide range of contexts, though some work would be needed for such generalizations, and there are some limitations. In particular, our example is tailored to a unimodal seasonal pattern of all-cause mortality and temperature. However, it can be extended to multimodal seasonality and other outcomes and environmental drivers, sometimes applying alternative modelling choices. For example, a recent study applied a quasi-Poisson regression model with a cyclic spline function of 3 df to investigate the impact of temperature on the bimodal seasonality of sports injuries in Madrid, Spain. Also, a previous study illustrated the contribution of climatic factors to the bimodal seasonal pattern of cholera incidence in Dhaka, Bangladesh, using a Poisson regression model with Fourier functions. As this cholera study suggests, the procedure described can more generally be applied to infectious diseases with some model variations., In addition, the proposed approach here can also be used to analyse data from multiple locations, and the differences in location-specific seasonality estimates can be explored further through a meta-analytical technique., An important issue that should be addressed carefully is the optimal adjustment of the environmental drivers. This can be particularly critical for infectious diseases, as they usually exhibit a complex association with environmental factors., In addition, although the cyclic spline function in our example is flexible for seasonality assessment, it is more mathematically complex and difficult to interpret than some alternatives, especially when the fitted curve is very wiggly. Therefore, the readers should critically assess the potential modelling alternatives and adapt the statistical procedure to their investigations.

Conclusion

In conclusion, the proposed framework covers key steps and important issues involved in seasonality assessment, and provides an opportunity to advance through general methodological steps for a further examination of the underlying drivers of seasonality.

Ethics approval

Not applicable.

Data availability

Example dataset and the R code for the analysis are available as Supplementary data at IJE online.

Supplementary data

Supplementary data are available at IJE online.

Author contributions

L.M. analysed the data and drafted the manuscript. A.T. designed the study and directed the study’s implementation. Y.K. helped to design the analytical strategy and to interpret the findings. Y.C. helped with data analysis and the revision of the manuscript. B.A. helped to analyse and interpret the data and revise the article critically. M.H. made a substantial contribution to the concept of the study and interpretation of data.

Funding

This work was primarily supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI (grant number 19K19461). A.T. was supported by the Japanese Society for the Promotion of Science (JSPS) Invitational Fellowships for Research in Japan (S18149). Y.C. was supported by a Senior Research grant (2019R1A2C1086194) from the National Research Foundation of Korea (NRF), funded by the Ministry of Science, ICT (Information and Communication Technologies). M.H. was supported by the Japan Science and Technology Agency (JST) as part of SICORP, grant number JPMJSC20E4.

Conflict of interest

None declared. Click here for additional data file.

20 in total

Review 1. Interaction revisited: the difference between two estimates.

Authors: Douglas G Altman; J Martin Bland
Journal: BMJ Date: 2003-01-25

2. Cholera in Bangladesh: climatic components of seasonal variation.

Authors: Masahiro Hashizume; Abu S G Faruque; Yukiko Wagatsuma; Taiichi Hayashi; Ben Armstrong
Journal: Epidemiology Date: 2010-09 Impact factor: 4.822

3. Time series regression model for infectious disease and weather.

Authors: Chisato Imai; Ben Armstrong; Zaid Chalabi; Punam Mangtani; Masahiro Hashizume
Journal: Environ Res Date: 2015-07-16 Impact factor: 6.498

4. Methods to assess seasonal effects in epidemiological studies of infectious diseases--exemplified by application to the occurrence of meningococcal disease.

Authors: C F Christiansen; L Pedersen; H T Sørensen; K J Rothman
Journal: Clin Microbiol Infect Date: 2012-07-20 Impact factor: 8.067

5. Seasonal variation in mortality and the role of temperature: a multi-country multi-city study.

Authors: Lina Madaniyazi; Ben Armstrong; Yeonseung Chung; Chris Fook Sheng Ng; Xerxes Seposo; Yoonhee Kim; Aurelio Tobias; Yuming Guo; Francesco Sera; Yasushi Honda; Antonio Gasparrini; Masahiro Hashizume
Journal: Int J Epidemiol Date: 2022-02-18 Impact factor: 7.196

6. Attributable risk from distributed lag models.

Authors: Antonio Gasparrini; Michela Leone
Journal: BMC Med Res Methodol Date: 2014-04-23 Impact factor: 4.615

7. Seasonality of suicide: a multi-country multi-community observational study.

Authors: J Yu; D Yang; Y Kim; M Hashizume; A Gasparrini; B Armstrong; Y Honda; A Tobias; F Sera; A M Vicedo-Cabrera; H Kim; C Íñiguez; E Lavigne; M S Ragettli; N Scovronick; F Acquaotta; B Chen; Y L Guo; M de Sousa Zanotti Stagliori Coelho; P Saldiva; A Zanobetti; J Schwartz; M L Bell; M Diaz; C De la Cruz Valencia; I Holobâcă; S Fratianni; Y Chung
Journal: Epidemiol Psychiatr Sci Date: 2020-08-24 Impact factor: 6.892

8. Seasonal dynamics of bacterial meningitis: a time-series analysis.

Authors: Juliette Paireau; Angelica Chen; Helene Broutin; Bryan Grenfell; Nicole E Basta
Journal: Lancet Glob Health Date: 2016-06 Impact factor: 26.763

9. Brief Report: Investigating Uncertainty in the Minimum Mortality Temperature: Methods and Application to 52 Spanish Cities.

Authors: Aurelio Tobías; Ben Armstrong; Antonio Gasparrini
Journal: Epidemiology Date: 2017-01 Impact factor: 4.822

10. Time series regression studies in environmental epidemiology.

Authors: Krishnan Bhaskaran; Antonio Gasparrini; Shakoor Hajat; Liam Smeeth; Ben Armstrong
Journal: Int J Epidemiol Date: 2013-06-12 Impact factor: 7.196