| Literature DB >> 35249590 |
Ryan B Simpson1, Alexandra V Kulinkina1,2,3, Elena N Naumova1.
Abstract
Foodborne and waterborne gastrointestinal infections and their associated outbreaks are preventable, yet still result in significant morbidity, mortality and revenue loss. Many enteric infections demonstrate seasonality, or annual systematic periodic fluctuations in incidence, associated with climatic and environmental factors. Public health professionals use statistical methods and time series models to describe, compare, explain and predict seasonal patterns. However, descriptions and estimates of seasonal features, such as peak timing, depend on how researchers define seasonality for research purposes and how they apply time series methods. In this review, we outline the advantages and limitations of common methods for estimating seasonal peak timing. We provide recommendations improving reporting requirements for disease surveillance systems. Greater attention to how seasonality is defined, modelled, interpreted and reported is necessary to promote reproducible research and strengthen proactive and targeted public health policies, intervention strategies and preparedness plans to dampen the intensity and impacts of seasonal illnesses.Entities:
Keywords: Foodborne infection; gastrointestinal infection; reproducibility; season; seasonality; time series analysis; waterborne infection
Mesh:
Year: 2022 PMID: 35249590 PMCID: PMC8915194 DOI: 10.1017/S0950268822000243
Source DB: PubMed Journal: Epidemiol Infect ISSN: 0950-2688 Impact factor: 2.451
Fig. 1.PRISMA flow diagram detailing the identification, screening, eligibility and inclusion of articles for our systematic review. Included studies (n = 220) were original research articles that detected and estimated the seasonality of human gastrointestinal infections using local, regional and national surveillance systems or hospital health records.
A summary of time series methods for describing, comparing and explaining the seasonality of the 14 most cited gastrointestinal infections from our review
| Pathogen | Total citations | Discrete seasons | Seasonal curves as continuous processes | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Two seasons | Four seasons | Monthly records | Average smoothers | Cubic splines | STL models | SARIMA models | Harmonic regressions | Spectral analyses | ||
| 66 | 5 | 25 | 22 | 1 | 2 | 12 | ||||
| 45 | 2 | 20 | 9 | 1 | 1 | 1 | 9 | 2 | ||
| Gastroenteritis | 38 | 1 | 15 | 15 | 1 | 2 | 4 | |||
| 20 | 2 | 7 | 4 | 1 | 2 | 4 | ||||
| 19 | 3 | 9 | 6 | |||||||
| 18 | 14 | 1 | 1 | 2 | ||||||
| 17 | 2 | 1 | 10 | 1 | 3 | |||||
| 11 | 7 | 3 | 1 | |||||||
| 11 | 7 | 2 | 2 | |||||||
| 10 | 1 | 3 | 3 | 1 | 2 | |||||
| 9 | 6 | 2 | 1 | |||||||
| 6 | 1 | 1 | 4 | 1 | ||||||
| Rotavirus | 6 | 2 | 1 | 3 | 1 | |||||
| 2 | 2 | |||||||||
| Total | 215 | 17 | 76 | 74 | 2 | 4 | 2 | 7 | 31 | 2 |
We ranked pathogens in descending order by total citations. We divided methods by comparisons of discrete seasons and the construction of seasonal curves. Discrete seasons methods included comparisons by two seasons, four seasons or calendar months. Seasonal curve methods included average smoothers, cubic splines, seasonal trend decomposition (STL), seasonal autoregressive integrated moving average (SARIMA) models, harmonic regression models and spectral analyses. Column and row totals are less than the sum of all rows and columns, respectively, as many publications investigated multiple pathogens and used multiple methodologies to describe, compare and explain seasonality features.
Fig. 2.An illustration of detecting seasonality and estimating seasonal peak timing using two discrete seasons. Scenarios include (a) when peak and nadir timing align with the centre of each season, as expected for incidence-based definitions of seasons; (b) when peak and nadir timing is shifted from the centre of each season; and (c) when peak timing aligns with the boundary between seasons and results in substantial misclassification bias.
Fig. 3.An illustration of detecting seasonality and estimating seasonal peak timing using four discrete seasons. Scenarios include (a) when peak timing is well aligned with the centre of a season; (b) when peak timing is shifted from the centre of an a priori assigned season; and (c) when peak timing aligns with the boundary between 2 seasons. Scenario (a) offers higher precision and accuracy as compared to scenarios (b) and (c).
Overview of the advantages (✓) and limitations (✗) of time series methods described in this systematic review
| 2 and 4 Seasons | Calendar months | Smoothers, splines, STL | SARIMA, harmonics | Spectral analyses | ||
|---|---|---|---|---|---|---|
| Minimal statistical training is required to calculate peak timing estimates | ✓ | ✓ | ✗ | ✗ | ✗ | ✗ |
| Definitions of seasons and peak timing estimates are easily comprehended by general audiences | ✓ | ✓ | ✗ | ✗ | ✗ | ✗ |
| Researchers do not need to aggregate surveillance data to perform time series analyses | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Statistical results are generalisable using standardised Gregorian calendar time units | ✗ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Researchers do not define seasons' lengths or reference seasons (i.e. data-driven analyses) | ✗ | ✗ | ✓ | ✓ | ✓ | ✓ |
| Time series methods adjust for irregularities and temporality of surveillance data | ✗ | ✗ | ✓ | ✓ | ✓ | ✓ |
| Time series methods do not require long time series to achieve sufficient sample size | ✗ | ✗ | ✓ | ✓ | ✓ | ✓ |
| Modelling techniques are flexible to adjust for other time-varying risk factors | ✗ | ✗ | ✓ | ✓ | ✓ | ✓ |
| Modelling techniques are flexible to adjust for dual peak or multi-peak outbreak behaviours | ✗ | ✗ | ✗ | ✓ | ✓ | ✓ |
| Modelling techniques adjust for harmonic seasonal curves using trigonometric functions | ✗ | ✗ | ✗ | ✓ | ✓ | ✓ |
| Modelling techniques are reliable at detecting seasonality using regression coefficients | ✗ | ✗ | ✗ | ✓ | ✓ | ✓ |
| Modelling techniques offer robust peak estimate in the presence of multiple seasonal peaks | ✗ | ✗ | ✗ | ✗ | ✓ | ✓ |
| Modelling techniques calculate seasonality features with measures of uncertainty | ✗ | ✗ | ✗ | ✗ | ✗ | ✓ |
| Peak timing estimates can be easily compared across subpopulations, pathogens, locations, etc. | ✗ | ✗ | ✗ | ✗ | ✗ | ✓ |
Methods detect seasonality and estimate peak timing using discrete comparisons between seasons (two seasons or four seasons; calendar months) and constructed seasonal curves (smoothing, spline and STL methods; SARIMA, Fourier series transformations and harmonic models without applying the δ-methods; harmonic models with applying the δ-methods; and spectral analyses). We differentiate the advantages and limitations of harmonic models that do and do not apply the δ-methods to emphasise the importance of these methods in infectious disease seasonality research.
A summary of terminology for describing time series analyses conducted in infectious disease epidemiology research
| Term | Definition |
|---|---|
| Time series data | A set or a sample of time-referenced observations or records with an identified time period, cycle and unit (e.g. day, week, month extracted from a timestamp as YYYY:MM:DD:HH:mm) often illustrated by dot, line or needle plots with axes reflecting time and an outcome of interest. |
| Distribution of time series data | A general summary of frequencies in time-referenced data – i.e. how often an outcome of interest reaches a certain level with respect to time units (often illustrated with histograms and density plots). |
| Time series analyses | A collection of methods to describe, explain and predict temporal processes with time-referenced data for an outcome of interest. |
| Trend | General temporal behaviour in an outcome of interest that can exhibit steady incremental changes (linear) or varying incremental changes (non-linear) over time. |
| Season | An interval of time within one time cycle (typically 1 calendar year) defined by a specific biological, environmental, physical, physiological, or other property or feature in a biological or non-biological system [ |
| Seasonal pattern | A recurrence of periods in an outcome of interest with alternating values (e.g. high and low) over the course of a time cycle. |
| Seasonality | A systematic periodic fluctuation in an outcome of interest over the course of one cycle (typically 1 calendar year) as an observable property of a biological or non-biological system. |
| Seasonal curve | An analytical representation of seasonal periodic fluctuations in an outcome of interest within one time cycle (typically 1 calendar year). |
| Seasonality features | A set of measurable characteristics to describe seasonality and a seasonal curve within 1 year, including seasonal peak, nadir, intensity, duration, speed at which a seasonal curve reaches its peak and speed at which a seasonal curve declines to its nadir [ |
| Peak or nadir timing | A seasonality feature that represents times when a seasonal curve of an outcome reaches its maximum or minimum [ |
| Amplitude or intensity | A seasonality feature that represents the difference between seasonal peaks and nadirs [ |
| Duration | A seasonality feature that represents the time interval when incidence rises above a specified threshold [ |
Terms specify differences between time series data, distributions and analyses, as well as trends, season, seasonal patterns, seasonality and seasonality features.