| Literature DB >> 30679458 |
Craig J McGowan1, Matthew Biggerstaff2, Michael Johansson3, Karyn M Apfeldorf4, Michal Ben-Nun5, Logan Brooks6, Matteo Convertino7,8, Madhav Erraguntla9, David C Farrow10, John Freeze9, Saurav Ghosh11, Sangwon Hyun12, Sasikiran Kandula13, Joceline Lega14, Yang Liu8, Nicholas Michaud15, Haruka Morita13, Jarad Niemi16, Naren Ramakrishnan11, Evan L Ray17, Nicholas G Reich18, Pete Riley5, Jeffrey Shaman13, Ryan Tibshirani19, Alessandro Vespignani20, Qian Zhang20, Carrie Reed1.
Abstract
Since 2013, the Centers for Disease Control and Prevention (CDC) has hosted an annual influenza season forecasting challenge. The 2015-2016 challenge consisted of weekly probabilistic forecasts of multiple targets, including fourteen models submitted by eleven teams. Forecast skill was evaluated using a modified logarithmic score. We averaged submitted forecasts into a mean ensemble model and compared them against predictions based on historical trends. Forecast skill was highest for seasonal peak intensity and short-term forecasts, while forecast skill for timing of season onset and peak week was generally low. Higher forecast skill was associated with team participation in previous influenza forecasting challenges and utilization of ensemble forecasting techniques. The mean ensemble consistently performed well and outperformed historical trend predictions. CDC and contributing teams will continue to advance influenza forecasting and work to improve the accuracy and reliability of forecasts to facilitate increased incorporation into public health response efforts.Entities:
Mesh:
Year: 2019 PMID: 30679458 PMCID: PMC6346105 DOI: 10.1038/s41598-018-36361-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Percentage of visits for ILI reported by ILINet – 2009–2010 season to 2015–2016 season.
2015–2016 seasonal target values and boundaries of evaluation periods for the United States as a whole and each HHS Region, based on ILINet values published MMWR week 28 (July 22, 2016).
| Seasonal Targets | Evaluation Periods (MMWR Week) | ||||||
|---|---|---|---|---|---|---|---|
| Baseline value | Onset Week | Peak Week | Peak Intensity | Onset Week | Peak Week/ Intensity | 1–4 week ahead | |
| US National | 2.1% | 3 | 10 | 3.6% | 42 − 9 | 42 − 14 | 51 − 17 |
| HHS Region 1 | 1.3% | 51 | 10 | 2.5% | 42 − 5 | 42 − 17 | 47 − 18 |
| HHS Region 2 | 2.3% | 4 | 11 | 4.1% | 42 − 10 | 42 − 14 | 52 − 17 |
| HHS Region 3 | 1.8% | 47 | 10 | 4.0% | 42 − 1 | 42 − 18 | 43 − 18 |
| HHS Region 4 | 1.6% | 3 | 10 | 3.6% | 42 − 9 | 42 − 18 | 51 − 18 |
| HHS Region 5 | 1.6% | 7 | 10 | 3.3% | 42 − 13 | 42 − 14 | 3 − 17 |
| HHS Region 6 | 3.6% | 47 | 7 | 5.6% | 42 − 1 | 42 − 13 | 49 − 16 |
| HHS Region 7 | 1.7% | 7 | 10 | 2.5% | 42 − 13 | 42 − 14 | 3 − 17 |
| HHS Region 8 | 1.4% | 5 | 8, 11 | 2.2% | 42 − 11 | 42 − 15 | 1 − 18 |
| HHS Region 9 | 2.6% | 3 | 7 | 4.4% | 42 − 9 | 42 − 14 | 51 − 17 |
| HHS Region 10 | 1.1% | 2 | 7 | 2.4% | 42 − 8 | 42 − 15 | 50 − 18 |
Participating model descriptions.
| Model | Data source | Regional forecasta | Model type | Returning Team | Ensemble Forecast | Brief description |
|---|---|---|---|---|---|---|
| A | ILINet, weather attributes | Yes | Mechanisticb | Yes | Yes | SIRS model with ensemble Kalman filter to assimilate observed data sources. |
| B | ILINet | Yes | Statisticalc | No | Yes | Historical predictions for part of season, followed by extra trees random forest predictive model. |
| C | ILINet, specific humidity | Yes | Mechanistic | Yes | Yes | SIR, SIRS, SEIR, SEIRS models combined using three ensemble filter algorithms w/ fixed scale and real-time ILI measures.[ |
| D | ILINet, specific humidity | Yes | Mechanistic | Yes | Yes | SIR, SIRS, SEIR, SEIRS models combined using three ensemble filter algorithms w/ variable scale and inferred ILI measures[ |
| E | ILINet, Twitter, Wikipedia | Yes | Statistical | Yes | No | Kalman filter using archetypal ILI trajectory as a process model and digital surveillance as measurements[ |
| F | ILINet, crowd-sourced forecasts | Yes | Statistical | Yes | Yes | Aggregate forecast from many individual crowd-sourced forecasts[ |
| G | ILINet | Yes | Statistical | Yes | Yes | Weighted ensemble of ten statistical models including empirical Bayes, smooth splines, empirical distribution. |
| H | ILINet, weather attributes | Yes | Statistical | No | No | Use maximum mutual information to explore dependencies between factors and determine the optimal predictive model; variables included are ILI, temperature, rain/snowfall, leading to a maximum entropy generalized non-linear model. |
| Id | ILINet, Twitter | Yes | Statistical | No | No | Bayesian hierarchical model that borrows information from previous flu seasons to inform about the current flu season. |
| J | ILINet | Yes | Mechanistic | No | No | Fit optimal parabola to incidence curve for current season ILI data, incorporating noise estimated from past seasons[ |
| Kd | ILINet | Yes | Statistical | No | No | Use k-nearest neighbours approach to select past season most similar to current season. Historical variance with normality assumption used to generate probabilities. |
| Ld | ILINet | No | Statistical | No | No | Use kernel conditional density estimation to estimate each future week, combine using copulas to create joint distribution[ |
| M | ILINet, Twitter | Yes | Mechanistic | Yes | No | Uses Twitter and ILINet data to set initial conditions for stochastic generative epidemic model, calibrated to historical ILI surveillance[ |
| Nd | ILINet, school vacation schedules, specific humidity | Yes | Mechanistic | No | No | An MCMC procedure with an SIR model using climate and school vacation schedule to determine the reproduction number. National forecasts are a weighted average of coupled regional forecasts. |
a“Yes” denotes forecast for ≥1 HHS region (for all weeks).
bIncludes models that incorporate compartmental modelling like Susceptible-Exposed-Infected-Recovered [SEIR] models.
cIncludes models like time series analysis and generalized linear models.
dFirst forecast received on MMWR week 45 (Model K), 49 (Model L), 50 (Model I), and week 4 (Model N).
Average forecast skill for US national targets by forecast team during the 2015–2016 influenza season. Bold denotes the highest scoring team for that target.
| Onset week | Peak week | Peak intensity | Seasonal averagea | 1 week ahead | 2 week ahead | 3 week ahead | 4 week ahead | Short-term averageb | |
|---|---|---|---|---|---|---|---|---|---|
| Model A | 0.004 | 0.003 | 0.523 | 0.021 | 0.107 | 0.122 | 0.114 | 0.115 | 0.114 |
| Model B |
|
| 0.515 |
| 0.612 | 0.513 | 0.451 | 0.398 | 0.492 |
| Model C | 0.038 | 0.015 | 0.255 | 0.054 | 0.578 | 0.293 | 0.164 | 0.098 | 0.238 |
| Model D | 0.037 | 0.031 | 0.279 | 0.072 | 0.876 | 0.668 | 0.443 | 0.297 | 0.540 |
| Model E | 0.045 | 0.072 |
| 0.139 | 0.707 | 0.658 | 0.601 | 0.535 | 0.626 |
| Model F | 0.038 | 0.072 | 0.647 | 0.131 |
| 0.727 |
| 0.514 |
|
| Model G | 0.047 | 0.110 | 0.581 | 0.157 | 0.847 | 0.715 | 0.638 |
| 0.693 |
| Model H | 0.014 | 0.000 | 0.055 | 0.007 | 0.014 | 0.067 | 0.011 | 0.008 | 0.017 |
| Model Ic | 0.004 | 0.008 | 0.013 | 0.008 | 0.162 | 0.209 | 0.257 | 0.317 | 0.225 |
| Model J | 0.000 | 0.155 | 0.383 | 0.036 | 0.711 | 0.399 | 0.303 | 0.207 | 0.376 |
| Model Kc | 0.037 | 0.030 | 0.076 | 0.044 | 0.358 | 0.343 | 0.320 | 0.283 | 0.326 |
| Model Lc | 0.105 | 0.167 | 0.323 | 0.185 | 0.747 |
| 0.566 | 0.352 | 0.590 |
| Model M | 0.004 | 0.021 | 0.278 | 0.033 | 0.698 | 0.426 | 0.284 | 0.169 | 0.357 |
| Model Nc | 0.001 | 0.002 | 0.003 | 0.002 | 0.061 | 0.043 | 0.014 | 0.009 | 0.025 |
| Median Team Skill | 0.037 | 0.030 | 0.301 | 0.049 | 0.655 | 0.413 | 0.311 | 0.290 | 0.366 |
| FluSight Ensemble | 0.115 | 0.134 | 0.505 | 0.206 | 0.719 | 0.620 | 0.542 | 0.466 | 0.585 |
| Hist. Avg. Forecast | 0.108 | 0.054 | 0.268 | 0.117 | 0.406 | 0.408 | 0.404 | 0.400 | 0.404 |
aAverage of submissions for onset week, peak week, and peak intensity.
bAverage of submissions for 1, 2, 3, and 4 weeks ahead.
cFirst forecast received on MMWR week 45 (Model K), 49 (Model L), 50 (Model I), and week 4 (Model N); Missing forecasts are assigned a log score of −10 for scoring purposes.
Figure 2Weekly forecast skill for national onset week, season peak intensity, and season peak week during the 2015–2016 influenza season. Each grey line represents a separate forecast model, the solid black line represents the FluSight Ensemble, and vertical dashed lines indicate the date when the forecasted target occurred.
Figure 3Weekly forecast skill for one to four week ahead forecasts of the national ILINet percentage for individual team forecasts shown in grey and for the FluSight Ensemble shown in black during the 2015–2016 season, by week, with the observed ILINet percent (wILI%) overlaid in red. The x-axis represents the MMWR week that each forecast is predicting.
Median team average forecast skill by target for forecast locations during the 2015-2016 influenza forecasting challenge. Bold denotes the location with the highest median team forecast skill.
| Location | Onset week | Peak week | Peak intensity | 1 week ahead | 2 week ahead | 3 week ahead | 4 week ahead |
|---|---|---|---|---|---|---|---|
| US National | 0.037 | 0.030 | 0.301 | 0.655 | 0.413 | 0.311 | 0.290 |
| HHS Region 1 |
| 0.013 | 0.382 | 0.497 | 0.553 | 0.525 | 0.404 |
| HHS Region 2 | 0.003 | 0.011 | 0.209 | 0.379 | 0.231 | 0.223 | 0.184 |
| HHS Region 3 | 0.384 | 0.006 | 0.174 | 0.570 | 0.269 | 0.184 | 0.163 |
| HHS Region 4 | 0.004 | 0.027 | 0.229 | 0.386 | 0.302 | 0.191 | 0.204 |
| HHS Region 5 | 0.039 | 0.019 | 0.184 | 0.537 | 0.256 | 0.286 | 0.200 |
| HHS Region 6 | 0.259 | 0.017 | 0.060 | 0.256 | 0.161 | 0.122 | 0.077 |
| HHS Region 7 | 0.032 | 0.022 | 0.153 | 0.627 | 0.514 | 0.438 | 0.276 |
| HHS Region 8 | 0.054 | 0.021 | 0.498 |
|
|
| 0.436 |
| HHS Region 9 | 0.043 | 0.063 |
| 0.248 | 0.231 | 0.224 | 0.192 |
| HHS Region 10 | 0.092 |
| 0.482 | 0.625 | 0.565 | 0.537 |
|
Figure 4Log scores by characteristics of the forecasting approach. Each small, transparent point represents the log score for a specific target (colours), location, and forecast week. Seasonal targets are shown in the top panel and short-term targets in the bottom panel. Each sub-panel is divided by forecast characteristics including whether the team had participated in previous seasons, whether the model was mechanistic or statistical, whether data sources other than ILINet were used, and whether an ensemble was used to create the forecast. Bold diamonds represent the average log score across models for each target in each category. Solid lines indicate statistically significant differences determined by multivariable gamma regression controlling for location and forecast week.
Figure 5Log scores by characteristics of the influenza season. Each small, transparent point represents the log score for a specific target (colours), location, and forecast week. Seasonal targets are shown in the top panel and short-term targets in the bottom panel. Each sub-panel is divided by seasonal characteristics including observed timing of onset week, observed timing of peak week, relative intensity of the peak wILI% value to the baseline value, the number of weeks ILINet remained above baseline, and the absolute difference between the initial published wILI% value for the week a forecast is based on and the week’s final wILI% value. Bold diamonds represent the average log score across models for each target in each category. Solid lines indicate statistically significant differences determined by multivariable gamma regression controlling for forecast week.
Median and top team forecast skill for national targets from 2014–2015 and 2015–2016 influenza challenge, using scoring rules from 2014–2015 challengea.
| Onset week | Peak week | Peak intensity | Seasonal averageb | 1 week ahead | 2 week ahead | 3 week ahead | 4 week ahead | Short-term averagec | |
|---|---|---|---|---|---|---|---|---|---|
| 2015/2016 Median Team Skill | 0.01 | 0.02 | 0.22 | 0.02 | 0.34 | 0.23 | 0.19 | 0.16 | 0.23 |
| 2014/2015 Median Team Skill | 0.04 | 0.25 | 0.02 | 0.11 | 0.14 | 0.11 | 0.08 | 0.10 | 0.13 |
| 2015/2016 Top Team Skill | 0.04 | 0.06 | 0.35 | 0.07 | 0.63 | 0.58 | 0.45 | 0.39 | 0.46 |
| 2014/2015 Top Team Skill | 0.41 | 0.49 | 0.35 | 0.39 | 0.43 | 0.30 | 0.34 | 0.34 | 0.34 |
aFor 2014-2015, forecasts for peak intensity and short-term forecasts were binned as semi-open 1% bins up to 10%, with a final bin for all values greater than or equal to 10%. For all targets, only the probability assigned to the correct bin was considered correct for scoring[9].
bAverage of submissions for onset week, peak week, and peak intensity.
cAverage of submissions for 1, 2, 3, and 4 weeks ahead.