| Literature DB >> 29317382 |
Fred Sun Lu1, Suqin Hou2, Kristin Baltrusaitis3, Manan Shah4, Jure Leskovec4,5, Rok Sosic4, Jared Hawkins1,6, John Brownstein1,6, Giuseppe Conidi7, Julia Gunn7, Josh Gray8, Anna Zink8, Mauricio Santillana1,6.
Abstract
BACKGROUND: Influenza outbreaks pose major challenges to public health around the world, leading to thousands of deaths a year in the United States alone. Accurate systems that track influenza activity at the city level are necessary to provide actionable information that can be used for clinical, hospital, and community outbreak preparation.Entities:
Keywords: communicable diseases; epidemiology; influenza, human; machine learning; patient generated data; public health; regression analysis; statistics
Year: 2018 PMID: 29317382 PMCID: PMC5780615 DOI: 10.2196/publichealth.8950
Source DB: PubMed Journal: JMIR Public Health Surveill ISSN: 2369-2960
Comparison of single data source models for nowcasting Boston Public Health Commission’s percentage of influenza-like illness over the assessment period (September 2, 2012, to May 7, 2017). Each flu season starts on week 40 and ends on week 20 of the next year.
| Modela | |||||||
| 2012-16 | 2012-13 | 2013-14 | 2014-15 | 2015-16 | 2016-17 | ||
| AR52 | 0.303 | 0.577 | 0.199 | 0.305 | 0.217 | 0.229 | |
| ARGO(athena)b | 0.229 | ||||||
| ARGO(Google) | 0.206 | 0.312 | 0.247 | 0.161 | 0.188 | ||
| ARGO(FNY)c | 0.299 | 0.552 | 0.204 | 0.343 | 0.172 | 0.280 | |
| — | — | — | 0.162 | 0.427 | 0.351 | ||
| GFTd | — | 0.352 | 0.271 | 0.284 | — | — | |
| Naive | 0.266 | 0.481 | 0.208 | 0.280 | 0.202 | 0.219 | |
| AR52 | 0.180 | 0.345 | 0.218 | 0.176 | 0.176 | ||
| ARGO(athena) | 0.189 | 0.154 | |||||
| ARGO(Google) | 0.150 | 0.206 | 0.155 | 0.213 | 0.131 | ||
| ARGO(FNY) | 0.182 | 0.362 | 0.153 | 0.221 | 0.140 | 0.213 | |
| — | — | — | 0.136 | 0.338 | 0.251 | ||
| GFT | — | 0.294 | 0.225 | 0.227 | — | — | |
| Naive | 0.167 | 0.290 | 0.165 | 0.213 | 0.158 | 0.168 | |
| AR52 | 0.184 | 0.188 | 0.185 | 0.188 | 0.130 | ||
| ARGO(athena) | 0.193 | 0.129 | |||||
| ARGO(Google) | 0.179 | 0.134 | 0.152 | 0.209 | 0.146 | ||
| ARGO(FNY) | 0.192 | 0.210 | 0.139 | 0.186 | 0.143 | 0.157 | |
| — | — | — | 0.212 | 0.308 | 0.189 | ||
| GFT | — | 0.308 | 0.221 | 0.228 | — | — | |
| Naive | 0.172 | 0.169 | 0.152 | 0.188 | 0.162 | 0.130 | |
| AR52 | 0.898 | 0.882 | 0.846 | 0.834 | 0.806 | 0.898 | |
| ARGO(athena) | 0.964 | 0.843 | |||||
| ARGO(Google) | 0.956 | 0.968 | 0.910 | 0.896 | 0.930 | ||
| ARGO(FNY) | 0.901 | 0.909 | 0.845 | 0.824 | 0.886 | 0.879 | |
| — | — | — | 0.888 | 0.416 | 0.759 | ||
| GFT | — | 0.785 | 0.921 | — | — | ||
| Naive | 0.922 | 0.912 | 0.846 | 0.868 | 0.848 | 0.906 | |
| AR52 | 0.222 | 0.359 | −0.105 | 0.115 | −0.048 | 0.222 | |
| ARGO(athena) | 0.657 | 0.220 | 0.483 | ||||
| ARGO(Google) | 0.546 | 0.730 | 0.284 | 0.267 | 0.417 | ||
| ARGO(FNY) | 0.252 | 0.387 | −0.056 | −0.025 | 0.122 | 0.253 | |
| — | — | — | −0.481 | −0.291 | 0.095 | ||
| GFT | — | 0.281 | — | — | |||
| Naive | 0.291 | 0.480 | -0.100 | 0.280 | −0.070 | 0.193 | |
aThe best performance within each season and metric is italicized. Results for each model are shown where available.
bARGO: autoregression with general online information.
cFNY: Flu Near You.
dGFT: Google Flu Trends.
Figure 1Retrospective nowcasts from single data source models are shown, compared with Boston Public Health Commission’s official percentage of influenza-like illness (BPHC official %ILI) (black), over the entire study period (September 2, 2012, to May 7, 2017). The gold section indicates the holdout period from May 22, 2016, to May 7, 2017. The bottom panel shows the corresponding errors of each model compared with the official %ILI (ARGO: autoregression with general online information; FNY: Flu Near You).
Comparison of models using multiple data sources for nowcasting Boston Public Health Commission’s percentage of influenza-like illness over the study period (September 2, 2012, to May 7, 2017). ARGO(athena) and the naive model are included as benchmarks for comparison.
| Modela | |||||||
| 2012-16 | 2012-13 | 2013-14 | 2014-15 | 2015-16 | 2016-17 | ||
| ARGO(athena)b | 0.195 | 0.306 | 0.229 | 0.192 | 0.182 | ||
| ARGO(athena+Google+FNY)c | 0.165 | 0.199 | 0.192 | 0.189 | 0.168 | 0.156 | |
| Ensemble | 0.139 | ||||||
| Naive | 0.266 | 0.481 | 0.208 | 0.280 | 0.202 | 0.219 | |
| ARGO(athena) | 0.137 | 0.205 | 0.189 | 0.136 | 0.154 | ||
| ARGO(athena+Google+FNY) | 0.124 | 0.146 | 0.144 | 0.154 | 0.128 | 0.131 | |
| Ensemble | 0.106 | ||||||
| Naive | 0.167 | 0.290 | 0.165 | 0.213 | 0.158 | 0.168 | |
| ARGO(athena) | 0.163 | 0.128 | 0.193 | 0.129 | |||
| ARGO(athena+Google+FNY) | 0.154 | 0.112 | 0.136 | 0.153 | 0.142 | 0.104 | |
| Ensemble | 0.132 | 0.118 | |||||
| Naive | 0.172 | 0.169 | 0.152 | 0.188 | 0.162 | 0.130 | |
| ARGO(athena) | 0.959 | 0.964 | 0.843 | 0.950 | 0.949 | ||
| ARGO(athena+Google+FNY) | 0.972 | 0.985 | 0.861 | 0.916 | 0.957 | ||
| Ensemble | 0.928 | ||||||
| Naive | 0.922 | 0.912 | 0.846 | 0.868 | 0.848 | 0.906 | |
| ARGO(athena) | 0.547 | 0.657 | 0.220 | 0.483 | 0.515 | ||
| ARGO(athena+Google+FNY) | 0.656 | 0.807 | 0.419 | 0.312 | |||
| Ensemble | 0.633 | 0.357 | 0.565 | ||||
| Naive | 0.291 | 0.480 | −0.100 | 0.280 | −0.070 | 0.193 | |
aThe best performance within each season and metric is italicized.
bARGO: autoregression with general online information.
cFNY: Flu Near You.
Comparison of models using multiple data sources for forecasting Boston Public Health Commission’s percentage of influenza-like illness, over the study period (September 2, 2012, to May 7, 2017). ARGO(athena) and the naive model are included as benchmarks for comparison.
| Modela | |||||||
| 2012-16 | 2012-13 | 2013-14 | 2014-15 | 2015-16 | 2016-17 | ||
| ARGO(athena)b | 0.325 | 0.647 | 0.249 | 0.260 | 0.188 | 0.261 | |
| ARGO(athena+Google+FNY)c | 0.245 | 0.367 | 0.314 | 0.190 | |||
| Ensemble | 0.237 | 0.251 | |||||
| Naive | 0.428 | 0.827 | 0.276 | 0.447 | 0.271 | 0.340 | |
| ARGO(athena) | 0.203 | 0.432 | 0.200 | 0.156 | 0.221 | ||
| ARGO(athena+Google+FNY) | 0.169 | 0.247 | 0.225 | 0.151 | |||
| Ensemble | 0.171 | 0.202 | 0.198 | ||||
| Naive | 0.252 | 0.528 | 0.203 | 0.329 | 0.201 | 0.251 | |
| ARGO(athena) | 0.217 | 0.254 | 0.186 | 0.163 | 0.175 | ||
| ARGO(athena+Google+FNY) | 0.192 | 0.167 | 0.214 | 0.147 | 0.149 | ||
| Ensemble | 0.155 | 0.198 | |||||
| Naive | 0.238 | 0.308 | 0.169 | 0.272 | 0.195 | 0.204 | |
| ARGO(athena) | 0.887 | 0.842 | 0.785 | 0.898 | 0.891 | ||
| ARGO(athena+Google+FNY) | 0.938 | 0.949 | 0.826 | 0.922 | 0.903 | 0.910 | |
| Ensemble | 0.916 | ||||||
| Naive | 0.799 | 0.739 | 0.744 | 0.663 | 0.728 | 0.775 | |
| ARGO(athena) | 0.432 | 0.452 | 0.259 | 0.472 | |||
| ARGO(athena+Google+FNY) | 0.533 | 0.621 | 0.508 | 0.285 | 0.477 | ||
| Ensemble | 0.441 | 0.515 | 0.286 | ||||
| naive | 0.102 | 0.212 | 0.174 | −0.181 | 0.043 | −0.065 | |
aThe best performance within each season and metric is italicized.
bARGO: autoregression with general online information.
cFNY: Flu Near You.
Figure 2Estimations with multiple data source models over the entire study period (September 2, 2012, to May 7, 2017), with corresponding errors for each model compared with Boston Public Health Commission percentage of influenza-like illness (BPHC official %ILI). The gold section indicates the holdout period from May 22, 2016, to May 7, 2017. Predictions are shown separately for the nowcast horizon (top) and the forecast horizon (bottom) (ARGO: autoregression with general online information; FNY: Flu Near You).
Figure 3Heatmap of input variable coefficients for ARGO(athena+Google+FNY) from September 2, 2012, to May 15, 2016, for the nowcast horizon (ARGO: autoregression with general online information).
Figure 4Heatmap of input variable coefficients for ARGO(athena+Google+FNY) from September 2, 2012, to May 15, 2016, for the 1-week forecast horizon (ARGO: autoregression with general online information).
Efficiency improvement of ensemble method with 95% confidence intervals over the period of September 2, 2012, to May 15, 2016, using the stationary block bootstrap with mean length 52 weeks.
| Meana | |||
| AR52 | 4.03 | 2.11-6.83 | |
| athena | 1.91 | 1.14-3.01 | |
| 2.16 | 1.46-2.78 | ||
| athena+Google | 1.51 | 1.25-1.77 | |
| ARGO(athena)b | 1.67 | 1.31-2.18 | |
| ARGO(Google) | 1.87 | 1.57-2.30 | |
| ARGO(athena+Google+FNY)c | 1.20 | 1.10-1.29 | |
| AR52 | 4.55 | 1.98-7.16 | |
| athena | 2.37 | 1.54-3.14 | |
| 2.78 | 1.98-3.63 | ||
| athena+Google | 2.21 | 1.20-3.24 | |
| ARGO(athena) | 2.14 | 1.20-3.08 | |
| ARGO(Google) | 6.09 | 2.12-10.27 | |
| ARGO(athena+Google+FNY) | 1.21 | 1.03-1.35 | |
aMean values of the error for each methodology are displayed as multiples of the error associated to the best ensemble approach (for which the efficiency is assigned to be 1).
bARGO: autoregression with general online information.
cFNY: Flu Near You.