| Literature DB >> 28482810 |
Shihao Yang1, Mauricio Santillana2,3, John S Brownstein4,5, Josh Gray6, Stewart Richardson6, S C Kou7.
Abstract
BACKGROUND: Accurate influenza activity forecasting helps public health officials prepare and allocate resources for unusual influenza activity. Traditional flu surveillance systems, such as the Centers for Disease Control and Prevention's (CDC) influenza-like illnesses reports, lag behind real-time by one to 2 weeks, whereas information contained in cloud-based electronic health records (EHR) and in Internet users' search activity is typically available in near real-time. We present a method that combines the information from these two data sources with historical flu activity to produce national flu forecasts for the United States up to 4 weeks ahead of the publication of CDC's flu reports.Entities:
Keywords: Autoregression; Digital disease detection; Dynamic error reduction; Influenza-like illnesses reports; Validation test
Mesh:
Year: 2017 PMID: 28482810 PMCID: PMC5423019 DOI: 10.1186/s12879-017-2424-7
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
ARGO performance compared to alternative methods for the time period of July 6, 2013 to February 21, 2015
| real-time | forecast 1 week | forecast 2 week | forecast 3 week | |
|---|---|---|---|---|
| RMSE | ||||
| ARGO |
|
|
|
|
| Ref. [ | 0.469 | 0.544 | 0.590 | 0.578 |
| ar4 | 0.944 | 0.954 | 0.935 | 0.902 |
| naive | 1 (0.374) | 1 (0.613) | 1 (0.756) | 1 (0.869) |
| MAE | ||||
| ARGO |
|
|
|
|
| Ref. [ | 0.497 | 0.614 | 0.603 | 0.593 |
| ar4 | 0.895 | 0.880 | 0.872 | 0.867 |
| naive | 1 (0.221) | 1 (0.363) | 1 (0.480) | 1 (0.575) |
| RMSPE | ||||
| ARGO |
|
|
|
|
| Ref. [ | 0.655 | 0.677 | 0.657 | 0.691 |
| ar4 | 1.001 | 1.018 | 1.032 | 1.044 |
| naive | 1 (0.126) | 1 (0.194) | 1 (0.246) | 1 (0.293) |
| MAPE | ||||
| ARGO |
|
|
|
|
| Ref. [ | 0.625 | 0.704 | 0.662 | 0.676 |
| ar4 | 0.956 | 0.965 | 0.977 | 0.988 |
| naive | 1 (0.101) | 1 (0.156) | 1 (0.205) | 1 (0.251) |
| Correlation | ||||
| ARGO |
|
|
|
|
| Ref. [ | 0.989 | 0.960 | 0.928 | 0.904 |
| ar4 | 0.954 | 0.871 | 0.804 | 0.748 |
| naive | 0.951 | 0.867 | 0.796 | 0.727 |
| Error reduction of ARGO over the best available alternative (in %) | ||||
| RMSE | 32.90 | 20.07 | 17.40 | 20.53 |
| MAE | 18.79 | 27.44 | 24.41 | 28.13 |
| RMSPE | 31.50 | 29.90 | 23.26 | 33.32 |
| MAPE | 22.92 | 34.95 | 31.42 | 38.02 |
The evaluation metrics between the prediction and the target include RMSE , and Pearson correlation. The benchmark models include the ensemble method by Santillana et al. [11], an autoregression model with 4 lags, and a naive model, which uses prior week’s ILI level as the prediction for the current week as well as the next 3 weeks. Boldface highlights the best method for each metric in each forecasting time horizon. RMSE, MAE, RMSPE, MAPE are relative to the error of the naive method, i.e., the numbers are the ratio of the error of a given method over that of the naive method; the absolute error of the naive method is given in the round bracket. Table S3 in the Additional file 1 gives the absolute error of all methods. For each forecasting time horizon and each evaluation metrics, the error reduction of ARGO over the best alternative method is given in the second half of the table, together with 95% confidence intervals (in the square bracket) constructed using stationary bootstrap [33] with mean block size of 52 weeks.
Fig. 1Forecasting results. The four panels show the forecasted ILI activity levels for real-time and 1 to 3 weeks into the future from ARGO (thick red), the method of Santillana et al. [11](blue), Healthmap Flu Trends system (green), and the autoregression model with 4 lags (grey), compared to the true CDC’s ILI activity level (thick black), which became available weeks later. The plot at the bottom of each panel shows the estimation error, namely the estimated value minus the true CDC’s ILI activity level
ARGO performance compared to alternative methods for the validation period of February 28, 2015 to July 2, 2016
| real-time | forecast 1 week | forecast 2 week | forecast 3 week | |
|---|---|---|---|---|
| RMSE | ||||
| ARGO |
|
|
|
|
| healthmap | 0.530 | 0.590 | 0.932 | 0.949 |
| ar4 | 0.902 | 0.909 | 0.838 | 0.780 |
| naive | 1 (0.206) | 1 (0.330) | 1 (0.439) | 1 (0.552) |
| MAE | ||||
| ARGO |
|
|
|
|
| healthmap | 0.527 | 0.564 | 0.697 | 0.700 |
| ar4 | 0.994 | 0.952 | 0.852 | 0.766 |
| naive | 1 (0.146) | 1 (0.248) | 1 (0.341) | 1 (0.435) |
| RMSPE | ||||
| ARGO |
|
|
|
|
| healthmap | 0.622 | 0.613 | 0.868 | 0.871 |
| ar4 | 0.959 | 1.006 | 0.958 | 0.920 |
| naive | 1 (0.108) | 1 (0.173) | 1 (0.232) | 1 (0.293) |
| MAPE | ||||
| ARGO |
|
|
|
|
| healthmap | 0.592 | 0.593 | 0.666 | 0.654 |
| ar4 | 1.034 | 1.018 | 0.935 | 0.860 |
| naive | 1 (0.083) | 1 (0.139) | 1 (0.194) | 1 (0.250) |
| Correlation | ||||
| ARGO |
|
|
|
|
| healthmap | 0.987 | 0.956 | 0.843 | 0.774 |
| ar4 | 0.961 | 0.896 | 0.842 | 0.776 |
| naive | 0.963 | 0.900 | 0.829 | 0.745 |
| Error reduction of ARGO over the best alternative (in %) | ||||
| RMSE | 35.63 | 8.38 | 27.94 | 9.77 |
| MAE | 26.75 | 11.07 | 24.16 | 19.49 |
| RMSPE | 31.63 | 22.94 | 39.59 | 31.93 |
| MAPE | 24.29 | 21.42 | 26.58 | 24.42 |
The evaluation metrics are defined in Table 1. The benchmark methods are the same as Table 1 except that the ensemble method of Santillana et al. [11] is replaced by a refined version broadcasted by the Healthmap Flu Trends system. Boldface highlights the best method for each metric in each forecasting time horizon. RMSE, MAE, RMSPE, MAPE are relative to the error of the naive method, i.e., the numbers are the ratio of the error of a given method over that of the naive method; the absolute error of the naive method is given in the round bracket. Table S4 in the Additional file 1 gives absolute error of all methods. For each forecasting time horizon and each evaluation metrics, the error reduction of ARGO over the best alternative method is given in the second half of the table.