| Literature DB >> 27154479 |
Abstract
OBJECTIVES: Reliable monitoring of influenza seasons and pandemic outbreaks is essential for response planning, but compilations of reports on detection and prediction algorithm performance in influenza control practice are largely missing. The aim of this study is to perform a metanarrative review of prospective evaluations of influenza outbreak detection and prediction algorithms restricted settings where authentic surveillance data have been used.Entities:
Keywords: detection algorithms; evaluation; influenza; meta-narrative review; prediction algorithms
Mesh:
Year: 2016 PMID: 27154479 PMCID: PMC4861093 DOI: 10.1136/bmjopen-2015-010683
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Summary of semantic system used to interpret algorithm performance
| Performance | |||
|---|---|---|---|
| Measurement | Outstanding | Excellent | Acceptable |
| Outbreak detection and prediction | |||
| AUC, AUWROC, VUTROC | 0.90 | 0.80 | 0.70 |
| Sensitivity, specificity, PPV (weekly) | 0.95 | 0.90 | 0.85 |
| Sensitivity, specificity, PPV (daily) | 0.90 | 0.85 | 0.80 |
| Only outbreak prediction | |||
| Pearson's correlation (weekly) | 0.90 | 0.80 | 0.70 |
| Pearson's correlation (daily) | 0.85 | 0.75 | 0.65 |
AUC, area under the curve; AUWROC, area under the weighted receiver operating characteristic curve; PPV, positive predictive value; VUTROC, volume under the time-ROC surface.
Figure 1Flow chart of the paper selection process. Additional reasons for exclusion (*) included that the case definition did not comprise at least a clinical diagnosis of influenza or influenza-like illness.
Evaluation algorithms include in the metanarrative review and their absolute and relative performance
| Study | Algorithm | Modification | Temporal | Absolute performance | Relative performance |
|---|---|---|---|---|---|
| Outbreak detection | |||||
| Closas | Kolmogorov-Smirnov test | Weekly | Acceptable (sensitivity 1.00; specificity 0.88) | No comparisons | |
| Martínez-Beneito | Markov model (hidden) V.1 | Weekly | Outstanding (AUWROC 0.97–0.98) | Markov model (switching)>Markov model (hidden)>regression (Serfling)>CUSUM>regression (simple) | |
| Regression (Serfling) | Outstanding (AUWROC 0.93) | Markov model (switching)>Markov model (hidden)>regression (Serfling)>CUSUM>regression (simple) | |||
| Markov model (hidden) V.2 | Outstanding (AUWROC 0.93–0.95) | Markov model (switching)>Markov model (hidden)>regression (Serfling)>CUSUM>regression (simple) | |||
| Regression (simple) | Poor (AUWROC 0.57) | Markov model (switching)>Markov model (hidden)>regression (Serfling)>CUSUM>regression (simple) | |||
| SPC (CUSUM) | Poor (AUWROC 0.65–0.70) | Markov model (switching)>Markov model (hidden)>regression (Serfling)>CUSUM>regression (simple) | |||
| Cowling | Time series, dynamic linear model | Different parameter combinations tested. W represents the assumed smoothness of the underlying system. Range: 0.025, 0.050, 0.075 or 0.100 | Weekly | Hong Kong: acceptable (VUTROC 0.77, sensitivity 1.00, timeliness 1.40 weeks), with fixed specificity=0.95 | Hong Kong data: time series (dynamic linear model)>regression (simple)>CUSUM US data: time series (dynamic linear model)>CUSUM>regression (simple) |
| Regression (simple) | Different parameter combinations tested. m represents the number of prior weeks used to calculate the running mean and variance. Range: 3, 5, 7 or 9 | Hong Kong: acceptable (VUTROC 0.75, sensitivity 1.00, timeliness 1.72 weeks), with fixed specificity=0.95 | Hong Kong data: time series (dynamic linear model)>regression (simple)>CUSUMUS data: time series (dynamic linear model)>CUSUM>regression (simple) | ||
| SPC (CUSUM) | Different parameter combinations tested. d represents the number of weeks t separating the baseline and the index day of the outbreak. Range: 2 or 3. k represents the minimum standardised difference. Range: 1 or 2 | Hong Kong: poor (VUTROC 0.56, sensitivity 0.86, timeliness 2.00 weeks), with fixed specificity=0.95 | Hong Kong data: time series (dynamic linear model)>regression (simple)>CUSUMUS data: time series (dynamic linear model)>CUSUM>regression (simple) | ||
| Outbreak prediction | |||||
| Timpka | Shewhart type | Daily and weekly | Pandemic outbreak: poor (AUC 0.84; PPV 0.58) on a daily basis and poor (at most acceptable) (AUC 0.78; PPV 0.79) on a weekly basis | No comparisons | |
| Yuan | Multiple linear regression | Monthly | NA. Limits not defined for the adjusted metrics of residuals used (APE) | No comparisons | |
| Jiang | Bayesian network | Daily | Outstanding (r=0.97, prediction on day 13; r=0.94, prediction on day 22) | No comparisons | |
| Burkom | Regression (log-linear, non-adaptive) | Non-adaptive | Daily | NA. Limits not defined for the adjusted metrics of residuals used (MAD, MedAPE) | Ten series of case count data: Holt-Winters>regression (log-linear, adaptive)>regression (log-linear, non-adaptive) |
| Regression (log-linear, adaptive) | Adaptive | Ten series of case count data: Holt-Winters>regression (log-linear, adaptive)>regression (log-linear, non-adaptive) | |||
| Holt-Winters (generalised exponential smoothing) | Ten series of case count data: Holt-Winters>regression (log-linear, adaptive)>regression (log-linear, non-adaptive) | ||||
| Viboud | Method of analogues (non-parametric time-series forecasting method) | Weekly | From poor (r=0.66, for 10-week-ahead prediction) to excellent (r=0.81, for 1-week-ahead prediction) | Method of analogues>autoregressive model (linear)>Stone's naive method | |
| Autoregressive model (linear) | From poor (r=–0.07, for 10-week-ahead prediction) to acceptable (r=0.73, for 1-week-ahead prediction) | Method of analogues>autoregressive model (linear)>Stone's naive method | |||
| The naive method | Poor (r=–0.09, for 10-week-ahead prediction; r=0.65, for 1-week-ahead prediction) | Method of analogues>autoregressive model (linear)>Stone’s naive method | |||
APE, absolute percentage error; AUC, area under the curve; AUWROC, area under the weighted receiver operating characteristic curve; CUSUM, cumulative sum; MAD, median absolute residual; MedAPE, median absolute percentage error; NA, not applicable; PPV, positive predictive value; SPC, statistical process control; VUTROC, volume under the time-ROC surface.
Summary of narrative characteristics
| Narrative | Storyline | Intended audience* | Learning period dilemma | Theoretical proofs | Population descriptions | End point measures |
|---|---|---|---|---|---|---|
| Biodefence | System verification | Engineers and modellers | Irregular attention | Included in argument | Summary | Various statistical |
| Health policy | System validation | Policymakers | Binding attention | Excluded | Extensive | Standard epidemiological |
*In addition to researchers.