| Literature DB >> 32251464 |
Sarah F McGough1, Michael A Johansson2, Marc Lipsitch3,4, Nicolas A Menzies1.
Abstract
Achieving accurate, real-time estimates of disease activity is challenged by delays in case reporting. "Nowcast" approaches attempt to estimate the complete case counts for a given reporting date, using a time series of case reports that is known to be incomplete due to reporting delays. Modeling the reporting delay distribution is a common feature of nowcast approaches. However, many nowcast approaches ignore a crucial feature of infectious disease transmission-that future cases are intrinsically linked to past reported cases-and are optimized to one or two applications, which may limit generalizability. Here, we present a Bayesian approach, NobBS (Nowcasting by Bayesian Smoothing) capable of producing smooth and accurate nowcasts in multiple disease settings. We test NobBS on dengue in Puerto Rico and influenza-like illness (ILI) in the United States to examine performance and robustness across settings exhibiting a range of common reporting delay characteristics (from stable to time-varying), and compare this approach with a published nowcasting software package while investigating the features of each approach that contribute to good or poor performance. We show that introducing a temporal relationship between cases considerably improves performance when the reporting delay distribution is time-varying, and we identify trade-offs in the role of moving windows to accurately capture changes in the delay. We present software implementing this new approach (R package "NobBS") for widespread application and provide practical guidance on implementation.Entities:
Year: 2020 PMID: 32251464 PMCID: PMC7162546 DOI: 10.1371/journal.pcbi.1007735
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Weekly dengue fever nowcasts for December 23, 1991 through November 29, 2010 using a 2-year moving window.
(A) Weekly NobBS nowcasts along with (B) point estimate and uncertainty accuracy, as measured by the score and the prediction error, are compared to (C) weekly nowcasts by the HH approach with (D) corresponding scores and prediction errors. For nowcasting, the number of newly-reported cases each week (blue line) are the only data available in real-time for that week, and help inform the estimate of the total number of cases that will be eventually reported (red line), shown with 95% prediction intervals (pink bands). The true number of cases eventually reported (black line) is known only in hindsight and is the nowcast target. Historical information on reporting is available within a 104-week moving window (grey shade) and used to make nowcasts. The score (brown line) and the difference between the true and mean estimated number of cases (grey line) are shown as a function of time.
Fig 2Weekly ILI nowcasts for June 30, 2014 through September 25, 2017 using a 6-month moving window.
(A) Weekly NobBS nowcasts along with (B) point estimate and uncertainty accuracy, as measured by the score and the prediction error, are compared to (C) weekly nowcasts by the HH approach with (D) corresponding scores and prediction errors. For nowcasting, the number of newly-reported cases each week (blue line) are the only data available in real-time for that week, and help inform the estimate of the total number of cases that will be eventually reported (red line), shown with 95% prediction intervals (pink bands). For the HH approach, the 95% prediction intervals are very narrow and are thus difficult to see. The true number of cases eventually reported (black line) is known only in hindsight and is the nowcast target. Historical information on reporting is available within a 27-week moving window (grey shade) and used to make nowcasts. The score (brown line) and the difference between the true and mean estimated number of cases (grey line) are shown as a function of time.
Performance measures for each nowcast approach by disease (mean % reported with no delay, computed across the complete time period* of reports).
| Disease | Model | Period | % of weeks predicted | Average Score | MAE | RMSE | rRMSE | 95% PI coverage |
|---|---|---|---|---|---|---|---|---|
| Dengue | NobBS | Full time period | 100% | 0.349 | 16 | 37.6 | 0.600 | 0.87 |
| (4%) | Weeks in which at least 1 case was reported in the first week | -- | 0.274 | 21 | 46.6 | 0.464 | 0.85 | |
| HH (ref. 9) | Full time period | 55% | -- | 32 | 57.4 | 1.14 | -- | |
| Weeks in which at least 1 case was reported in the first week | -- | 0.161 | 37 | 68.1 | 1.24 | 0.90 | ||
| Influenza | NobBS | Full time period | 100% | 0.218 | 693 | 987.8 | 0.074 | 1.00 |
| (82%) | ||||||||
| HH (ref. 9) | Full time period | 100% | 0.017 | 609 | 916.2 | 0.062 | 0.00 |
*Full time period for: dengue fever (12/23/1991-11/29/2010) and ILI (6/30/2014-9/25/2017)
Performance measures for estimates of the change in disease incidence from the previous week.
| Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Period | MAEΔ | RMSEΔ | ρa | ρc | MAEΔ | RMSEΔ | ρa | ρc | |
| NobBS | Full time period | 17 | 35.8 | 0.876 | 0.958 | 669 | 1027.1 | 0.973 | 0.972 |
| Weeks in which at least 1 case was reported in the first week | 23 | 45.2 | -- | -- | -- | -- | -- | ||
| HH (ref. 9) | Full time period | 34 | 64.6 | 0.631 | 0.958 | 612 | 1004.2 | 0.970 | 0.972 |
| Weeks in which at least 1 case was reported in the first week | 50 | 88.2 | -- | -- | -- | -- | -- | ||
*Full time period for: dengue fever (12/23/1991-11/29/2010) and ILI (6/30/2014-9/25/2017)
Annual performance measures for each nowcast model, by disease.
All predicted weeks for each model are compared.
| Disease | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Year | Cases | MAE | rRMSE | RMSE | Average Score | MAE | rRMSE | RMSE | Average Score | |
| Dengue | 1992 | 3,570 | 15 | 0.271 | 19.7 | 0.262 | 27 | 0.473 | 33.2 | 0.154 |
| 1993 | 2,044 | 10 | 0.325 | 13.0 | 0.436 | 20 | 0.559 | 23.5 | 0.237 | |
| 1994 | 5,455 | 29 | 0.356 | 45.9 | 0.171 | 50 | 0.690 | 63.3 | 0.108 | |
| 1995 | 2,075 | 13 | 0.450 | 16.2 | 0.330 | 28 | 1.035 | 38.4 | 0.178 | |
| 1996 | 1,856 | 8 | 0.520 | 11.0 | 0.472 | 17 | 0.617 | 21.9 | 0.270 | |
| 1997 | 2,413 | 12 | 0.375 | 16.2 | 0.402 | 20 | 0.625 | 26.7 | 0.228 | |
| 1998 | 5,334 | 33 | 0.448 | 47.8 | 0.129 | 65 | 0.801 | 89.9 | 0.072 | |
| 1999 | 1,823 | 9 | 0.389 | 11.9 | 0.493 | 18 | 0.897 | 23.5 | 0.250 | |
| 2000 | 766 | 4 | 0.359 | 6.1 | 0.720 | 17 | 2.225 | 20.2 | 0.304 | |
| 2001 | 2,274 | 11 | 0.487 | 16.6 | 0.437 | 26 | 0.492 | 37.7 | 0.189 | |
| 2002 | 821 | 5 | 0.522 | 5.7 | 0.834 | 16 | 1.101 | 23.0 | 0.352 | |
| 2003 | 1,422 | 6 | 0.471 | 9.5 | 0.590 | 32 | 1.412 | 47.5 | 0.193 | |
| 2004 | 911 | 6 | 0.599 | 7.2 | 0.610 | 13 | 2.088 | 17.0 | 0.368 | |
| 2005 | 2,543 | 14 | 0.998 | 21.4 | 0.407 | 32 | 1.150 | 42.0 | 0.178 | |
| 2006 | 734 | 4 | 0.891 | 6.3 | 0.770 | 13 | 1.211 | 15.8 | 0.395 | |
| 2007 | 3,290 | 30 | 0.675 | 55.4 | 0.102 | 55 | 0.632 | 93.6 | 0.066 | |
| 2008 | 843 | 8 | 1.032 | 12.8 | 0.629 | 38 | 4.145 | 50.7 | 0.191 | |
| 2009 | 2,448 | 19 | 0.667 | 26.7 | 0.225 | 57 | 2.405 | 81.9 | 0.092 | |
| 2010 | 6,820 | 71 | 0.583 | 132.4 | 0.055 | 121 | 0.854 | 198.7 | 0.041 | |
| Influenza | 2014 | 726,312 | 1052 | 0.085 | 1565.9 | 0.188 | 958 | 0.091 | 1482.0 | 0.004 |
| 2015 | 679,850 | 685 | 0.086 | 890.3 | 0.203 | 624 | 0.069 | 848.0 | 0.019 | |
| 2016 | 704,020 | 696 | 0.072 | 861.8 | 0.224 | 376 | 0.043 | 480.1 | 0.063 | |
| 2017 | 632,353 | 551 | 0.046 | 712.9 | 0.258 | 659 | 0.047 | 934.2 | 0.008 | |