| Literature DB >> 33398249 |
Nicola Bartolomeo1, Paolo Trerotoli1, Gabriella Serio1.
Abstract
To estimate the size of the novel coronavirus (COVID-19) outbreak in the early stage in Italy, this paper introduces the cumulated and weighted average daily growth rate (WR) to evaluate an epidemic curve. On the basis of an exponential decay model (EDM), we provide estimations of the WR in four-time intervals from February 27 to April 07, 2020. By calibrating the parameters of the EDM to the reported data in Hubei Province of China, we also attempt to forecast the evolution of the outbreak. We compare the EDM applied to WR and the Gompertz model, which is based on exponential decay and is often used to estimate cumulative events. Specifically, we assess the performance of each model to short-term forecast of the epidemic, and to predict the final epidemic size. Based on the official counts for confirmed cases, the model applied to data from February 27 until the 17th of March estimate that the cumulative number of infected could reach 131,280 (with a credibility interval 71,415-263,501) by April 25 (credibility interval April 12 to May 3). With the data available until the 24st of March the peak date should be reached on May 3 (April 23 to May 23) with 197,179 cumulative infections expected (130,033-315,269); with data available until the 31st of March the peak should be reached on May 4 (April 25 to May 18) with 202,210 cumulative infections expected (155.235-270,737); with data available until the 07st of April the peak should be reached on May 3 (April 26 to May 11) with 191,586 (160,861-232,023) cumulative infections expected. Based on the average mean absolute percentage error (MAPE), cumulated infections forecasts provided by the EDM applied to WR performed better across all scenarios than the Gompertz model. An exponential decay model applied to the cumulated and weighted average daily growth rate appears to be useful in estimating the number of cases and peak of the COVID-19 outbreak in Italy and the model was more reliable in the exponential growth phase. .Entities:
Keywords: CI, cumulative infections; Coronavirus; Covid-19; DGR, daily growth rate; DI, daily infections; Daily grow rate; EDM, exponential decay model; Exponential decay model; Italy; Short-term forecasts; WR, weighted and cumulated average of the daily growth rate
Year: 2020 PMID: 33398249 PMCID: PMC7773318 DOI: 10.1016/j.idm.2020.12.007
Source DB: PubMed Journal: Infect Dis Model ISSN: 2468-0427
Fig. 1CI, DGR, and observed and calculated in Italy during the reference period (February 27, 2020–April 07, 2020).
Descriptive statistics of the CI, DGR, and WR for each time phase.
| Time Model | N° Days | CI at end Date | DGR (%) | |||
|---|---|---|---|---|---|---|
| (end Date) | (from 27/02) | (from 650) | Range | Median [IQR] | Range | Median [IQR] |
| P1 (17/03) | 20 | 31,506 | 10.7–62.5 | 22.8 [19.1–25.7] | 18–45.4 | 26.0 [22.1–31.4] |
| P2 (24/03) | 27 | 69,176 | 8.1–62.5 | 20.2 [13.6–24.6] | 13.6–45.4 | 22.5 [17.7–28.3] |
| P3 (31/03) | 34 | 105,792 | 4.0–62.5 | 15.9 [8.8–23.3] | 9.5–45.4 | 20.7 [14.7–26.2] |
| P4 (07/04) | 41 | 135,606 | 2.3–62.5 | 13.4 [6.9–22.8] | 7.0–45.4 | 17.3 [11.3–26.0] |
DGR, daily growth rate; CI, cumulative infections; IQR, inter quartile range; P1–P4, time intervals; WRt, weighted and cumulated average of the daily growth rate.
Fig. 2CI, DGR, and observed and estimated with model 4) in Hubei Province (China) between January 29, 2020 and March 13, 2020.
Estimated parameters with the exponential decay model applied to (Eq. (5)) and the Gompertz model applied to the CI (Eq. (7)) for each time phase.
| Model | N° Days | RMSE | MAE | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Fixed = | Mean±Std.Err | Mean±Std.Err | Mean±Std.Err | |||||||
| 20 | 0.011 | 0.421 ± 0.011 | 0.050 ± 0.003 | – | 0.019 | 0.016 | ||||
| 27 | 0.011 | 0.414 ± 0.009 | 0.047 ± 0.002 | – | 0.018 | 0.014 | ||||
| 34 | 0.011 | 0.414 ± 0.007 | 0.046 ± 0.002 | – | 0.016 | 0.011 | ||||
| 41 | 0.011 | 0.415 ± 0.006 | 0.047 ± 0.001 | – | 0.014 | 0.010 | ||||
| CI–P1 | 20 | – | 602.2 ± 63.6 | 0.044 ± 0.004 | 0.309 ± 0.019 | 256.7 | 192.6 | |||
| CI–P2 | 27 | – | 498.7 ± 70.8 | 0.049 ± 0.003 | 0.338 ± 0.020 | 564.6 | 410.3 | |||
| CI–P3 | 34 | – | 200.3 ± 40.9 | 0.068 ± 0.002 | 0.480 ± 0.028 | 943.5 | 804.5 | |||
| CI–P4 | 41 | – | 118.2 ± 21.2 | 0.075 ± 0.001 | 0.559 ± 0.023 | 1016.3 | 832.2 | |||
p < .0001 CI, cumulative infections; P1–P4, time intervals; WRt, weighted and cumulated average of the daily growth rate; RMSE, root mean squared error; MAE, mean absolute error.
Fig. 3Fitted models based on the exponential decay model (Eq. (5); left column) and Gompertz model (Eq. (7); right column) using epidemic data for the four successive time phases P1, P2, P3, and P4, respectively. The expected values (continuous red line) and the 95% credibility interval (red dotted line) for the fitted models are displayed along with the observed data of the outbreak (empty black circles).
Fitting (a) and forecasting (b) performance statistics for the exponential decay model applied to (Eq. (5)) and the Gompertz model applied to the CI (Eq. (7)). For each time interval, we highlight the lowest MAPE (gray) for the size of the error.
| Model | N° Days | WR | CI | ||
|---|---|---|---|---|---|
| MAPE | rPers | MAPE | rPers | ||
| 20 | 0.056 | 0.968 | 0.036 | 1.000 | |
| 27 | 0.051 | 0.978 | 0.058 | 1.000 | |
| 34 | 0.044 | 0.986 | 0.141 | 1.000 | |
| 41 | 0.044 | 0.990 | 0.151 | 1.000 | |
| MAPE | rPers | MAPE | rPers | ||
| 35 | 0.075 | 0.998 | 0.371 | 0.984 | |
| 28 | 0.029 | 0.997 | 0.322 | 0.996 | |
| 21 | 0.037 | 0.999 | 0.040 | 0.999 | |
| 14 | 0.023 | 1.000 | 0.028 | 0.999 | |
Fig. 4Epidemic forecasts based on the exponential decay model (Eq. (5); left column) and Gompertz model (Eq. (7); right column) calibrated using the epidemic data for scenarios 1, 2, 3, and 4, respectively. The expected values (continuous red line) and the 95% credibility interval (red dotted line) for the calibrated models are displayed along with the observed data of the epidemic, both those used for the estimation of the models (empty black circles) and those after modeling (filled black circles).