| Literature DB >> 34207174 |
Luisa Ferrari1, Giuseppe Gerardi2, Giancarlo Manzi3, Alessandra Micheletti4, Federica Nicolussi3, Elia Biganzoli5, Silvia Salini3.
Abstract
In this paper, we develop a forecasting model for the spread of COVID-19 infection at a provincial (i.e., EU NUTS-3) level in Italy by using official data from the Italian Ministry of Health integrated with data extracted from daily official press conferences of regional authorities and local newspaper websites. This data integration is needed as COVID-19 death data are not available at the NUTS-3 level from official open data channels. An adjusted time-dependent SIRD model is used to predict the behavior of the epidemic; specifically, the number of susceptible, infected, deceased, recovered people and epidemiological parameters. Predictive model performance is evaluated using comparison with real data.Entities:
Keywords: COVID-19; EU NUTS-3 regions; Italy; SIRD-derived models; epidemic data
Mesh:
Year: 2021 PMID: 34207174 PMCID: PMC8296340 DOI: 10.3390/ijerph18126563
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Main data sources for provincial COVID-19 deaths.
| Region | Main Source |
|---|---|
| Valle d’Aosta | |
| Pedmont | |
| Lombardy | |
| Veneto | |
| Friuli-Venezia-Giulia | |
| Trentino-Alto-Adige | |
| Emilia-Romagna | |
| Liguria | |
| Tuscany | |
| Marche | |
| Umbria | |
| Lazio | |
| Abruzzo | |
| Molise | |
| Puglia | |
| Basilicata | |
| Calabria | |
| Sicily | |
| Sardinia |
Figure 1Official and retrieved data comparison.
Differences between CPA death counts and regional bulletins and local newspapers death counts—Marche region, period 1 April 2020–4 April 2020.
| Province | 1 April 2020 | 2 April 2020 | 3 April 2020 | 4 April 2020 |
|---|---|---|---|---|
| Ancona | 7 | 10 | 6 | 7 |
| Pesaro-Urbino | 10 | 20 | 7 | 14 |
| Fermo | 0 | 0 | 2 | 3 |
| Ascoli Piceno | 0 | 1 | 1 | 0 |
| Macerata | 9 | 0 | 1 | 1 |
| “From other regions” | 0 | 0 | 0 | 0 |
| Marche (from provincial deaths) | 26 | 31 | 17 | 25 |
| Marche (from CPA reporting) | 25 | 26 | 54 | 17 |
Differences between CPA death counts and regional bulletins and local newspapers death counts—Emilia-Romagna region, period 1 April 2020–4 April 2020.
| Province | 1 April 2020 | 2 April 2020 | 3 April 2020 | 4 April 2020 |
|---|---|---|---|---|
| Piacenza | 25 | 19 | 18 | 12 |
| Parma | 24 | 11 | 9 | 25 |
| Reggio-Emilia | 9 | 9 | 14 | 15 |
| Modena | 10 | 18 | 9 | 6 |
| Bologna | 3 | 7 | 31 | 10 |
| Ferrara | 1 | 3 | 3 | 1 |
| Ravenna | 4 | 1 | 0 | 2 |
| Forlì.Cesena | 4 | 3 | 2 | 1 |
| Rimini | 5 | 4 | 4 | 2 |
| “From other regions” | 3 | 4 | 1 | 1 |
| Emilia-Romagna (from provincial deaths) | 85 | 75 | 90 | 74 |
| Emilia-Romagna (from CPA reporting) | 88 | 79 | 91 | 75 |
Figure 2SIRD model compartments and flows.
Figure 3Comparison of index values on 10 September 2020, 10 October, 10 November, 10 December, for provinces where the number of deaths is available. Green pins are for , orange pins for , red pins for and black pins for .
Figure 4Adjusted SIRD model for Catania province—Forecast origin: 20 December 2020.
Figure 5Variables’ prediction intervals based on interval—Torino province starting from 12 October 2020. The shaded region represents the 90% confidence bands, while the dots represent the point predictions. Real values are always within the confidence bands.
MAPE values-DTW aggregation training with and without stringency index.
| Horizon Days | I | I | R | R | D | D |
|---|---|---|---|---|---|---|
| without s.i. | with s.i. | without s.i. | with s.i. | without s.i. | with s.i. | |
| 1 | 2.78 | 2.79 | 1.46 | 1.46 | 1.08 | 1.07 |
| 2 | 5.19 | 5.24 | 2.69 | 2.70 | 1.96 | 1.95 |
| 3 | 7.79 | 7.88 | 4.00 | 4.01 | 2.82 | 2.81 |
| 4 | 10.48 | 10.64 | 5.39 | 5.42 | 3.67 | 3.67 |
| 5 | 13.18 | 13.48 | 6.88 | 6.94 | 4.52 | 4.53 |
| 6 | 15.73 | 16.30 | 8.54 | 8.61 | 5.33 | 5.34 |
| 7 | 17.55 | 18.48 | 9.90 | 10.07 | 6.00 | 6.03 |
| 8 | 19.64 | 20.64 | 11.00 | 11.37 | 6.69 | 6.73 |
| 9 | 21.97 | 22.98 | 11.90 | 12.41 | 7.37 | 7.43 |
| 10 | 24.35 | 25.39 | 12.80 | 13.39 | 8.04 | 8.10 |
| 11 | 26.72 | 27.85 | 13.64 | 14.27 | 8.71 | 8.77 |
| 12 | 29.09 | 30.36 | 14.40 | 15.05 | 9.36 | 9.45 |
| 13 | 31.33 | 32.80 | 15.12 | 15.78 | 10.02 | 10.13 |
| 14 | 33.55 | 35.37 | 15.88 | 16.53 | 10.64 | 10.77 |
MAPE D-Aggregation training without Abruzzo, Basilicata, Lazio and Molise without stringency index.
| Horizon Days | MAPE D |
|---|---|
| 1 | 0.987 |
| 2 | 1.785 |
| 3 | 2.560 |
| 4 | 3.346 |
| 5 | 4.115 |
| 6 | 4.846 |
| 7 | 5.458 |
| 8 | 6.073 |
| 9 | 6.706 |
| 10 | 7.342 |
| 11 | 7.986 |
| 12 | 8.609 |
| 13 | 9.245 |
| 14 | 9.881 |
Figure 6Boxplots of MPE on currently infected.
Figure 7MAPE on weekly predictions over time. The dates on the horizontal axis represent the time window limit on which the training was done. The vertical axis displays the absolute error between real and predicted on the seventh day after that date for each province.
Figure 8Moran’s index computed on I using a row-standardized spatial weight matrix. The lower panel gives the significance levels corresponding to each day of the time window considered, and the horizontal dotted line is the p-value level.
Figure 9SIRD model predictions for Turin province using the STARMA model based on data up to September 10th and November 30th, 2020. (a) September 10th; (b) November 30th.