| Literature DB >> 32868965 |
Luca Bonacini1, Giovanni Gallo1,2, Fabrizio Patriarca1.
Abstract
Identifying structural breaks in the dynamics of COVID-19 contagion is crucial to promptly assess policies and evaluate the effectiveness of lockdown measures. However, official data record infections after a critical and unpredictable delay. Moreover, people react to the health risks of the virus and also anticipate lockdowns. All of this makes it complex to quickly and accurately detect changing patterns in the virus's infection dynamic. We propose a machine learning procedure to identify structural breaks in the time series of COVID-19 cases. We consider the case of Italy, an early-affected country that was unprepared for the situation, and detect the dates of structural breaks induced by three national lockdowns so as to evaluate their effects and identify some related policy issues. The strong but significantly delayed effect of the first lockdown suggests a relevant announcement effect. In contrast, the last lockdown had significantly less impact. The proposed methodology is robust as a real-time procedure for early detection of the structural breaks: the impact of the first two lockdowns could have been correctly identified just the day after they actually occurred. © Springer-Verlag GmbH Germany, part of Springer Nature 2020.Entities:
Keywords: COVID-19; Coronavirus; Lockdown; Machine learning
Year: 2020 PMID: 32868965 PMCID: PMC7449634 DOI: 10.1007/s00148-020-00799-x
Source DB: PubMed Journal: J Popul Econ ISSN: 0933-1433
Fig. 1Daily growth of COVID-19 deaths, hospitalizations, and positive cases at the national. Source: Civil Protection Department (2020). “Positive cases” refers to the overall number of COVID-19 cases, excluding those who died or recovered. The three vertical lines represent the days on which the school lockdown, main lockdown, and business lockdown were introduced, respectively
Fig. 2Google Trends for “Coronavirus Italia” in Italy. Source: Authors’ elaborations from https://trends.google.it
Fig. 3Daily energy consumption in Italy, weekends excluded. Source: Authors’ elaboration from https://www.terna.it
Data and variable descriptions
| Variable | Source (Year of reference) | Definition | Mean | Std. dev. |
|---|---|---|---|---|
| Daily growth in COVID-19 cases | Civil Protection Department ( | Dependent variable Difference between the overall COVID-19 cases at time | 30.07 | 61.97 |
| Number of deaths | Civil Protection Department ( | Number of people deceased with COVID-19 infection at the provincial level. As this information is available at the regional level only, the variable is calculated for each province weighting regional COVID-19 deaths by its share of regional COVID-19 cases. | 93.63 | 271.78 |
| Number of recovered | Civil Protection Department ( | Number of people recovered from COVID-19 infection at the provincial level. As this information is available at the regional level only, the variable is calculated for each province weighting regional COVID-19 recoveries by its share of regional COVID-19 cases. | 156.44 | 452.55 |
| Population density | ISTAT (2019) | Ratio between total provincial population and total surface area (km2) | 270.13 | 380.48 |
| Proximity to a hospital | Ministry of Economic Development (2014) | Share of provincial population living in a municipality with at least one 1st level DEA hospital (i.e., a hospital providing first aid, resuscitation, and general surgery services) | 0.333 | 0.171 |
| Proximity to a railway station | Ministry of Economic Development (2014) | Share of provincial population living in a municipality with at least one silver railway station (i.e., a station with morethan 2500 daily visitors on average) | 0.456 | 0.180 |
| Hospital dismissals by the elderly | ISTAT (2018) | Share of hospital dismissals of people aged 65 or above (average for 2016–2018) at the provincial level | 0.460 | 0.049 |
| Mortality for infectious diseases | ISTAT (2017) | Mortality rate for infectious diseases at the provincial level (× 10,000 inhabitants) | 2.488 | 0.957 |
| High school students | ISTAT (2018) | Share of students attending upper secondary schools at the provincial level out of the total population aged 64 or below | 0.058 | 0.007 |
| University students | ISTAT (2017) | Number of students attending universities at the provincial level out of the total population aged 64 or below | 0.025 | 0.026 |
| Nursing homes | ISTAT (2011) | Number of nursing homes at the provincial level (× 10,000 inhabitants) | 1.129 | 0.638 |
| Unemployment rate | ISTAT (2019) | Unemployment rate among people aged 15–74 at the provincial level | 0.104 | 0.057 |
| Poverty rate | INPS (2018) | Share of households declaring an ISEEa lower than 6000 euros out of the total provincial population of households | 0.072 | 0.039 |
aThe ISEE is an indicator combining household income and wealth and it is generally declared when applying for social benefits. It consists of the sum of household income and 20% of household wealth (in terms of both financial assets and property) divided by an ad hoc equivalence scale. The ISEE equivalence scale is equal to the number of household members raised to the power of 0.65
Fig. 4Akaike information criterion values by model specification and values of the t parameters. a School lockdown (LD1). b Main lockdown (LD2). c Business lockdown (LD3). The LD1 effectiveness day in models illustrated in panel b is set to 17 days after the introduction of LD1. The LD1 and LD2 effectiveness days in models illustrated in panel c are set to, respectively, 17 and 19 days after their introductions
Fig 6Daily swabs performed at the national level. Source: Civil Protection Department (2020)
Detection delay by lockdown and model specification
| Lockdown | Effectiveness delay (number of days from introduction) | ||||
|---|---|---|---|---|---|
| Model 3 | Model 5 | Model 6 | Model 7 | Model 8 | |
| School lockdown (LD1) | 17 | 17 | 17 | 17 | 17 |
| Main lockdown (LD2) | 19 | 21 | 19 | 19 | 19 |
| Business lockdown (LD3) | 10 | 18 | 10 | 10 | 10 |
Unlike Model 3, Model 5 includes a quadratic polynomial of COVID-19 cases at time t-1 and its interactions with lockdowns variables, but there are no time dummies. Model 6 adds time dummies to Model 5. In contrast to Model 3, Model 7 includes the number of COVID-19 deaths and recovered at the regional level instead of the provincial one. Model 8 adds to Model 3 the number of swab tests undertaken at the provincial level. As this information is available at the regional level only, the variable is calculated for each province weighting regional COVID-19 swab tests by its share of regional COVID-19 cases
Fig 7Fitted values of the daily growth in COVID-19 cases at the regional level. Fitted values are based on our best model specification (Model 3)
Effects of the three lockdowns on the daily growth of COVID-19 cases (fixed-effects panel model)
| Variables | Model 1 | Model 2 | Model 3 | Model 4 |
|---|---|---|---|---|
| COVID-19 cases | 0.120*** | 0.117*** | 0.125*** | 0.129*** |
| LD1 * COVID-19 cases | − 0.059*** | − 0.060*** | − 0.058*** | − 0.057*** |
| LD2 * COVID-19 cases | − 0.031*** | − 0.028*** | − 0.027*** | − 0.029*** |
| LD3 * COVID-19 cases | − 0.015*** | − 0.015*** | − 0.012*** | − 0.012*** |
| Number of deaths | 0.011 | 0.021 | ||
| Number of recovered | − 0.052** | − 0.067*** | ||
| Constant | 8.165** | 0.200 | 0.178 | 6.511** |
| Time dummies | No | Yes | Yes | No |
| Observations | 6313 | 6313 | 6313 | 6313 |
| 0.428 | 0.455 | 0.463 | 0.444 | |
| Number of provinces | 107 | 107 | 107 | 107 |
Standard errors are clustered by Italian province. ***p < 0.01, **p < 0.05, *p < 0.1
Lockdown effects on the daily growth in COVID-19 cases by subsample and definition of dependent variable (fixed-effects panel model)
| Variables | Model 3 | Only even days | Only odd days | No Lombard provinces | No provinces listed in the Prime Ministerial Decree of March 8, 2020 | COVID-19 cases per every 10,000 inhabitants | No Lombard provinces and COVID-19 cases per every 10,000 inhabitants | No provinces listed in the Decree of March 8, 2020, and COVID-19 cases per every 10,000 inhabitants |
|---|---|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | (7) | (8) | |
| COVID-19 cases | 0.125*** | 0.129*** | 0.121*** | 0.115*** | 0.126*** | 0.069*** | 0.081*** | 0.113*** |
| LD1 * COVID-19 cases | − 0.058*** | − 0.060*** | − 0.056*** | − 0.055*** | − 0.068*** | − 0.038*** | − 0.052*** | − 0.080*** |
| LD2 * COVID-19 cases | − 0.027*** | − 0.024*** | − 0.029*** | − 0.024*** | − 0.021*** | − 0.019*** | − 0.022*** | − 0.023*** |
| LD3 * COVID-19 cases | − 0.012*** | − 0.012*** | − 0.013*** | − 0.018*** | − 0.025*** | − 0.007** | − 0.015*** | − 0.021*** |
| Number of deaths | 0.011 | − 0.029 | 0.054 | 0.149** | 0.246*** | 0.05 | 0.206** | 0.295** |
| Number of recovered | − 0.052** | − 0.047* | − 0.059*** | − 0.056*** | − 0.056*** | − 0.019 | − 0.018 | − 0.019 |
| Constant | 0.178 | 0.172 | 1.979 | 0.367 | 0.105 | 0.008 | 0.01 | 0.004 |
| Time dummies | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| Observations | 6313 | 3210 | 3103 | 5605 | 4779 | 6313 | 5605 | 4779 |
| 0.463 | 0.461 | 0.475 | 0.391 | 0.410 | 0.250 | 0.241 | 0.210 | |
| Number of provinces | 107 | 107 | 107 | 95 | 81 | 107 | 95 | 81 |
Standard errors are clustered by Italian province. ***p < 0.01, **p < 0.05, *p < 0.1. Column 6 replicates estimates in Model 3 but all COVID-19 cases are considered in relative terms with respect to the provincial population. Specifically, both the dependent variable and the “COVID-19 cases at time t-1” variable are divided by the number of inhabitants at the provincial level and then multiplied by 10,000. Column 7 is the same as Column 6 but replicates the analysis in a subsample excluding 12 Lombard provinces. Column 8 is the same as Column 6 but replicates the analysis in a subsample excluding 26 provinces listed in the Prime Ministerial Decree of the 8th of March 2020.
Interactions of province-level characteristics (infrastructures, local health system, and diseases vulnerability) with lockdowns (fixed-effects panel model)
| Variables | Variable of interest (VoI) | ||||
|---|---|---|---|---|---|
| Population density | Proximity to a hospital | Proximity to a railway station | Hospital discharge of the elderly | Mortality for infectious diseases | |
| COVID-19 cases | 0.097*** | 0.078*** | 0.071*** | 0.411*** | 0.129*** |
| LD1 * COVID-19 cases | − 0.039*** | − 0.036*** | − 0.035** | − 0.196*** | − 0.060*** |
| LD2 * COVID-19 cases | − 0.023*** | − 0.027*** | − 0.030*** | − 0.088*** | − 0.038*** |
| LD3 * COVID-19 cases | − 0.006*** | − 0.007*** | − 0.005** | − 0.072*** | − 0.014*** |
| VoI * COVID-19 cases | 0.014*** | 0.045** | 0.054* | − 0.305*** | − 0.006 |
| VoI * LD1 * COVID-19 cases | − 0.008*** | − 0.024 | − 0.029 | 0.146** | 0.002 |
| VoI * LD2 * COVID-19 cases | − 0.001*** | − 0.001 | 0.003 | 0.065*** | 0.012*** |
| VoI * LD3 * COVID-19 cases | − 0.002*** | − 0.007*** | − 0.010*** | 0.066*** | 0.001 |
| Number of deaths | -0.016 | 0.028 | 0.051* | 0.015 | 0.025 |
| Number of recovered | − 0.063*** | − 0.044** | − 0.046** | − 0.063*** | − 0.055*** |
| Constant | 0.211 | 0.201 | 0.189 | 0.248 | 0.185 |
| Time dummies | Yes | Yes | Yes | Yes | Yes |
| Observations | 6313 | 6313 | 6313 | 6313 | 6313 |
| 0.493 | 0.492 | 0.493 | 0.482 | 0.469 | |
| Number of provinces | 107 | 107 | 107 | 107 | 107 |
Standard errors are clustered by province. All variables of interest are normalized to mean 1 before being interacted with lockdown variables. ***p < 0.01, **p < 0.05, *p < 0.1
Interactions of province-level characteristics (incidence of students, nursing homes, local labor market, and income levels) with lockdowns effects (fixed-effects panel model)
| Variables | Variable of interest (VoI) | ||||
|---|---|---|---|---|---|
| High school students | University students | Nursing homes | Unemployment rate | Poverty rate | |
| COVID-19 cases | 0.482*** | 0.083*** | 0.175*** | 0.110*** | 0.095*** |
| LD1 * COVID-19 cases | − 0.254** | − 0.039*** | − 0.090*** | − 0.058*** | − 0.045** |
| LD2 * COVID-19 cases | − 0.075*** | − 0.025*** | − 0.037*** | − 0.038*** | − 0.037*** |
| LD3 * COVID-19 cases | − 0.056** | − 0.007*** | − 0.018*** | − 0.003 | 0.001 |
| VoI * COVID-19 cases | − 0.390** | 0.032*** | − 0.067*** | 0.016 | 0.042 |
| VoI * LD1 * COVID-19 cases | 0.215** | − 0.018*** | 0.043*** | 0.002 | − 0.020 |
| VoI * LD2 * COVID-19 cases | 0.053* | − 0.002 | 0.013*** | 0.022*** | 0.016*** |
| VoI * LD3 * COVID-19 cases | 0.049* | − 0.005*** | 0.009* | − 0.021*** | − 0.022*** |
| Number of deaths | − 0.005 | 0.022 | 0.019 | 0.028 | 0.016 |
| Number of recovered | − 0.062*** | − 0.047*** | − 0.061*** | − 0.039** | − 0.042** |
| Constant | 0.193 | 0.225 | 0.239 | 0.191 | 0.188 |
| Time dummies | Yes | Yes | Yes | Yes | Yes |
| Observations | 6313 | 6313 | 6313 | 6313 | 6313 |
| 0.479 | 0.497 | 0.476 | 0.477 | 0.471 | |
| Number of provinces | 107 | 107 | 107 | 107 | 107 |
Standard errors are clustered by province. All variables of interest are normalized to mean 1 before being interacted with lockdown variables. ***p < 0.01, **p < 0.05, *p < 0.1
Fig. 5Effects of lockdowns on the daily growth of COVID-19 cases by time series length. Outlined areas represent confidence intervals at the 5% level. “Lagged cases” refers to the COVID-19 cases at time t-1, while “LD1,” “LD2,” and “LD3” stand for the three lockdown interaction terms in Table 1. The three vertical lines represent, respectively, the effectiveness days of the school lockdown, main lockdown, and business lockdown, as shown in Section 5