Literature DB >> 34149204

On the heterogeneous spread of COVID-19 in Chile.

Danton Freire-Flores^1,2, Nyna Llanovarced-Kawles^1,2, Anamaria Sanchez-Daza^2,3, Álvaro Olivera-Nappa^1,2.

Abstract

Non-pharmaceutical interventions (NPIs) have played a crucial role in controlling the spread of COVID-19. Nevertheless, NPI efficacy varies enormously between and within countries, mainly because of population and behavioral heterogeneity. In this work, we adapted a multi-group SEIRA model to study the spreading dynamics of COVID-19 in Chile, representing geographically separated regions of the country by different groups. We use national mobilization statistics to estimate the connectivity between regions and data from governmental repositories to obtain COVID-19 spreading and death rates in each region. We then assessed the effectiveness of different NPIs by studying the temporal evolution of the reproduction number R t . Analysing data-driven and model-based estimates of R t , we found a strong coupling of different regions, highlighting the necessity of organized and coordinated actions to control the spread of SARS-CoV-2. Finally, we evaluated different scenarios to forecast the evolution of COVID-19 in the most densely populated regions, finding that the early lifting of restriction probably will lead to novel outbreaks.

Entities: Disease Gene Species

Keywords: COVID-19; Chile; Epidemiological model; Inverse problems; Reproduction number; SARS-CoV-2; SEIRD model

Year: 2021 PMID： 34149204 PMCID： PMC8196305 DOI： 10.1016/j.chaos.2021.111156

Source DB: PubMed Journal: Chaos Solitons Fractals ISSN： 0960-0779 Impact factor: 5.944

Introduction

The Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) is the seventh reported coronavirus that can infect humans [1], [2]. As a consequence of the fast global spread and severe effects of the infectious disease caused by SARS-CoV-2, COVID-19 was declared a pandemic by the World Health Organization on 11 March 2020, resulting in 214 countries affected to date [3], [4]. After more than one year of living in a pandemic world and despite scientific efforts, effective treatments for COVID-19 are not yet available. Even though several countries are deploying vaccination plans worldwide, uncertainties related to vaccine efficacy and uptake suggest the necessity of keeping non-pharmaceutical interventions (NPIs) in place for preventing excess deaths and the emergence of escape variants [5], [6], [7]. Since the pandemic began, researchers have been proposing different epidemiological models to evaluate and forecast the evolution of the disease. A significant part of those models derives from the renowned SIR model proposed by Kermack and McKendrick in 1927 [8], which compartmentalizes the population exposed to the virus in the variables susceptible (), infected () and Recovered (), whose interaction determines the evolution of the disease over time. Even though helpful, SIR models rely on several hypotheses that are rather hard to meet [9], most of them related to homogeneity [10]. SIR models assume a “perfect mixing”, where individuals are equally likely to meet, and from a continuous perspective, fractions of them do all the time. Individuals are assumed to have the same transition rates, translating to the same probability of being infected and the same average time to recover. Furthermore, SIR models do not include a latent period, one of the signature characteristics of COVID-19 [11]. Direct extensions of the SIR model include extra compartments to represent variables of epidemiological interest, as SEIR models, differentiating exposed individuals that are not yet infectious, SIRD, differentiating deaths, and SEIRA models, considering asymptomatic carriers, among others [12], [13], [14], [15], [16], [17], [18], [19]. Viral spread depends not only on its biological properties but also on the behavior and susceptibility of the population where it propagates [20]. Therefore, a more realistic analysis of the propagation of the disease in a heterogeneous population usually requires models that include the interaction between the different sub-populations [21], [9], [22]. These sub-populations interact dynamically with each other, and the spreading modes are not necessarily isotropic [23]. Furthermore, our understanding of viral spread is greatly determined by data availability and quality, which has been proven to have significant delays [24], weekly modulation [25], and even weekend effects [26], [27]. The case of Chile is an example of countries where typical SIR models would not work; its geopolitical centralization isolates and scatters the different regions that behave as independent populations with varying rules of interaction [23], [24], [28]. Besides, the profound economic inequality as reported by the Gini’s index of the world bank [29] among different social classes constitutes both physical and behavioral anisotropy for the spread of COVID-19. Chile has 16 regions with non-homogeneous connectivity, represented in the modeling by the interaction matrix (), in which each module can be weighted. This matrix is not necessarily symmetric, it considers the fractions of the communities that effectively interact, and it is modified by the spreading rate () of each sub-population according to the restriction measures implemented by the government to every community, representing the effective interaction in a given period. In this work, we modified the multi-group SEIRA model presented in [23] to study the COVID-19 spreading dynamics in Chile. We incorporate a compartment for the deaths and design a tailored parameter fitting methodology to fit the parameters governing the dynamics. We incorporate real-world data for estimating connectivity matrices and solve the inverse problem for parameter fitting using data from official sources. We assessed the effectiveness of NPIs by studying how parameters evolved in the different epidemiological periods. We also study the differences between data-driven and model-based estimates of the reproduction number to evaluate whether the observed dynamics are driven by the local behavior or by the coupling with other regions. By a Monte Carlo-inspired procedure [30], we estimated parameter variability and provided a statistically-based forecast of the pandemic’s evolution in Chile. We also study three future scenarios assuming different levels of social distancing, which allow us to predict the effect of the various health policies implemented by the government; in consequence, authorities can take better decisions to prevent the spreading [31].

Methodology

Overview

We detail the implementation workflow of the proposed SEIRD multi-group model in Fig. 1 . We first collect and pre-process official data of the spread of COVID-19 in Chile from local authorities [32]. The interaction and mobility data required for modeling the connectivity between the sixteen regions of Chile is estimated from national statistics [33], [34], [35] and projections for 2020 [36]. We obtained default and initial parameter values for the model from regional reports and literature. Then, we study the temporal evolution of the spreading and death rate in each region, using a combination of the simulated annealing and gradient descent algorithms to minimize the error existing between the reported (raw) and simulated curves. Finally, using these values and based on different social distancing measures, various possible scenarios are presented for active infected and deceased cases within an established temporal horizon.

Fig. 1

Schematic representation of the inverse problem-solving pipeline.

Interaction dynamics between different regions

We summarize the interaction structure between the different sub-populations considered in the SEIRD model in a connection matrix (). In this matrix, entries represent the fraction of individuals from the th sub-population that moves to the th. Note that is not necessarily symmetric, as the flux of individuals moving from to may be different from the ones from moving to . An example of this situation is the migration to centralized regions, where a considerable fraction of individuals move to, but there is no considerable migration in the other way around. Therefore, . In our model, is also modulated by the local spreading rates , thereby including governmental restrictions valid in the different periods considered. For each region , we estimated as the fraction of individuals over fifteen years old from temporally moving to because of work or studies, based on national mobility data [33], [34], [35], [36]. The displacement matrix comprises factors and ; The value represents the fraction of the population that leaves the region, defined as the number of people from region that travels to another region, divided by its total population : On the other hand, represents the fraction of those individuals leaving that go to region . Thus, we can estimate as: Given the geographical characteristics of Chile and its centralization, Regions do not have the same level of displacement between them (Table S2).

Model description

Equations

In our model, represents one of the sixteen regions of Chile, and we represent the spreading dynamics of SARS-CoV-2 using differential equations in a SEIRD compartmental model where the temporal scale is measured in days. Susceptible individuals can acquire the virus after an effective contagion from an infected individual and would be moved to the exposed compartment . There, they would spend, on average, a time (latent period) until they become infectious . After an average time (infectious period), infectious individuals are moved to a final recover or death compartment. The description, initial values, and sources for all the parameters in equations from (1) to (5) are shown in Table 1 .

Table 1

Parameters description and initial values range assignment for the inverse-problem solution.

Parameter	Description	Value or range	Units	Source
α	Asymptomatic ratio of the population	14	%	[32]
βi	Spreading rate of the virus in region i	0.00 – 0.30	days−1	[49]
γ	Recovery rate	1/14	days−1	[50]
ϵ	Latent to infectious transition rate	1/5	days−1	[51], [52], [53]
θi	COVID-19 induced death rate in region i	0.00 – 0.02	days−1	[32]
ξ	Factor of behavioral virulence of asymptomatics	1	-	[54]

Parameters description and initial values range assignment for the inverse-problem solution. Assuming that natural birth and death rate of the population can be neglected when compared with the COVID-19 induced death rates –which also occur in much shorter timescales–, the system of differential equations describing the dynamics is:

Data treatment

Raw data presented corresponds to that reported daily by the Government of Chile, available in Ministry of Science [32]. This work analysed official data reported daily by the Government of Chile [37], consisting in a total of 244 days until November the 2nd, 2020. Regional-level daily data for each region starts in March 3rd 2020 and include new daily infected cases, total infected cases, and total deaths (). However, due to a reporting inconsistency generated when the official reporting guidelines changed (June 1st), datasets contained systematic errors. We corrected those errors as explained in Supplementary Section 3. Using these corrected datasets (available as supplementary dataset), we calculate total recovered () and active infected cases (), defining also variables accounting for the experimental time () and number of inhabitants per region (). For parameter fitting, we defined fifteen epidemiological periods for the pandemic progression in Chile, which limits matched relevant governmental measures detailed in Table 2 [38], [39], [40].

Table 2

Epidemiological periods.

Period	2020 dates	Remarkable government measures
I	Mar 3 – Apr 6	3/3 First case reported, 3/15 Mandating Schools closure3/16 Borders were closed, 3/18 “State of exception*” enacted3/21 First deceased person, 3/25 Lockdown in some Metropolitan Region areas
II	Apr 7 – Apr 21	4/19 Government call for a “new normality”
III	Apr 22 – May 6	4/23 School were indefinitely closed
IV	May 7 – May 21	5/13 Announcement of lockdown for 90% of Metropolitan Region
V	May 22 – Jun 5	Health Ministry defines new COVID-19 case-reporting criteria:- Expansion of the dead case criterion- Active contagious case since the onset of symptoms- Include positive PCRs in laboratory reports as active cases- Expansion of the dead case criterion- Active contagious case since the onset of symptoms- Include positive PCRs in laboratory reports as active cases
VI	Jun 6 – Jun 20	Change of Health Minister6/16 The “state of exception”* is extended for another 90 days6/17 31,412 previously dismissed cases were integrated into the official count6/18 Previously unnotified cases were added to the daily infectionslist due to an inclusion criterion change
VII	Jun 21 – Jul 05	Chile reached sixth place in the total number of confirmed cases worldwide7/4 Some communes of the Metropolitan Region completed 100 days in quarantine
VIII	Jul 6 – Jul 20	7/19 Announcement and enactment of the “Step by Step” lockdown release plan
IX	Jul 21 – Aug 4	7/28 Lockdown release for districts in the Metropolitan and Valparaíso Regions
X	Aug 6 – Aug 19	7/28 Transition step area broadened in the Metropolitan Region
XI	Aug 20 – Sep 3	8/28 Several districts of Bío-Bío and Maule Regions went back to lockdown
XII	Sep 4 – Sep 18	Communes of O’Higgins, Magallanes Regions, among others, move back to lockdown
XIII	Sep 19 – Oct 3	The state of exception due to catastrophe will remainin force by a presidential mandate for the next 79 days
XIV	Oct 4 – Oct 18	First schools reopening
XV	Oct 19 – Nov 2	10/25 Chile’s 2020 National Plebiscite

In which the government may transcend the rule of law in the name of the public good.

Epidemiological periods. In which the government may transcend the rule of law in the name of the public good.

Initial guess and nested problems

The set of initial values for the parameters (Supplementary Table S3) are expected to reflect the behavior of the different population variables (SEIRD) in the different epidemiological periods, according to the confinement measures established by the government in every region. We perform a separate parameter fitting for every epidemiological period and could be extended to a higher number of periods, if necessary. After obtaining the simulated curves for the SEIRD variables, using the values in Table S3, we define which parameters will be estimated by the simulated annealing algorithm, before proceeding to the formal parameter fitting. We considered the percentage of asymptomatic patients to be constant across regions, based on governmental reports [37]. Recovery and exposure rates are considered uniform at the country level as the Chilean health system have not been overwhelmed to date. Therefore, we obtain in the parameter fitting the local spreading and death rates ( and respectively). We also provide freedom to the initial conditions of the first epidemiological period, leaving them to be determined in the parameter fitting procedure. We numerically obtain the solution of the SEIRD model (Eq. (1)) for every parameter combination evaluated throughout the fitting process. Subsequently, we quantify a measure proportional to the mean squared error (MSE) between simulated curves and raw data reported for each region, combined in the cost functional being minimized in the th epidemiological period. The optimization algorithm selected was a combination of simulated annealing and gradient descent. Finally, the initial guess for initial condition of the SEIRD variables in each region was defined as follows: the initial value for the number of susceptible () people is considered as the total population of the region, while , except in Maule Region, where and because this region was where the first COVID-19 case in Chile was reported.

Parameter fitting strategy: resolution of inverse problem

We determined the set of parameters that best describes the observed national dynamics (with regional resolution) by minimizing a cost functional for each of the epidemiological periods as described in Table 2. This functional accounts for the total MSE (mean squared error) between the number of infected , deaths , and recovered cases and raw data obtained from official repositories [37]. Structurally, includes the three contributions: Because of differences in the number of inhabitants across regions, without explicitly correcting it, the error contributed by a small region will be less than that contributed by a large region. To avoid unrealistic solutions where small regions would be left aside because of the algorithm’s blind drive to minimize the error, we included a weighting factor. Each contribution of each region was weighted by , a factor directly proportional to the ratio between the total infected, recovered or deceased calculated cases for the all country in the last day of the epidemiological period, and the respective number of infected, recovered or deceased calculated cases in the region the same day. where the 1.7 exponent in was empirically set for balancing the contributions of large and small regions, and the correction factor connects the different contributions in functional . Given the difference in magnitude of deceased in comparison with active infected and recovered cases we define and . We then find the optimal parameters for the fit as the argument of a minimization problem, numerically minimizing value of for each epidemiological period. Noteworthy, each epidemiological period described in Table 2 has at least 14 data points ranging in times corresponding to daily levels of infection, recoveries, and deaths, thus constituting for each region (and per period) a total of 42 independent measures. On the other hand, we only have to fit two parameters per region and per epidemiological period (namely, and ), thus not risking overfitting. The implementation of the mathematical model and resolution of the parameter-fitting inverse problem was performed in Matlab version R2018a.

Variability assessment and forecasting

To obtain reliable values for the selected parameters, Monte Carlo simulations [30] were performed according to the methodology described in Contreras et al. [41] (). In these simulations, we induce a white noise in the input data, and fit the parameters using this mildly noisy signal. Doing so, we aim to minimize the contribution of potential errors underlying the data. In that way, this experiment is also a sensitivity analysis of the method to the data. The inverse problem for curve-fitting is solved individually, resulting in distributions of parameters. We statistically obtained average population parameters and their variability from those distributions. Once obtaining values for both parameters and their uncertainty, we evaluate different forecast scenarios for the evolution of COVID-19 pandemic Chile.

Results and discussion

Monte Carlo simulations

As a result of Monte Carlo multiple-simulations experiment performed we obtained distributions for both and parameters in each region, and subsequently used them to numerically solve our model. Here we focus on active infected cases () and total deaths (), since these we believe are of greater importance for policymakers. Tables 3 and 4 show the median values obtained for and , respectively, with a 95% confidence interval in the Metropolitan, Valparaíso and Bío-Bío Regions in each epidemiological period. Noteworthy, these regions concentrate the largest number of inhabitants in the northern, central, and southern zones of the country, adding up to almost 60% of the total population of Chile. We observe that the highest values of were in the first epidemiological period, where no social distancing measures had been established, followed by the IV epidemiological period, for Metropolitan, Valparaíso, and Bío-Bío Regions — which was approximately three weeks after the government announced “safe return to work” or “the new normality” in the Metropolitan Region —. We observe a change point in the general population behavior after the announcement, which was delayed to the IV epidemiological period because of the disease timeline (latency, incubation, recovery time) and significant delays in testing [24]. A higher rate of infections subsequently spread from Metropolitan Region to the regions with the highest rate of transfers: Valparaíso and Bío-Bío. In each epidemiological period, values do not present a high variability, which is reflected in the narrow 95% confidence intervals. The periods in which there is a higher variability correspond to those with higher values, so the number of infections was triggered and the fitting becomes more challenging.

Table 3

[] median values with 95% confidence intervals for each epidemiological period for Metropolitan, Valparaíso, and Bío-Bío Regions.

Epid. Period	Valparaíso		Metropolitan		Bío-Bío
	Median	CI	Median	CI	Median	CI
I	0.98	[0.95–0.98]	1.03	[1.01–1.04]	1.10	[1.06–1.11]
II	0.05	[0.05–0.05]	0.14	[0.14–0.15]	0.01	[0.01–0.01]
III	0.11	[0.11–0.12]	0.29	[0.29–0.29]	0.01	[0.01–0.01]
IV	0.21	[0.21–0.22]	0.19	[0.19–0.20]	0.26	[0.25–0.28]
V	0.10	[0.10–0.10]	0.14	[0.14–0.15]	0.14	[0.14–0.15]
VI	0.09	[0.09–0.09]	0.06	[0.06–0.06]	0.08	[0.07–0.08]
VII	0.05	[0.05–0.05]	0.05	[0.05–0.05]	0.14	[0.14–0.15]
VIII	0.05	[0.05–0.06]	0.01	[0.01–0.01]	0.04	[0.04–0.04]
IX	0.05	[0.05–0.05]	0.02	[0.02–0.02]	0.06	[0.06–0.06]
X	0.07	[0.07–0.07]	0.12	[0.12–0.13]	0.10	[0.10–0.11]
XI	0.09	[0.09–0.09]	0.01	[0.01–0.01]	0.10	[0.10–0.12]
XII	0.06	[0.06–0.07]	0.08	[0.08–0.09]	0.05	[0.05–0.05]
XIII	0.03	[0.03–0.03]	0.03	[0.03–0.04]	0.07	[0.07–0.07]
XIV	0.06	[0.06–0.06]	0.08	[0.08–0.08]	0.07	[0.07–0.07]
XV	0.05	[0.05–0.05]	0.07	[0.07–0.07]	0.10	[0.10–0.10]

Table 4

[] median values with 95% confidence intervals for each epidemiological period for Metropolitan, Valparaíso and Bío-Bío Region.

Epid. Period	Valparaíso		Metropolitan		Bío-Bío
	Median	CI	Median	CI	Median	CI
I	0.0029	[0.0027–0.0033]	0.0031	[0.0027–0.0033]	0.0031	[0.0027–0.0033]
II	0.0024	[0.0023–0.0027]	0.0024	[0.0023–0.0027]	0.0010	[0.0009–0.0012]
III	0.0039	[0.0036–0.0044]	0.0033	[0.0028–0.0033]	0.0020	[0.0018–0.0022]
IV	0.0028	[0.0027–0.0033]	0.0027	[0.0027–0.0027]	0.0012	[0.0009–0.0014]
V	0.0019	[0.0018–0.0022]	0.0018	[0.0018–0.0018]	0.0020	[0.0018–0.0022]
VI	0.0018	[0.0018–0.0022]	0.0018	[0.0018–0.0018]	0.0008	[0.0005–0.0008]
VII	0.0019	[0.0018–0.0022]	0.0027	[0.0027–0.0027]	0.0010	[0.0009–0.0014]
VIII	0.0018	[0.0018–0.0022]	0.0027	[0.0027–0.0027]	0.0009	[0.0009–0.0009]
IX	0.0028	[0.0027–0.0033]	0.0027	[0.0027–0.0031]	0.0009	[0.0009–0.0012]
X	0.0020	[0.0018–0.0022]	0.0018	[0.0018–0.0022]	0.0009	[0.0009–0.0013]
XI	0.0030	[0.0027–0.0033]	0.0018	[0.0018–0.0022]	0.0009	[0.0009–0.0014]
XII	0.0020	[0.0018–0.0022]	0.0018	[0.0018–0.0022]	0.0009	[0.0009–0.0014]
XIII	0.0020	[0.0018–0.0022]	0.0019	[0.0018–0.0022]	0.0019	[0.0018–0.0022]
XIV	0.0031	[0.0027–0.0033]	0.0020	[0.0018–0.0021]	0.0012	[0.0009–0.0014]
XV	0.0032	[0.0027–0.0033]	0.0019	[0.0018–0.0022]	0.0021	[0.0018–0.0022]

[] median values with 95% confidence intervals for each epidemiological period for Metropolitan, Valparaíso, and Bío-Bío Regions. [] median values with 95% confidence intervals for each epidemiological period for Metropolitan, Valparaíso and Bío-Bío Region. These analyses are extended to each of the 13 remaining regions of the country, reporting median values of and () for each region and epidemiological period (Supplementary Tables S4 and S5). Raw data (parameter values for each realization of the Monte Carlo simulation) for all regions and epidemiological periods are reported as separate Supplementary Files for and . The death rate should not be mistaken by the Infection Fatality Ratio (IFR), as the latter represents the fraction of individuals who die after being infected. If we focus on the infectious compartment , the transitions rates to recovery or death are given by and , being at least one order of magnitude larger than (cf. Table 4). Estimating the IFR as , and using the inferred values of theta for all epidemiological periods and regions, we obtain a median IFR of 1.56%, which agrees well with official data [42]. A more detailed analysis is provided in the Supplementary Materials, Section S4. The NPI (non-pharmaceutical interventions) agenda of the government was different for each region, and the main criterion for enacting them was the reported new cases. We observe that the spreading rate remained high in some regions during the first epidemiological periods before decreasing, due to the time required for evidencing the effects of lockdowns and delays associated with the disease progression. In those regions reporting the first COVID-19 cases in Chile, the spreading rate decreased faster because of the earlier establishment of NPIs. An example is the O’Higgins Region, which has the lowest average value for the fifteen epidemiological periods of spreading rate. Regarding the death rate , it remained relatively low in all regions since the hospital capacity at the country level for the correct care of critical patients was not exceeded, and also due to the efficiency in the inter-regional transfer of patients.

COVID-19 regional spreading dynamics

Using the different sets of parameters obtained in the Monte Carlo experiment, the simulated curves for the active infected cases (I) and deaths cases (D) are projected in each region over time with a two-level calculated confidence interval (60% and 95%). Figure 2 shows both the simulations and raw data for the Metropolitan, Valparaíso, and Bío-Bío Regions.

Fig. 2

Simulated curves of active infected (top) and deceased (bottom) cases for the Valparaíso (A,B), Metropolitan (C,D) and Bío-Bío (E,F) Regions, along with 60% and 95% confidence intervals. Raw data for the evaluated time-frame is also presented (grey dots). We observe that simulated curves are in good agreement with raw data, which always remain within the confidence intervals and present consistent trends. The color intensity of these zones varies depending on the confidence intervals percentage (95% and 60%, light and dark shadows, respectively), calculated from the and distributions obtained in the Monte Carlo experiment. We observe a peak in the active cases in the Metropolitan Region between June and July, which corresponds to an increase in the number of deaths cases on the same period. Raw data for the active infected cases in the Valparaíso and Bío-Bío Regions has two peaks. The first one corresponds to the date where active cases in the Metropolitan Region were at their peak, so due to the high number of transfers between these regions, these active cases act as vectors of infection in the destination regions, thus increasing infected cases and therefore deceased. In contrast, the second one is related to the fact that the measures adopted by the government are sectorized for each Region. For Valparaíso and Bío-Bío Regions, the isolation measures were relaxed before than for the Metropolitan Region, resulting in a second increase rate of the active infected cases. Curves for the other regions are presented in the supplementary material (Figure 1 to Figure 13, Supplementary). As the time-frame set for each epidemiological period does not necessarily match the temporality of the different measures enacted in each region, some parts of the raw data drift from the median in certain epidemiological periods. Thus, it is crucial to consider the actual spreading dynamics between geographical Regions to represent more appropriately the scenario in each Region. This could also be solved by further reducing the number of days per period but risking the possibility of overfitting to raw data. The proposed SEIRD model was able to adjust well to the data in both low-population and heavily populated regions, showing the relevance of the 1.7 factor described in the parameter fitting strategy section. Higher values for this adjustment factor result in a better adjustment in the small regions, but at the cost of a less rigorous adjustment in the larger regions. In contrast, with values lower than 1.7, the opposite occurs. In Figure 3 , we present the results of a sensitivity analysis of the fitted set to perturbations of in the parameters obtained in for last epidemiological period.

Fig. 3

Sensitivity analysis for the last epidemiological period. The red scale represents the variations in values, from the first region (red) to the last (yellow), while the blue scale represents the variations in values, from the first region (blue) to the last one (cyan). (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) We observe that, when making these variations in each of the 32 parameters obtained (16 and 16 ) the calculated error increases, thus confirming that the parameters obtained by solving the inverse problem effectively minimize the error existing between the simulated curves and the experimental ones.

On the values of in coordinated government measures

The Effective Reproduction Number represents the number of persons a single infected individual might infect, in a population that is aware of the disease representing the viral spread rate of the virus, and varies depending on the policies implemented by the government, such as quarantines [43]. Based on the parameters obtained from the simulations, it is possible to calculate the value of as the ratio between the spreading rate and the recovery plus deaths rate , as demonstrated in Cintrón-Arias et al. [44]. This approach allows us to obtain the effective reproduction number driven only by the local population’s behavior, decoupled from other regions: We obtain a data-driven value for the observed , , adapting the methodology presented in Contreras et al. [45], Medina-Ortiz et al. [46]: In particular, both values do not necessarily need a match, as they represent -slightly- different quantities. The observed represents whether the overall trend is the spread or containment of the disease, purely driven by data, and is affected by testing and tracing governmental plans [47]. More extensive testing will uncover unnoticed infection chains, and also increase the numbers as the uncovering would be faster than the spread of the disease. On the other hand, the effective reproduction number accounts for local trends on contagion, disregarding whether those cases would be noticed or remain uncovered throughout the disease timeline. In Fig. 4 the values obtained from official sources and simulation results in the Metropolitan, Valparaíso, and Bío-Bío Regions are presented, in conjunction with an 95% confidence interval. As is shown, both values reflect a similar behavior. However, they are not identical, probably because there is a difference between the simulated and raw data due to the contribution of the inter-region movements in the infection rate, which agrees with the results obtained in the inverse problem-solving.

Fig. 4

values calculated from raw data (red) and simulated data (blue) for the Valparaíso (A), Metropolitan (B) and Bío-Bío (C) Regions with 60% and 95% confidence interval. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) These results for show the correlation between the imposed social distancing measures and the values of the Effective Reproduction Number, in the different regions. This value is higher in the III epidemiological period for the three presented regions and subsequently decreases as more restrictive measures are declared. The case of the Magallanes region (Figure 26, Supplementary Material) is of particular interest, in the absence of proper coordination around social distancing measures, a high number is observed in mid-August and consequently a second outbreak of infected assets, more significant than the first one. Projecting these values into the future, a decrease in the virus spreading rate and a reduction of the number of active infected would be expected, since a < 1 implies a slow rate of spread and the outbreak size would decay exponentially.

Model forecasting

To evaluate the impact of governmental interventions, we simulated different scenarios aiming to project the contagion trends observed if those interventions did not take place. In Fig. 5 , we present projections of infected cases in the Metropolitan Region, if no more restrictions were imposed after the II and III epidemiological periods.

Fig. 5

Different simulated scenarios for active infected cases in the Metropolitan Region. A: Scenario I (cyan curve) corresponds to establishing a total quarantine from the first epidemiological period. In contrast, scenarios II (orange curve) and III (purple curve) represent the cases where no greater measures of social distancing took place as of periods II and III, respectively. The vertical dashed red lines indicate the date of periods I, II and III. Figures B and C correspond to different scales from figure A to visualize the scope of the curves. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Due to the importance of establishing containment measures as soon as possible to avoid an excessive spread of the virus, the first three epidemiological periods were chosen to evaluate immediate quarantine (I) and non-imposition of measures (II and III) scenarios. It is observed that in the scenario I the number of active infected cases reaches a peak of approximately 1500 (Fig. 5, C) cases, 30 days after the first reported case. The spreading rate () (Table 3) associated with this scenario corresponds to 0.05, registered in the VII epidemiological period in which the metropolitan region was in total quarantine. On the other hand, in scenarios II and III active infections peak over a million cases between days 150 - 200 (Figure 5, B). Scenario III is the one that presents a higher number of infections due mainly to two factors: the number of active infected cases at the beginning of the period and the spreading rate. In both factors, the values are higher in the third period than in the second. In addition to the spreading rate () (Table 3) in the third period being 0.29, while in the second period it is 0.14, a higher initial number of actively infected would facilitate the spread of the virus in the presence of a high spreading rate, explaining the behavior of the scenario II and III curves. Using the same strategy, we analysed different future scenarios based on the projected restrictions for each region. On July 19th, the Government of Chile announced the beginning of the “Step by Step” plan to lift the current social distancing measures gradually. This plan involved the gradual opening in the different districts and regions of the country based on the contagion rate present in each one, the percentage of occupancy of ICU (intensive care unit) beds in hospitals, and the rate of PCR test positivity, among others. The steps are: Quarantine: People cannot leave their homes. Transition: People is allowed to leave the house with restrictions, only on weekdays. Preparation: Individuals are free to move, but group gatherings are not permitted. Initial Opening: Group gatherings are permitted, with a restricted number of people. Advanced Opening: free group gatherings are permitted. Each region that enters a new stage of the plan presents greater freedom of movement and, therefore, the number and intensity of contacts between people (and thereby the spreading rate ) could increase. Consequently, a weighting factor is assigned to each stage of the plan. This factor multiplies the value of the corresponding region to projecting the effects that the “Step by Step” program will have on the extrapolation in the SEIRD variables (Table S6) [48]. Figure 6 present three different scenarios projected for the Metropolitan Region. These scenarios are projected in a time window until the end of 2020:

Fig. 6

Different scenarios proposed for the Metropolitan Region. The first scenario (A) represents the evolution of the active infected (top) and death (bottom) cases maintaining a lockdown from November 2nd (vertical red line) until the end of 2020, while the second (B) and third scenario (C) represent the current situation and a massive reopening respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Scenario 1: The Metropolitan Region remain in stage 1 of quarantine. Scenario 2: Current scenario. The Metropolitan Region advances to 4th phase of preparation. Scenario 3: Limit scenario, in which the spreading rate increases to such an extent that large outbreaks and the second wave of massive infections occur. Different scenarios proposed for the Metropolitan Region. The first scenario (A) represents the evolution of the active infected (top) and death (bottom) cases maintaining a lockdown from November 2nd (vertical red line) until the end of 2020, while the second (B) and third scenario (C) represent the current situation and a massive reopening respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Based on the projections, we would expect a sustained decrease in case numbers if the quarantined regime continues. However, in the current scenario (initial opening phase), a second peak in contagion is expected. Nevertheless, this wave would be lower in magnitude compared with the one observed in the first half of 2020, and therefore the health system would also be able to handle it. Finally, in the third scenario (where a massive opening is proposed), we observe a more pronounced increase in case numbers, which will eventually exceed the number of infections observed at the beginning of the pandemic. Noteworthy, this is more or less the current (March 2021) national trend (see, e.g. [4]). In this scenario, it would be uncertain to predict occupancy levels of ICU beds in hospitals as well as requests for mechanical respirators. Nonetheless, we would expect a saturation of the public health system.

Conclusions

COVID-19 spreading dynamics depends on multiple factors, including the biological/epidemiological aspects of SARS-CoV-2, human behavior, governmental interventions, and heterogeneities among the affected population. Several research groups have statistically analysed these factors during pandemic progression, and we carefully included them in our modelling. The multi-group SEIRD model used considers the heterogeneous distribution and dynamic displacement of the Chilean population, grouped in 16 regions, and also a timeline of the different NPIs enacted by the government. Following the presented results, our model shows to be efficient to the adjustment of the raw data, overcoming challenges as discontinuities and high variability in it, generating simulations that are easy to interpret and project with narrow confidence intervals. The measures imposed for a particular region can affect other regions, to a greater or lesser extent depending on their interaction, thus highlighting the need for coordinated governmental actions to control the spread of COVID-19. We have shown that the multi-group SEIRD model presented in this work is a useful tool to represent the contribution of each region in a heterogeneously populated country and is helpful to forecast in the short term the evolution of the different population groups.

CRediT authorship contribution statement

Danton Freire-Flores: Conceptualization, Methodology, Software, Formal analysis, Investigation, Writing - original draft, Writing - review & editing. Nyna Llanovarced-Kawles: Conceptualization, Formal analysis, Investigation, Data curation, Writing - original draft, Visualization, Writing - review & editing. Anamaria Sanchez-Daza: Conceptualization, Writing - original draft, Writing - review & editing, Project administration. Álvaro Olivera-Nappa: Project administration, Funding acquisition, Writing - review & editing.

Declaration of Competing Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

X	Arbitrary variable for representing a generic fraction
ni	Base number of members class i
niT	Effective number of members class i
α	Asymptomatic ratio of the population
ξ	Extra factor of behavioral virulence of asymptomatic patients
Φij	Fraction of class i in class j
pi	Immunity ratio of newborns of class i
Λi	Net population growth rate i
di	Per capita base death rate of class i
βi	Spreading rate of the virus in class i
ϵi	Inverse of the incubation time in class i
γi	Recovery rate of class i
θi	Pathogen induced death rate in class i
Φ	Interaction matrix
Tk	kth epidemiological period
tik,tfk	First and last day of the kth epidemiological period
ei	Fraction of people from region i travelling to other region
Cij	Fraction of those individuals leaving region i that travel to region j
wXi,k	Weighting empirical factor for the Xi variable in the kth epidemiological period

25 in total

1. The estimation of the effective reproductive number from disease outbreak data.

Authors: Ariel Cintrón-Arias; Carlos Castillo-Chávez; Luís M A Bettencourt; Alun L Lloyd; H T Banks
Journal: Math Biosci Eng Date: 2009-04 Impact factor: 2.080

2. A SIR model assumption for the spread of COVID-19 in different communities.

Authors: Ian Cooper; Argha Mondal; Chris G Antonopoulos
Journal: Chaos Solitons Fractals Date: 2020-06-28 Impact factor: 9.922

3. Estimation of COVID-19 dynamics "on a back-of-envelope": Does the simplest SIR model provide quantitative parameters and predictions?

Authors: Eugene B Postnikov
Journal: Chaos Solitons Fractals Date: 2020-05-01 Impact factor: 5.944

4. Real-Time Estimation of R _t for Supporting Public-Health Policies Against COVID-19.

Authors: Sebastián Contreras; H Andrés Villavicencio; David Medina-Ortiz; Claudia P Saavedra; Álvaro Olivera-Nappa
Journal: Front Public Health Date: 2020-12-22

5. A Novel Synthetic Model of the Glucose-Insulin System for Patient-Wise Inference of Physiological Parameters From Small-Size OGTT Data.

Authors: Sebastián Contreras; David Medina-Ortiz; Carlos Conca; Álvaro Olivera-Nappa
Journal: Front Bioeng Biotechnol Date: 2020-03-13

6. The Incubation Period of Coronavirus Disease 2019 (COVID-19) From Publicly Reported Confirmed Cases: Estimation and Application.

Authors: Stephen A Lauer; Kyra H Grantz; Qifang Bi; Forrest K Jones; Qulu Zheng; Hannah R Meredith; Andrew S Azman; Nicholas G Reich; Justin Lessler
Journal: Ann Intern Med Date: 2020-03-10 Impact factor: 25.391