Alberto Aleta1, Qitong Hu2,3, Jiachen Ye2,3, Peng Ji2,3, Yamir Moreno4,5,1. 1. ISI Foundation, Via Chisola 5, 10126 Torino, Italy. 2. Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai 200433, China. 3. Research Institute of Intelligent and Complex Systems, Fudan University, Shanghai 200433, China. 4. Institute for Biocomputation and Physics of Complex Systems (BIFI), University of Zaragoza, 50018 Zaragoza, Spain. 5. Department of Theoretical Physics, University of Zaragoza, 50018 Zaragoza, Spain.
Abstract
Two months after it was firstly reported, the novel coronavirus disease COVID-19 spread worldwide. However, the vast majority of reported infections until February occurred in China. To assess the effect of early travel restrictions adopted by the health authorities in China, we have implemented an epidemic metapopulation model that is fed with mobility data corresponding to 2019 and 2020. This allows to compare two radically different scenarios, one with no travel restrictions and another in which mobility is reduced by a travel ban. Our findings indicate that i) travel restrictions might be an effective measure in the short term, however, ii) they are ineffective when it comes to completely eliminate the disease. The latter is due to the impossibility of removing the risk of seeding the disease to other regions. Furthermore, our study highlights the importance of developing more realistic models of behavioral changes when a disease outbreak is unfolding.
Two months after it was firstly reported, the novel coronavirus disease COVID-19 spread worldwide. However, the vast majority of reported infections until February occurred in China. To assess the effect of early travel restrictions adopted by the health authorities in China, we have implemented an epidemic metapopulation model that is fed with mobility data corresponding to 2019 and 2020. This allows to compare two radically different scenarios, one with no travel restrictions and another in which mobility is reduced by a travel ban. Our findings indicate that i) travel restrictions might be an effective measure in the short term, however, ii) they are ineffective when it comes to completely eliminate the disease. The latter is due to the impossibility of removing the risk of seeding the disease to other regions. Furthermore, our study highlights the importance of developing more realistic models of behavioral changes when a disease outbreak is unfolding.
In Dec. 31st, 2019, Chinese authorities reported an outbreak of a novel coronavirus disease, called COVID-19 by the World Health Organization. Due to the proximity of the Spring Festival, the Chinese Government implemented quarantine in Wuhan, where the outbreak started, as well as in several nearby cities since Jan. 23rd, 2020. As of Feb. 16th the outbreak had already infected 68,584 individuals in China of which 1666 died [1]. Much is still unknown about the characteristics of this pathogen. For instance, it remains unclear the animal source of this zoonotic disease, being bats and pangolins currently the two most likely sources [2], [3], [4]. It has also been proposed that more than half of the cases might have gone undetected by routinely screening passengers, due to the special characteristics of this disease [5], which makes it possible that infected individuals are asymptomatic while infectious.Several studies predict a much larger number of infections than the actual number reported by the authorities, claiming that only between 10% and 20% of the cases have been detected and reported [6], [7], [8], [9]. The reasons for such deviations between models and actual count of cases are diverse, for instance, the fact that the symptoms could be mild and similar to other flue-like diseases for some people, may induce infected individuals not to seek medical care [10]. On the other hand, on Feb. 13th, 14,840 new cases were reported [11], in contrast to 2022 cases counted during the previous day [12]. The reason was that previously to that day, only those cases that had been laboratory-confirmed were being recognized as so, whereas from that date onwards, also the clinically diagnosed cases were accounted for. Therefore, the model-based prediction of the number of infected individuals can plausibly be larger than the official reports.From a theoretical and computational point of view, there are groups that have proposed new epidemic models to properly account for the special characteristics of this disease [10], [13], [14]. However, our knowledge of the dynamics of the disease is too limited to be constrained to use such sophisticated models. In fact, in some of these works, the models are fitted to reproduce exactly the reported number of infected individuals, which, as noted before, can be counterproductive given that the actual number of infected individuals in the population is surely higher than those detected either by clinical diagnosis or in the laboratory. Lastly, there has also been intense research aimed at computing the probability that the outbreak extends beyond Wuhan to other cities in China, as well as to the rest of the world [6], [7], [15], [16], [17], [18]. These works use historical data in order to produce a risk assessment and obtain probabilities that the disease is imported in other populations. Likewise, modeling efforts have been directed towards gauging the effect of Wuhan’s quarantine on the spreading of the epidemic all over the country (which so far has determined to be a delay of 3 days in the arrival of the peak [7]) and worldwide.Given the unprecedented characteristics of this outbreak, we adopt a slightly different approach and study a data-driven metapopulation model that makes use of the actual flows of the population to properly measure the early effects of the travel reductions in China. Specifically, we built a basic metapopulation model [19], [20] of 31 regions in China (except HongKong, Macau and Taiwan). Inside each population, we considered that the individuals in the population interact following a homogeneous mixing scheme. This approach is similar to the ones proposed in [7] and [15]. However, in our case, we perform a data-driven simulation with the actual flows of individuals that have taken place during the period of study. That is to say, we do not rely on transportation data (which is most of the time given as the maximum flow capacity between subpopulations), but on a real dataset gathered for this occasion (see Materials and Methods). The results align with previous findings and indicate that travel restrictions do not have a significant impact in containing the expansion of the disease, though reducing the flow of individuals could lead to a delay in the importation of new cases in other subpopulations. Importantly, we show that using the real mobility patterns of the population is a key factor to understand the spreading of the disease. In this sense, our conclusions point out that despite the many advances in disease modeling during the last two decades, there still remain many open challenges, most of them related to how to sensibly incorporate human behavioral changes during the unfolding of an outbreak.
Model
We implement a stochastic SEIR-metapopulation model to simulate the spread of the epidemic across mainland China. The model can be divided in two discrete processes: the disease dynamics governed by a SEIR compartmental model; and the mobility of individuals. In a SEIR compartmental model individuals are assigned into each group according to their health status: susceptible (S) if they are susceptible to the disease; exposed (E) if they have been infected but cannot infect; infectious (I) once the incubation period is over and the individuals can infect others; and removed (R) when they are either recovered or deceased. Within each subpopulation (henceforth, region), the transition between compartments results from the following rules, iterated at each time step, corresponding to 1 day:S → E: Susceptible individuals in region i might get infected with probability where R
0 is the reproduction number, T the mean infectious time, N the number of individuals in region i and I the number of infected individuals in the region.E → I: Exposed individuals enter the infected compartment with a rate inversely proportional to the mean latent period, T.I → R: Infected individuals enter the removed compartment with a rate inversely proportional to the mean infectious period, T.We define the generation time as . On the other hand, the mobility of individuals is implemented through a data-driven approach. We have obtained the number of individuals leaving each region each day, as well as the probability, p, of going from each region i to region j (see Materials and Methods for a thorough description of the data). Hence, at each time step, we select individuals at random from within each region - excluding infectious individuals in i, which are supposed to be under quarantine or hospitalized - , and distribute them across the country according to such probabilities. Note that this implies that the disease will be propagated to other regions by exposed individuals. We also implemented a randomized version of the model for the mobility, in which the fraction of the population traveling from each region at each time step is 0.008 (estimated from the average fraction of individuals traveling during the first week of January 2020) and their destinations are chosen randomly with probability proportional to the population of the destination region.Furthermore, since we have data of from 2019 and 2020, we simulated the spreading of the disease in both years. In this way, we are able to asses the impact of travel reductions due to mobility restrictions across regions without making any - however sensible - assumptions about possible changes in individuals’ mobility. Nonetheless, this period of the year has some peculiarities due to the Spring Festival, an event that completely modifies the travel patterns of the population. In 2020, the Spring Festival was celebrated on Jan. 25th, while in 2019 it took place on Feb. 5th. For this reason, we align both simulations so that “day 0” will correspond in both years to the day of the Spring Festival (Jan. 25th and Feb. 5th respectively). Lastly, the simulation will run in both cases to 13 days after the Festival, which corresponds to Feb. 5th in 2020 and Feb. 16th in 2019, so that exogenous effects such as the strong social distancing measures enforced by the government or the new way of defining a positive case do not enter into play.
Results and discussions
Fig. 1 clearly shows that the changes induced by the epidemic in the overall flow and movement of the population throughout China are not restricted just to the region in which the city of Wuhan is located (Hubei). In 2019, a large number of individuals moved right before the Spring Festival. Then, it reached a minimum at the Spring Festival and, later on, a large number of individuals moved again, see Fig. 1A. On the other hand, in 2020, the situation was similar only until Jan. 23rd, when the travel restrictions in Wuhan were implemented. Two features are worth highlighting. First, when the travel restrictions were implemented, the flow of population had peaked already just before the Spring festival. This is key to understand why, despite the reduction in mobility, the daily number of new cases continued to increase for weeks. Secondly, most of the second wave observed in 2019 - which corresponds to travel back to the original region - had not taken place yet (see Fig. 1B), and thus, there was still a high risk that a subsequent large outbreak could occur. The existence of recurrent local outbreaks is a feature that some models have anticipated [20]. Further indications that the population has not yet reached the original distribution by region are provided in Fig. 1C and D, where we show the evolution of the number of individuals living in each region (initial data obtained from the Chinese Yearbook, see Materials and Methods). Clearly, while in 2019 most of the population returned to their regions, in 2020 this had not taken place yet.
Fig. 1
Movement patterns of individuals across Mainland China in 2019 and 2020. Panels A and B represent the percentage of individuals traveling from each region compared to such fraction on Jan. 1st in 2020 (Jan. 12th in 2019). Additionally, panels C and D show the population of each region assuming that the number of individuals traveling from each region is given by the flow data and that their destinations are randomly selected according to the mixing patterns p.
Movement patterns of individuals across Mainland China in 2019 and 2020. Panels A and B represent the percentage of individuals traveling from each region compared to such fraction on Jan. 1st in 2020 (Jan. 12th in 2019). Additionally, panels C and D show the population of each region assuming that the number of individuals traveling from each region is given by the flow data and that their destinations are randomly selected according to the mixing patterns p.To parameterize the metapopulation epidemiological model we follow Chinazzi et al. [7] and set a generation time of 7.5 days and a reproduction number R
0 equal to 2.4. The latent period is set to 3 days [21], which is compatible with pre-symptomatic transmission of the disease, since the incubation period is between 5 and 6 days [7]. The outbreak is seeded by introducing 40 exposed individuals on Dec. 1st 2019 (Dec. 12th 2018 for 2019) [7]. Then, the simulations run for 66 days in both cases and we extract the cumulative number of infected cases in each region as a function of time. Note that as there were no travel restrictions in 2019, one can see the results obtained with the 2019 data as the more plausible outcome for the 2020 outbreak without travel restrictions. In other words, by comparing 2019 with 2020, we can factor out the impact of the early travel reduction in the city of Wuhan and the subsequent changes in the mobility patterns of the population.Fig. 2 shows the cumulative number of infected individuals for Hubei and for the rest of mainland China. The large majority of cases, in all situations considered, are contained within Hubei province. The results in Fig. 2B show that travel restrictions have a small impact in the temporal evolution of the disease in the rest of the country(compare 2019 with 2020). As it can also be deduced from the trend of the curves in Fig. 2B, there is no indication that the growth in the number of cases will evolve following a different functional form. This contrasts with the results that would be obtained if individuals moved at random (with a probability proportional to the size of the subpopulations).
Fig. 2
Predicted cumulative number of cases: in Hubei (panel A) and the rest of mainland China (panel B) using mobility data of 2019 (solid lines, scenario equivalent to no travel restrictions), of 2020 (dashed lines, with travel restrictions considered) and under the assumption of random movement of the population (dashed-dotted lines). Dots represent the actual total value in Hubei reported by the authorities.
Predicted cumulative number of cases: in Hubei (panel A) and the rest of mainland China (panel B) using mobility data of 2019 (solid lines, scenario equivalent to no travel restrictions), of 2020 (dashed lines, with travel restrictions considered) and under the assumption of random movement of the population (dashed-dotted lines). Dots represent the actual total value in Hubei reported by the authorities.Fig. 2 A also shows that there is a large difference between the cumulative number of cases predicted in 2020 and the reported one. Although, as previously discussed, this has been reported by several other studies, to ensure that the methodology is correct, and to further analyze the effect of the mobility patterns, in Fig. 3
we show the correlation between the real values of infected individuals and the simulated ones for 2020. We obtain a Pearson correlation of 0.80 implying that the assumptions behind the model, albeit simplistic, can correctly describe the basic dynamics of the epidemic. Furthermore, we also see that the data-driven model is able to predict better the dynamics than its random counterpart. In Fig. S1, we perform a sensitivity analysis on the different parameter choices. For sets of parameters that induce a slower spreading (such as smaller R
0 or larger T) the disease spreads outside Hubei only after the restrictions have been implemented. This, in turn, enlarges the differences between 2019 and 2020. Conversely, for sets of parameters that produce a faster spreading, the disease spreads outside Hubei before the restrictions are implemented, producing a larger incidence in those regions. In either case, the correlation between the model predictions and the observed values is high, signaling that the real mobility patterns of the population are a key element to understand the spreading of the disease.
Fig. 3
Correlation between model and reality. Predicted cumulative number of cases in each region, except Hubei, compared to the real number reported by the authorities by Feb. 5th. Left: estimations obtained using 2020 data. Right: estimations obtained using random mobility data. In both cases the dashed line represents the identity line.
Fig. S1
(Top Panels) Predicted cumulative number of cases in Hubei and the rest of mainland China using mobility data of 2019 (solid lines, scenario equivalent to no travel restrictions), and 2020 (dashed lines, with travel restrictions considered) for several parameter values of the epidemic model as indicated. Dots represent the actual total value reported by the authorities. (Bottom Panels) Predicted cumulative number of cases in each region, except Hubei, compared to the real number reported by the authorities by Feb. 5th, 2020 for several disease parameters as indicated.
Correlation between model and reality. Predicted cumulative number of cases in each region, except Hubei, compared to the real number reported by the authorities by Feb. 5th. Left: estimations obtained using 2020 data. Right: estimations obtained using random mobility data. In both cases the dashed line represents the identity line.Lastly, we have also studied the difference in the predicted number of cases in each region between 2019 and 2020 at the end of the simulation period. Results are reported in Table 1
, where we show the mean number of infected individuals per region. As it can be seen in the table, the estimated total number of cases is quite similar in both cases. However, the distribution across the country is fairly different. Indeed, while there are many areas where the number of infected individuals would be larger in 2019, there are others where it would be smaller.
Table 1
Reported value of the total number of infected individuals by Feb. 5th per region compared to the values predicted by the model using data from 2019 (Feb. 16th) and 2020 (Feb. 5th). The mean value of the prediction, obtained from 104 independent runs of the model, alongside the 95% confidence interval is shown.
Region
Reported value
2019 simulated
2020 simulated
no travel restriction
travel restriction
Henan
815
6887 [4095-10,203]
4938 [2796-7633]
Hubei
19,665
348,540 [214,850-504,160]
361,876 [221,094-526,075]
Hunan
711
9647 [5839-14,095]
4580 [2589-7013]
Guangdong
944
5765 [3464-8528]
2681 [1482-4183]
Jiangxi
600
2531 [1403-3927]
2342 [1217-3746]
Chongqing
389
4909 [2933-7228]
2263 [1211-3606]
Anhui
591
2148 [1164-3396]
2021 [1030-3294]
Jiangsu
373
1443 [732-2346]
1566 [775-2608]
Zhejiang
954
1003 [523-1643]
1093 [542-1835]
Shaanxi
173
1809 [1008-2811]
1143 [520-2206]
Sichuan
321
2061 [1127-3230]
1460 [706-2472]
Beijing
274
482 [257-778]
320 [143-564]
Shanghai
254
805 [453-1259]
296 [131-529]
Shandong
343
731 [305-1429]
996 [415-1871]
Fujian
215
716 [308-1134]
828 [354-1526]
Guizhou
69
494 [184-1007]
789 [305-1495]
Hebei
157
711 [301-1337]
950 [417-1713]
Guangxi
168
647 [282-1224]
815 [341-1525]
Yunnan
128
373 [128-811]
510 [164-1091]
Shanxi
90
734 [383-1244]
476 [154-1005]
Hainan
100
155 [30-426]
260 [69-606]
Liaoning
89
217 [63-571]
272 [67-668]
Gansu
62
238 [67-605]
359 [92-832]
Tianjin
70
183 [68-408]
167 [35-441]
Heilongjiang
227
144 [30-445]
205 [40-549]
Xinjiang
36
103 [18-352]
148 [15-443]
Inner Mongolia
46
134 [39-373]
155 [23-451]
Jilin
59
150 [44-406]
144 [21-435]
Qinghai
18
43 [6-177]
62 [1-286]
Ningxia
40
53 [7-221]
75 [2-317]
Tibet
1
11 [0-56]
21 [0-165]
Total
27,982
393,869 [242,817-569,435]
393,812 [241,466-572,552]
Reported value of the total number of infected individuals by Feb. 5th per region compared to the values predicted by the model using data from 2019 (Feb. 16th) and 2020 (Feb. 5th). The mean value of the prediction, obtained from 104 independent runs of the model, alongside the 95% confidence interval is shown.To elucidate the reasons behind this behavior, in Fig. 4
A we show the relative difference in the incidence per region, comparing 2019 and 2020. We observe that regions near Hubei tend to have smaller incidences in the simulation with data from 2020. Conversely, regions further away from Hubei tend to have larger incidences in 2020. We can compare this distribution with Fig. 4B, where we show the total number of individuals who travel from Hubei to any other region from Jan. 1st to Feb. 5th in 2020 with the ones that did the same from Jan. 12th to Feb. 16th in 2019. We can see that regions closer to Hubei had a smaller number of visitors in 2020, while the ones further away from Hubei had more visitors. This explains the observed differences in the simulations and highlights the importance of using real and updated data to properly account for the behavior of individuals.
Fig. 4
Expansion of the disease. A: relative difference of the incidence in 2020 and 2019 in each region by Feb. 5th (Feb. 16th in 2019). B: relative difference in the total number of travelers that went from Hubei to any other region for the period under consideration.
Expansion of the disease. A: relative difference of the incidence in 2020 and 2019 in each region by Feb. 5th (Feb. 16th in 2019). B: relative difference in the total number of travelers that went from Hubei to any other region for the period under consideration.Summarizing, we have studied a data-driven metapopulation model that allows assessing the effect of early travel restrictions in Wuhan and Hubei province. Even if our modeling framework is simpler than other more sophisticated implementations of the disease dynamics, our results are in line with several available studies in that: i) travel restrictions have limited efficacy unless they are applied very early, and ii) reducing the travel does not appear to have a long term impact on the spreading of the disease if not accompanied by other measures. We also note the effect of travel restrictions might be underestimated because the large majority of people had already moved before these mobility restrictions were implemented (as it can be seen in Fig. 1). Our study is limited in several aspects that can constitute future research goals. First, the geographic resolution allowed by the mobility data used here is low. Considering large regions have the undesired effect that one can not add structure to the population and therefore the dynamics within each subpopulation is constrained by the homogeneous mixing hypothesis. This limitation could be overcome if less granular spatial and temporal data becomes available. Secondly, and perhaps more important as it currently represents a scientific challenge, we have assumed that the transmissibility does not change during the whole simulation period. This implies that changes in behavioral patterns of the population are not fully accounted for nor they can be completely disentangled from those associated with travel restrictions. Understanding how to deal with such behavioral changes is key for the development of more realistic descriptions of the large-scale spreading of diseases. Finally, another critical feature of current models that needs to be improved in future research is the use of disease parameters - notably R
0 - that are constant both in time and across populations.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Authors: Alberto Aleta; Andreia N S Hisi; Sandro Meloni; Chiara Poletto; Vittoria Colizza; Yamir Moreno Journal: R Soc Open Sci Date: 2017-03-15 Impact factor: 2.963
Authors: Huaiyu Tian; Yonghong Liu; Yidan Li; Chieh-Hsi Wu; Bin Chen; Moritz U G Kraemer; Bingying Li; Jun Cai; Bo Xu; Qiqi Yang; Ben Wang; Peng Yang; Yujun Cui; Yimeng Song; Pai Zheng; Quanyi Wang; Ottar N Bjornstad; Ruifu Yang; Bryan T Grenfell; Oliver G Pybus; Christopher Dye Journal: Science Date: 2020-03-31 Impact factor: 47.728
Authors: Gurjit S Randhawa; Maximillian P M Soltysiak; Hadi El Roz; Camila P E de Souza; Kathleen A Hill; Lila Kari Journal: PLoS One Date: 2020-04-24 Impact factor: 3.240
Authors: Matteo Chinazzi; Jessica T Davis; Marco Ajelli; Corrado Gioannini; Maria Litvinova; Stefano Merler; Ana Pastore Y Piontti; Kunpeng Mu; Luca Rossi; Kaiyuan Sun; Cécile Viboud; Xinyue Xiong; Hongjie Yu; M Elizabeth Halloran; Ira M Longini; Alessandro Vespignani Journal: Science Date: 2020-03-06 Impact factor: 47.728
Authors: Jacob Burns; Ani Movsisyan; Jan M Stratil; Renke Lars Biallas; Michaela Coenen; Karl Mf Emmert-Fees; Karin Geffert; Sabine Hoffmann; Olaf Horstick; Michael Laxy; Carmen Klinger; Suzie Kratzer; Tim Litwin; Susan Norris; Lisa M Pfadenhauer; Peter von Philipsborn; Kerstin Sell; Julia Stadelmaier; Ben Verboom; Stephan Voss; Katharina Wabnitz; Eva Rehfuess Journal: Cochrane Database Syst Rev Date: 2021-03-25
Authors: Ani Movsisyan; Jacob Burns; Renke Biallas; Michaela Coenen; Karin Geffert; Olaf Horstick; Irma Klerings; Lisa Maria Pfadenhauer; Peter von Philipsborn; Kerstin Sell; Brigitte Strahwald; Jan M Stratil; Stephan Voss; Eva Rehfuess Journal: BMJ Open Date: 2021-04-09 Impact factor: 2.692
Authors: Olalekan A Uthman; Olatunji O Adetokunboh; Charles Shey Wiysonge; Sameh Al-Awlaqi; Johanna Hanefeld; Charbel El Bcheraoui Journal: Front Public Health Date: 2022-02-25