Literature DB >> 35143548

On mobility trends analysis of COVID-19 dissemination in Mexico City.

Kernel Prieto1, M Victoria Chávez-Hernández2, Jhoana P Romero-Leiton3.   

Abstract

This work presents a tool for forecasting the spread of the new coronavirus in Mexico City, which is based on a mathematical model with a metapopulation structure that uses Bayesian statistics and is inspired by a data-driven approach. The daily mobility of people in Mexico City is mathematically represented by an origin-destination matrix using the open mobility data from Google and the Transportation Mexican Survey. This matrix is incorporated in a compartmental model. We calibrate the model against borough-level incidence data collected between 27 February 2020 and 27 October 2020, while using Bayesian inference to estimate critical epidemiological characteristics associated with the coronavirus spread. Given that working with metapopulation models leads to rather high computational time consumption, and parameter estimation of these models may lead to high memory RAM consumption, we do a clustering analysis that is based on mobility trends to work on these clusters of borough separately instead of taken all of the boroughs together at once. This clustering analysis can be implemented in smaller or larger scales in different parts of the world. In addition, this clustering analysis is divided into the phases that the government of Mexico City has set up to restrict individual movement in the city. We also calculate the reproductive number in Mexico City using the next generation operator method and the inferred model parameters obtaining that this threshold is in the interval (1.2713, 1.3054). Our analysis of mobility trends can be helpful when making public health decisions.

Entities:  

Mesh:

Year:  2022        PMID: 35143548      PMCID: PMC8830699          DOI: 10.1371/journal.pone.0263367

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The coronavirus disease 2019 (COVID-19) is caused by a novel coronavirus. The coronaviruses are a family of viruses that cause infection in humans and animals. The diseases that are by a coronavirus are zoonotic [1]. In particular, the coronaviruses that affect humans (HCoV) can produce clinical symptoms, such as the Severe Acute Respiratory Syndrome (SARS) viruses and Middle East Respiratory Syndrome (MERS-CoV) [2]. COVID-19 was first identified amid an outbreak of respiratory illness cases in Wuhan City, Hubei Province, China. This disease was initially reported to the WHO on 31 December 2019. On 11 March 2020, the WHO declared COVID-19 to be a global pandemic [3]. From the beginning of the epidemic to 21 January 2021, more than 97,890,676 cases and 2,094,459 deaths have been reported globally. The first case of COVID-19 in South America was registered in Brazil on 26 February 2020. The first death from this infection in this region was announced in Argentina on 7 March 2020. The virus then arrived in Mexico, where by 21 January 2021 there have been almost 1,688,944 confirmed cases and 144,371 deaths. To date, many researchers around the world have focused on understanding the transmission dynamics of COVID-19 disease using mathematical and statistical models and methods, see for example [4-12]. In this work, we will focus on those models that incorporate information on human movement. The relationship between human mobility and the transmission of coronavirus disease in the United States has been studied in [13, 14]. Metapopulation models are not only among the simplest spatial models but they are also the most applicable to modelling many human diseases [15]. The metapopulation concept is to subdivide the entire population into distinct sub-populations, each of which has independent dynamics together with limited interaction between the sub-populations. This approach has been used to great effect within the ecological literature [16] and it has recently been used to model the spread of COVID-19; see, for example, [17-21]. In this work, we calibrate the metapopulation model proposed by Li et al. in [21], similar to [22], using incidence data reported in [23]. Consequently, we first describe the mathematical model that we have used; then, we compute the number of trips that are produced and attracted in each borough of Mexico City using data about these trips in 2017 [24], which then we combine with the rates of reduction or increase in mobility during the pandemic reported by Google [25] and the government of Mexico City [26]. Later, by using Bayesian inference, we solve the associated inverse problem to predict the dynamics of the spread of cases, similar to the following references [27-33]. Our conclusions are presented in the last section.

Computation of the mobility matrices

To incorporate mobility in the transmission model, the produced and attracted trips in the boroughs of Mexico City are considered (see Fig 1 and Table 1). Mexico City is the capital of Mexico. It has around 9 million inhabitants and a floating population of over 22 million, who are composed of daily commuters and international visitors. Mexico City is among the top 10 most crowded cities in the world [34]. It also has a large number of corporate headquarters and a large transport network, which is composed of 20 different modes of transport.
Fig 1

Boroughs of Mexico City.

Table 1

Boroughs of Mexico City.

Id.NameId.NameId.NameId.Name
1Azcapotzalco5Iztacalco9Álvaro Obregón14Cuauhtémoc
2Coyoacán6Iztapalapa10Tláhuac15Miguel Hidalgo
3Cuajimalpa de7La Magdalena11Tlalpan16Venustiano
MorelosContreras12XochimilcoCarranza
4Gustavo A. Madero8Milpa Alta13Benito Juárez
Mobility between the zones in Table 1 is represented in a two-dimensional arrangement, which is known as the origin-destination matrix (O-D matrix) M = {M}, i, j = 1, …, 16, where M represents the number of trips from zone i to zone j. Origin-destination matrices are usually obtained every 10 years from surveys. In Mexico City, the last survey was carried out in 2017 [24]. The information available identifies, among other things: if the trip was made on a weekday or if it was made during the weekend, the transport mode used, the purpose and the time. It is important to notice that, due to the complexity of mobility in Mexico City, the O-D matrix does not have to be symmetric nor the sum of the row i has to be equal to the sum of the column i. This can be explained because the O-D matrix captures chains of trips that may begin one day and end within the next few days. For instance, trips of people who leave their home to go to zone i to work and then go to zone j to see a movie before returning home, or people that work in zone i for few consecutive days (see [35]). In this paper, we consider all of the trips and they are identified by the mode of transport that was used in the area of interest (see Table 2).
Table 2

Transport modes used in the area of interest.

CarRTP/M1Metrobus/MexibusMotorcycle taxi
Collective/microBicycleLight railSchool transportation
Taxi appBusSuburbanPersonal transportation
TaxiMotorcycleWalkOther
SubwayTrolleybusPedicab
There are several methodologies to update the O-D matrix in the literature. Most of them combine known information with current data observed, such as the number of trips in some segments of the transit network [36]. There are also some approaches that project the trips to/from each zone based on the projected economic growth in those areas [37]. Nevertheless, given the pandemic situation that we are experiencing today and that we have current available data about the increase or decrease in mobility for some modes of transport, transit stations and parking lots, we consider the 2017 O-D matrix as a reference matrix and we update it to a scenario in 2020 using the daily mobility reports provided by Google [25] and the government of Mexico City [26]. According to [24], Tables 3 to 6 represent the number of trips between these zones; for instance, the mean number of trips whose origin is Coyoacán (id = 2) and destination is Iztapalapa (id = 6) during a week day is 228,272 (see Table 3) and the mean number of trips whose origin is Tláhuac (id = 10) and destination is Cuauhtémoc (id = 14) is 21,881 (see Table 6) during a day on the weekend.
Table 3

Mean number of trips from zones 1-16 to zones 1-9 during a week day.

Id123456789
14775602110875641206766475250261557024149
2211118864281101048039322302282724930517446108741
373361046230180597705397108673491722100180
412073550796969916773562342752676260259027846
5648632393535822936361938181660156314519190
6252442217501030856688177845276647193261039564521
7168550941414825761389800826078944183675
8016022722590145108723362181753901
9241801081291028922883618012619908017155771034673
1017305096919947283824313411921792280111070
111103423736710663179581437082757539111720395685
1221481007852637606275864685670303128414993
1317200126058146455994459956163732291879714161543
14882521182922029226429884725253810224159495107543
15998685224643942810132645073629110722972115902
1615795319713324103651561921023973239119922583
Sum9203642115717551003250767688438042031425381733481591996195
Table 6

Mean number of trips from zones 1-16 to zones 10-16 during a day of the weekend.

10111213141516Sum
11501578615429701633315384611181590982
22685515038860418641496795319583199951243890
34933714856821813863188503176314540
463701643038512884018149837641716211511072
531409292808040741564001373733730559377
69937252020300839429418128840162685412595911
71013379413235147901431544523892335044
813264101822345127041162932641256236080
956195494810135875766551260836264141138236
103451711720450337147402188165635956621265
111507854669477342413365876621318106911110259
125189876431452836174913305187376028784776
13147533885518884232506856712849620327777589
1422350601413031084877427114923121143061472455
1571561959310801274939465733993528238772196
1662001032655812004911677726215344904764371
Sum6202331109945787742789505149370677594777025614828043
In Tables 3–6, the last row represents the total number of trips attracted by each zone and the last column in Tables 4 and 6 represents the total number of trips generated by each zone. This way we can see in Table 4 that during a week day 849,911 trips are attracted to zone 10 and 540,274 trips are generated from zone 7.
Table 4

Mean number of trips from zones 1-16 to zones 10-16 during a week day.

Id10111213141516Sum
11918101392112172839308310245016775927875
24800424281810341412437911814052754342312126322
313911177921181621021397427013179548805
4744618848641960994261785813671005612503147
5772813951754660375852002473058757889956
61335608436942750167620260158750021008424206849
720865075679502952322641112002466540274
8218931840231875101911167031372059349990
9107889965615231164890106156120847256402007668
1051776326394504592397726811143037563907658
1122491893910124478783028962739742153781804876
125234612153469422033487428531336266621183845
1322689741813226641087615051066226413651440092
142489489400462271456778214671512551520442400086
1513294389431235266648149602542212379051368050
16662014267500139229150324366864919381084416
Sum89491118093471184418144966124114241377974109736524289909
To compute the new number of trips made using the subway, RTP/M1, trolleybus, light rail or suburban, we used the corresponding rates given by the government of Mexico City. To compute the number of trips made using collective/micro or buses, we used the rates of transit stations given by Google and for personal transportation we used the workplaces rate. To compute the number of trips made by bicycle, we used the rates for Ecobici and for metrobus/mexibus we used an average of both the rates of metrobus and mexibus in Mexico City. The other transport modes remain the same as in 2017. Fig 2 shows the variations in the mobility indices from 27 February to 31 November, for each mode of transport that was modified.
Fig 2

Variations in the mobility indices from 27 February, 2020 to 31 November, 2020.

The population of each borough is also considered in our mathematical model. According to [38], these populations in 2020 are given in Table 7.
Table 7

Population in 2020 for each borough of Mexico City.

IdBoroughPopulation
1Azcapotzalco414,711
2Coyoacán628,063
3Cuajumalpa de Morelos199,224
4Gustavo A. Madero1,185,772
5Iztacalco384,326
6Iztapalapa1,815,786
7La Magdalena Contreras239,086
8Milpa Alta137,927
9Álvaro Obreón726,664
10Tláhuac305,076
11Tlalpan574,577
12Xochimilco407,885
13Benito Juárez385,439
14Cuauhtémoc531,831
15Miguel Hidalgo372,889
16Venustiano Carranza430,978
Note that both, the mobility rates from Google and the government of Mexico City and the reference O-D matrix (the one from 2017) are given per day. This way, the population N could be considered as constant; but only per day. In this work, a phenomenon whose duration is of the order of months is studied, so that for the entire period of estimation N is considered as a variable. Furthermore, although in all boroughs the largest number of trips are carried out within the same borough, the distribution of trips to/from the other boroughs does not follow the same pattern in all cases. For example, the Coyoacán borough attracts the highest number of trips from the Tlalpan borough and the least amount of trips from Cuajimalpa; meanwhile, the borough of Iztapalapa attracts the highest number of trips from the Cuauhtémoc borough and the least amount of trips from La Magdalena Contreras. This phenomenon can be explained by the prevailing economic activity in each borough and the transportation connectivity with the others. In order to obtain an stable algorithm for modeling correctly the migration of people, we use the Fratar method to balance all the origin-destination matrices [39].

Clusters

In this section, we describe the clustering analysis that we implemented on Mexico City based on mobility data. This analysis is presented not only to try to find some possible socioeconomic relations between some boroughs, but because there exist computational challenges which can be avoided creating clusters. Firstly, in order to solve model (1) for the whole Mexico City, the program uses around 50GB in RAM during the compilation process using the Stan package. Secondly, the computation time for estimating the parameters is around 3 days. We have used a computer with Ubuntu 20.04, 64 GB in RAM and 12 cores. Therefore, we propose in this section how it could be avoided both of these computational challenges. Solving model (1) for each cluster would reduce the amount of RAM memory used and the computation time. Moreover, this strategy could be implemented simultaneously using the t-walk package for example, since the t-walk package only uses one core for execution program. During the pandemic, the Mexican government has scheduled four phases depending of level of contagion risk. These phases corresponding to the following periods: phase 1: from 27 February February to 22 March; phase 2: from 23 March to 19 April; phase 3: from 20 April to 28 June; phase 4: from 29 June to 27 October. The mobility network was analysed using the community detection module Louvain inside the igraph R package [40]. For more details about the igraph R package, see [41]. Thus, using the Louvain community detection algorithm, we are able to identify that Mexico City’s network has a modular structure, with three communities, as shown in Figs 3 and 4. From Figs 3 and 4 we observe that the communities in the first and fourth phases of the pandemic are the same, and the second and third of the pandemic are the same. The community 1 of the first phase of the pandemic is composed of boroughs 1, 4, 14, 15 and 16; community 2 is composed of boroughs 2, 3, 7, 8, 9, 11, 12 and 13; and, community 3 is composed of boroughs 5, 6 and 10. The community 1 of the second phase of the pandemic is composed of boroughs 1, 4, 14, 15 and 16; community 2 is composed of boroughs 2, 5, 6, 8, 10, 11, and 12; and, community 3 is composed of boroughs 3, 7, 9 and 13.
Fig 3

Borough clusters of Mexico City.

(A): Borough clusters for the first period of the pandemic. (B): Borough clusters for the second period of the pandemic.

Fig 4

Borough clusters of Mexico City.

(A): Borough clusters for the third period of the pandemic. (B): Borough clusters for the fourth period of the pandemic.

Borough clusters of Mexico City.

(A): Borough clusters for the first period of the pandemic. (B): Borough clusters for the second period of the pandemic. (A): Borough clusters for the third period of the pandemic. (B): Borough clusters for the fourth period of the pandemic.

Mathematical model

As we mention in the introduction, the transmission model incorporates information on human movement within the following Susceptible, Exposed, Infected, Recovered (SEIR) metapopulation structure [21]: where S, E, A, I and N are the susceptible, exposed, undocumented infected, documented infected and the total population in borough i at time t, respectively, and denotes the fixed population in borough i given by Table 7. Spatial coupling within the model is represented by the daily number of people traveling from city j to city i (M) an a multiplicative scale factor , reflecting the under-reporting of human movement. It is also assumed that documented infected individuals (I) do not move between boroughs, although these individuals can move between boroughs during the latency period. The total population N in each borough is reset each new day as the sum of and the inflow term θ∑ M, minus the outflow term θ∑ M. We note that distinction between daytime and nighttime in the transmission model 1 is implemented in [35]. A complete description of the parameters involved in the model (1), the respective range of values proposed in [21] and their measurement units can be found in Table 8. We have set a minimum value for the denominator N − I or N − I as equal to 103 in order to avoid instabilities.
Table 8

Parameter description and values proposed in [21] of the state equations given on (1).

ParameterDescriptionProposed RangeUnits
β s Transmission rate due to documented infected individuals.[0., 1.]None
β a Transmission rate due to undocumented individuals.[0., 1.]None
1/θMultiplicative factor (reflect under-reporting of human movement).[0., 1.]Trips/person
1/αAverage latency period.[2, 10] Days
1/γAverage duration of infection.[2, 20] Days
ρ Fraction undocumented/documented people.[0.,.4]None

The values are given in days as time unit.

The values are given in days as time unit.

Parameter estimation

For parameter estimation, we use the daily reported dataset [23]. We use Bayesian inference to solve the inverse problem associated to the system of Ordinary Differential Equations (ODEs) given on (1), similarly to [33]. Some references using this method of parameter estimation can be found in [42-53]. Let us denote the vector of state variables in the zone i as x = (S, E, A, I) ∈ (L2[0, T]), where n = 4 denotes the number of state variables and the vector of parameters in the zone i as , where m = 10 denotes the dimension number of parameters to estimate. Thus, we can write the model (1) as the following Cauchy problem Problem (2), defines a mapping Φ() = x from parameters to state variables x, where where denotes the non-negative real numbers. We assume that Φ has a Fréchet derivative. Usually, not all states of the system can actually be directed measured, i.e., the data consists of measurements of some state variables at a discrete set of points t1, …, t, e.g. in epidemiology, these data consist of number of cases of confirmed infected people. This defines a linear observation mapping from state variables to data , where s ≤ n is the number of observed variables and k is the number of sample points. Let us define as F() = Ψ(Φ()), called the forward problem. Thus, the inverse problem is formulated as a standard optimization problem such that x = Φ() holds, with yobs is the observable data which has error measurements of size . Problem (3) may be solved using numerical tools to deal with a non-linear least-squares problem [54-58]. In this work, we implement Bayesian inference to solve the inverse problem given on (3). From the Bayesian perspective, all of the state variables x and parameters are considered as random variables and the data yobs is fixed. For the random variables x and , the joint probability distribution density of the data x and the parameters , denoted by π(, x), is given by π(, x) = π(x|)π(), where π(x|)π() is the conditional probability distribution, which is also called the likelihood function, and π(x|) is the prior distribution, which involves the prior information of parameters . Given x = yobs, the conditional probability distribution π(|yobs), which is called the posterior distribution of , is given by the Bayes’ theorem: If an additive noise is assumed where is the noise due to discretisation, the model error and the measurement error. If the noise probability distribution π() is known, and are independent, then All of the available information regarding the unknown parameter is codified into a prior distribution π(), which specifies our belief in a parameter before observing the data. All of the available information regarding how we obtained the measured data is codified into the likelihood distribution π(yobs|). This likelihood can be seen as an objective or cost function because it punishes deviations of the model from the data. To solve the associated inverse problem (4), one may use the maximum a posterior (MAP) We used the dataset in the zone i as , which correspond to the susceptible, exposed, documented infected and undocumented infected in the zone i, respectively. A Poisson distribution with respect to the time is typically used to account for the discrete nature of these counts. However, the variance of each component of the dataset yobs is larger than its mean, which indicates that there is over-dispersion of the data. Thus, a more appropriate likelihood distribution is to use the Negative Binomial (NB) because it has an additional parameter that allows the variance to exceed the mean [50, 51, 59]. The NB is a mixture of Poisson and Gamma distributions, where the rate parameter of the Poisson distribution itself follows a Gamma distribution [59, 60]. We note that although there are different mathematical expressions for the NB depending on the author or source, they are equivalent. Because of this multiple representation of the NB in the literature, one must ensure to use the NB distribution accordingly to the source. Here, we have used the following expression for the NB distribution where μ is the mean of the random variable and ϕ is the over-dispersion parameter; that is, We recall that the Poisson distribution has mean and variance equal to μ, so μ2/ϕ > 0 is the additional variance of the NB with respect to the Poisson distribution. The inverse of the parameter ϕ, controls the over-dispersion. Thus, it is important to select its support adequately for parameter estimation. In addition, there are alternative forms of the NB distribution. We have used the first option neg_bin of the NB distribution of Stan [61]. We acknowledge that some scientists have had success with the second alternative representation of the NB distribution [47]. We assume independent NB distributed noise (i.e., all dependency in the data is codified into the contact tracing model). In other words, the positive definite noise covariance matrix is assumed to be diagonal. Therefore, using the Bayes formula, the likelihood is where i denotes the borough index. As mentioned earlier, we approximate the likelihood probability distribution corresponding to diagnosed cases with a NB distribution where the index j denotes the number of days, i the number of the boroughs, and ϕ are the parameters corresponding to the over-dispersion parameter of the NB distribution (5) respect to each borough. For independent observations, the likelihood distribution π(y|) is given by the product of the individual probability densities of the observations where the mean μ of the NB distribution , is given by the solution I(t) of the model (1) at time t = t. For the prior distribution, we select the LogNormal distribution for β and β parameters, Gamma distributions for α and γ parameters and Uniform distributions for the other parameters to estimate: ρ, and initial conditions . The hyperparameters and their support corresponding to all the distributions of the parameters to estimate are given on the table’s range Table 8. The posterior distribution π(|yobs) given by (4) does not have an analytical closed form since the likelihood function, which depends on the solution of the non-linear model given on (1), does not have an explicit solution. Then, we explore the posterior distribution using two methods: first, the Stan Statistics package [61] within its version the Automatic Differentiation Variational Inference (ADVI) method,; and second, the general purpose Markov Chain Monte Carlo Metropolis-Hasting (MCMC-MH) algorithm t- walk [62]. Both algorithms generate samples from the posterior distribution π(|yobs) that can then be used to estimate marginal posterior densities, mean, credible intervals, percentiles, variances, and others. We the reader refer to [63] for a more complex description of the MCMC-MH algorithms. Fig 5 shows the credible intervals of parameters of model (1) within 95% Highest-Posterior Density (HPD) using the ADVI-Fullrank method of Stan package [61]. Table 9 shows the posterior mean and quantiles of all the estimated parameters of model (1) using the ADVI-Fullrank method of Stan package. Table 10 shows the posterior mean and quantiles of all over-dispersion parameters ϕ of the Negative Binomial distribution (6). Fig 6 shows the joint probability density distributions of the estimated parameters of model (1) within 95% (HPD). The blue lines represent the medians. Figs 7–9 show the fit of confirmed COVID-19 cases of all of the boroughs of Mexico City using the Stan [61]. Fig 10 shows Credible intervals of parameters of model (1) within 95% Highest-Posterior Density (HPD) using the t-walk Package [62]. Note that the result obtained with the t-walk package are preliminary because we only performed 60,000 iterations, with 30,000 of them as burn-in and it was obtained without balancing the Origen-Destination matrices. We performed this limited quantity of iterations because the computational time consumption is significantly large for each 1,000 of iterations. However, we will perform more iterations in the near future. Using both packages, we did a fit for the first 245 days of the pandemic in Mexico City, starting 27 February, and we have performed predictions from 245–275 days, corresponding to 28 October to 27 November and compared with the true cases in this last period 28 October to 27 November. We assumed that the mobility from 28 October to 27 November is the same as from 28 September to 27 October, i.e., we assumed the same mobility cluster for the projection period. We set up a minimum borough fraction equal to 0.6 to limit the borough to fall below their population size. Our future work will analyse the identifiability of the parameters of model (1), as suggested in [49, 64, 65]. Specifically, the ρ parameter because it is multiplied by the period of incubation of the disease, α. Thus, estimating both parameters simultaneously may lead to non-identifiability difficulty. we have uploaded all the codes and source data used in this paper to the following Github link for a detailed review.
Fig 5

Credible intervals of parameters of the model (1) within 95% Highest-Posterior Density (HPD) using the Stan package [61].

Table 9

Parameter estimation of β, β, ρ, α, γ, θ and initial conditions of the model (1).

MeanStdMin2.5%50%97.5%Max
β s 0.18170.06880.04330.12990.17190.22150.4822
β a 0.27970.07240.09030.22740.27430.32590.5665
ρ 0.01860.00560.00570.01460.01780.02200.0481
α 0.17200.03340.11030.14850.16700.19100.3067
γ 0.22100.05700.08040.17980.21610.25970.4116
θ 0.72790.21120.04090.61130.77860.89760.9963
E 10 215.5595165.69889.402589.7281172.8620297.3270901.8550
A 10 546.1551275.584613.5082310.5730565.1880793.2150995.1790
I 10 1.99002.08740.04810.68621.36932.603522.6618
E 20 253.3650193.91203.8161102.6910201.1370366.7250936.9310
A 20 295.6783195.30059.7637135.7270260.0070417.1160873.4510
I 20 3.49614.54450.03850.91001.94254.067540.6196
E 30 185.7097145.45841.996078.6950147.7400247.2920835.4920
A 30 507.7907263.51926.1474283.0790514.8280736.8860984.3130
I 30 4.22975.84220.03780.96112.20604.958961.0710
E 40 683.1201253.476318.5321517.4730743.1590900.7840999.2030
A 40 496.4247271.02626.0559258.5220495.7330728.1480987.1500
I 40 30.866329.65720.02695.249519.780752.131899.6383
E 50 552.5027299.41804.4051284.7780577.9160837.3670999.1490
A 50 562.7039282.28211.3356315.3790605.4310809.2500997.2760
I 50 4.99656.96200.03881.00972.49186.074261.8987
E 60 450.4365279.10411.4010200.7940420.6680688.3850992.9970
A 60 547.0965270.47779.5825315.7010568.5270783.6100992.2320
I 60 23.401526.03010.01733.706312.101336.030899.6128
E 70 115.0333110.98392.850338.329776.6577151.7080795.5370
A 70 515.9971278.43866.2676281.7120534.1290755.6920997.6560
I 70 3.72886.00630.01800.69171.66314.076655.2651
E 80 224.1849186.98983.037075.5207172.3130321.0610925.9820
A 80 569.0899287.32111.8729328.3040597.3180833.6790999.2210
I 80 3.48494.86960.03570.79351.71304.242544.0485
E 90 188.5786164.56133.309261.5219134.4440269.3350807.6380
A 90 687.0370275.991310.8989500.1660765.8410926.9680999.2580
I 90 16.515320.73360.02582.37117.469822.826296.2754
E 100 356.7580263.39862.5233125.7860297.0560550.1360987.3250
A 100 746.1666233.514056.2971610.6280829.0390934.2400999.0530
I 100 31.603432.89510.00173.335316.658255.709699.9472
E 110 281.1917236.36082.778090.8408206.4170432.5200970.9040
A 110 252.0265216.57671.461774.8393192.5350375.3250980.7710
I 110 4.09545.96130.01780.84081.92024.566054.8201
E 120 130.6564131.67142.029536.738285.9181175.2460823.6720
A 120 189.1367168.34343.820762.8391135.1090254.0920847.2030
I 120 8.760312.76190.03381.19133.913010.203794.1881
E 130 135.1793130.85472.458739.740989.4389187.5800786.0350
A 130 233.7060218.52372.523063.5877160.4860337.5610966.4870
I 130 17.245223.44790.00131.24476.549323.553499.6969
E 140 288.4052241.84652.094985.1094216.5890441.7050967.2260
A 140 132.8599136.08761.501336.528786.8088176.9500817.9200
I 140 1.12261.14400.02900.40210.76891.452613.1217
E 150 194.1867170.33162.612663.5594138.1430273.0570818.0280
A 150 629.8427301.48545.1016382.9950700.1620909.0470999.4200
I 150 31.914732.69070.01143.282218.526856.728299.9608
E 160 615.1693319.68560.5475323.0720703.2440916.7530999.8430
A 160 590.1725297.53087.9860345.1590634.7790868.4580999.4370
I 160 34.333334.68190.01003.841918.855964.881299.9809
Table 10

Over-dispersion parameters estimation of the Negative Binomial distribution (6).

MeanStdMin2.5%50%97.5%Max
ϕ 1 2.19910.32901.28221.96012.16922.41303.5188
ϕ 2 1.89430.36391.08831.63401.84872.11943.9229
ϕ 3 2.07230.35721.24651.80682.05202.29423.2018
ϕ 4 1.44080.26640.83471.24761.42031.60043.0505
ϕ 5 1.85770.29991.12891.63301.84092.04252.9154
ϕ 6 1.28100.21200.68391.14061.25951.40092.1667
ϕ 7 2.01100.32941.05951.77421.98212.21693.3500
ϕ 8 1.54530.35380.74031.29151.51451.75833.3937
ϕ 9 2.16330.44671.12521.84102.10922.41734.3067
ϕ 10 1.38690.29240.65781.18161.36101.55932.4075
ϕ 11 2.41840.43981.17072.10762.37092.68324.0595
ϕ 12 1.82980.40350.88281.54611.78472.06684.1388
ϕ 13 2.62040.60581.24492.21692.56232.97256.2347
ϕ 14 2.19330.38041.30561.91112.15542.41543.8624
ϕ 15 2.19770.49871.09551.83622.15342.50103.9972
ϕ 16 1.61670.32320.85051.39261.59581.80853.0021
Fig 6

Joint probability density distributions of the estimated parameters of model (1) within 95% (HPD).

The blue lines represent the medians.

Fig 7

Fit of confirmed COVID-19 cases of the boroughs 1 to 6 using the Stan package [61].

Top row from left-hand to right-hand: the fit for the confirmed cases of the Districts Azcapotzalco and Coyoacan. The tomato colour bars represent the confirmed cases, the blue and purple solid lines represent the median and the mode, respectively, and the shaded area represent the %95 probability bands for the expected value for the state variable of Documented Infecteds. Middle row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Cuajimalpa de Morelos and Gustavo A. Madero. Bottom row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Iztacalco and Iztapalapa.

Fig 9

Fit of confirmed COVID-19 cases of the boroughs 13 to 16 using the Stan package [61].

Top row from left-hand to right-hand: the fit for the confirmed COVID-19 cases of the Districts Benito Juarez and Cuauhtemoc. The tomato colour bars reprent the confirmed COVID-19 cases, the blue solid line reprent the median and the shaded area represent the %95 probability bands for the expected value for the state variable of Documented Infecteds. Bottom row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Miguel Hidalgo and Venustiano Carranza.

Fig 10

Credible intervals of parameters of model (1) within 95% Highest-Posterior Density (HPD) using the t-walk package [62].

Joint probability density distributions of the estimated parameters of model (1) within 95% (HPD).

The blue lines represent the medians.

Fit of confirmed COVID-19 cases of the boroughs 1 to 6 using the Stan package [61].

Top row from left-hand to right-hand: the fit for the confirmed cases of the Districts Azcapotzalco and Coyoacan. The tomato colour bars represent the confirmed cases, the blue and purple solid lines represent the median and the mode, respectively, and the shaded area represent the %95 probability bands for the expected value for the state variable of Documented Infecteds. Middle row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Cuajimalpa de Morelos and Gustavo A. Madero. Bottom row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Iztacalco and Iztapalapa.

Fit of confirmed COVID-19 cases of the boroughs 7 to 12 using the Stan package [61].

Top row from left-hand to right-hand: the fit for the confirmed COVID-19 cases of the Districts La Magdalena Contreras and Milpa Alta. The tomato colour bars represent the confirmed COVID-19 cases, the blue solid line represent the median and the shaded area represent the %95 probability bands for the expected value for the state variable of Documented Infecteds. Middle row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Alvaro Obregon and Tlahuac. Bottom row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Tlalpan and Xochimilco.

Fit of confirmed COVID-19 cases of the boroughs 13 to 16 using the Stan package [61].

Top row from left-hand to right-hand: the fit for the confirmed COVID-19 cases of the Districts Benito Juarez and Cuauhtemoc. The tomato colour bars reprent the confirmed COVID-19 cases, the blue solid line reprent the median and the shaded area represent the %95 probability bands for the expected value for the state variable of Documented Infecteds. Bottom row from left-hand to right-hand: the fit for the diagnosed cases of the Districts Miguel Hidalgo and Venustiano Carranza.

The basic reproduction number estimation

The basic reproduction number, which is commonly denoted by , is the average number of secondary infections generated by a single infective during the curse of the infection in a whole susceptible population. We calculate the reproductive number in Mexico City using the inferred parameters. Define X = (E, A, I) and using the next generation operator method [66] on the system (1), the Jacobian matrices and of system (1) are given by The disease free equilibrium (DFE) of system (1) is X0 = (0, 0, 0, N, 0), we then have and Therefore, the next-generation matrix is K = FV−1, from where R is computed as the leading eigenvalue of matrix K; that is, Table 8 shows the range of values for the parameters involved in the expression (8) obtained using the Stan package [61]. With those values, we obtain a %95 credible interval for .

Discussion

In this work, we analyse a networked dynamic metapopulation model of the coronavirus dissemination in Mexico City using ODEs and Bayesian statistics. We present an explanation of how to estimate the mobility per day between the boroughs that compound the Mexico City, both on a weekday and on weekends; combining available information from the origin-destination survey carried out in 2017 with the current mobility indices that Google and the government of Mexico City report, depending on the mode of transport used to make each trip (e.g., bus, subway, car, etc.) and then we use the Fratar method to balance the daily origin-destination matrices. We also present a clustering analysis of the boroughs which compound Mexico City based on mobility data from Google and the Transportation Mexican Survey. From Figs 3 and 4, we can identify three different clusters during the each phase of the pandemic. We point out that the same cluster analysis done for Mexico City, could be implemented for a broader area, the metropolitan area named Valle de Mexico, which is rather important for the whole country. We consider that this clustering analysis which is based on individual movement may be crucial to efficiently model a human pandemic on the same scale as presented here, or at a smaller scale. From Fig 5, the transmission rate of symptomatic was 0.19 within 95% Credible Interval (CI) [0.06, 0.42], and the transmission rate of asymptomatic was 0.27 within 95% CI [0.14, 0.40], which is in concordance with the value estimated of 0.25 in [67], in that study the transmission rate was not separated in symptomatics and asyntomatics. The fraction of undocumented infections, ρ, was 0.027 within 95% CI [0.02, 0.04]. The estimated latency period, 1/α, is 5.96 days within 95% CI [3.60, 8.93] days, which is in concordance with the value used of 5.99, 6.0, 5.1 and 5.0 in [50, 52, 68, 69], and the estimated recovery period, 1/γ, is 4.86 days within 95% CI [3.15, 9.33] days, which is lower in comparison with the ones used of 5.97 and 10.81 (asymptomatic and symptomatic class, respectively) in [69], 10.0 in [52], 7.0 in [50] and 5.0, 10.8 and 14.0 (reported infectious, symptomatic and asymptomatic class, respectively) in [68]. The inter-borough scale factor was 0.77 within 95% CI [0.61, 0.89], this value indicates that the mean number of trips made by a person is between three and four in one day, which makes sense with complete trips to get out of home, do some activities (e.g., work, shopping, or services), and return home. The results of the inferred parameters of model (1) and the population size of the boroughs (e.g., Iztapalapa and Gustavo A. Madero) help to explain the fast dispersion of COVID-19 and indicate the challenge of finding strategies to contain it. We have compared the parameter values inferred with respect to those used for Mexico City. As mentioned in Section, we will analyse the identifiability of the parameters of model (1) (i.e., the ρ parameter) because this parameter is multiplied by the period of incubation of the disease, α. Thus, estimating both parameters simultaneously may lead to a non-identifiability difficulty. We may observe this non-identifiability in Figs 5 and 10; that is, different combinations of the model parameters lead to the same “energy” value of the system 1. In particular, we can observe that a different combination of the estimated parameters values obtained with the methods ADVI-Fullrank and ADVI-Meanfield give very similar fitted curves for diagnosed cases. We also observe that the recovery period time is more in accordance with the values used in [50, 52, 68, 69] but the latency period is lower than the ones used there. We note that the parameters, β, β of the model (1) are considered as global; that is, they are assumed the same for all the boroughs of Mexico City and all the transportation modes. In the near future, we will explore a more robust model that will consist of local parameters of transmission β, β, instead of globally (i.e., a pair of transmission rates β, β for each borough). Furthermore, we will consider those transmission rates, β, β, to be dependent on time as in [52, 70]. In addition, we will consider the interstate and international mobility from/to Mexico City. We will take into account imported cases from the other 31 states of Mexico. We will also consider the cases imported from overseas by airplane passengers, and will do a global and local sensitivity analysis of model (1). Finally, we will investigate a spatio-temporal model based on a diffusion partial differential equation model combined with individual movement trends.
Table 5

Mean number of trips from zones 1-16 to zones 1-9 during a day of the weekend.

123456789
13311411368831146847158341060712033789658
21169655175048382584919910120115305551695252884
321744091175522300732717323448249265008
4691382435641421007667166052942882587411786
545452017537291641721022812857011187938682
61104612412073253274612850216706014991780043020
71311290993224191612004804158791323251829
8252136680925793759416771432752146
98721520446821913616935840811508973938579592
10140626844757553132531003561831131386297
116821147822286917403895555210326141215855182
12262258174103446755281279023341255249751
131056263766712729544390228468815222311985047
145659666481137741843275703018047313321898860055
155326617612178693733313774368014374410059194
1698001947932067208333526679132627311422571
Sum5810971233169316749152151055654225731963278692478751122702
  33 in total

1.  Parameter estimation of some epidemic models. The case of recurrent epidemics caused by respiratory syncytial virus.

Authors:  Marcos A Capistrán; Miguel A Moreles; Bruno Lara
Journal:  Bull Math Biol       Date:  2009-07-01       Impact factor: 1.758

2.  Approximate Bayesian computation for spatial SEIR(S) epidemic models.

Authors:  Grant D Brown; Aaron T Porter; Jacob J Oleson; Jessica A Hinman
Journal:  Spat Spatiotemporal Epidemiol       Date:  2017-11-22

3.  Contemporary statistical inference for infectious disease models using Stan.

Authors:  Anastasia Chatzilena; Edwin van Leeuwen; Oliver Ratmann; Marc Baguelin; Nikolaos Demiris
Journal:  Epidemics       Date:  2019-10-05       Impact factor: 4.396

4.  Bayesian uncertainty quantification for transmissibility of influenza, norovirus and Ebola using information geometry.

Authors:  Thomas House; Ashley Ford; Shiwei Lan; Samuel Bilson; Elizabeth Buckingham-Jeffery; Mark Girolami
Journal:  J R Soc Interface       Date:  2016-08       Impact factor: 4.118

5.  Estimating the reproductive number and the outbreak size of COVID-19 in Korea.

Authors:  Sunhwa Choi; Moran Ki
Journal:  Epidemiol Health       Date:  2020-03-12

6.  A Bayesian Monte Carlo approach for predicting the spread of infectious diseases.

Authors:  Olivera Stojanović; Johannes Leugering; Gordon Pipa; Stéphane Ghozzi; Alexander Ullrich
Journal:  PLoS One       Date:  2019-12-18       Impact factor: 3.240

7.  Fluid dynamics and epidemiology: Seasonality and transmission dynamics.

Authors:  Talib Dbouk; Dimitris Drikakis
Journal:  Phys Fluids (1994)       Date:  2021-02-02       Impact factor: 3.521

8.  A mathematical model for the novel coronavirus epidemic in Wuhan, China.

Authors:  Cha Yu Yang; Jin Wang
Journal:  Math Biosci Eng       Date:  2020-03-11       Impact factor: 2.080

9.  Assessing parameter identifiability in compartmental dynamic models using a computational approach: application to infectious disease transmission models.

Authors:  Kimberlyn Roosa; Gerardo Chowell
Journal:  Theor Biol Med Model       Date:  2019-01-14       Impact factor: 2.432

10.  Monitoring Italian COVID-19 spread by a forced SEIRD model.

Authors:  Elena Loli Piccolomini; Fabiana Zama
Journal:  PLoS One       Date:  2020-08-06       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.