Literature DB >> 33162673

Predicting the number of total COVID-19 cases and deaths in Brazil by the Gompertz model.

Jemy A Mandujano Valle1.   

Abstract

In this work, we estimate the total number of infected and deaths by COVID-19 in Brazil and two Brazilian States (Rio de Janeiro and Sao Paulo). To obtain the unknown data, we use an iterative method in the Gompertz model, whose formulation is well known in the field of biology. Based on data collected from the Ministry of Health from February 26, 2020, to July 2, 2020, we predict, from July 3 to 9 and at the end of the epidemic, the number of infected and killed for the whole country and for the Brazilian states of Sao Paulo and Rio de Janeiro. We estimate, until July 9, 2020, a total of 1,709,755 cases and 65,384 deaths in Brazil, 331,718 cases and 15,621 deaths in Sao Paulo, 134,454 cases and 11,574 deaths in Rio de Janeiro. We also estimate the basic reproduction number R 0 for Brazil and its two states. The estimated values ( R 0 ) were 1.3, 1.3, and 1.4 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. The results show a good fit between the observed data and those obtained by the Gompertz. The proposed methodology can also be applied to other countries and Brazilian states, and we provide an executable as well as the source code for a straightforward application of the method on such data. © Springer Nature B.V. 2020.

Entities:  

Keywords:  Covid-19; Inverse problem; Minimal error method; Model Gompertz

Year:  2020        PMID: 33162673      PMCID: PMC7606032          DOI: 10.1007/s11071-020-06056-w

Source DB:  PubMed          Journal:  Nonlinear Dyn        ISSN: 0924-090X            Impact factor:   5.022


Introduction

The World Health Organization declared the coronavirus disease 2019 a pandemic on March 11, pointing to the over 118,000 cases in over 110 countries and territories around the world at that time [18]. The COVID-19 pandemic in Brazil began on February 26, 2020, when a man from Sao Paulo who returned from Italy tested positive for the virus. The first case of COVID-19 in Rio de Janeiro was confirmed on March 5, 2020. Nowadays, Brazil is considered the epidemic center of Latin America, occupying the second place in the total number of cases and, more recently, in the total number of deaths. Currently (July 9, 2020), Sao Paulo and Rio de Janeiro are the states with the highest number of deaths by the new coronavirus in Brazil, according to data from the Brazilian Health Ministry. In the past few months, a considerable number of studies related to the evolution of COVID-19 in the world have been submitted and published. In the following, we describe some of these works. Ahmadi et al. [1] developed mathematical models to predict the number of COVID-19 cases in Iran from April 3, 2020, to May 13, 2020. The unknown parameters in these models were estimated by running the fminsearch, a MATLAB function, which is a least-squares algorithm. Torrealba et al. [22] analyzed the modeling and prediction of COVID-19 in Mexico, from an initial approximation, and using the Gauss–Newton algorithm, the authors estimated parameters in the Gompertz and Logistic models. Articles [12, 15, 16] estimated the number of total COVID-19 cases and deaths in world using the Gompertz model. In [11], they analyzed the number of deaths by COVID-19 with social distancing in Brazil, Sao Paulo. Using the SEIR model, the authors recommend temporary lockdowns, however, great the economic and social costs. In [2], the authors studied the impact on the evolution of cases in Rio de Janeiro, Brazil, considering a Susceptible-Infectious-Quarantined-Recovered (SIQR) model with containment. In this work, we model the near-future trajectory of the cumulative number of infections and deaths by COVID-19 for Brazil and two Brazilian states, given the cumulative number of infected cases and deaths from February 26, 2020 to July 2, 2020. Data were obtained from the Brazilian Ministry of Health until July 2 (https://covid.saude.gov.br), and we considered the cumulative reports from the date on which the first case was notified in Brazil and in each analyzed state. We also estimate the basic reproduction number , which represents the average number of secondary infections generated by each infected person. To estimate the unknown parameters, we use the Gompertz model and propose a gradient type iterative method (Minimal error method). The Gompertz model is one of the particular cases of the Richards model, as well as Brody, negative exponential, logistic and von Bertalanffy models, see [20, 21]. The growth functions can be grouped in three main categories: those without inflection point (Brody and negative models), those with sigmoidal shape and a fixed inflection point (Gompertz, logistic, and von Bertalanffy models), and those with a flexible inflection point (Richards model). The logistic, Gompertz, and von Bertalanffy models exhibit inflection points at about 50, 37, and 30% of the upper asymptote, respectively, which means that the Gompertz and von Bertalanffy processes are asymmetric, whereas the logistic is a symmetric process, see [19]. The Gompertz and the logistic models are the most frequently used sigmoid functions, and the literature on these models is extensive [3, 21]. In general, the cumulative number of deaths and cases by COVID-19 presents an asymmetrical sigmoidal growth curve. Therefore, using an inappropriate growth curve can have a substantial impact on forecasting [3]. The Gompertz model [5] was proposed by Benjamin Gompertz in 1825. Since then, this exponential model has been used to describe growth in plants, animals, bacteria, and cancer cells [22]. The Gompertz differential equation has the following form,where t represents time; N(t) is the cumulative population size at time t; r the intrinsic growth rate of model; K is the maximum value of model (N) when t goes to infinity; represents the initial time; and is the initial population or condition. The analytic solution from (1) iswhere . The turning point is the time at which the rate of accumulation changes from increasing to decreasing or vice versa and can be easily located by finding the inflection point of the epidemic curve, that is, the moment at which the trajectory begins to decline. Clearly, this quantity is of epidemiological importance, indicating either the beginning (i.e., the moment of acceleration after deceleration) or end (i.e., the moment of deceleration after acceleration) of a phase [6]. From Eq. (2), it is trivial to show that the inflection point is given byAt the inflection point, the number of infected or killed is given by . In this paper, we consider the discrete function, from Eq. (2) and for ,where i represents time in days and is the cumulative number of infected cases (or deaths) at day i. Estimates of parameters unknown Second and third columns are estimated values for the Gompertz model (4). Note that K is the number of infected (or killed) at the end of the pandemic. The fourth column describes the regression coefficient according to Eq. (8). The fifth column presents the date when the number of cases or deaths reaches its peak (i.e., the moment of deceleration after acceleration), see Eq. (3). Finally, the sixth column is the basic reproduction number, see Remark (2) In this paper, we consider that equality (3) is the inflection point of Eq. (4). We denote . The inverse problem is to estimate , from (4), given .

Method: inverse problem to the Gompertz model

Here, we consider an application of the minimal error method (MEM) to the inverse problem at the Gompertz model. The MEM is a variant of the conjugate gradient method [4, 10, 17], and this approach was used to estimate unknown parameters in a computational neuroscience models [24, 25]. Also, [14] solves the Cauchy problem in linear elasticity with MEM. Knowing the cumulative number of infections or deaths from COVID-19 , we want to determine assuming that Eq. (4) holds. The transpose of a vector is denoted by . Let the nonlinear operator defined bywhere are multivariable functions with real value and solves (4). To obtain an approximation for , given and , we used the minimal error iterationwhere is the directional derivative of computed at , and is its adjoint. We also definewhere solves (4), replacing by . Note that . It is possible to show that, under certain conditions (we assume that is the case), converges to a solution of ; see [9, Theorem 3.21]. Estimated number of cases and deaths by days in Brazil and its two states The cases and deaths are cumulative. The mean absolute percentage error (MAPE) values for the cumulative number of cases and deaths in Brazil are 2.0% and 3.9%, respectively. Considering the data from Sao Paulo, the MAPEs for the cumulative number of cases and deaths are 4.3% and 6.9%, respectively. Finally, the MAPEs for the cumulative number of cases and deaths in Rio de Janeiro are 4.4% and 3.4%, respectively Plots for Brazil. Subplot a shows extrapolated infection trajectory from February 26 to July 9. Subplot b presents the number of COVID-19 deaths from March 17 to July 9. The blue curve shows the exact data. The red line represents the estimate of the blue curve with the Gompertz model. The green asterisk is the inflection point From Eq. (5), we obtain an approximation for , but the adjoint is not known. In this paper, we obtain the unknown operator (see Appendix) together with the following iterationwhereThe regression coefficient is used to evaluate the fitting ability of various methods and can be obtained by the following equation,whereis the average of the cumulative confirmed COVID-19 cases (or deaths). Results for Sao Paulo. Subplots a and b are the cumulative number of infected individuals and deaths from February 26 to July 17 and March 17 to July 17, respectively. See Figure 1 for a description of each variable
Fig. 1

Plots for Brazil. Subplot a shows extrapolated infection trajectory from February 26 to July 9. Subplot b presents the number of COVID-19 deaths from March 17 to July 9. The blue curve shows the exact data. The red line represents the estimate of the blue curve with the Gompertz model. The green asterisk is the inflection point

Plots for Rio de Janeiro. Subplots a and b show the cumulative number of infected individuals and deaths from March 5 to July 17 and March 19 to July 17, respectively. See Figure 1 for a description of each variable

Remark 1

From differential Eq. (1) and since is known, we consider the following initial approximation for The numerical scheme for our method poposed is in Algorithm 1.

Remark 2

The basic reproduction number is the most important parameter to analyze any epidemic model for any disease. In a practical sense, is the average number of secondary infections caused by one infected individual during his/her entire infectious period. An epidemic occurs only if a single individual can spread its illness to more than one individual . If , on average an infectious individual infects less than one person and the contagion is expected to stop spreading. Our result can be used to compute the basic reproduction numbersee [6–8, 13], where r denotes the intrinsic growth rate in the Gompertz model and is the generation time of disease transmission. In this paper, we consider .

Results

In this section, we will provide the results obtained for the whole country (Brazil) and for the Brazilian states of Sao Paulo and Rio de Janeiro. Table 1 describes the estimated values for the Gompertz model and also presents the regression coefficient, the inflection point, and the basic reproduction number.
Table 1

Estimates of parameters unknown

Kr\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R^2$$\end{document}R2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {Peak day}\;(t_{i.p})$$\end{document}Peak day(ti.p)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_0$$\end{document}R0
Brazil
Cases4,618,8930.02040.999July 91.3
Deaths91,4400.03090.999June 4
Sao Paulo
Cases820,2020.02020.997July 41.3
Deaths20,8020.03110.994May 30
Rio de Janeiro
Cases255,8540.02350.998June 201.4
Deaths17,5850.02750.998June 7

Second and third columns are estimated values for the Gompertz model (4). Note that K is the number of infected (or killed) at the end of the pandemic. The fourth column describes the regression coefficient according to Eq. (8). The fifth column presents the date when the number of cases or deaths reaches its peak (i.e., the moment of deceleration after acceleration), see Eq. (3). Finally, the sixth column is the basic reproduction number, see Remark (2)

The estimated daily values for the cumulative number of confirmed cases and deaths, from July 3 to 9, are illustrated in Table 2. Figures 1, 2, 3 illustrate the relationship between observed values and those predicted by the Gompertz model. The tables and figures show the efficiency of the proposed iterative method.
Table 2

Estimated number of cases and deaths by days in Brazil and its two states

DateTotal casesTotal deaths
Prediction RealPredictionReal
Brazil
July 31,501,9701,539,08161,05963,174
July 41,536,4741,577,00461,81564,265
July 51,571,0501,603,05562,55764,867
July 61,605,6811,623,28463,28565,487
July 71,640,3531,668,58963,99866,741
July 81,675,0491,713,16064,69867,964
July 91,709,7551,755,77965,38469,184
Sao Paulo
July 3295,141310,70214,72915,694
July 4301,245312,53014,88615,996
July 5307,349320,17915,03916,078
July 6313,451323,07015,19016,134
July 7319,548332,70815,33716,475
July 8325,637341,36515,48016,788
July 9331,718349,71515,62117,118
Rio de Janeiro
July 3121,965118,95610,73810,500
July 4124,083120,44010,88210,624
July 5126,188121,29211,02510,667
July 6128,278121,87911,16510,698
July 7130,353124,08611,30410,881
July 8132,412126,32911,44010,970
July 9134,454128,32411,57411,115

The cases and deaths are cumulative. The mean absolute percentage error (MAPE) values for the cumulative number of cases and deaths in Brazil are 2.0% and 3.9%, respectively. Considering the data from Sao Paulo, the MAPEs for the cumulative number of cases and deaths are 4.3% and 6.9%, respectively. Finally, the MAPEs for the cumulative number of cases and deaths in Rio de Janeiro are 4.4% and 3.4%, respectively

Fig. 2

Results for Sao Paulo. Subplots a and b are the cumulative number of infected individuals and deaths from February 26 to July 17 and March 17 to July 17, respectively. See Figure 1 for a description of each variable

Fig. 3

Plots for Rio de Janeiro. Subplots a and b show the cumulative number of infected individuals and deaths from March 5 to July 17 and March 19 to July 17, respectively. See Figure 1 for a description of each variable

Conclusion

In this article, we propose an iterative method to estimate the unknown parameters in the Gompertz model. The results of the modeling show a good fit between the estimated and the observed data. We obtained an estimate of for the cumulative number of infected and killed in Brazil. For Sao Paulo, we obtained an approximation of and for the cumulative number of infected and deaths, respectively. For Rio de Janeiro, the estimate was for the cumulative number of infected and deaths. The basic reproduction number ( or the average number of infections caused by one typically infectious individual ) depends on the growth rate (r), which was estimated in this work, and the duration of infectiousness . Considering , the basic reproduction number was 1.3, 1.3, and 1.4 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. Applying our proposed methodology for China, we obtained , given the number of infected cases accumulated from February 2 to July 2, based on data collected from the world health organization. We did the same for Italy and obtained . Many works obtained these values for China and Italy. Therefore, our estimates for the basic reproduction number are acceptable. The inflection point of the curve provides vital information about changing trends in the epidemic and may possibly indicate changes in intervention and control. In this paper, the inflection points for the cumulative number of infected cases were July 9, July 4, and June 21 for Brazil, Sao Paulo, and Rio de Janeiro, respectively. Regarding the methodology used, there are several methods for estimating or determining parameters in a mathematical model. We can divide these methods into two groups, iterative and non-iterative methods. In this paper, we used a gradient type iterative method. This method can recover unknown parameters with a non-uniform distribution (non-constant functions) given noisy data. On the other hand, like all iterative methods, it depends on an initial estimate. If this initial approximation is far from the solution, the iteration will diverge. Although it is difficult to determine an initial estimate for some problems, in this work we found an appropriate initial guess, see Remark 1.
  11 in total

1.  A unified approach to the Richards-model family for use in growth analyses: why we need only two model forms.

Authors:  Even Tjørve; Kathleen M C Tjørve
Journal:  J Theor Biol       Date:  2010-09-08       Impact factor: 2.691

2.  A computational approach for the inverse problem of neuronal conductances determination.

Authors:  Jemy A Mandujano Valle; Alexandre L Madureira; Antonio Leitão
Journal:  J Comput Neurosci       Date:  2020-07-06       Impact factor: 1.621

3.  Early outbreak of 2009 influenza A (H1N1) in Mexico prior to identification of pH1N1 virus.

Authors:  Ying-Hen Hsieh; Stefan Ma; Jorge X Velasco Hernandez; Vernon J Lee; Wei Yen Lim
Journal:  PLoS One       Date:  2011-08-31       Impact factor: 3.240

Review 4.  Deciphering death: a commentary on Gompertz (1825) 'On the nature of the function expressive of the law of human mortality, and on a new mode of determining the value of life contingencies'.

Authors:  Thomas B L Kirkwood
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-04-19       Impact factor: 6.237

Review 5.  The use of Gompertz models in growth analyses, and new Gompertz-model approach: An addition to the Unified-Richards family.

Authors:  Kathleen M C Tjørve; Even Tjørve
Journal:  PLoS One       Date:  2017-06-05       Impact factor: 3.240

6.  COVID-19 spreading in Rio de Janeiro, Brazil: Do the policies of social isolation really work?

Authors:  Nuno Crokidakis
Journal:  Chaos Solitons Fractals       Date:  2020-05-23       Impact factor: 5.944

7.  Estimation of COVID-19 spread curves integrating global data and borrowing information.

Authors:  Se Yoon Lee; Bowen Lei; Bani Mallick
Journal:  PLoS One       Date:  2020-07-29       Impact factor: 3.240

8.  Modeling and forecasting trend of COVID-19 epidemic in Iran until May 13, 2020.

Authors:  Ali Ahmadi; Yasin Fadaei; Majid Shirani; Fereydoon Rahmani
Journal:  Med J Islam Repub Iran       Date:  2020-03-31

9.  Model Selection and Evaluation Based on Emerging Infectious Disease Data Sets including A/H1N1 and Ebola.

Authors:  Wendi Liu; Sanyi Tang; Yanni Xiao
Journal:  Comput Math Methods Med       Date:  2015-09-15       Impact factor: 2.238

10.  Modeling and prediction of COVID-19 in Mexico applying mathematical and computational models.

Authors:  O Torrealba-Rodriguez; R A Conde-Gutiérrez; A L Hernández-Javier
Journal:  Chaos Solitons Fractals       Date:  2020-05-29       Impact factor: 5.944

View more
  4 in total

1.  Forecasting COVID19 Reliability of the Countries by Using Non-Homogeneous Poisson Process Models.

Authors:  Nevin Guler Dincer; Serdar Demir; Muhammet Oğuzhan Yalçin
Journal:  New Gener Comput       Date:  2022-07-03       Impact factor: 1.180

2.  Estimating the state of epidemics spreading with graph neural networks.

Authors:  Abhishek Tomy; Matteo Razzanelli; Francesco Di Lauro; Daniela Rus; Cosimo Della Santina
Journal:  Nonlinear Dyn       Date:  2022-01-21       Impact factor: 5.741

3.  EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics.

Authors:  Abdennour Sebbagh; Sihem Kechida
Journal:  Sci Rep       Date:  2022-08-04       Impact factor: 4.996

4.  Numerical simulation and stability analysis of a novel reaction-diffusion COVID-19 model.

Authors:  Nauman Ahmed; Amr Elsonbaty; Ali Raza; Muhammad Rafiq; Waleed Adel
Journal:  Nonlinear Dyn       Date:  2021-06-28       Impact factor: 5.741

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.