Literature DB >> 35194090

Deep learning forecasting using time-varying parameters of the SIRD model for Covid-19.

Arthur Bousquet¹, William H Conrad², Said Omer Sadat¹, Nelli Vardanyan¹, Youngjoon Hong³.

Abstract

Accurate epidemiological models are necessary for governments, organizations, and individuals to respond appropriately to the ongoing novel coronavirus pandemic. One informative metric epidemiological models provide is the basic reproduction number ([Formula: see text]), which can describe if the infected population is growing ([Formula: see text]) or shrinking ([Formula: see text]). We introduce a novel algorithm that incorporates the susceptible-infected-recovered-dead model (SIRD model) with the long short-term memory (LSTM) neural network that allows for real-time forecasting and time-dependent parameter estimates, including the contact rate, [Formula: see text], and deceased rate, [Formula: see text]. With an accurate prediction of [Formula: see text] and [Formula: see text], we can directly derive [Formula: see text], and find a numerical solution of compartmental models, such as the SIR-type models. Incorporating the epidemiological model dynamics of the SIRD model into the LSTM network, the new algorithm improves forecasting accuracy. Furthermore, we utilize mobility data from cellphones and positive test rate in our prediction model, and we also present a vaccination model. Leveraging mobility and vaccination schedule is important for capturing behavioral changes by individuals in response to the pandemic as well as policymakers.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35194090 PMCID： PMC8863886 DOI： 10.1038/s41598-022-06992-0

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

According to the World Health Organization (WHO), over 5 million people worldwide have died from Covid-19[1]. Public health interventions have limited incidence and mortality of this disease from an early stage[2]. Governments, public health institutions, and the public at large benefit from statistical models that help to determine what approaches are effective at controlling the virus, and to predict when it is necessary to take strong measures to slow its transmission. For instance, recent studies have shown the benefits of both voluntary and government-induced social distancing measures[3]. A key metric to predict epidemic progression is the basic reproduction number ()[4]. Both compartmental models and networked models have been used to predict [5-14]. Compartmental models such as the SIRD (susceptible, infectious, recovered, or dead) model and its variants are used to predict for infectious diseases[15,16], because the number of susceptible, infectious, recovered, and dead people in a population can be readily estimated from publicly available data. The epidemiological parameters (contact rate), (recovery rate), and (deceased rate) in the SIRD model can be esteimated from the number of the susceptible, infectious, recovered, and dead. These parameters are then used to determine . The SIRD model (see Eq. 2) can be solved in different ways. One of the popular methods is to solve the ordinary differential equations (ODEs) using numerical methods. In this case, one needs to know the parameters (, , and ) of the system of differential equations. However, if these parameters are set to be a time-independent constant, the assumption may not be realistic. For example, the contact rate, , varies depending on many time-dependent factors such as mobility and lockdown policy. Hence, time-independent parameters may give outdated information, which introduces a prediction error. Another method to solve the equations is to use neural networks by considering the system as a time series[17] with a recurrent neural network[18]. This approach does not ensure that the model follows the dynamics of compartmental models, and the neural network is required to predict twice as many variables. More importantly, this approach does not provide the reproduction rate directly. Recently, related studies about Covid-19 incorporated mobility datasets to aid in pandemic modeling[19-21]. For example, James and Menzies[19] used Apple mobility data to examine the relationship between daily Covid-19 cases and national equity index price on a country-by-country basis. Yilmazkuday[20] studied the relationship between country-specific changes in mobility, from the Google mobility dataset, and the number of Covid-19 cases. Also, a metapopulation SEIR model was investigated in[21] that integrated fine-grained dynamic mobility from Safegraph data to simulate the spread of Covid-19. Each of these studies demonstrated that by integrating these mobility data, the SEIR model can accurately fit the real case trajectory, despite substantial changes in the behavior of the population over time. In this work, we combine a compartmental model with a recurrent neural network that incorporates mobility data as well as the positive test rate. We (1) predict the time-dependent parameters and using a neural network; (2) forecast the infection rates when mobility decreases or increases; and (3) forecast the change in infection rate based on different vaccination schedules. The goal of this paper is to provide a method to predict time-varying parameters and (and hence ) as well as to solve the SIRD equations. The method under consideration in our paper combines the two aforementioned approaches. We first introduce a version of recurrent neural networks to predict the time-varying parameters and . Since is assumed to be constant, one can easily find from the neural network. We then obtain the compartments, S, I, R, and D, by numerically solving the SIRD equation over a certain time period (e.g. 7 days). To test the performance of our approach, we used publicly available data for different countries, France, United Kingdom, Germany, and South Korea, provided by Johns Hopkins University. For more detail, we provide an illustration of the algorithm in Fig. 9. We also include two additional datasets: mobility data from cellphones and the positive test rate. Studies reveal that both mobility and positive test rate have been shown to influence the spread of Covid-19 considerably[22-25].

Figure 9

A description of the combined SIRD–LSTM model structure with Covid-19 community mobility (mobility) and positive test rate (Pos. Test Rate) to generate forecasts of time varying parameters . The ODE solver based on the Runge–Kutta fourth order method makes use of the predicted parameters in the numerical discretization.

In this paper, we present an accurate computational scheme to predict the reproduction number which enables Covid-19 forecasting. We use this scheme to forecast different scenarios by increasing or decreasing the mobility parameter. In doing so, our model can help study the effect of government-imposed lockdowns on . Furthermore, we make use of a SIRD model with vaccination to see how vaccination affects the spread of the virus. Among many other vaccination models[26,27], our study focuses on the model introduced in[28,29] as it is sufficient to capture important dynamics in the experiments. By leveraging parameters relative to the vaccination rates, our simulations show how the vaccination rate affects the number of infectious cases. Such experiments can show how different public health interventions may affect the outcome of the epidemic.

Results

In this section, we describe a sequence of numerical experiments of our algorithm further detailed in the Method section below. First, we present the estimated values of our time-dependent parameters and using the Levenber–Marquardt algorithm. Then, the accuracy of the algorithm is demonstrated using in-sample data, and out-of-sample predictions for the next 10 weeks. Lastly, forecasting depending on mobility and vaccination rate is examined. In summary, our main contributions consist of three key findings; (i) our SIRD–LSTM combined network outperforms classical prediction models; (ii) we incorporate the mobility and vaccination as inputs of our neural network to increase the accuracy of our parameters predictions; (iii) we forecast Covid-19 trends when mobility decreases or increases.

Parameter Estimates

A significant finding of our paper is that treating the parameters and as time-dependent increases model accuracy. Figure 1 shows for four countries (France, United Kingdom, Germany, and South Korea) generated by the Levenberg–Marquardt algorithm. From this, we can find the basic reproduction number, , with , which is useful to study the dynamics of the infectious class[30]. We compare real infection data from France, the United Kingdom, Germany, and South Korea with a SIRD model using constant or time-dependent . Figure 2 shows the difference between a and constant, that we estimate using the Levemberg–Marquardt algorithm over one year, with and estimated over just 1 week. The time-dependent model more accurately forecasts the infection rate over seven days across each country regardless of the time period. Therefore, it is necessary to consider and as time-dependent variables.

Figure 1

Predicted time varying parameters (contact rate), (deceased rate), and reproduction number for each country derived from the Levenberg–Marquardt algorithm.

Figure 2

Comparison between the number of infected, I, from our data with a predicted I using the SIRD model using a constant or a time-dependent .

Predicted time varying parameters (contact rate), (deceased rate), and reproduction number for each country derived from the Levenberg–Marquardt algorithm. Comparison between the number of infected, I, from our data with a predicted I using the SIRD model using a constant or a time-dependent .

Accuracy of our model

To test the forecasting capability of the SIRD–LSTM combined network, we compare the number of predicted confirmed Covid-19 cases under various measures for within sample scenarios. The in-sample fit of the model is an essential indicator for the validity of the model’s prediction of the parameters, whereas the out-of-sample forecasts can provide an important guideline for decision/policymakers. Figure 3 depicts the prediction of the time varying parameters compared with from the dataset. We randomly choose test data amongst 365 days, and make use of them as a test set. To measure the accuracy, we use the relative- errors of , , S, I, R, and D such thatwhere is the ith true dataset of , , S, I, R, or D, and is ith predicted values from our algorithm. We observe that the predicted and true parameters are close to each other. Table 1 demonstrates quantitative results on accuracy of our computation.

Figure 3

Table 1

Relative errors with (2 weeks) for and , and for the SIRD implementation with the LSTM networks.

Country	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta$$\end{document}β	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu$$\end{document}μ	S	I	R	D
France	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.17 \times 10^{-3}$$\end{document}3.17×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.02 \times 10^{-1}$$\end{document}1.02×10-1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$9.4 \times 10^{-5}$$\end{document}9.4×10-5	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4.2 \times 10^{-2}$$\end{document}4.2×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.3 \times 10^{-3}$$\end{document}6.3×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.3 \times 10^{-2}$$\end{document}6.3×10-2
United Kingdom	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.13 \times 10^{-3}$$\end{document}3.13×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$9.26 \times 10^{-2}$$\end{document}9.26×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.7 \times 10^{-4}$$\end{document}1.7×10-4	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.8 \times 10^{-2}$$\end{document}3.8×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.8 \times 10^{-3}$$\end{document}1.8×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.2 \times 10^{-2}$$\end{document}2.2×10-2
Germany	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.11 \times 10^{-3}$$\end{document}3.11×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$9.71 \times 10^{-2}$$\end{document}9.71×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$5.4 \times 10^{-5}$$\end{document}5.4×10-5	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.1 \times 10^{-2}$$\end{document}2.1×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.1 \times 10^{-2}$$\end{document}1.1×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.3 \times 10^{-1}$$\end{document}1.3×10-1
Korea	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6.29 \times 10^{-2}$$\end{document}6.29×10-2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.73 \times 10^{-1}$$\end{document}1.73×10-1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8.0 \times 10^{-7}$$\end{document}8.0×10-7	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$9.4 \times 10^{-3}$$\end{document}9.4×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2.0 \times 10^{-3}$$\end{document}2.0×10-3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.4 \times 10^{-2}$$\end{document}3.4×10-2

SIRD stands for susceptible (S), infectious (I), removed (R), deceased (D) individuals. The definition of relative error is stated in (1).

Table 1 shows the relative error of is between and , the relative error of is between and . The relative error with (2 weeks), of the compartments, S, I, R, and D, is also displayed in Table 1. Figure 4 depicts mobility, positive test rate, cumulative infectious individuals, and contact ratio against the time. The positive test rate and cumulative infectious individuals follow similar trends as opposed to mobility and the positive test rate. The countries under consideration enforce lockdowns as cumulative infectious individuals increased. Hence, the trend plots reveal that greater mobility leads to an increase in infectious individuals.

Figure 4

Plots of the normalized mobility values, positive test rate (p), infectious individual (I), and contact ratio () against the time (Days) for France, United Kingdom, Germany, and South Korea.

Comparison between computed by the Levenberg–Marquardt (LM) algorithm which is considered as our true data, and (, ) predicted from the LSTM networks for France, United Kingdom, Germany, and South Korea. Plots of the normalized mobility values, positive test rate (p), infectious individual (I), and contact ratio () against the time (Days) for France, United Kingdom, Germany, and South Korea. Relative errors with (2 weeks) for and , and for the SIRD implementation with the LSTM networks. SIRD stands for susceptible (S), infectious (I), removed (R), deceased (D) individuals. The definition of relative error is stated in (1).

Out-of-sample forecast

We next conduct an out-of-sample forecast analysis of our SIRD–LSTM combined model. Figure 5 demonstrates a prediction of of each country using generated by the LSTM networks. By forecasting , in Fig. 6, we show a short-term prediction of the SIRD model up to 10 weeks. In the simulation, we assume that the positive test rate and mobility are the same as the final observation from the dataset. Both the SIRD and vaccinated SIRD models are computed and demonstrated in Fig. 6. In France, Germany, and South Korea, the depicted curves of the infections for the next 10 weeks are increasing, while the infection curve for the next 10 weeks tends to slightly decrease in the United Kingdom. In fact, it has been reported from various sources in May 2021 that the vaccination strategy and lockdowns in the United Kingdom were successful[31].

Figure 5

10 weeks prediction of the reproduction number, , using generated by the LSTM network for each country.

Figure 6

The SIRD prediction with the LSTM network for France, United Kingdom, Germany, and South Korea. The plots display I, R, and D prediction against the time (Days) for 10 weeks.

10 weeks prediction of the reproduction number, , using generated by the LSTM network for each country. The SIRD prediction with the LSTM network for France, United Kingdom, Germany, and South Korea. The plots display I, R, and D prediction against the time (Days) for 10 weeks.

Forecasting depending on mobility

Policymakers have sought to decrease the rate of infection in their populations by decreasing population mobility through lockdowns and, more recently, increasing vaccinations. Here, we model the effect of decreasing mobility and increasing vaccination rate on infection rate. If the mobility is increased by of the normal mobility (baseline mobility), the model shows that the peak of infectious individuals increases drastically, see Fig. 7. The data show how visits to places are changing compared to the baseline. A baseline day represents a normal value for that day of the week. The baseline day is the median value from the 5 weeks Jan 3–Feb 6, 2020; for more information, see e.g.[32]. Figure 7 shows that in France, South Korea, and Germany, increased mobility results in a drastic change in the number of new Covid-19 cases. On the other hand, if mobility restrictions are decreased to 30% normal mobility, the model predicts that the peak of infectious individuals decreases compared to the baseline mobility.

Figure 7

Forecasting the number of Covid-19 infections for France, the United Kingdom, Germany, and South Korea under 30% increased and decreased mobility to normal mobility (baseline mobility). Mobility data is real-time cell phone/mobile device location for each country collected from[32]. Here, , , and stand for infections with normal, 30% increased, 30% decreased mobility, respectively. The vaccination model is used for the simulations.

Forecasting depending on the vaccination rate

In addition, with vaccination, the Covid-19 cases are noticeably decreasing for all of the countries under study in our work. The countries whose reproduction number () is close to 1 such as the United Kingdom and South Korea, have a better vaccination effect than the other countries. Figure 8 displays forecasting of infectious cases under various vaccination schedules within 70 days. In the experiment, we assume that the vaccine is evenly distributed with respect to time. The plots reveal that high vaccination rates are important in reducing the number of infectious cases. Figure 7 shows the models’ forecast for infections with different mobility levels in each country. Given mobility information, the combined SIRD–LSTM model can predict the time-varying parameters . With those predicted parameters, the number of infectious individuals are implemented with or without vaccination. Based on the projected forecasts, we observe that a continuation of quarantine level mobility will result in low case counts.

Figure 8

Forecasting of of the number of Covid-19 infections for France, the United Kingdom, Germany, and South Korea under various vaccination schedules. Here, “", “", and “" mean of the population is vaccinated, respectively.

Discussion

We introduced a novel algorithm that incorporates deep learning and compartmental models allowing for forecasts and evaluation of the current Covid-19 outbreak worldwide. We combined the SIRD model with the LSTM network and observed advantages of real-time forecasting and parameter estimation. The new algorithm integrates the forecasting accuracy of LSTM networks with the epidemiological model dynamics of the SIR-type model. Compared to the classical SIRD model in the literature, we forecast time-varying parameters predicted by the LSTM neural network. To forecast the parameters, mobility and positive test rate data are used in the architecture. We find that these inputs are important in improving the model’s ability to fit the data. In addition, incorporating these data is essential for capturing behavioral changes by individuals in response to the pandemic as well as to observe the effect of policy decisions to increase vaccination and decrease mobility. As in other approaches, we conduct our research on publicly available datasets. We demonstrate how a new algorithm can be developed to better exploit quantitative measures in the fight against Covid-19. By utilizing reliable metrics and infection dynamics, we provide an approach that is deeply data-driven and computer-based. The proposed simulations can provide a tool for forecasting the effects of different mobility scenarios. Furthermore, as the proposed algorithm is compatible and generalizable, this allows for additional compartments in the SIR model or additional input datasets in the network which makes the method accessible to policymakers. Our developments point towards several extensions of great importance. In particular, we evaluated the impact of the imposition and relaxation of lockdown measures by inputting these changes into the LSTM neural network. We found that employing lockdown rules for each country can help to capture interesting regional dynamics of the Covid-19, and may give specific information to the policymakers. Another direction is to study an advanced deep learning architecture such as attention mechanism or transformer[33]. These modern architectures can provide better investigation on not only the increase in forecasting performance but also on how the highly nonlinear capabilities of the neural network can be used to conduct inference on latent parameters of the SIR model.

Methodology

In this section, we explore our numerical method and prediction algorithm considered in this research. To begin, we describe the compartmental models, the SIRD equations, and the Runge–Kutta method. Then, we present the Levenberg–Marquardt algorithm. Lastly, we illustrate the combined SIRD–LSTM architecture which is the heart of our approach. We confirm that all methods were performed in accordance with the relevant guidelines and regulations.

Compartmental model: SIRD model

In this study, we represent the spread of Covid-19 using the susceptible-infected-recovered-dead (SIRD) model. Compartmental models have been used to simplify the mathematical modeling of infectious diseases[34,35]. One of the well-known (and simplest) models is the SIR model, and many models including SIRD are derivatives of this basic form[36-38]. The SIRD model predicts how a disease spreads, the total number infected, or the duration of an epidemic, and estimate important epidemiological parameters such as the reproductive number. Regarding the compartmental model, the population is assigned to compartments with labels:In addition, N is the total number of people in the area at time t with . The SIRD model is given by the following expressions[15]:where the parameter , called the contact ratio, represents the effective contact rate, i.e. expected number of people infected by an infectious person, and is defined as recovery rate, i.e. expected number of people removed from the infected state. The ratio of and is called as reproduction number, i.e. = . The reproduction number () shows the average number of secondary infections coming from an infected person. The parameter is defined as a deceased rate. We assume that the recovered subjects are no longer susceptible to infection; the number of deaths due to other reasons is neglected. Further, the region under consideration is assumed to be isolated from other regions. This is a reasonable assumption as containment measures such as travel restriction have been enforced in most countries. S(t): the number of individuals susceptible of contracting the infection at time t, I(t): the number of individuals that are alive and infected at time t; R(t): the cumulative number of individuals that recovered from the disease up to time t; D(t): the cumulative number of individuals that deceased due to the disease, up to time t. By introducing the vaccination rate,the S(t) and R(t) terms can be modified for the vaccination model. We add the vaccination rate, , and the vaccine efficacy factor, , into our SIRD model to study an extended SIRD model with vaccination. For instance, for the Moderna and Pfizer vaccine[39]. More precisely, we introduce a multiplier factor . We now write the following SIRD model which incorporates vaccination[28,29]With the SIRD model, we generate a deep neural network to predict and . Subsequently, the SIRD with vaccination model provides the dynamics of the vaccination with predicted parameters and . The contact rate, , and death rate, , of many acute infectious diseases varies significantly in time and frequently exhibits significant seasonal dependence[40,41]. Epidemiological models can be used to predict contact and death rate, which are important for measuring the spread of disease. A substantial body of research predicts the contact and death rate, and , of infectious diseases via the discrete compartmental model[42-44]. The rest of this section introduces an algorithm to compute the time-dependent parameters directly from our data and the discrete SIRD model.

Levenberg–Marquardt algorithm

To estimate the contact rate, , and the death rate, , we use the Levenberg–Marquardt algorithm. To apply the algorithm, we solve the SIRD equations using a numerical approximation. In the present study, we use the fourth-order Runge–Kutta methods (RK4) which give the following discrete version of the SIRD model. For simplicity, we setthen (2) can be recast The RK4 of (2) can be written as Given a dataset , using the Levenberg–Marquardt algorithm, we aim to find the parameters of the model curve with the least-squares curve-fitting[45], We note that the Covid-19 dataset for each country is obtained from the Google mobility report[32].

Neural network architecture

Long short term memory networks—so-called LSTM—are variants of recurrent neural network (RNN), capable of learning long-term dependencies. They were introduced by Hochreiter and Schmidhuber[46], and are widely used in many fields such as time series prediction[47], speech recognition[48], and robot control[49] among many other applications. Classic RNNs can keep track of arbitrary long-term dependencies in the input sequences. However, there is a computational drawback to the standard RNNs. In standard RNNs, this repeating module will have a very simple structure, such as a single layer. When training a classical RNN with back-propagation, the gradients which are back-propagated may tend to zero (vanish gradient problem) because the RNN remembers data for just a small duration of time. In other words, if we need the information after a small-time it may be reproducible, but once a lot of information is fed in, this information may get lost somewhere. This issue can be resolved by applying a variant of RNNs such as the LSTM network. The LSTMs are explicitly designed to avoid the long-term dependency problem as remembering information for long periods is practically their default behavior. The compact forms of the LSTM with a forget gate can be described by the following system of equations:where is input vector, is a memory cell, and denote the input, forget, and output gates, respectively; for more details, see for instance[46,50,51]. Here, the operator denotes the Hadamard product (element-wise product), and subscript t indexes the time step. A description of the combined SIRD–LSTM model structure with Covid-19 community mobility (mobility) and positive test rate (Pos. Test Rate) to generate forecasts of time varying parameters . The ODE solver based on the Runge–Kutta fourth order method makes use of the predicted parameters in the numerical discretization. In the proposed neural network, we couple the SIRD model (2) and the LSTM network. By the Levenberg–Marquardt algorithm, predictions on and are made by curve-fitting methods. With this, input data consists of where is a positive rate (the percentage of all coronavirus tests performed that are actually positive) and is a mobility trend at time t obtained from Google’s mobility report. The reports chart movement trends over time by geography, across different categories of places such as retail and recreation, groceries and pharmacies, parks, transit stations, workplaces, and residential. The parameters and are predicted by the Levenberg-Marquardt algorithm. The output of the LSTM network is (, . When implementing cost functions, we apply a mean-squared forecasting error metric as well as mean-absolute percentage errors. The network structure and activation of each hidden unit in the hidden layers are determined by the neurons in the previous layers. The activity of each layer is given by the nonlinear activation function such as a sigmoid function or ReLU function. The final output of the coupled model is obtained by combining the network output of confirmed cases with the SIR model forecast. More precisely, the collective dataset generated from the SIRD model is used as inputs for the LSTM whose outputs provide the parameters and for the next time period. By predicting the parameters, we are able to solve the SIRD moded, which gives for the next time period. The coupled models given in Fig. 9 illustrate the Neural LSTM-SIRD architecture. The network architecture we use is an LSTM with ReLU activation functions, and is trained by using Adam optimizer with a mean-squared error loss function. The model is not constrained to a particular setup and we could search over various hyperparameters to manipulate the number of neurons, with similar results.

Data

We collected data from the following sources: Covid-19 data repository by the center for systems science and engineering (csse) at Johns Hopkins University, https://github.com/CSSEGISandData/COVID-19 (see[52]). Our World In Data, https://github.com/owid/covid-19-data/tree/master/public/data (see[53]). Google Mobility Report, https://www.google.com/covid19/mobility/.

28 in total

1. Learning to forget: continual prediction with LSTM.

Authors: F A Gers; J Schmidhuber; F Cummins
Journal: Neural Comput Date: 2000-10 Impact factor: 2.026

2. Modeling Epidemics With Compartmental Models.

Authors: Juliana Tolles; ThaiBinh Luong
Journal: JAMA Date: 2020-06-23 Impact factor: 56.272

3. How will country-based mitigation measures influence the course of the COVID-19 epidemic?

Authors: Roy M Anderson; Hans Heesterbeek; Don Klinkenberg; T Déirdre Hollingsworth
Journal: Lancet Date: 2020-03-09 Impact factor: 79.321

4. A compartmental model that predicts the effect of social distancing and vaccination on controlling COVID-19.

Authors: Mohammadali Dashtbali; Mehdi Mirzaie
Journal: Sci Rep Date: 2021-04-14 Impact factor: 4.379

5. Heterogeneous interventions reduce the spread of COVID-19 in simulations on real mobility data.

Authors: Haotian Wang; Abhirup Ghosh; Jiaxin Ding; Rik Sarkar; Jie Gao
Journal: Sci Rep Date: 2021-04-08 Impact factor: 4.379

6. Data-driven optimized control of the COVID-19 epidemics.

Authors: Afroza Shirin; Yen Ting Lin; Francesco Sorrentino
Journal: Sci Rep Date: 2021-03-22 Impact factor: 4.379

7. Plug-and-play inference for disease dynamics: measles in large and small populations as a case study.

Authors: Daihai He; Edward L Ionides; Aaron A King
Journal: J R Soc Interface Date: 2009-06-17 Impact factor: 4.118

8. An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China.

Authors: Huaiyu Tian; Yonghong Liu; Yidan Li; Chieh-Hsi Wu; Bin Chen; Moritz U G Kraemer; Bingying Li; Jun Cai; Bo Xu; Qiqi Yang; Ben Wang; Peng Yang; Yujun Cui; Yimeng Song; Pai Zheng; Quanyi Wang; Ottar N Bjornstad; Ruifu Yang; Bryan T Grenfell; Oliver G Pybus; Christopher Dye
Journal: Science Date: 2020-03-31 Impact factor: 47.728

9. An interactive web-based dashboard to track COVID-19 in real time.

Authors: Ensheng Dong; Hongru Du; Lauren Gardner
Journal: Lancet Infect Dis Date: 2020-02-19 Impact factor: 25.071

10. Effectiveness of Pfizer-BioNTech and Moderna Vaccines Against COVID-19 Among Hospitalized Adults Aged ≥65 Years - United States, January-March 2021.

Authors: Mark W Tenforde; Samantha M Olson; Wesley H Self; H Keipp Talbot; Christopher J Lindsell; Jay S Steingrub; Nathan I Shapiro; Adit A Ginde; David J Douin; Matthew E Prekker; Samuel M Brown; Ithan D Peltan; Michelle N Gong; Amira Mohamed; Akram Khan; Matthew C Exline; D Clark Files; Kevin W Gibbs; William B Stubblefield; Jonathan D Casey; Todd W Rice; Carlos G Grijalva; David N Hager; Arber Shehu; Nida Qadir; Steven Y Chang; Jennifer G Wilson; Manjusha Gaglani; Kempapura Murthy; Nicole Calhoun; Arnold S Monto; Emily T Martin; Anurag Malani; Richard K Zimmerman; Fernanda P Silveira; Donald B Middleton; Yuwei Zhu; Dayna Wyatt; Meagan Stephenson; Adrienne Baughman; Kelsey N Womack; Kimberly W Hart; Miwako Kobayashi; Jennifer R Verani; Manish M Patel
Journal: MMWR Morb Mortal Wkly Rep Date: 2021-05-07 Impact factor: 35.301