Literature DB >> 34092919

RLIM: a recursive and latent infection model for the prediction of US COVID-19 infections and turning points.

Xiang Yu1, Lihua Lu1, Jianyi Shen1, Jiandun Li1, Wei Xiao1, Yangquan Chen2.   

Abstract

Initially found in Hubei, Wuhan, and identified as a novel virus of the coronavirus family by the WHO, COVID-19 has spread worldwide at exponential speed, causing millions of deaths and public fear. Currently, the USA, India, Brazil, and other parts of the world are experiencing a secondary wave of COVID-19. However, the medical, mathematical, and pharmaceutical aspects of its transmission, incubation, and recovery processes are still unclear. The classical susceptible-infected-recovered model has limitations in describing the dynamic behavior of COVID-19. Hence, it is necessary to introduce a recursive, latent model to predict the number of future COVID-19 infection cases in the USA. In this article, a dynamic recursive and latent infection model (RLIM) based on the classical SEIR model is proposed to predict the number of COVID-19 infections. Given COVID-19 infection and recovery data for a certain period, the RLIM is able to fit current values and produce an optimal set of parameters with a minimum error rate according to actual reported numbers. With these optimal parameters assigned, the RLIM model then becomes able to produce predictions of infection numbers within a certain period. To locate the turning point of COVID-19 transmission, an initial value for the secondary infection rate is given to the RLIM algorithm for calculation. RLIM will then calculate the secondary infection rates of a continuous time series with an iterative search strategy to speed up the convergence of the prediction outcomes and minimize the maximum square errors. Compared with other forecast algorithms, RLIM is able to adapt the COVID-19 infection curve faster and more accurately and, more importantly, provides a way to identify the turning point in virus transmission by searching for the equilibrium between recoveries and new infections. Simulations of four US states show that with the secondary infection rate ω initially set to 0.5 within the selected latent period of 14 days, RLIM is able to minimize this value at 0.07 and reach an equilibrium condition. A successful forecast is generated using New York state's COVID-19 transmission, in which a turning point is predicted to emerge on January 31, 2021. Supplementary Information: The online version contains supplementary material available at 10.1007/s11071-021-06520-1.
© The Author(s), under exclusive licence to Springer Nature B.V. 2021.

Entities:  

Keywords:  COVID-19; Recursive time series; SEIR; Secondary infections; Turning point

Year:  2021        PMID: 34092919      PMCID: PMC8166369          DOI: 10.1007/s11071-021-06520-1

Source DB:  PubMed          Journal:  Nonlinear Dyn        ISSN: 0924-090X            Impact factor:   5.022


Introduction

Since its first appearance in Hubei, Wuhan, China, in December 2019, a novel virus named COVID-19 has affected millions of people worldwide, causing unpredicted economic losses and public fear. To date, the origin, incubation time, and transmission speed of COVID-19 have not been clarified. Numerous attempts from medical, clinical, and mathematical perspectives have been made to analyze the dramatic increase in infections brought by COVID-19 and predict its transmission trends. A number of COVID-19-related studies developed their mathematical modeling based on the susceptible–infected–removed (SIR) model, which was originally proposed by Kermack and McKendrick [13] to analyze black death virus transmission occurring in London, the UK, and pestilence in Mumbai, India, in 1666 and 1906, respectively. Theoretically, this model divides the progress of virus transmission into three phases–susceptible, infected, and removed–and relates mathematical parameters with the characteristics of each stage. For example, a mathematical parameter, , was assumed between susceptible and infected to identify the percentage of the healthy and vulnerable population that transform into a positively infected patient. has been associated with , the basic reproductive number, which is widely used by clinical experts to express the average speed of transmission for a specific virus. Another important indicator, , has been widely applied to record the percentage that move from infection to recovery or death. The reciprocal of indicates the median incubation period of COVID-19 transmission, which has attracted much interest from the scientific community. Regarding the incubation period of COVID-19, a number of research findings have also been published: Yu et al. [21] investigated COVID-19-infection cases reported in China and other countries and recorded incubation periods ranging from 7 to 14 days; Lai [14] collected exposure periods for 125 Chinese patients, and the estimation indicated that the median incubation period was 4.75. Zhu [22] assumed that the latent period and the infectious period are approximately equal to the incubation period and the length of stay in the hospital and preliminarily concluded that the value of the latent period and the infectious period is 5 and 10 days, respectively. Adhikari [1] asserted that the average incubation duration of COVID-19 was , ranging from 2 to 11 days (with 95% confidence interval, 4.1 to 7). Although numerous mathematical models have been developed to address the dynamics of COVID-19, very few focus on the secondary infections caused by recovery. Many of these models treat COVID-19 as a respiratory disease that requires immediate medical attention but does not last for long or cause secondary effects. However, long-lasting illnesses and secondary outbreaks in the USA, UK, Brazil, and India all indicate that COVID-19 symptoms cannot be treated as terminating in a manner similar to flu. For example, Sabino et al. [18] observed the resurgence of COVID-19 in January 2021 in Brazil and asserted that one of the main reasons behind this resurgence was that immunity against COVID-19 infection had already begun to wane by December 2020. Thus, the recovered group could still be infected or become a virus carrier. According to the COVID tracking project [8], the definition of “COVID-19 recovery” varies among different US regions, ranging from, for example, “symptom improvement” to “hospital discharges” or even “days since diagnosis”. In addition, there is no clear evidence that “recovered” patients are subsequently immune to COVID-19. Thus, it is reasonable and necessary to assume that a portion of them, after a certain period of time, will move from the immune group to the susceptible group. A recent scientific report from Christian Gaebler et al. [9] proves that the humeral memory response to COVID-19 will last between 1.3 and 6.3 months after infection without vaccine support. Okhuese [16] attempted to estimate the probability of COVID-19 reinfection by searching the equilibrium state of the SEIRUS model. In his simulation report, after 12 days, the rate of recovery and rate of infection will meet and reach an equilibrium state. However, his model merely considered incorrectly executed PCR tests, which is not sufficiently accurate to describe current COVID-19 transmission in the USA. According to Altan and Karasu [2], X-ray images are able to provide better results than RT-PCR tests in the diagnosis of COVID-19 disease. One of the essential questions to be answered by a forecasting algorithm is when and how a turning point will appear. A turning point within COVID-19 transmission contains valuable information to help governments, clinical services, and scientists model the transmission and prepare. Yang et al. [20] successfully predicted the peak of COVID-19’s first wave in China in late February under public health interventions. Many studies, such as [7] and [21], relate COVID-19 transmission’s turning point with political or public affairs, such as city lockdowns and school closure. Recently, some researchers claimed that the turning point and end of an expanding epidemic cannot be precisely forecasted [3] because COVID-19 transmission is highly dynamic and unstable, and the forecasting results are sensitive to small variations in parameters. The recursive and latent infection model (RLIM) algorithm proposed in this paper can provide a reliable estimation of the COVID-19 transmission turning point due to three factors. First, the authors chose a period of 14 days for prediction, which is not long enough for new effects to emerge and affect the results. Second, the turning point in our system was predicted with validated data records, and the trend in these data records was carefully observed to guarantee their smoothness. Finally, a detailed investigation of state-level regulations was made for the target states and dates to ensure that no political events occurred (such as a state lockdown or hospital emergencies). In this article, we develop and present the RLIM, a novel COVID-19 transmission model. The main contributions of this paper are as follows: (1) Developed a novel method to forecast the number of infections in the upcoming 14 days based on historical infection and recovery data. This method is able to efficiently locate the relationship between historical data and infection data and optimize the parameters of the RLIM model in a short period. Evidence from our experiment proves that the key parameters converge within a certain period in the optimized RLIM model, thereby locating the optimal parameters. (2) Given an infection-recovery dataset for COVID-19, this method is able to promptly locate the turning point with an iterative search strategy. Our experiment shows that within a period of 60 days, an RLIM model with an optimized set of parameters based on historical data (see contribution point 1) is able to predict the secondary infection rates for the coming week with an optimization strategy that minimizes the MSE (maximum square error) between the reported number of infections and RLIM predictions. The remainder of this manuscript is organized as follows. Section 2 discusses the implemented mathematical modeling, equations and algorithms. Section 3 describes the simulation settings, software and scientific packages utilized by the RLIM program. Section 4 discusses the data and simulation results for four US states’ COVID-19 and provides predictions on their infections between mid-January and mid-February. Section 5 summarizes the work and offers further discussion.

Method

This section discusses the algorithm in three steps. The first step introduces the mathematics behind RLIM in detail, explaining how it evolves from the classical SIR model and describes COVID-19’s infection process in a series of equations. The next step assigns mathematical symbols to parameters in RLIM and implements these equations into sequential procedures in our algorithm. Finally, a performance measure on RLIM is proposed to evaluate how the algorithm runs on COVID-19 dataset.

RLIM: mathematics

In this paper, a modified COVID-19 transmission model is proposed based on the original SIR model by Kermack and McKendrick [13]. They proposed the susceptible–infected–removed model and used it to successfully explain the 1665–1666 plague in London and the 1906 pestilence at Mumbai, India. The SIR model diagram is shown in Fig. 1.
Fig. 1

The SIR model

The SIR model The transmission process is described by Eqs. (1), (2), and (3).Their SIR model is only feasible in an ideal epidemic transmission environment because it does not consider the time variance in the infection rate or recovery rate . Additionally, it requires no disease control—any political or clinical intervention is forbidden, and such transmission behavior rarely appears. However, based on these theoretical assumptions, many revised models, such as SEIR [19], SEIRUS [16], mechanic-statistic SEIR [17], and deep learning SEIR [12], have been proposed and developed by researchers adopting different epidemic transmission characteristics and human interventions. A description of these models can be found in Hethcote’s review [10]. RLIM is inspired by research from Jianping Huang’s team at Lanzhou university [11]. Their model, named as Global Prediction system for COVID-19 Pandemic (GPCP), adds 4 states of disease from SIR model: insusceptible state (P), potentially infected state (E), quarantined state (Q), and mortality state (D). The GPCP disease transmission model is described by Eqs. (4)–(10).RLIM, based on our observations and given facts from news reports analysis, adds a symbol in the transmission loop. represents the probability that a patient who had recovered from COVID-19 for a certain period is again identified by respiratory tests or antibody tests as a virus carrier. Following this definition, is used between statuses R and SI, representing the transmission possibility between the recovered group and the secondary infected group (Fig. 2).
Fig. 2

RLIM model diagram

Following the assumptions above, for the RLIM, the equation series is modified as Eqs. (11)–(15). RLIM model diagram Compared with GPCP model, the RLIM has the following advantages: (1) The RLIM simplifies the classical susceptible–infected–quarantined–immune process into a susceptible–infection process due to the maturity of the COVID-19 detection system through PCR tests or other nucleic acid amplification tests approved by CDSE [4]. Given an accurate number of confirmed infections, RLIM focuses on differentiating first-time and secondary infections brought by different groups to achieve more accurate prediction results. (2) The RLIM improves the GPCP model with recursive state SI and parameter to avoid the problem of forward transmission only. Without a recursive state and the existence of parameters, the number of new infections will decrease regardless of the actions taken, and this process would be contradictory to the current US COVID-19 transmission records. (3) Introduce the latency parameter to indicate the median reinfection period. In RLIM, is initialized with a value of 14 according to WHO’s instructions and scientific reports. This parameter correlates with the recovery policy in many US states: patients in the hospital will be automatically treated as recovered after a certain period. To apply Eqs. (11) and (12) in our algorithm, a transform into discrete data series shall be implemented as Eqs. (16) and (17). Replace I(t) and R(t) with fourth stage Eqs. (16) and  (17) into Eq. (14) and we have Eq. (18).In Eq. (16), a relationship between coefficients [a, b, c, d] and [e, f, g, h] is established; thus, RLIM is able to predict the infected number of cases given historical number of recovery, previous infections, and assumptions of infection rate, recovery rate, and secondary infection period. A detailed description of the corresponding algorithm flow and diagram will be discussed in is given in Fig. 3 and Sect. 2.2.
Fig. 3

The RLIM model diagram

The RLIM model diagram

RLIM: algorithm

The RLIM algorithm calculates predicted infection numbers according to Eqs. (16)–(18) and optimizes the difference between actual data recorded by the COVID tracking project and the predicted numbers returned from our model. Initially, the predicted recovery numbers are calculated by the fourth-order method based on the real data series of a selected state from the USA between November 2020 and January 2021.

Notations

The notations used throughout this article are described in Table 1. The coefficients (e, f, g, h) associated with this recovery function will then be transformed into other coefficients (a, b, c, d), with the preassigned recovery rate .
Table 1

Notations in RLIM algorithm

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_k$$\end{document}IkNumber of newly infected cases on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k^{th}$$\end{document}kth date.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_k$$\end{document}RkNumber of newly recovered cases on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k^{th}$$\end{document}kth date.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{p,k}$$\end{document}Ip,kNumber of predicted infected cases on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k^{th}$$\end{document}kth date.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{p,k}$$\end{document}Rp,kNumber of predicted recovered cases on \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k^{th}$$\end{document}kth date.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\omega $$\end{document}ωProbability of secondary infections after \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\delta $$\end{document}δ days.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\lambda $$\end{document}λProbability of recovery from infected cases.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\tau $$\end{document}τTime interval between recovery and secondary infection in days.
efghCoefficients returned by fourth method on recovered cases.
abcdCoefficients calculated by Eq. (16) on infected cases.
Notations in RLIM algorithm

RLIM algorithm

The RLIM algorithm calculates predicted infection numbers according to Eqs. (16)–(18) and optimizes the difference between actual data recorded by the COVID tracking project and the predicted numbers returned from our model. Initially, the predicted recovery numbers are calculated by the fourth-order method based on the real data series of a selected state from the USA between November 2020 and January 2021. The coefficients (e, f, g, h) associated with this recovery function will then be transformed into other coefficients (a, b, c, d), with the preassigned recovery rate  (default value 0.01) and secondary infection rate  (default value 0.01). With coefficients (a, b, c, d) assigned, the number of newly infected cases within this state can be calculated. Comparing these predicted numbers with actual data, one can evaluate and justify whether this round of prediction is accurate or not. Our RLIM model will continuously search for the optimal infection series and then determine the optimal associated with this state.

Performance measure

RLIM’s performance measure is calculated as the difference between its predictive output and actual value . Three performance indicators are given: the mean square error (MSE), standard deviation, and average forecasting error rate (AFER). Because different US states have quite different numbers of infections, ranging from hundreds to thousands, these indicators will be uniform between 0 and 1 to justify the performance. Mean square error (MSE) The average of squared difference between RLIM’s predictive output and actual value can be calculated in (19). Root mean square error (RMSE) The root mean square error is also used to evaluate RLIM’s prediction quality. Its formulation is in Eq. (20). Average forecasting error rate (AFER) The average forecasting error rate is the percentage of error, which represents the relative difference between the predicted output and the actual value . It is a cumulative statistic capturing deviation between two time series. The AFER is calculated in (21). RILM simulation report on infections, New York, November 2020–January 2021

Experimental setup

In this section, the data set, the source code, and the software packages which have been used in RLIM are explicitly listed for researchers who are interested in our research and have the intention to re-produce our simulations.

Data source

The data source directly applied in our simulation is from [6]. This data set contains US state-level data on COVID-19, starting from April 2020 until December 2020. In this article, New Jersey (NJ), New York (NY), South Dakota (SD), and Virginia (VG) are selected because they all have daily tracking recovery reports. RLIM relies heavily on accurate and reliable recovery case reports, and these states have highly credible recovery data sources. Parameter initializations for RLIM are the same for all the states: (1) Data fitting period: November 15, 2020–January; 15, 2021; (2) Prediction: January 16, 2021–February; 15, 2021; (3) Recovery rate: ; (4) Secondary infection rate: ; (5) Latency period: ;

Software implementation

The programming language inside RILM is PYTHON version 3.7, and the essential software package used is SCIPY version 1.5.4. Two software modules are inherited from SCIPY: integrate and optimize. RILM utilizes the integrate function to calculate the MSE and the optimize function to fit the real recovery data into fourth-order parameters. RILM simulation report on recoveries, New York, November 2020–January 2021

Code availability

RLIM software is publicly available on GitHub [5], with all codes and implementations available for research. The simulation results are also available upon request.

Results and discussion

In this section, the authors present our simulation results in figures and tables and discuss how RLIM located the turning points for four US states. Our discussion starts from New York state, where RLIM successfully located the turning point through COVID-19’s epidemic data records. Then, we present a detailed discussion of New York’s turning point to show its relationship of the re-infection rate, which we believe is the key. Finally, we describe RLIM’s performance on the other US states with predictions of their COVID-19 transmission trends.

Prediction with MSE/RMSE/AFER

Observations from Fig. 4 indicate that RLIM successfully fits the reported data records from mid-November until mid-January and provides predictions for the upcoming weeks. Delving into the 2 columns of Fig. 4, a conclusion can be drawn that different assignments to the secondary infection rate will result in different outcomes and prediction series. For example, a equal to 0.2 will produce a curve of infection numbers with a peak number of 18549, while in the case of 0.3, the peak value is forecast to be 24866. The simulation results suggest that adoption of an iterative search strategy enables RLIM to match real numbers from COVID-19 infection reports. Thus, the relative error between predictions and observations should be minimized. The other advantage of RLIM is its ability to foretell the turning point within a certain period. Forecasts of the turning point on November 31, 2021, are also marked in the left-column figures. (A discussion of the turning point calculation is presented in Sect. 4.2.)
Fig. 4

RILM simulation outputs, New York, USA

RILM simulation outputs, New York, USA In Table 2, the number of new infections in New York State caused by COVID-19 is predicted starting on November 15, 2020. The data records of infections were collected from the COVID tracking project during November 2020.
Table 2

RILM simulation report on infections, New York, November 2020–January 2021

No.Mid-Nov. 2020Mid-Dec. 2020Mid-Jan. 2021
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_k$$\end{document}Ik\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{p,k}$$\end{document}Ip,k\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_k$$\end{document}Ik\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{p,k}$$\end{document}Ip,k\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_k$$\end{document}Ik\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$I_{p,k}$$\end{document}Ip,k
136492569103539345N/A17324
23490269399989632N/A17480
350882827109149919N/A17626
4529429701269710207N/A17762
553103123991910495N/A17889
654683285995710783N/A18005
759723455900711070N/A18112
853923634971611356N/A18207
9590638211193711641N/A18291
10488140161256811925N/A18365
11626542181244612206N/A18426
12693344271080612486N/A18475
1381764643762312763N/A18512
14606348661040713038N/A18537
15672350941143813309N/A18549
16681953291342213577N/A18547
17728555691680213842N/A18532
18897358141649714102N/A18503
19985560651507414358N/A18460
201127163201136814610N/A18403
211076165791120914857N/A18331
22970268431266615098N/A18244
23730271101664815334N/A18141
24933573801763615564N/A18023
251060076541883215789N/A17889
261017879311694316006N/A17738
271059582101535516217N/A17571
281112984911371416421N/A17388
291019487741521416618N/A17186
30904490591457716807
311366116988
MSE5845242.5674630069.903
RMSE2417.6936462151.759722
AFER(%)29.0246109714.96828112
The ranges of infection numbers between November, December, and January are [2600, 9000], [9300, 17,000], and [17,300, 18,500]. Observations from Fig. 4 indicate that the value fits infected cases well. Regarding the MSE, RMSE, and AFER indicators, the RLIM reaches 5.8 million absolute errors over 60 days with an AFER of 29.02%. In December 2020, it obtained a better MSE of 4.6 million and a lower AFER of 14.97%. Considering that RLIM’s objective is to fit the actual infection numbers and predict the trend, it can be concluded that RLIM achieves a satisfactory result in fitting the actual data and converging the MSE. RLIM’s prediction results suggest that COVID-19 transmission in New York state will reach an equilibrium after January 31, 2021, with new infections remaining at a level of 1.8 k per day. New infections will not bring an abrupt change in numbers, so clinical services such as hospitals will not require extra measures. In Table 3, the number of new recoveries in New York state is also predicted for the months of November, December, and January, with MSE/RMSE/AFER calculated against authentic data records. In November 2020, the MSE of recovery case prediction reached 27893.26, with an AFER of 38.8%. For predictions in December 2020, the MSE increased to 43394.77, but AFER improves to 25.6%. Predictions of recovery data indicate that new recoveries will remain at the level of eight hundred per day, with a flattened tail after mid-January 2021.
Table 3

RILM simulation report on recoveries, New York, November 2020–January 2021

No.Mid-Nov 2020Mid-Dec. 2020Mid-Jan. 2021
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_k$$\end{document}Rk\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{p,k}$$\end{document}Rp,k\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_k$$\end{document}Rk\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{p,k}$$\end{document}Rp,k\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_k$$\end{document}Rk\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R_{p,k}$$\end{document}Rp,k
1120101599363N/A737
2114104683375N/A746
3200108639388N/A754
4285112522400N/A762
5259117600413N/A769
6265122600426N/A776
7276128406438N/A783
8200134672451N/A789
9194141743464N/A794
10300148750477N/A800
11338155706490N/A805
12384163527502N/A809
13215171425515N/A813
14349179434527N/A816
15269188853540N/A819
16252197834552N/A821
17393206839565N/A823
18300216860577N/A825
19337226574589N/A825
20635236537601N/A825
21376247640613N/A825
22400257800624N/A824
23335268864636N/A822
24505280901647N/A819
25511291891658N/A816
26552303947669N/A813
27595314618680N/A808
28619326541690N/A803
29300338882700N/A797
30470351956710
31940719
MSE27893.2666743394.77419
RMSE167.012774208.3141238
AFER(%)38.8062785925.62371705

The turning point

From the RLIM’s output on New York state’s infection and recovery numbers, one can observe that the turning point of this state’s COVID-19 transmission occurs around January 30, 2021. Predictions indicate that from mid-January, New York’s infections will slowly increase from 17,324 to 18,549 and then fall back to 17,816 in mid-February. This turning point appears because the secondary infection rate , whose range is [0.1, 0.55], experiences high fluctuations in early November 2020 and then drops below 0.4 during December, while after Christmas 2020, it becomes stable around a value of 0.25. The process of how the RLIM algorithm begins from an initial value and quickly evolves within one month to reach a turning point is clearly illustrated in Fig. 5. The secondary infection rate is given a value of 0.5 on November 15, 2020, and the goal is to reach an equilibrium state where decreases to 0.07, the same as the value of . The optimistic case is that it will follow a straight pathway to reach 0.07 on a certain date. However, its path needs to be justified according to the actual secondary infection data reported from New York state, recorded as the series of blue points in Fig. 5. Our model produces a series of fitted measurements, which are marked in black in Fig. 5. These measurements represent optimal secondary infection values, chosen by an iterative search algorithm, with a near-optimal error based on the real values. The model forecast starts on January 14, 2021, and successfully reaches an equilibrium state after 2 weeks. It is reasonable to assert that the RLIM model is able to find an optimal curve for the secondary infection rate with an acceptable MSE and follows the curve to predict the forthcoming secondary infection rate. According to the simulation reports in Table 2, RLIM’s prediction results reduce the absolute error of new infections from 5.8 to 4.6 million, with an AFER from 29 to 15%.
Fig. 5

New York predictions with turning points

New York predictions with turning points RLIM simulation outputs, New Jersey (above), South Dakota (middle), and Virginia

New Jersey, South Dakota and Virginia

The simulation results shown in Fig. 6 indicate three scenarios: moderate increase (New Jersey) at the top, moderate decrease (South Dakota) in the middle, and exponential increase (Virginia) at the bottom. The optimal secondary infection rate for these states is marked above (0.13 for New Jersey, 0.056 for South Dakota and 0.19 for Virginia). Observations from these states’ infection and recovery data indicate no strong correlations between and COVID-19 transmission trends. Revisiting Eqs. (11) and (15) from Sect. 2.1 explains that in RLIM, affects the incremental steps of infection cases positively and recovery cases negatively. However, it is still valuable to predict the turning point when it approaches the value of the recovery rate. Thus, one can conclude that if the recovery rate remains stable during the periods of  (in RLIM, equals 14), then RLIM will approach it during a period of time and lock down the turning point. This will greatly reduce the time needed for scientists to elaborate on COVID-19’s behavior.
Fig. 6

RLIM simulation outputs, New Jersey (above), South Dakota (middle), and Virginia

Conclusion and future work

This research proposes a recursive, latent, dynamic virus transmission model based on the classical SEIR model. This model, named RLIM, is able to fit the COVID-19 transmission data of the USA and efficiently locate the transmission turning point. Introducing a new parameter into the classical SEIR model, RLIM is able to predict newly infected cases based on recovered data and historical COVID-19 records. Experimental results for New York, New Jersey, South Dakota, and Virginia prove that given a reasonable initial value of , RLIM is able to predict 30-day infections and recoveries with a reasonable error rate. RLIM also provides an estimation of in the time domain, suggesting that it is valuable to explore its approximation and locate the future turning point in COVID-19 transmission. Simulations on New York, dated from mid-November 2020 until the end of January 2021, provide valuable information for ’s curve and predict that it reaches an equilibrium state on the 31st of January. Our conclusion, based on the RLIM results, indicates that starting in February, New York state’s COVID-19 transmission will enter an equilibrium state. One of RLIM’s advantages is that it does not include environmental factors, such as weather changes, hospital capacity, or city lockdowns. Thus, it is suitable for the prediction of COVID-19 transmission without additional information. Our model is effective for virus modeling including a second wave of COVID-19 epidemic transmission, with key factors such as incubation period and infection rate statistically determined in advance. Compared with other predictive algorithms, RLIM predicts infection numbers based on optimal parameter set from 14-day historical records. The authors believe that this time span would be beneficial to avoid data overfit issue as mentioned in [3]. In a word, RLIM is believed to propose a novel yet effective solution in COVID-19 prediction and turning point estimation. A promising field of application is to integrate RLIM with machine learning techniques. RLIM’s recursive, latent status is suitable for description with a back propagation process inside a neural network, so it can be easily equipped with self-learning abilities. Another interesting yet unexplored subject is to use RLIM in prediction of COVID-19’s vaccine impact. Researchers may use RLIM to evaluate different kinds of vaccines’ impact on COVID-19 transmission. RLIM is able to generate optimal set for pre and post vaccine inoculation group, and with these parameters’ visualized, researchers and governments are able to justify the certain kind of vaccine’s effectiveness on COVID-19 transmission. Below is the link to the electronic supplementary material. Supplementary material 1 (pdf 33 KB)
  1 in total

1.  VSHR: A Mathematical Model for the Prediction of Second-Wave COVID-19 Epidemics in Malaysia.

Authors:  Xiang Yu; Lihua Lu; Jiangfan Guo; Haihuan Qin; Chunlei Ji
Journal:  Comput Math Methods Med       Date:  2022-01-18       Impact factor: 2.238

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.