Literature DB >> 35411129

Forecasting of a complex phenomenon using stochastic data-based techniques under non-conventional schemes: The SARS-CoV-2 virus spread case.

Daniel E Mendoza^1,2, Ana Ochoa-Sánchez^3,4, Esteban P Samaniego^2,5.

Abstract

Epidemics are complex dynamical processes that are difficult to model. As revealed by the SARS-CoV-2 pandemic, the social behavior and policy decisions contribute to the rapidly changing behavior of the virus' spread during outbreaks and recessions. In practice, reliable forecasting estimations are needed, especially during early contagion stages when knowledge and data are insipient. When stochastic models are used to address the problem, it is necessary to consider new modeling strategies. Such strategies should aim to predict the different contagious phases and fast changes between recessions and outbreaks. At the same time, it is desirable to take advantage of existing modeling frameworks, knowledge and tools. In that line, we take Autoregressive models with exogenous variables (ARX) and Vector autoregressive (VAR) techniques as a basis. We then consider analogies with epidemic's differential equations to define the structure of the models. To predict recessions and outbreaks, the possibility of updating the model's parameters and stochastic structures is considered, providing non-stationarity properties and flexibility for accommodating the incoming data to the models. The Generalized-Random-Walk (GRW) and the State-Dependent-Parameter (SDP) techniques shape the parameters' variability. The stochastic structures are identified following the Akaike (AIC) criterion. The models use the daily rates of infected, death, and healed individuals, which are the most common and accurate data retrieved in the early stages. Additionally, different experiments aim to explore the individual and complementary role of these variables. The results show that although both the ARX-based and VAR-based techniques have good statistical accuracy for seven-day ahead predictions, some ARX models can anticipate outbreaks and recessions. We argue that short-time predictions for complex problems could be attained through stochastic models that mimic the fundamentals of dynamic equations, updating their parameters and structures according to incoming data.

Entities: Chemical

Keywords: Autoregressive-with-exogenous-variables; Differential-equations; Non-stationary; Outbreaks recessions; Vector-autoregressive

Year: 2022 PMID： 35411129 PMCID： PMC8986496 DOI： 10.1016/j.chaos.2022.112097

Source DB: PubMed Journal: Chaos Solitons Fractals ISSN： 0960-0779 Impact factor: 9.922

Introduction

Complex dynamical systems are commonly described by a set of differential equations containing the phenomenon's qualitative features [1]. However, while the processes-based models provide essential insights into the dynamics and non-linear interactions involved in complex problems, for some phenomena, the quantity and nature of such interacting factors could significantly complexify the phenomenon's evolution. We are especially interested in cases for which rapid variations might occur. This is likely to happen when the phenomenon is sensitive to highly volatile factors that are prone to fast changes in their states, which makes analysis and prediction challenging [2]. A phenomenon of the kind described above is the case of epidemics' spread. Systems of ordinary differential equations describing the evolution of contagious diseases have been available for a while and still have a great scientific interest [3], [4], [5]. The solutions of these deterministic equations provide information about the epidemic advance and its coarse-grained features [6]. An interesting example is provided by [7], who successfully applied a set of ordinary differential equations that consider the effect of asymptomatic patients over the SARS-CoV-2 virus spread [8], [9]. However, the COVD-19 pandemic shows us that social behavior and policy decisions could cause serious discrepancies between deterministic models and observations (e.g., [10], [11], [12], [13], [14], [15], [16], [17], [18]). The COVID-19 pandemic has involved severe human health, social and economic problems. Moreover, during the first months of the virus's spread, the consequences were catastrophic, collapsing the health systems in several places, partly due to the lack of predictive assessments [19], [20]. Alternatives were presented for this complex task, such as, for example, the approach proposed by [21], which uses fractal theory combined with fuzzy logic for time series forecasting. Also, [22], [23] have performed spatial analysis and classifications of the contagious dynamics. However, in line with [24], the efforts should continue, and they should tackle short and long time series predictions, as well as spatial analysis to effectively and objectively inform decision-makers about policies and coordination strategies. Apart from being complex, pandemics are of serious risk at their beginning, thus limiting the time for sophisticated applications or the development of deterministic theories. Therefore, it is desirable to use existing knowledge and modeling frameworks to deal with the problem. Although originated in a different context, an example of an applied technique that uses an existing framework in an innovative fashion is the “Forecasting at scale” approach proposed by [25], [26], [27], [28]. This technique is devised as a flexible tool working in a general mathematical structure, capable of identifying changing points during the training phase for inferring future curve inflections. Other alternatives are, for instance, the well-known Neural Networks, with interesting applications such as the one proposed by Melin et al. [23], [29], who combined it with fuzzy logic to predict the virus spread in Mexico, or the approach presented by Namasudra et al. [30], who applied nonlinear neural networks. As a motivation here, we apply the “Forecasting at scale” technique and the Autoregressive Neural Networks (NNA) for the forecasting of the number of infections variable (as indicated in Fig. 1 ) in COVID-19 pandemics. Both techniques are implemented as R-packages with the names of “nnetar” and “Prophet” [25], [31]. We performed sequential forecasting tasks for several periods of seven days (i.e., seven-day ahead forecasting) using the COVID-19 data sets of Iraq and Iran, which show pronounced recessions and outbreaks in their curves, making them challenging for prediction. Each sequential forecasting was attained by calibrated models using the complete set of past observations relative to the forecasted period. This entails that both the Prophet and the ANN parameters are re-estimated for each forecasted period as more data were progressively integrated into the observations set for the training task. The forecasting results in (Fig. 1) evidences the difficulty in anticipating recessions and outbreaks.

Fig. 1

SARS-CoV-2 contagious curves for Iraq and Iran. Dotted lines identify seven-days periods. Dashes lines are forecasted resulted. Orange for ANN and grey for Prophet model. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.) Due to the complexity of predicting COVID-19 cases, the need for testing different stochastic strategies is suggested [32]. Nonetheless, considering the unusual statistical characteristics of a virus' spread, such as the fact that it follows fat-tail distributions that limit the prediction by probabilistic approaches [33], these stochastic models must scape the conventional applications as mentioned by Santosh [34]. In this context, we regard a strategy as unconventional if it relies, at least partly, on the possibility of combining existing knowledge and tools in an innovative manner. The continuous development of these strategies will facilitate fast predictive implementations in future pandemics. Nonetheless, two questions arise before devising such innovations: 1) How to deal with the complex changing conditions of the problem? and 2) When using stochastic approaches, which should be the underlying theory constraining the models? Regarding the first question, it may be that fast fluctuations of the phenomenon (such as the triggering of outbreaks and sudden recessions) depend on some natural states of the contagious curves or depend significantly on the surrounding conditions in time. Thus, the parameters of any model for the forecasting task should be determined to a great degree by the information closer to the forecast time. For capturing the complex changing conditions, the parameters should update their values progressively. Moreover, the stochastic structures themselves should be updated. As a consequence, the simultaneous updating of both the parameters and stochastic structures should provide the modeling strategy with inherent non-stationary features and greater flexibility for accommodating the incoming data and training it for short-term forecasting tasks. Examples of techniques dealing with shaping the models' parameters in time, or making them dependent on states, are the well-known Generalized-random-walk (GRW) and the State-dependent parameters (SDP) theories. Regarding the second question, we choose to frame the stochastic structures considering the governing ordinary differential equations of epidemics (in this case, the Susceptible-Infected-Recovered-Deceased SIRD equations). The main idea is to constrain these structures so that they are not entirely oblivious of conceptual or dynamical considerations. The aforementioned pursues, in some sense, the Data-based-modeling's (DBM) philosophy, which in many cases reached high forecasting performances, partly because they are linked to differential equations mechanisms (please refer to [35] book and the examples given therein). Autoregressions with exogenous variables (ARX) and Vector autoregressions (VAR) are useful stochastic techniques because their structures could be devised to incorporate the form of the differential equations. More precisely, some the SIRD equations are considered to some extent in order to define the structure of the models proposed here [35], [36], [37], [38]. Furthermore, the GRW and SDP techniques are merged with the ARX into the Recursive-time series theory, as detailed in Young [35], and implemented in the Matlab package CAPTAIN (which is freely available in http://wp.lancs.ac.uk/captaintoolbox/). It is worth mentioning that the CAPTAIN tool does not provide VAR models. However, under certain assumptions (detailed later in this document), the GRW can be merged with VAR techniques using the available CAPTAIN options. In addition, some important considerations for the proposed methodology are the following: Since the CAPTAIN tools enables different GRW options – such as Random-Walk (RW), Integrated-Random Walk (IRW) – each ARX-based and VAR-based model will be used considering several combinations of the GRW options. The stochastic structures are analogous to the differential equations and they use the daily rate of deaths and healings as exogenous variables handling the number of infections curve in the ARX model. For the VAR model, the three variables are treated as endogenous variables. We restrict the prediction period to a window of seven days in the future, motivated by the time-average period for incubation of the virus into the human body [39]. We hypothesize that the exogenous variables could be sensitive to spread's changes in the previous days (seven days), carrying with them implicit information of recessions and outbreaks in advance. To test this hypothesis and at the same time to unveil the role that different variables have on the problem, we conducted an experiment through the exploration of different model types, which are. a) ARX models considering one and two lagged exogenous variables. b) Different VAR models' combinations between the endogenous variables. In summary, we propose a novel modeling strategy for the complex COVID-19 forecasting task based on existing tools and knowledge. The stochastic models are based on analogies to the epidemic's differential equations. Moreover, they simultaneously perform the updating of their parameters and structures for fitting the incoming data. In addition, we explore different ARX and VAR model configurations with different GRW techniques to unveil the usefulness and roles that the variables have on the anticipation of outbreaks and recessions. To the authors' knowledge, all this is a non-conventional strategy, which takes advantage and adapts the existing knowledge and tools to the problem of forecasting a complex phenomenon. This systematic modeling approach might support practical applications and decision making, especially when knowledge and data are insipient during the early contagion stages. Furthermore, the method could help forecast other complex problems.

Materials

Data

We used daily data retrieved by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University (https://github.com/CSSEGISandData/COVID-19). From all the collected time-series information, our main interest lies in three different data sets: Accumulated infections (I ): the daily incremental number of confirmed diagnosed individuals with the virus Accumulated recoveries (H) and deaths (D): the daily incremental number of healed and dead victims of the virus Infection-curve (I): The infected individuals calculated by subtracting H and D from the I , for each i-th day. Since the (I) data-set describes the change of infections through time (here referred to as the epidemic-curve), the modeling process will be focused on emulating this variable. The (H) and (D) data-sets will play the role of the exogenous variables in the ARX models and endogenous variables in the VAR techniques. The I information will be used for estimating non-linear functions in the modeling process, as explained in Section 3.4. Data from Iraq, Iran (Fig. 1) show interesting dynamics in the contagious evolution, which are challenging to be emulated, offering the opportunity for evaluating different models' abilities. Fig. 1 shows the daily infection curves and accumulated infections, recoveries, and deaths of Iraq and Iran populations. Although in different scales in the number of affections, both curves present the typical growing phases and descending parts probably related to the first trigger on the number of infections and the posterior restrictive measures for alleviating the conditions. Interestingly, new outbreaks in both curves are present after the descending phase. These rapidly changing conditions make the forecasting's task a challenge, which motivates the experiments carried out in this work.

Methods

The SIRD model

The Susceptible-Infected-Recovered-Deceased (SIRD) model is frequently used for modeling epidemics [3], [4], [5]. The model follows the well-known low-mass-action principle for describing a dynamical interaction between the four different compartments in a system of ordinary differential equations. The (N) denotes the total population of individuals involved in the process, while (S), (I) (H), and (D) indicates the susceptible, infected, healed, and dead individuals with their respective initial conditions (S , I , H , D ). The β, γ, and μ are the rates of infection, healings, and deaths, respectively estimated from data. The healed (H) and deaths (D) can be aggregated into a single variableR(t) = H(t) + D(t); where (R) denotes the removed cases from the population. The model assumes a constant balance in the flux of individuals through the compartmentsS(t) + I(t) + R(t) = N. The latter is proper for epidemics with faster time evolution than the balance between birth and death rates (vital dynamics process). Since the transmission speed of the SARS-CoV-2 virus is highly similar to its predecessors (SARS-CoV and MERS-CoV) [40], [41], the vital-dynamics process could be omitted without significant detriment. The model also assumes that the removed cases (R) do not interact with the remaining susceptible (S) population. This property is directly related to the acquired immunity of recovered individuals (at least for a period larger than our short-term forecasting experiment), protecting them from new contagious and limiting the spread of the virus, as has been argued for the SARS-CoV-2 case [42]. The assumptions made by the SIRD justify the study of the SARS-CoV-2 contagious dynamics by analyzing the SIRD mathematical properties and solutions [43], [44], [45], [46], [47], [48]. However, rather than solving the set of differential equations directly, we use them as theoretical support. Thus, we build a restated mathematical expression, which is more convenient here, as detailed below. When its equivalent balance replaces the component of susceptible individualsS(t) = N − I(t) − R(t), and the Eqs. (1b), (1c), (1d) are combined in a single mathematical form, we obtain the Eq. (2a). This expression indicates that the rate of change of infections (I) in an instant (t) is proportional to the present quantity of infected individuals (I), minus the rate of change of the total removed cases (). It is worth noticing that (a) is considered a non-stationary parameter, synthesizing the β, I, R, and N variability. Eq. (2a) can be reformulated in a transfer function form by introducing the differential operator (s) and the auxiliary variable (b) (see Eq. (2b)). Furthermore, under certain assumptions, the previous continuous expression can be approximated following a discrete transfer function, which is analog to the continuous transfer function, as shown by Eq. (2c). For the sake of brevity, we have focused on the most critical aspects of the theory supporting our work. However, for more details about the analogies and relations between the continuous and discrete forms, the reader should refer to the book of [35]. Here, it is sufficient to mention that Eq. (2c) is the autoregressive moving average with exogenous variables (AMARX) model. For simplicity, we will use the related autoregressive with exogenous variables (ARX) model.

The ARX approach

We degenerate Eq. (2c) to the expression (3a), also known as autoregressive with exogenous variables (ARX) model. Such that: In Eq. (3a), () denotes the daily rate of change of the removed individuals (exogenous variable), which is approximated by . The (e) indicates a white noise and zero-mean error in the k-th daily time step. The (τ ) is a pure time delay parameter acting on the exogenous-variable, which we established according to the restricted minimum period of seven days for allowing forecasting tasks. The A(z ) and B(z ) denotes the transfer function's polynomials of the backward shift operator (z), which build auto-regressive structures of (I), plus regressive forms of () information. Eq. (3a) supports some of the modeling hypotheses to be tested here. For instance, the parameters n and m, which define the regressive structures, must be identified according to some statistical criteria. Commonly, the identification process uses the total available data, hopefully deriving a suitable model structure that generalizes the main aspects of the phenomenon. However, as mentioned previously, attempting to capture the complex dynamic elements of the epidemic in the early stages, the identification process should evolve according to the updated data retrieved by the monitoring systems. Eqs. (3a), (3b) also suggest exploring how sensitive are the exogenous variables () and () in leading the model and how they interact with it. For that purpose, the experimentation must consider the following modeling options. 1) Pure auto-regressive (AR) models, considering the exogenous variable R = I. 2) An ARXH model, assuming the exogenous component as R = H. 3) An ARXD model assuming that R = D. 4) An ARXH+D model including the total removed cases R = H + D. 5) And finally, an ARXH&D model considering two independent exogenous variables into it, shown in Eq. (3b). Such that: As noticed, Eq. (3b) has two exogenous variables. The B(z ) and C(z ) are polynomials acting on () and (), while (τ ) and (τ ) are their pure time delay parameters. Here, three parameters define the model's structure (n.m,p), while (e) denotes a white noise and zero mean error in the k-th daily time step again. The discrete ARXH&D transfer function does not have a transfer function analogy to the continuous SIRD equations (i.e., not in the sense that ARX of Eq. (3a) has with Eq. (2b). Nonetheless, intending to test whether such a following analogical meaning conceptualized herein has predictive advantages over other alternatives, Eq. (3b) is an interesting contrasting experiment. It is essential to mention that since parameter (a) in Eq. (2b) is non-stationary – because it synthesizes great variability of several factors – this non-stationarity should be inserted in the discrete ARX approximations. In that respect, the Generalized-Random-Walk (GRW) and the State-dependent-parameter (SDP) are adequate complementary techniques for tackling the non-stationarity problem. These GRW and SDP techniques are described briefly below using model (3b) (ARXH&D) as a vehicle for explaining in a more general sense how these techniques can deal with the non-stationarity of the ARX's parameters. From the latter, it is only a matter of adding or removing the corresponding elements for the rest of the ARX models,

The GRW technique for time variable parameters (TVP)

The backward shift operator in Eq. (3b) is lifted, obtaining in this way the expanded form of the ARXH&D model (see Eq. (4a))where: The elements in (z) are the daily historical observations of the variable I, , and, while the elements in (ρ) vector are the parameters in the linear ARXH&D model, whose notation is conveniently redefined to follow an ordinal sequence (ρi). In turn, each (ρi) parameter is determined by a two dimensional state-vector ρ i(k) = [l (k) d (k)]T, which follows a Generalized-Random-Walk (GRW) process (see eq. 4b) defining their non-stationarity, as follows.where: In the Eq. (4b), each η i(k) = [η1,i(k) η2,i(k)]T is also a two dimensional zero mean vector, with their noise defined by a diagonal covariance matrix Q η,i. Each α, β, γ, δ, ε, as well as the elements of Q η,i (or usually the NVR matrix Q nvr = Q η, i / σ2) are the so-called hyperparameters, which are time-invariant, and are estimated from data through optimization. Such optimization uses likelihood functions based on the innovations obtained from the Kalman-Filter process. Once the hyperparameters are optimized, the recursive Kalman-Filter and the Fixed-Interval-Smoothing (KF/FIS) technique gives the desired time-variability property to each parameter in the model. For more details about these techniques, the reader should refer to specialized literature [35]. The GRW process includes several random walk types, from which the most important are: the random walk (RW), the integrated random walk (IRW), and smoothed random walk (SRW). The ARX model with constant parameters, the RW and the IRW process, can be set out in the MATLAB-CAPTAIN toolbox, which is available at http://wp.lancs.ac.uk/captaintoolbox/ [49]. Actually, for the reasons explained in the technical notes below, we experiment with different modeling approaches combining the available GRW options to evaluate their effects on the phenomenon.

The SDP technique

Briefly, the State-Dependent-Parameter technique redefines the output variable (I) of Eq. (4a) to make it dependent on some state-variable (χ). The non-parametric functions for each parameter (ρ ), respecting to (χ), can be estimated by the KF/FIS algorithm and its GRW techniques, which are finally parameterized through curve-fitting procedures. Furthermore, the algorithm can be extended for considering different state-variables (χi) through the well-known Back-fitting algorithm and the Modified-Dependent-Variables (MDVs). This work does not provide a detailed explanation of this technique, but the reader could refer to [35], [50] for the extended minutia. In summary, the model is defined as follows: We conveniently rewrite the linear model (4a) in (5a) for coupling it with the expression (5b), which denotes a vector containing the collection of the parameters (ρ ). In turn, each element (ρ ) is a function of a (χi) state variable, which is defined in k-th time step. Of course, the observation vector (z) is built from autoregressive information of the (I) epidemic curve, plus regressive structures of () and () daily rates observations, of orders n, m, and p, respectively. Regarding the (χ) state variables, the accumulated infections (I ) is setting here as the independent variable defining the functions for all the parameters of the autoregressive components (i.e., the parameters ρ ). The (I ) is equivalent to (I + R) expression, whose variability is included in the parameter (a) of Eq. (2a). Because of the latter, it is a logical assumption to make each parameter in the A-polynomial (see Eqs. (3a), (3b)) dependent on the I variable state. Nonetheless, it is worth mentioning that we are implicitly assuming that the variability of I absorbs the non-stationarity of (β) in the SIR model, and the population (N) remains constant for simplicity. Extending the modeling experimentation, we will consider that the set of parameters of both exogenous variables () and () (see the parameters of B and C polynomials in Eq. (3a), (3b) will act as state variables for themselves. This experimentation is motivated by the possibility that abrupt time changes related to outbreaks or recessions might be more sensible to be detected in the () and () daily-rates fast fluctuations. The latter hypothesis was also one of the reasons for including such a daily-rates as exogenous variables instead of using the daily soft evolutions of (H) and (D) data. The SDP is also freely available in the MATLAB-CAPTAIN toolbox at http://wp.lancs.ac.uk/captaintoolbox/ [49]. Therefore, similar to the ARX modeling, we will use the mentioned toolbox for a comprehensive modeling experiment considering different combinations between constant, RW and IRW options, which mainly define the smoothing shape process of the non-parametric functions estimated by the Back-fitting algorithm. The estimated SDP functions' parameterization is usually attained through fitting curve techniques with meaningful physical or conceptual forms. Nonetheless, for the sake of simplicity, we use linear and quadratic interpolations between two and three consecutive points of the non-parametric functions as surrogate parameterizations. Suppose any state variable value χ(k) falls outside the non-parametric function's domain in the prediction phase. In that case, we use linear and quadratic extrapolations using seven data points from the extremes of the curve. Similar options have been used for other purposes [51].

The VAR technique

One main limitation of the ARX model is the unidirectional relation between the exogenous variables (D) and (H) and the output variable (I). However, as described by the SIRD equations, the phenomenon's behavior is an interaction between the different compartments, and they have a simultaneous influence on each other in their evolutive processes. In that regard, the autoregressive vector framework can consider such feedback between these variables, treating each one as endogenous. In other words, a Vector-Autoregressive (VAR) technique comprises one equation per variable into the system been described [52]. Consider the set of Eqs. (2b), (1c), (1d). Thereon, following a similar argument as explained in Section 3.1, and discretizing () and (), one can arrive at a set of ARX discrete analogical approximations as expressed by Eqs. (6a), (6b), (6c) in the expanded form.where ψ(k) = [D(k − 1) I(k − τ ) I(k − τ − 1) I(k − τ − 2). …I(k − τ − s)] The reader is already familiarized with the notation and meaning of Eqs. (6a), (6b), (6c). Nonetheless, it is worth mentioning that the (e), (e ), and (e ) errors are considered white noise processes that may be contemporaneously correlated (i.e., correlated just for the k-th time step). In the context considered herein, the discrete Eq. (6a) and its similarity with the continuous form 2b was already justified. Also, the regressive structures of the (I) variable in Eqs. (6a), (6b) surrogate the time-variability nature of (γ) and (μ) parameters in the differential Eqs. (1c), (1d). The set of Eqs. (6a), (6b), (6c) interact with each other, allowing recursive predictions. Although the models have no restrictions about the future prediction's horizons, we have limited these to a seven days ahead horizon for comparison purposes with the remaining modeling alternatives. Eq. (6a) imposes a minimum time delay (τ ) of one month to ensure the first k-th prediction of (I) as a function of. After that, (H) and (D) can be estimated through Eqs. (6b), (6c), respectively. In turn, the last H and D estimations are used for a new estimation of (I), continuing in this manner until the desired prediction future step. For the case herein, we implemented these models (6a) to (6c_ using the TVP concept. Thus, the parameters in the (ρ), (μ), and (ξ) vectors follow the GRW process (as explained in Section 3.3). The latter modeling perspective is not directly available in the MATLAB-CAPTAIN toolbox. However, we implemented an algorithm for including more than one endogenous variable working recursively between them, using the same computational framework. Since we use the CAPTAIN framework for our purposes, the hyperparameter optimization is reached individually for each equation in the system through their error innovations obtained from the Kalman-Filter process. The latter rules out the possibility of any cointegrated process herein, which is a more complex modeling process that escapes the scope of this work. Reminding that R(t) = H(t) + D(t), and following one of the objectives herein, we explore the role of each variable inside the modeling through the next experiments: 1) A model assuming that R = H (and renaming τr by τh), excluding the Eq. (6c) (labeled as VARH). 2) A model in which R = D (and renaming τr by τd), excluding the Eq. (6b) (labeled as VARD). 3) A model in which R = H + D, excluding Eq. (6c), and considering Eq. (6b) as R (labeled as VARH+D). 4) A model including Eqs. (6a), (6b), (6c) simultaneously (labeled as VARH&D). And finally, 5) an extended model option in which for the vector (z) of Eq. (6a), the variable (R) is separated on individual components of (H) and (D), and the (φ) and (ψ) vectors of Eqs. (6b), (6c) include historical information of the endogenous variables. The latter model is labeled as VARH&D-extended, with specific the vectors of Eqs. (6a), (6b), (6c) in the following forms: VARH&D-extended is not justified as analogical forms related to the differential equations. Thus, this model contrasts the results against those VAR models defined here as analog to the differential equations.

Modeling process, considerations and technical notes

So far, we have outlined different techniques, which will be the vehicle for our purposes. Nonetheless, it is necessary to precise the modeling process, as well as some important technical aspects. For instance, since the problem is complex, we implemented an adaptive modeling strategy, in which the structure of the models (i.e., m, n, p, q, w, r, s, and τ, depending on ARX or VAR) changes for best fitting to the historical and new incoming data. As mentioned, this adaptive strategy attempts to emulate the natural conditions of the epidemic evolution. This emulation practice helps evaluate the potential of the models in supporting the decisions through continuous monitoring systems, using the most common data retrieved in the early stages of the epidemic phenomenon. The first structures of ARX and VAR models were identified using 21 data observations to avoid computational errors. After, the models' structures were updated according to the inclusion of every seven data observations sequentially incorporated. The process is repeated until reaching the final seven-day string. Each forecasted seven-day string was attained using the corresponding models' structures identified previously to the predicted period. In the end, periodical predictions every seven days were obtained and contrasted whit its observations. Although the central core of the modeling process was summarized in the previous lines, it is essential to mention some other important aspects, which we pointed out in the following paragraphs. For the ARX and VAR techniques, the identification of the models' structures was based on the statistical selection from a wide variety of combinations between m. n, p, q, w, r, s, and τ parameters, building a model that better fitted the observed data. We use the well-known Akaike-Information-Criterion (AIC) as the statistical parameter for selecting the most convenient structure [53], [54]. To avoid the explosion of combinations in the identification task, we impose a maximum bound of seven steps for m. n, p, q, w, r, s, and τ for the model's parameters. Of course, the latter bound for the ARX models was established after lagging the data for a minimum of seven days, allowing the seven-step-ahead forecasting using historical observations. While the forecasting estimations were attained by lagging the exogenous variables a minimum of seven days in the ARX modeling, the forecasted estimations using VAR techniques are attained by integrating sequentially one-step-ahead forecasted values as new inputs. The latter ARX's and VAR's forecasting strategies are known as “DirRec” and “Recursive,” respectively [55]. The latter constitutes an interesting contrasting experiment for evaluating the forecasting estimations using only observations as inputs (in the ARX's models), versus the forecasting using predicted values surrogating observed data (as in VAR's models). It is recalled that only the GRW technique was used for modeling the parameters' non-stationarity for the VAR models, while for the ARX kind of models, both GRW and SDP techniques were tried. In all the cases, we explore the modeling of the parameters considering three different combinations: RW, IRW and stationarity (SP) (i.e., constant parameter). For the sake of simplicity, we built the combinations in such a way that the same GRW option is established for the entire set of parameters belonging to the same polynomial (i.e., A, B, or C polynomial depending if ARX or VAR model kind). The idea of combining the available GRW options for modeling the parameters' non-stationarity is motivated by the fact that a user in practice could not know the best approach a priori in that regard. Therefore, it would be reasonable to test several GRW combinations for the parameters on each of the five ARX and VAR experiments (detailed 3.2, 3.5). Since different GRW render different forecasting estimations, we select the 50-quantile (median) to synthesize the models' predictive capability (predictive accuracy). Additionally, the 25 and 75 quantiles are considered indicators of the variability between the different predictions attained under different GRW combinations. A comparison against the model with the best-fitted approach through all the predicted strings was made to validate the median as a reasonable indicator of the predictive capacity. In the authors' opinion, the adopted modeling strategies, such as the adaptive model's structures and combinations of different GRW and SDP techniques, constitute an unconventional endeavor to objectively tackle such a phenomenon's predictive complexity.

Results and discussion

To analyze the forecasting abilities on different curve phases of the contagious curve, we segmented the data into several strings of seven values each and numbered them, as shown in Fig. 2, Fig. 3 . For Iraq (Fig. 2), strings 1 and 2 correspond to the initial growing phase of the epidemics. String 3 contains a maximum, followed by a decrement in the number of cases as shown by string 4. String 5 is the segment where an inflection point occurs before a new outbreak arises in string 6. String 7 and 8 are the growing evolution of the contagious curve of the second outbreak.

Fig. 2

Fig. 3

Seven-days forecasting of Iran's infection curve. Vertical dotted lines indicate data strings groups of seven days. Blue-dots indicate the median estimations calculated from the forecasted combination using different GRW options. Green-lines are the best fitted models for all the strings, following a specific GRW process. Red-dots indicates the 25% percentile and 75% percentile values from all forecasted values. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Seven-days forecasting of Iraq's infection curve. Vertical dotted lines delineate data strings groups of seven days. Blue-dots indicate the median estimations calculated from the forecasted combination using different GRW options. Green-lines are the best fitted models for all the strings, following a specific GRW process. Red-dots indicates the 25% percentile and 75% percentile values from all forecasted values. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Seven-days forecasting of Iran's infection curve. Vertical dotted lines indicate data strings groups of seven days. Blue-dots indicate the median estimations calculated from the forecasted combination using different GRW options. Green-lines are the best fitted models for all the strings, following a specific GRW process. Red-dots indicates the 25% percentile and 75% percentile values from all forecasted values. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) For Iran (Fig. 3), string 1 corresponds to the initial growing phase of the epidemics. After a sharp peak, the curve experiments a decrement in the number of cases during the strings 2, 3, and 4. String 5 shows a slight inflection change in the curve, followed by a new increment in the number of cases that continues for the strings 6, 7, 8, 9, and 10. Interestingly, strings 8 and 9 show a slow-down in the contagious during the second increment, accelerating in string 10. The results referent to the different string data portions are discussed in the following paragraphs for both study cases, Iraq's and Iran's virus spread.

Predictions for Iraq's case

As indicated by Table 1a and Table 1b , when using the GRW techniques in both the ARX and VAR models, the median's prediction curve (calculated from the models using different GRW combinations) is statistically similar to the best predictions attained under a specific GRW with the best performance. Since in real-time, modelers would not know an optimum GRW option a priori, the median of all the forecasted values could be considered as a reliable predictive curve. The latter is an alternative against the blindness regarding an optimum GRW combination for the ARX and VAR predictions.

Table 1a

Coefficient of determination (R2) for Iraq (ARX models).

Parameter's modeling	AR	ARX_H	ARX_D	ARX_H+D	ARX_H&D
GRW-TVP	0.934(0.929)	0.888(0.926)	0.910(0.911)	0.906(0.932)	0.608(0.683)
SDP	0.605(0.724)	0.696(0.769)	0.224(0.765)	0.804(0.598)	0.573(0.781)

The values outside and inside the brackets contain the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables a, b), estimated and chosen from all the GRW combinations, respectively.

Table 1b

Coefficient of determination (R2) for Iraq (VAR models).

Parameter's modeling	VAR_H	VAR_D	VAR_H+D	VAR_H&D	VAR_H&D-extended
GRW-TVP	0.862(0.888)	0.927(0.931)	0.911(0.88)	0.701(0.772)	0.702(0.763)

The values outside and inside the brackets contain the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables c, d, e), estimated and chosen from all the GRW combinations, respectively.

Coefficient of determination (R2) for Iraq (ARX models). The values outside and inside the brackets contain the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables a, b), estimated and chosen from all the GRW combinations, respectively. Coefficient of determination (R2) for Iraq (VAR models). The values outside and inside the brackets contain the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables c, d, e), estimated and chosen from all the GRW combinations, respectively. Table 1a indicates that pure autoregressive (AR) models using GRW techniques are statistically more accurate than those using external variables for the ARX kind. The AR median's prediction curve captures the initial growing epidemics, ameliorating the overshooting effect of the peak (strings 1, 2, 3, and 4 in Fig. 2). More interesting, after the descendent part (strings 4 and 5), the AR model seems to be able to capture the second outbreak that suddenly appears on the curve (string 6) and continuing its growth (strings 7 and 8). The latter supports the hypothesis about the implicit information in the historical observations and their potentials in supporting predictive tasks of complex dynamics features when used as input variables in a model. Although not with the best performance, the ARXH+D modeling with GRW techniques seems to shape the peak (strings 2, 3, and 4) and the second outbreak of the curve (string 6), as shown by Fig. 2. According to ARXH and ARXD in Fig. 2, the role played individually by the daily rates of H and D variables for representing the latter effects are very similar. Interestingly, the variability of the ARXD's predictions – represented by the percentile 25 and 75 in Fig. 2 –is narrower than the alternative ARX models. The latter could be related to the identification process (Appendix 1_Table a), in which the structures' evolution reached by the ARXD are statistically efficient, especially when included the critical curvatures (i.e., strings 5, 6, and 7) sequentially. Because the number of deaths during an epidemic is probably the most reliable register, the fact that the models' predictivity respond to this variable could be advantageous. Splitting the exogenous daily rates for constructing the ARXH&D does not increase the critical strings' predictive performance in the contagious curve (i.e., strings 3 and 6). The previous is interesting since such a forecasting accuracy is opposite to the fitting in the identification process, reaching efficient values during both, R2 and AIC parameters (Appendix 1_Table b). Since the fitting is accurate (as shown by R2 and AIC, respectively), the forecasting failure could be more related to the transfer functions not being in a direct analogy with the continuous forms derived from the differential equations. Because of the latter, the model is over-parameterized. It is worth mentioning that although the ARX models using the SDP technique do not attain the best forecasting effects (Table 1a), these show an interesting predictive accuracy during and posterior the second outbreak (i.e., for strings 6, 7, and 8), especially for the AR and for the ARXH+D models. The SDP technique can unravel the implicit information carried by the historical and the exogenous variables, whose non-parametric functions allow the dynamic curve to capture the non-linear changes. Nonetheless, since the ability of SDP relies on the number of observations, it might be that for early stages the models have enough observations to more widely identify the critical states that cause severe changes. Therefore, the model should be tested in later contagious phases, but this is an open future task. When daily rates are separated for constructing the ARXH&D the predictive capacity does not improve in any sense. The latter supports the hypothesis that stochastic approximations must be analogous to the differential equations describing the phenomenon, regardless of the technique behind the parameter modeling (i.e., GRW or SDP). This could be because ARX stochastic structures following analogies with the differential equations provide the models with conceptual robustness, implicitly helping with the problem of over-parameterization and, consequently, the over-fitting problem. On the other hand, as shown in Table 1b, VAR models have similar performances to those attained by the ARX models using GRW techniques. The best model performance is reached when the interacting components are given between the contagious variable (I) and the daily rates of deaths (D), determining the VARD model. Also, as indicated by Fig. 2, the prediction's variability using different GRW combinations is narrower for the same VARD model than its alternatives. As in ARX models, the low prediction variability could be related to the high statistical fitting and efficiency reached during the identification process (Appendix 1_Table c). The mentioned is notorious even when the critical strings with pronounced curvatures (i.e., strings 5, 6, and 7) are included in the identification process. Since variables (I) and (D) are endogenous in the VAR models, they can be simultaneously predicted, as shown in Appendix 3. This simultaneous prediction capacity is in the best interest of health systems during an epidemic process. Fig. 2 reveals that the peak and the second outbreak curvatures (i.e., strings 3, 4, 5, and 6) are captured by VARH, VARD, and VARH+D, but those predictions are not as efficient as the ones attained by the ARX models. Although the VARD model shows better statistics than its relative VAR models, both exogenous daily rate (H) seems to play an essential role in predicting these complex peaks and outbreaks as shown by the model VARH in Fig. 2. Furthermore, other important features such as the descendent section of the contagious curve (strings 4 and 5) are well captured by the VARH+D model, which is a notorious ability compared with the remaining VAR options. Interestingly, the fitting statistics of VARH+D structures are lower than those reached by the VARH and VARD models (Appendix 1_Table d). Making the daily rates H and D endogenous variables interacting separately with the contagious (I) variable as specified by VARH&D model (see the extended model in Section 3.5) do not increase the predictive capacity (Table 1b and Fig. 2). Moreover, the inclusion of more historical information into each endogenous variable in the VARH&D-extended alternative does not improve any predictive capacity either, despite its good fitting performance (R2) and efficiency (AIC) during the identification process (Appendix 1_Table e). The latter and the ARX results indicate that better predictions are attained when the model's structures follow similar analogies to the differential equations describing the phenomenon. These forecasting results obey to the overall strategy proposed here, which is sympathetic with the ARX and VAR models. In other words, for the short-term forecasting problem, the ARX and VAR techniques that follow differential equations on their structures, are capable to anticipate rapid curve changes if the parameters (following GRW models) and stochastic structures are updating at the same time.

Predictions for Iran's case

Table 2a and Table 2b indicates that the medians attained from the GRW combinations are statistically similar to the best predictions obtained by a specific GRW with the best performance. These results support the stated for the Iraq case about the median predictive curve as a reliable indicator of the model's predictive capacity, surrogating the predictions attained by an optimum model.

Table 2a

Coefficient of determination (R2) for Iran (VAR models).

Parameter's modeling	AR	ARX_H	ARX_D	ARX_H+D	ARX_H&D
GRW-TVP	0.885(0.891)	0.889(0.909)	0.947(0.951)	0.907(0.915)	0.882(0.944)
SDP	0.847(0.681)	0.729(0.873)	0.842(0.833)	0.890(0.907)	0.846(0.86)

The values outside and inside the brackets contains the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 2_Tables f, g), estimated and chosen from al the GRW combinations, respectively.

Table 2b

Coefficient of determination (R2) for Iran (VAR models).

Parameter's modeling	VAR_H	VAR_D	VAR_H+D	VAR_H&D	VAR_H&D-extended
GRW-TVP	0.947(0.942)	0.906(0.955)	0.916(0.917)	0.854(0.803)	0.863(0.76)

The values outside and inside the brackets contains the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables h, i, j), estimated and chosen from al the GRW combinations, respectively.

Coefficient of determination (R2) for Iran (VAR models). The values outside and inside the brackets contains the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 2_Tables f, g), estimated and chosen from al the GRW combinations, respectively. Coefficient of determination (R2) for Iran (VAR models). The values outside and inside the brackets contains the R2 coefficient for the median and the best model (using a specific GRW combination as specified in Appendix 1_Tables h, i, j), estimated and chosen from al the GRW combinations, respectively. Table 2a reveals an ARXD model with the best predictions compared to ARXH and ARXH+D. Fig. 3 graphically confirms the ARXD's predictive ability in all the forecasting strings. This predictive ability is linked to the fitted statistics attained during the training process (Appendix 2_Table f). This ARXD model captures the particular curve dynamics, such as the slope changes in the curve from the strings 5 to 6 (Fig. 3). Also, the prediction variability from the different GRW combinations is narrower than the alternative ARX models, which is the same report for the Iraq case. Although not with the best prediction accuracy and fitted performance (Table 2a and Appendix 2_Table f, respectively), the AR model reveals abilities to capture the pronounced curvatures in the strings 5, 6, and 7 shown in Fig. 3. The latter ability of the AR model for capturing the curvature changes is even better than the model ARXD, which confirms that historical data of the contagious curve encodes useful information for predicting new breakouts during the epidemic evolution. The statistics shown in Table 2a for the ARX models using SDP techniques are more efficient than those estimated for the Iraq case (Table 2a). Still, Fig. 3 shows serious inaccuracies, especially on strings 1 to 6. However, interesting results are shown for the strings 6 to 10, especially for the ARXH and ARXD models. The latter supports the argument derived from the Iraq predictions that although SDP is not helpful in the early stages of the phenomenon, it can be beneficial when the number of observations increases in future phases. The latter has the potential for predicting the strong non-linear behavior in the curves. As in Iraq's case, splitting the exogenous H and D daily for the model ARXH&D does not improve the predictive capacity in both quantitative and qualitative (Table 2a and Fig. 3). Since the fitting statistics are relatively high with the other ARX models (Appendix 2_Table g), the deficient predictions could result from an over-fitting effect. The over-fitting responds to the excess of parameters representing the mathematical structure of the phenomena. Thus, as long as the model follows analogies directly connected with the phenomenon's differential equations, it attains better predictions under a more efficient stochastic form. On the other hand, VAR models applied over the Iran data have similar statistics to those using ARX models for predictions. Here, the best predictive performance is reached when the predictions are recursively given by an interaction between the components of the contagious curve (I) and the recovered individuals (H), as shown by Table 2b for the VARH model. This result is the opposite of the ARX models' results and does not follow the same predictive features of Iraq's modeling case. The VARH model seems to alleviate the overshooting effect at the peak of the contagious curve. (i.e., strings 1 and 2 Fig. 3). Although VARH, VARD, and VARH+D achieved acceptable predictive accuracies (Table 2b), they cannot predict the intense changing slopes in the contagious curve, except the particular accuracy of prediction in string 8 given by the VARD model. It is worth mentioning that string 8 is a complicated zone for being adjusted when using VARD and VARD+H techniques (according to Appendix 2_Tables h and i). Reminding that ARX models work under the “DirRec” scheme for predictions, while the VAR models estimate the predictions using the “Recursive” scheme, the Iran cases give important insights about the differences in predictions for both types of forecasting approaches. In that regard, although the VAR models provide good predictions according to the statistical parameters, the ARX models show the potential for capturing the slope changes of the curve better than the VAR models. The latter is also replicated in the Iraq data. Thus, one could say that “DirRec” approaches to the forecasting problem attain good results in both quantitative and qualitative aspects of prediction. As for Iraq's case, daily rates H and D endogenous variables interacting separately with the contagious (I) variable as specified by VARH&D model do not increase the predictive capacity as evidence by Table 2b and Fig. 3. Moreover, more historical information into each endogenous variable in the VARH&D-extended model does not improve any predictive capacity, despite its good fitting performance (R2) and efficiency (AIC) during the identification process shown in Appendix 2_Tables i and j.

Summary and conclusions

Based on existing tools and knowledge, we propose a novel modeling strategy for the complex SARS-CoV-2 forecasting task. We used Autoregressive with exogenous variables (ARX) and Vector Autoregressions (VAR) techniques. The stochastic structures of ARX and VAR techniques follow analogies with the epidemic's differential equations, thus making them not entirely oblivious of conceptual or dynamical interpretations. To tackle the problem that entails an accurate forecasting task of rapid variations, the models simultaneously perform the updating of their parameters and structures for fitting the incoming data. In addition, we explore different ARX and VAR model configurations to unveil the usefulness and roles that the variables have in anticipating outbreaks and recessions. The experiment was addressed using the Iraq and Iraq data. Due to the complexity, the time forecasting horizon was set in seven days, and the parameters in the model's equations were considered non-stationary. The well-known Generalized-Random-Walk (GRW), as well as the State-variable-parameter (SDP) techniques, were used for shaping parameters non-stationary for the ARX models, while for the VAR models, a scheme using the GRW techniques was tried. The use of GRW techniques is motivated because they are compatible with the hypothesis that fast fluctuations in the contagious curve depend significantly on the surrounding previous conditions before the event. The SDP techniques are compatible with the alternative hypothesis that rapid fluctuations in the contagious curve depend on some natural states. Different GRW and SDP combinations render different forecasting estimations that are finally represented by the median. The investigation limits its applicability to the early stages of the phenomenon's evolution in Iraq and Iran, emulating the problem of data scarcity and uncertainty under high-demand circumstances of predictive skills during stages in which rapid variations can occur (such as sudden recessions and outbreaks). Motivated by this, the authors consider an adaptive fashion of the model's structures simultaneously with the parameters' updating for tackling the rapid dynamic variations of the phenomenon. The nature of ARX and VAR models allows the evaluation of different forecasting approaches. On the one hand, the ARX models use both observed historical data of exogenous variables and recursive estimations retrieved from the endogenous variable through its autoregressive structures, in an approach known as “DirRec” for the forecasting task. On the other hand, the VAR techniques' feat the interaction between each endogenous variable recursively through one-step-ahead estimations until the desired prediction horizon. According to the general argot, the last forecasting procedure is known as “Recursive”. This experiment suggests that the forecasting task of a short-term horizon is possible for a complex phenomenon with a high dynamical variability when using the ARX models and VAR models with GRW techniques. The results show that the forecasting estimations represented by the medians of all the GRW combinations are similar to the best models among those combinations, thus providing reliability to the criterion used in the strategy. Moreover, the models can capture qualitative features such as the intense curvature changes, especially those minimums related to new outbreaks in the contagious curve. Although SDP techniques applied in the ARX models do not attain good predictions in the early stages of the phenomenon, evidence suggests its capacity for accurately capturing the non-linear qualitative features as observations increase. The ARX models and GRW techniques seem to be more skillful in predicting strong curvatures than the VAR models using the same GRW approaches. Such ability appears to rely partly on the main difference between “DirRec” and “Recursive” methods, explained above. Interestingly, historical observations of contagious and daily rates of deaths that are closer to rapid curve changes (recessions and outbreaks), seem to play an essential role in the models for anticipating these qualitative features. One of the most important aspects derived from the experimental modeling is the suggestion that better predictions are attained when stochastic approaches are analogous to the differential equations governing the phenomenon. The reason could be that these analogies implicitly provide the models with conceptual robustness, also control over-parameterizations and over-fittings a priori. Moreover, for predicting a phenomenon of such complexity, it was necessary to consider the time variability of the model's parameters and consider an adaptive structure according to the retrieved information. To the authors' knowledge, the latter is a novel approach for emulating this kind of problem with uncertain conditions in critical stages of its evolution, using the most common and available data. In summary, the processes proposed for the forecasting strategy are complementary. On the one hand, although the central role of the analogies between ARX and VAR stochastic structures with differential equations is to provide the models with conceptual flavor, this implicitly provides the techniques with efficiency and robustness, ultimately controlling over-parameterizations and over-fitting. On the other hand, the success of GRW techniques suggests that rapid curve variations more likely depend on the surrounding observations when these changes occur, as hypothesized in the introduction. Finally, updating the stochastic structures supports capturing rapid changes that could entail strong discontinuities in the evolution of the infection. In the end, this strategic stochastic modeling with adaptive structures is approved as a promising alternative for estimating predictions in complex phenomena, such as the case of an epidemic spread, with the potential for anticipating rapid curve changes as such recessions and outbreaks. This experience could be helpful for supporting decisions in continuous monitoring systems. Future work could consider the following activities: 1) The evaluation of the SDP abilities, using more information for training the models. The suggestion relies on the fact that although SDP does not help in the early stages of the contagious process, it could be useful in advanced states when the model accounts for more detailed observations for the learning process. 2) The analysis of possible mechanisms behind the stochastic structures. This is suggested because ARX and VAR have analogies to differential equations. Thus, these analogies could be translated into explicit dynamic interpretations. 3) Other alternative techniques should be tested using a similar strategy, such as Autoregressive Neural-Networks or Fractal theory. However, other alternatives might require more extended work about developing theoretical aspects and software for allowing certain advantages that the recursive time series theory already has implemented.

CRediT authorship contribution statement

Conceptualization, D.M., A.O, E.S; methodology, D.M.; software, D.M; formal analysis, D.M, A.O, E.S; writing-original draft preparation, D.M.; writing-review and editing, D.M, A.O, E.S.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Table a

. Iraq_ARX models: identified structures and fitted statistics.

	AR model					ARX_H model					ARX_D model
Data portion	n	m	τ	R²	AIC	n	m	τ_h	R²	AIC	n	m	τ_d	R²	AIC
String 1	2	1	10	0.913	4.970	2	3	7	0.502	6.915	1	4	8	0.910	5.204
String 2	1	2	10	0.968	5.366	2	1	10	0.844	6.962	4	3	7	0.940	6.304
String 3	4	6	10	0.992	5.917	1	6	10	0.981	6.600	1	3	8	0.961	7.131
String 4	1	4	11	0.950	8.105	1	6	14	0.937	8.435	1	1	7	0.971	7.429
String 5	1	7	8	0.863	9.298	4	2	8	0.754	9.798	2	1	7	0.955	7.967
String 6	5	3	9	0.664	10.057	3	2	14	0.408	10.514	2	2	7	0.957	7.845
String 7	5	6	8	0.506	10.495	5	6	14	0.518	10.471	1	1	7	0.847	9.035
String 8	1	4	10	0.815	9.488	1	1	13	0.532	10.331	5	5	7	0.969	7.858
Best_GWR	C	RW				C	RW				RW	C			TVP
Best_GWR	RW	C				C	RW				RW	C			SDP

Notations of headers follow Eq. (3a). The model's structures evolve according to the data strings sequentially included in the identification process. R2 and AIC are the Coefficient-of-determination and Akaike-Information-Criteria, respectively, estimated during the identification process. The last line indicates the best GRW combinations.

Table b

. Iraq_ARX models: identified structures and fitted statistics.

	ARX_H+D model					ARX_H&D model
Data portion	n	m	τ_h+d	R²	AIC	n	m	p	τ_h	τ_d	R²	AIC
String 1	1	1	9	0.652	6.257	1	1	1	10	9	0.960	4.193
String 2	7	1	7	0.982	5.147	1	7	3	9	8	0.998	3.251
String 3	4	4	10	0.990	5.962	1	7	7	11	10	0.997	5.142
String 4	1	6	14	0.952	8.172	3	2	7	11	10	0.995	6.168
String 5	6	4	8	0.768	9.907	3	7	7	9	7	0.992	6.843
String 6	7	1	12	0.492	10.470	2	7	2	12	7	0.985	7.042
String 7	1	3	10	0.431	10.412	6	7	3	14	7	0.973	7.751
String 8	1	1	7	0.511	10.375	3	2	3	9	7	0.972	7.698
Best_GWR	C	RW				RW	C	C				TVP
Best_GWR	RW	RW				RW	C	C				SDP

Notations of headers follow Eqs. (3a), (3b). The model's structures evolve according to the data strings sequentially included in the identification process. R2 and AIC are the Coefficient-of-determination and Akaike-Information-Criteria, respectively, estimated during the identification process. The last line indicates the best GRW combinations.

Table c

. Iraq_VAR models: identified structures and fitted statistics.

	VAR_H model								VAR_D model
Data portion	n	m	–	q	τ_h	τ_Ih	R²	AIC	n	m	–	s	τ_d	τ_Id	R²	AIC
String 1	2	2	1	2	2	4	0.758(0.978)	6.093(2.117)	1	3	1	4	1	2	0.904(0.990)	5.168(−0.997)
String 2	1	3	1	1	2	4	0.989(0.978)	4.347(3.452)	4	3	1	1	4	0	0.988(0.980)	4.662(0.983)
String 3	7	2	1	1	2	5	0.995(0.992)	5.310(3.690)	2	7	1	5	4	0	0.997(0.979)	4.946(2.218)
String 4	2	2	1	5	4	1	0.920(0.997)	8.530(4.426)	5	2	1	3	1	0	0.995(0.983)	5.983(2.425)
String 5	7	7	1	7	6	0	0.915(0.999)	9.064(4.191)	3	2	1	5	1	0	0.980(0.988)	7.255(2.500)
String 6	5	6	1	2	6	2	0.711(0.999)	10.016(4.855)	2	3	1	5	2	0	0.993(0.988)	6.096(2.613)
String 7	1	6	1	4	1	0	0.738(0.999)	9.732(4.670)	2	2	1	6	2	0	0.990(0.990)	6.401(2.595)
String 8	1	5	1	7	1	0	0.731(0.999)	9.895(5.123)	3	2	1	7	1	0	0.992(0.992)	6.359(2.543)
Best_GWR	C	RW	IRW	RW					C	RW	C	IRW				TVP

Table d

. Iraq_VAR models: Identified structures and fitted statistics.

	VAR_H+D model								VAR_H&D
Data portion	n	m	–	r	τ_r	τ_Ir	R²	AIC	–	q	–	s	τ_Ih	τ_Id	R²	AIC
String 1	2	1	1	1	3	3	0.898(0.982)	5.125(2.292)	1	2	1	4	4	2	0.898(0.978)[0.990]	5.125(2.117)[−0.997]
String 2	4	1	1	1	2	4	0.987(0.982)	4.626(3.753)	1	1	1	1	4	0	0.987(0.978)[0.980]	4.626(3.452)[0.983]
String 3	2	5	1	1	2	7	0.995(0.992)	5.250(4.196)	1	1	1	5	5	0	0.995(0.992)[0.979]	5.250(3.690)[2.218]
String 4	3	1	1	2	1	7	0.927(0.997)	8.440(4.464)	1	5	1	3	1	0	0.927(0.997)[0.983]	8.440(4.426)[2.425]
String 5	4	7	1	6	2	1	0.797(0.999)	9.817(4.566)	1	7	1	5	0	0	0.797(0.999)[0.988]	9.817(4.191)[2.500]
String 6	1	3	1	1	2	5	0.624(0.999)	10.023(5.082)	1	2	1	5	2	0	0.624(0.999)[0.988]	10.023(4.855)[2.613]
String 7	1	5	1	3	2	1	0.571(0.999)	10.193(4.958)	1	4	1	6	0	0	0.571(0.999)[0.990]	10.193(4.670)[2.595]
String 8	1	6	1	4	1	3	0.750(0.999)	9.850(5.323)	1	7	1	7	0	0	0.750(0.999)[0.992]	9.850(5.123)[2.543]
Best_GWR	C	RW	IRW	C					IRW	C	IRW	C				TVP

Notations of headers follow Eqs. (6a), (6b), (6c). The model's structures evolve according to the data strings sequentially included in the identification process. R2 and AIC are the Coefficient-of-determination and Akaike-Information-Criteria, respectively. Values outside and inside () are the statistics of the equation components (6a), (6b) for VARH+D. For including Eqs. (6a), (6b), (6c) simultaneously in the VARH&D the parameters q, s, τIh, τId are coupled to the n, m and τr parameters. The values outside (), inside (), and inside [] are the statistics of Eqs. (6a), (6b), (6c), estimated during the identification process. The last line indicates the best GRW combinations.

Table e

. Iraq_VAR models: Identified structures and fitted statistics.

	VAR_H&D-extended model
Data portion	n	m	p	w	q	u	s	τ_h	τ_d	τ_Ih	τ_Id	R²	AIC
String 1	1	2	2	2	1	1	4	2	1	4	2	0.975(0.979)[0.990]	3.931(2.082)[−0.997]
String 2	1	2	3	1	1	3	1	1	5	4	6	0.995(0.978)[0.987]	3.766(3.452)[0.669]
String 3	4	7	7	2	3	1	5	1	4	1	0	0.998(0.994)[0.979]	3.042(3.591)[2.218]
String 4	2	6	1	4	5	1	3	5	2	3	0	0.998(0.998)[0.983]	4.920(4.248)[2.425]
String 5	3	7	3	3	2	1	5	7	2	2	0	0.998(0.999)[0.988]	5.011(3.970)[2.500]
String 6	4	2	3	3	2	1	5	3	3	0	0	0.998(0.999)[0.988]	5.156(4.690)[2.613]
String 7	2	2	3	3	4	2	7	2	1	2	0	0.997(1.000)[0.992]	5.404(4.470)[2.503]
String 8	2	2	3	4	3	2	5	2	1	0	1	0.997(1.000)[0.992]	5.475(4.886)[2.478]
Best_GWR	RW	C	IRW	C	C	C	C						TVP

Table f

. Iran_ARX models: identified structures and fitted statistics.

AR model					ARX_H model					ARX_D model
Data portion	n	m	τ	R²	AIC	n	m	τ_h	R²	AIC	n	m	τ_d	R²	AIC
String 1	1	4	11	0.976	13.622	1	6	8	0.977	13.715	1	3	12	0.916	14.802
String 2	1	2	11	0.978	14.525	1	4	8	0.978	14.615	2	3	8	0.977	14.685
String 3	2	5	13	0.942	16.078	2	1	12	0.939	15.929	2	6	11	0.986	14.676
String 4	4	2	14	0.924	16.160	2	1	12	0.594	17.715	3	6	12	0.995	13.538
String 5	1	5	8	0.654	17.519	5	1	12	0.540	17.804	3	2	14	0.990	13.896
String 6	1	4	8	0.504	17.712	3	4	9	0.472	17.837	6	5	12	0.983	14.500
String 7	1	4	8	0.586	17.418	2	2	11	0.107	18.157	3	4	8	0.982	14.331
String 8	1	6	11	0.552	18.079	2	7	12	0.503	17.596	2	7	12	0.957	15.142
String 9	1	7	10	0.589	17.299	3	7	12	0.489	17.564	7	5	14	0.861	16.308
String 10	1	7	9	0.566	17.304	4	2	14	0.673	16.976	1	7	7	0.634	17.133
Best_GWR	RW	C				RW	C				RW	RW			TVP
Best_GWR	RW	C				RW	C				RW	C			SDP

Table g

. Iran_ARX models: identified structures and fitted statistics.

	ARX_H+D model					ARX_H&D model
Data portion	n	m	τ_h+d	R²	AIC	n	m	p	τ_h	τ_d	R²	AIC
String 1	2	1	8	0.963	13.931	2	1	1	8	12	0.968	13.852
String 2	1	5	10	0.985	14.304	1	5	1	8	13	0.996	13.170
String 3	4	1	9	0.952	15.812	5	4	1	12	7	0.998	12.795
String 4	2	3	13	0.666	17.606	2	3	7	13	10	0.996	13.480
String 5	1	7	11	0.459	18.041	2	1	6	8	13	0.993	13.678
String 6	3	3	13	0.722	17.164	3	5	2	14	14	0.992	13.803
String 7	3	7	12	0.275	18.125	3	5	2	14	14	0.992	13.673
String 8	1	8	7	0.326	17.903	4	4	7	14	11	0.988	14.048
String 9	2	7	11	0.130	18.073	6	1	4	10	12	0.980	14.363
String 10	3	8	11	0.821	16.489	4	1	5	12	10	0.981	14.208
Best_GWR	RW	C				RW	C	IRW				TVP
Best_GWR	RW	C				RW	C	C				SDP

Table h

. Iran_VAR models: identified structures and fitted statistics.

	VAR_H model								VAR_D model
Data portion	n	m	–	q	τ_h	τ_Ih	R2	AIC	n	m	–	s	τ_d	τ_Id	R2	AIC
String 1	1	5	1	2	4	1	0.985(0.989)	13.235(11.887)	2	3	1	1	1	3	0.923(0.999)	14.795(6.831)
String 2	5	2	1	1	5	6	0.993(0.996)	13.597(11.673)	2	4	1	5	2	5	0.989(0.997)	13.990(8.511)
String 3	2	7	1	2	2	1	0.933(0.995)	16.333(13.191)	3	4	1	6	1	1	0.975(0.988)	15.266(10.356)
String 4	4	3	1	6	7	2	0.825(0.997)	17.042(13.880)	2	2	1	7	7	1	0.970(0.992)	15.168(10.254)
String 5	3	7	1	2	7	4	0.539(0.995)	17.955(14.655)	7	5	1	4	2	0	0.974(0.990)	15.159(10.678)
String 6	6	5	1	2	5	3	0.555(0.997)	17.797(14.547)	3	4	1	4	1	0	0.981(0.989)	14.500(10.893)
String 7	2	1	1	2	6	2	0.463(0.998)	17.620(14.410)	3	4	1	4	1	0	0.967(0.994)	14.945(10.458)
String 8	3	1	1	3	3	1	0.530(0.999)	17.411(14.294)	2	4	1	4	1	0	0.958(0.994)	15.048(10.439)
String 9	3	1	1	2	7	3	0.708(0.999)	16.862(14.295)	3	3	1	4	6	0	0.946(0.995)	15.215(10.495)
String 10	5	4	1	1	5	6	0.856(0.999)	16.229(14.520)	4	3	1	5	7	0	0.940(0.995)	15.298(10.553)
Best_GWR	RW	RW	IRW	IRW					C	IRW	C	C				TVP

Table i

. Iran_VAR models: identified structures and fitted statistics.

	VAR_H+D model								VAR_H&D
Data portion	n	m	–	r	τ_r	τ_Ir	R²	AIC	–	q	–	s	τ_Ih	τ_Id	R²	AIC
String 1	1	5	1	1	3	1	0.966(0.993)	14.055(11.843)	1	2	1	1	1	1	0.966(0.989)[0.999]	14.055(11.887)[6.831]
String 2	2	3	1	6	7	3	0.989(0.997)	13.985(12.045)	1	1	1	5	6	2	0.989(0.996)[0.997]	13.985(11.673)[8.511]
String 3	2	2	1	3	7	7	0.946(0.997)	15.873(13.074)	1	2	1	6	1	1	0.946(0.995)[0.988]	15.873(13.191)[10.356]
String 4	3	2	1	4	7	5	0.596(0.997)	17.795(14.023)	1	6	1	7	2	7	0.596(0.997)[0.992]	17.795(13.880)[10.254]
String 5	3	7	1	1	7	7	0.728(0.996)	17.428(14.747)	1	2	1	4	4	2	0.728(0.995)[0.990]	17.428(14.655)[10.678]
String 6	5	4	1	4	1	1	0.580(0.997)	17.675(14.709)	1	2	1	4	3	1	0.580(0.997)[0.989]	17.675(14.547)[10.893]
String 7	2	1	1	3	4	1	0.381(0.998)	17.763(14.518)	1	2	1	4	2	1	0.381(0.998)[0.994]	17.763(14.410)[10.458]
String 8	1	6	1	2	4	3	0.174(0.999)	18.054(14.461)	1	3	1	4	1	1	0.174(0.999)[0.994]	18.054(14.294)[10.439]
String 9	6	4	1	3	5	0	0.966(0.993)	14.055(11.843)	1	2	1	4	3	6	0.966(0.989)[0.999]	14.055(11.887)[6.831]
String 10	5	4	1	3	5	1	0.989(0.997)	13.985(12.045)	1	1	1	5	6	7	0.989(0.996)[0.997]	13.985(11.673)[8.511]
Best_GWR	RW	RW	RW	RW					IRW	IRW	C	C				TVP

Table j

. Iran_VAR models: identified structures and fitted statistics.

	VAR_H&D-extended model
Data portion	n	m	p	w	q	u	s	τ_h	τ_d	τ_Ih	τ_Id	R²	AIC
String 1	1	2	2	1	2	2	3	2	7	1	1	0.998(0.989)[0.999]	11.321(11.887)[6.096]
String 2	2	1	6	1	1	2	1	2	1	6	1	0.999(0.996)[0.997]	12.190(11.673)[8.267]
String 3	1	7	7	1	2	4	5	6	2	1	1	0.994(0.995)[0.999]	14.143(13.191)[7.654]
String 4	1	7	7	1	6	2	4	1	2	2	0	0.986(0.997)[0.999]	14.843(13.880)[8.528]
String 5	2	1	6	2	2	1	4	3	6	0	0	0.985(0.996)[0.990]	14.490(14.616)[10.678]
String 6	3	1	5	2	2	3	7	3	6	2	0	0.988(0.997)[0.996]	14.131(14.470)[9.976]
String 7	3	1	4	2	2	1	4	2	7	1	0	0.980(0.998)[0.994]	14.485(14.398)[10.458]
String 8	2	1	5	1	3	1	4	3	6	1	0	0.961(0.999)[0.994]	15.038(14.294)[10.439]
String 9	4	1	3	2	2	1	4	7	5	0	0	0.971(0.999)[0.995]	14.644(14.275)[10.495]
String 10	5	6	3	1	1	2	5	5	1	6	0	0.987(0.999)[0.996]	13.899(14.520)[10.378]
Best_GWR	RW	C	RW	C	RW	C	RW						TVP

27 in total

1. A note on the derivation of epidemic final sizes.

Authors: Joel C Miller
Journal: Bull Math Biol Date: 2012-07-25 Impact factor: 1.758

2. Mathematical models of SIR disease spread with combined non-sexual and sexual transmission routes.

Authors: Joel C Miller
Journal: Infect Dis Model Date: 2017-01-11

3. Health system collapse 45 days after the detection of COVID-19 in Ceará, Northeast Brazil: a preliminary analysis.

Authors: Daniele Rocha Queiros Lemos; Sarah Mendes D'Angelo; Luis Arthur Brasil Gadelha Farias; Magda Moura Almeida; Ricristhi Gonçalves Gomes; Geovana Praça Pinto; Josafa Nascimento Cavalcante Filho; Levi Ximenes Feijão; Ana Rita Paulo Cardoso; Thaisy Brasil Ricarte Lima; Pâmela Maria Costa Linhares; Liana Perdigão Mello; Tania Mara Coelho; Luciano Pamplona de Góes Cavalcanti
Journal: Rev Soc Bras Med Trop Date: 2020-07-03 Impact factor: 1.581

4. Airborne SARS-CoV-2: Weighing the Evidence for Its Role in Community Transmission.

Authors: Samuel Pecho-Silva; Kovy Arteaga-Livias; Alfonso J Rodriguez-Morales
Journal: J Prev Med Public Health Date: 2020-05-29

5. Analysis of Spatial Spread Relationships of Coronavirus (COVID-19) Pandemic in the World using Self Organizing Maps.

Authors: Patricia Melin; Julio Cesar Monica; Daniela Sanchez; Oscar Castillo
Journal: Chaos Solitons Fractals Date: 2020-05-21 Impact factor: 5.944

6. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

Authors: Joseph T Wu; Kathy Leung; Gabriel M Leung
Journal: Lancet Date: 2020-01-31 Impact factor: 79.321

Forecasting of a complex phenomenon using stochastic data-based techniques under non-conventional schemes: The SARS-CoV-2 virus spread case.

Introduction

Materials

Data

Methods

The SIRD model

The ARX approach

The GRW technique for time variable parameters (TVP)

The SDP technique

The VAR technique

Modeling process, considerations and technical notes

Results and discussion

Predictions for Iraq's case

Predictions for Iran's case

Summary and conclusions

CRediT authorship contribution statement

Declaration of competing interest

1. A note on the derivation of epidemic final sizes.

2. Mathematical models of SIR disease spread with combined non-sexual and sexual transmission routes.

3. Health system collapse 45 days after the detection of COVID-19 in Ceará, Northeast Brazil: a preliminary analysis.

4. Airborne SARS-CoV-2: Weighing the Evidence for Its Role in Community Transmission.

5. Analysis of Spatial Spread Relationships of Coronavirus (COVID-19) Pandemic in the World using Self Organizing Maps.

6. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study.

7. Nonlinear Neural Network Based Forecasting Model for Predicting COVID-19 Cases.

8. Estimation of the time-varying reproduction number of COVID-19 outbreak in China.

9. COVID-19 Prediction Models and Unexploited Data.