Literature DB >> 35645547

Modeling Medical Data by Flexible Integer-Valued AR(1) Process with Zero-and-One-Inflated Geometric Innovations.

Zohreh Mohammadi¹, Zahra Sajjadnia², Maryam Sharafi², Naushad Mamode Khan³.

Abstract

In this paper, we introduce a new stationary first-order integer-valued autoregressive process (INAR) with zero-and-one-inflated geometric innovations that is useful for modeling medical practical data. Basic probabilistic and statistical properties of the model are discussed. Conditional least squares and maximum likelihood estimators are proposed to estimate the model parameters. The performance of the estimation methods is assessed by some Monte Carlo simulation experiments. The zero-and-one-inflated INAR process is subsequently applied to analyze two medical series that include the number of new COVID-19-infected series from Barbados and Poliomyelitis data. The proposed model is compared with other popular competing zero-inflated and zero-and-one-inflated INAR models on the basis of some goodness-of-fit statistics and selection criteria, where it shows to provide better fitting and hence can be considered as another important commendable model in the class of INAR models.

Entities: Chemical

Keywords: Binomial thinning operator; Estimation; Geometric distribution; INAR process; Runs; Zero-and-one-inflated geometric distribution

Year: 2022 PMID： 35645547 PMCID： PMC9124749 DOI： 10.1007/s40995-022-01297-3

Source DB: PubMed Journal: Iran J Sci Technol Trans A Sci ISSN： 1028-6276 Impact factor: 1.553

Introduction

Time series of counts are emerging in almost every domain of applications now, be in economics, medicine, or life sciences. Some examples include the monthly cases of crimes and offenses as studied in Bakouch and Ristić (2010), Ristić et al. (2009, 2012), Bourguignon and Vasconcellos (2015), Mamode Khan et al. (2020a), the daily number of newly infected and deaths due to SARs-Cov 2 patients (Mamode Khan et al. 2020b), the weekly number of syphilis cases, (Bourguignon et al. 2018), the number of daily fatal road traffic accidents (Pedeli and Karlis 2011), the tick by tick intra-day transactions of stocks (Pedeli and Karlis 2013; Sunecher et al. 2018) and amongst others. In such applications, the counting series are usually characterized by frequent low figures that include mainly zeros and ones and this happens mostly when the unit of the collection is at a very micro level. Likewise, the daily SARs-Cov 2 death and newly infected series in small island developing states like Barbados, Guinea-Bissau, Sao Tome consist of mainly 0’s and 1’s. The same remark can be made to the sex offenses, arsenic, domestic violence data that are available at http://www.forecastingprinciples.com. The interested reader may consult more examples in Li et al. (2015) and the references therein. Such excess of zeros or ones leads to overdispersion in the series. This paper, therefore, proposes an integer-valued time series model of auto-regressive nature of order 1 to model such data series but with a zero-and-one-inflated type innovation structure. McKenzie (1985) and Al-Osh and Alzadi (1987), independently, introduced the integer-valued autoregressive (INAR) process with one lag using a binomial thinning operator as followswhere , is a sequence of independent and identically distributed integer-valued random variables, called innovations, with independent of for all , and . The binomial thinning operator “” is defined by Steutel and van Harn (1979) as , where the counting series is a sequence of independent and identically distributed Bernoulli random variables with . From the results of Al-Osh and Alzadi (1987), we have that and are the conditions of stationarity and non-stationarity of the process , respectively. Also, () implies the independence (dependence) of the observations of . The following representation for the marginal distribution of the INAR(1) model, provided by Al-Osh and Alzadi (1987), is expressed in terms of the innovation sequence Modeling of INAR(1) time series based on (1) was first introduced using the Poisson marginal distribution by Al-Osh and Alzadi (1987) and McKenzie (1988), denoted by PINAR(1). It is a simple model and is appropriate for modeling equidispersed time series data. In many practical scenarios, as discussed above, data are overdispersed. To cater for this phenomenon in counting series, Alzadi and Al-Osh (1988) considered INAR(1) processes with geometric marginal distribution for time series of overdispersed counts. Other useful overdispersed INAR(1) models have been proposed, such as the negative binomial INAR(1) process (McKenzie 1986), generalized Poisson INAR(1) process (Alzadi and Al-Osh 1993). Jazi et al. (2012a) and Schweer and Weiß (2014) introduced an overdispersed INAR(1) model with geometric and compound Poisson innovations, respectively. Basically, there is an ongoing vast literature on the handling of the overdispersion in the simple INAR process (Weiß 2008; Awale et al. 2021; Huang and Zhu 2021; Weiß 2020). However, we note that the construction of the INAR process, in addition to the self-decomposability properties, becomes simpler with assuming the distribution of the innovation series, and without compromising on the marginal distribution of the counting series (See Bourguignon et al. 2019; Livio et al. 2018). In fact, Livio et al. (2018) confirms that such a later INAR process with the pre-specified innovation yields lower AICs than other competing INAR(1)s in Mohammadpour et al. (2018). On the other hand, where the data set contains a large number of zeros, Jazi et al. (2012b) introduced an INAR(1) process with zero-inflated Poisson innovations and showed that the marginal distribution of the process is also zero-inflated. However, in the construction of the INAR(1) process, it is not always direct to derive the distribution of the counting series similar to the distribution of the zero-inflated innovation series. In this sense, Barreto-Souza (2015), Bakouch and Ristić (2010) and Bourguignon et al. (2018) studied novel INAR(1) models with zero-modified geometric and zero-truncated Poisson marginal distribution, respectively, similar to the construction process in Livio et al. (2018). Furthermore, Li et al. (2015) developed the mixed INAR(1) process with zero-inflated generalized power series innovations, while Bakouch et al. (2018) investigated the zero-inflated geometric INAR(1) process with random coefficient until recently, Sharafi et al. (2020) proposed the INAR(1) model with zero-modified Poisson–Lindley innovations. However, when a data set is subject to zero inflation along with one-inflation, the previous models are not very useful. In this research, we restrict our attention to modeling such data. Qi et al. (2019) introduced a stationary INAR(1) process with zero-and-one-inflated Poisson innovations. Also, Mohammadi et al. (2021) introduced the ZOIPLINAR(1) model which is the stationary INAR(1) model with zero-and-one-inflated Poisson–Lindley distributed innovations. The geometric distribution is one of the most important distributions used to analyze count data. Many authors such as McKenzie (1986), Ristić et al. (2009, 2012), Jazi et al. (2012a, b) used geometric distribution to analyze count time series data. This fact motivated us to introduce the flexible INAR(1) model with zero-and-one-inflated geometric innovations to model count data, especially in the analyzing of the COVID-19 real data. It should be mentioned that in the COVID-19 data time series analysis based on the PACF plot, in most applications, it seems that order 1 is not suitable and the higher-order time series are needed. Recently, Foroughi et al. (2021) introduced a new portmanteau test to examine the null hypothesis versus the alternative for and a wide group of INAR processes, called generalized INAR. They developed some portmanteau test statistics to check the adequacy of the fitted model. In this paper, we use the above test statistics to check the adequacy of our introduced model which is applied to the practical data example. The paper is organized as follows. In Sect. 2, we introduce and construct a flexible INAR(1) model and obtain some of its statistical and conditional properties. Section 3 is devoted to parameter estimation of the model which is included two estimation methods, maximum likelihood, and conditional least square estimators. In Sect. 4, we present some simulation experiments and real-life data applications to assess the performance of the proposed zero-and-one-inflated INAR model.

Model Construction and Properties

In this section, we introduce a flexible INAR(1) process with zero-and-one-inflated geometric-distributed innovations denoted by INARZOIG(1) and present some of its properties. Based on the Eq. (1), we define the INARZOIG(1) as follow:where and the innovation process is said to have zero-and-one-inflated geometric distribution, denoted by ZOIG, with the following probability mass function (pmf),where , . The parameter is the mean of the traditional geometric distribution and the parameters and denote the unknown proportions for incorporating extra zeros and ones than those allowed by the considered a traditional geometric distribution, respectively. Also, is independent of for all and it is independent of the counting series contained in the binomial thinning operator “.” Based on Du and Li (1991) and Dion et al. (1995), it can be easily shown that this process is stationary if and only if . This process is reduced into INARZIG(1) when and INAROIG(1) when , respectively. In the following proposition, some moments and conditional moments of the INARZOIG(1) process are summarized for the coming use.

Proposition 1

Let be the process defined by (3). Thenwhere , and . , , , , , , The proof of Proposition 1 is similar to Theorem 1 in Qi et al. (2019), we omit the details here and refer the reader to Qi et al. (2019). Using conditional mean is one of the most common techniques for forecasting time series processes. In the next proposition, conditional mean and variance of INARZOIG(1) process is obtained.

Proposition 2

For INARZOIG(1) process, the (h + 1)-step ahead forecast which is conditional mean, and the conditional variance areandrespectively.

Proof

The proof of Proposition 2 is given in Appendix.□ It is clear that as , which is the unconditional mean of the process. Also, the (h + 1)-step ahead conditional variance converges to as . According to Proposition 1, the Fisher index of dispersion for the model can be calculated aswhere , then the dispersion of the INARZOIG(1) process is similar to the dispersion of its innovation process, i.e., it is overdispersed (underdispersed) if the innovations is overdispersed (underdispersed).

Remark 1

After some calculation, it is easy to show that is overdispersed if , it is underdispersed if and it is equidispersed if . Since the model (3) forms a stationary discrete-time Markov chain, the transition probabilities obtained as (see, e.g., Weiß 2008):where is the pmf of defined by (4) and i, j = 0, 1, .... Hence, the marginal probability function of of INARZOIG(1) is obtained as:Also, the joint probability of the processes using the first-order dependence can be calculated as:

Distributions of Lengths of Zeros and Ones

Mood (1940) presented a definition of the number of the “succession” of similar events preceded and succeeded by different events, and called it “the Run.” In this section, we find the expected length of the runs of zeros and the lengths of the runs of ones for the INARZOIG(1) process.

Theorem 1

The expected length of the runs of zeros for the INARZOIG(1) process is and the expected length of the runs of ones for the INARZOIG(1) process is

Proof

The zero-to-zero transition probability for the INARZOIG(1) process is obtained as:Therefore, the transition probabilities from zero to nonzero for the INARZOIG(1) process can be obtained asSince the run length of zeros is defined as the number of zeros between two nonzero values, it can be shown that it follows from a geometric distribution with the parameter , and hence, the expected run length of zeros in the process is . The expected run length of ones can be obtained similarly.□ The expected length of the runs of zeros is independent of . If or , we obtain the expected length of the runs for the INAROIG(1) or INARZIG(1) process, respectively.

Theorem 2

The proportion of zeros in the INARZOIG(1) process is given by and the proportion of ones in the INARZOIG(1) process is Using part (f) of Proposition 1 and based on the following relationship between the probability generating function (pgf) and pmf,where denotes the kth derivative of the pgf , the proof is completed by calculating the following statements.and□

Parameter Estimation

Let be observations from the model (3) and denote the parameter vector. In the study of integer-valued time series, different estimation methods are applied. In this section, we are going to estimate the parameters of the INARZOIG(1) model using conditional maximum likelihood (CML) and conditional least squares (CLS) estimation methods.

Conditional Maximum Likelihood Estimation

For simplicity of notations, we can write the likelihood function through the joint probability function (10) aswhere is the pmf of and is the conditional pmf. To overcome the complexity of the marginal distribution, a simple approach is to find the conditional pmf conditioned on the first observation , essentially ignoring the dependency on the initial value and obtain the conditional maximum likelihood (CML) estimate given as an estimate of by maximizing the conditional log-likelihood.over . Since there is no closed form for the CML estimates, these estimates are achieved using numerical methods. The asymptotic properties of the CML estimators follow from Freeland and McCabe (2004).

Conditional Least Squares Estimation

In this subsection, we describe the estimation of the unknown parameters of the INARZOIG(1) process using the two-step CLS estimation method proposed by Karlsen and Tjøstheim (1988) which is conducted by the following two steps. Step 1 Let , then the conditional least square (CLS) estimators of the parameters and are obtained by minimizing the functionwhere , and are given byandStep 2 Let and , . Thenwhere Therefore, the CLS criterion function for can be written asThe CLS estimator of are obtained by numerical solution of (22). Step 3 Based on the results from Steps 1 and 2, the estimator of can be obtained by considering the following equation:Therefore, the resulting CLS estimators is . To study the asymptotic behavior of the estimators, we make the following assumptions, (C1) is a stationary and ergodic process. (C2)

Proposition 3

Under the assumptions (C1) and (C2), the CLS estimator is strongly consistent and asymptotically normal,where denote the true value of , , , and .

Proposition 4

Under the assumptions (C1) and (C2), the CLS estimator is strongly consistent and asymptotically normal,where denotes the true value of , , , and Based on Propositions 3 and 4 and Theorem 3.2 in Nicholls and Quinn (1982), we have the following proposition.

Proposition 5

Under the assumptions (C1) and (C2), the CLS estimator is strongly consistent and asymptotically normal,where and denotes the true value of Based on the above proposition, we state the strong consistency and asymptotic normality of in the following proposition.

Proposition 6

Under the assumptions (C1) and (C2), the CLS estimator is strongly consistent and asymptotically normal,where denotes the true values of andand . The brief proofs of Propositions 3–6 are given in Appendix.

Numerical Illustration

This part of the paper includes two subsections. In the first part, the performance of the estimation methods, which are presented in the previous section, is evaluated through a simulation study. Moreover, the empirical distribution of the simulated sample path in points zero and one are compared with the results of the Eqs. (15) and (16). To ensure the practical performance of the proposed process, the second part is focused on two real-life application series: the number of daily infected cases due to COVID-19 in Barbados, available in https://ourworldindata.org/covid-cases and the Poliomyelitis data from Zeger (1988) and Maiti et al. (2018).

Simulation

To conduct the simulation study, we need to generate a random sample from the INARZOIG(1) process. Based on the second stochastic representation in Zhang et al. (2016), we first generate a random sample from ZOIG and then simulate from INARZOIG(1) model. The simulation comprised the following steps: Step 1 Generate form , Step 2. From Bernoulli(p) generate , Step 3. From generate , Step 4. Use for i = 1, ..., n, generate , where and . According to the above algorithm, we generate a random sample (with n = 1000) from the INARZOIG(1) process with , , and . The sample path and barplot of the marginal distribution of this simulated count time series is presented in Fig. 1.

Fig. 1

Barplots of limiting marginal distribution and sample paths of the simulated INARZOIG(1) process for , , and

Barplots of limiting marginal distribution and sample paths of the simulated INARZOIG(1) process for , , and As can be seen from Fig. 1, for all values of and larger values of , the sample path tends to have larger values. But for all values of and smaller values of , the process has a strong tendency to return to zero or one values with less mean and variance which is clear from parts (b) and (d) of Proposition 1. In addition, Fig. 1 shows that the number of zeros and ones increases by decreasing the values of . To compare the performance of the CML and the CLS estimators, we simulate the data for n = 50, 100, 200, 500, 1000, , , and with 10,000 replications. Mean and mean squared error (MSE) of the estimates are computed to evaluate the estimates. The function “nlminb” in “R” is used to obtaining these estimates. The results of the simulation are given in Tables 1 and 2. These tables show that the CML estimate is performed better than CLS estimate because of smaller MSE (except for a few cases).

Table 1

Mean and MSE for CML and CLS estimators for

n	Method	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}α		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\phi }_0$$\end{document}ϕ0		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\phi }_1$$\end{document}ϕ1		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta $$\end{document}θ
n	Method	Mean	MSE	Mean	MSE	Mean	MSE	Mean	MSE
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.1$$\end{document}ϕ0=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1
50	CML	0.1914	0.0121	0.1287	0.0146	0.1244	0.0111	1.0248	0.0384
50	CLS	0.1994	0.0147	0.0994	0.0033	0.1010	0.0033	1.0062	0.0928
100	CML	0.1965	0.0069	0.1134	0.0083	0.1096	0.0059	1.0104	0.0155
100	CLS	0.1916	0.0094	0.0992	0.0032	0.0996	0.0033	1.1016	0.0518
200	CML	0.1977	0.0035	0.1051	0.0049	0.1025	0.0033	1.0039	0.0076
200	CLS	0.1915	0.0054	0.0998	0.0032	0.1007	0.0033	1.0144	0.0286
500	CML	0.1992	0.0014	0.1007	0.0022	0.1008	0.0013	1.0004	0.0026
500	CLS	0.1955	0.0022	0.0997	0.0033	0.1001	0.0033	1.0101	0.0151
1000	CML	0.1993	0.0006	0.0995	0.0011	0.1002	0.0006	1.0001	0.0012
1000	CLS	0.1978	0.0011	0.0999	0.0033	0.1000	0.0032	1.0088	0.0104
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.4$$\end{document}ϕ0=0.4, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1
50	CML	0.1853	0.0139	0.4024	0.0198	0.1115	0.0080	1.0062	0.0201
50	CLS	0.1981	0.0152	0.2507	0.0428	0.0997	0.0033	0.8215	0.1700
100	CML	0.1918	0.0071	0.3998	0.0109	0.1061	0.0046	0.9999	0.0094
100	CLS	0.1902	0.0100	0.2499	0.0437	0.1003	0.0033	0.8245	0.1242
200	CML	0.1967	0.0036	0.3985	0.0060	0.1027	0.0025	0.9984	0.0056
200	CLS	0.1924	0.0059	0.2488	0.0437	0.1006	0.0033	0.8169	0.0977
500	CML	0.1978	0.0014	0.3983	0.0023	0.1005	0.0009	0.9989	0.0024
500	CLS	0.1960	0.0025	0.2507	0.0431	0.0985	0.0033	0.8145	0.0826
1000	CML	0.1997	0.0006	0.3998	0.0011	0.1003	0.0004	0.9995	0.0011
1000	CLS	0.1986	0.0012	0.2506	0.0431	0.0992	0.0032	0.8123	0.0782

Table 2

Mean and MSE for CML and CLS estimators for

n	Method	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha $$\end{document}α		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\phi }_0$$\end{document}ϕ0		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\phi }_1$$\end{document}ϕ1		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta $$\end{document}θ
n	Method	Mean	MSE	Mean	MSE	Mean	MSE	Mean	MSE
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.1$$\end{document}ϕ0=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1
50	CML	0.1912	0.0124	0.1292	0.0144	0.1217	0.0111	1.0259	0.0393
50	CLS	0.1943	0.0136	0.1005	0.0033	0.0992	0.0033	3.0492	0.6482
100	CML	0.1944	0.0067	0.1125	0.0081	0.1102	0.0059	1.0100	0.0161
100	CLS	0.1918	0.0088	0.0993	0.0033	0.1005	0.0033	3.0630	0.3971
200	CML	0.1993	0.0016	0.1014	0.0037	0.1010	0.0028	3.0014	0.0020
200	CLS	0.1925	0.0049	0.0996	0.0033	0.1004	0.0033	3.0539	0.2405
500	CML	0.1995	0.0006	0.0999	0.0015	0.1006	0.0012	2.9992	0.0004
500	CLS	0.1961	0.0021	0.1004	0.0033	0.0996	0.0033	3.0403	0.1432
1000	CML	0.1996	0.0003	0.1002	0.0007	0.1002	0.0005	2.9996	0.0001
1000	CLS	0.1982	0.0010	0.1003	0.0033	0.0993	0.0033	3.0319	0.1055
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.4$$\end{document}ϕ0=0.4, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1
50	CML	0.1973	0.0064	0.3951	0.0139	0.1089	0.0066	2.9966	0.0135
50	CLS	0.1912	0.0146	0.2505	0.0431	0.0991	0.0033	2.4848	1.3653
100	CML	0.1987	0.0031	0.3978	0.0067	0.1043	0.0036	2.9991	0.0022
100	CLS	0.1874	0.0091	0.2521	0.0427	0.1002	0.0033	2.4954	0.9963
200	CML	0.1991	0.0014	0.3992	0.0033	0.1014	0.0019	2.9999	0.0004
200	CLS	0.1919	0.0051	0.2513	0.0042	0.0993	0.0033	2.4793	0.8224
500	CML	0.2001	0.0005	0.4002	0.0013	0.1009	0.0007	2.9998	0.0001
500	CLS	0.1965	0.0021	0.2473	0.0443	0.0994	0.0033	2.4504	0.7604
1000	CML	0.2002	0.0002	0.3997	0.0006	0.1004	0.0003	3.0001	0.00001
1000	CLS	0.1985	0.0011	0.2479	0.0440	0.1003	0.0033	2.4443	0.7251

Mean and MSE for CML and CLS estimators for Mean and MSE for CML and CLS estimators for In Table 3, we compare the empirical distribution of the simulated sample path with Eqs. (15) and (16) and it can be seen that for different values of n and other parameters of the model the estimated values of the proportion of zeros and ones are near to the theoretical values of them.

Table 3

Estimated values of the proportion of zeros and ones in the simulated data from INARZOIG(1) processes for different values of n

n	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{P}_{0}$$\end{document}P^0	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{P}_{1}$$\end{document}P^1	n	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{P}_{0}$$\end{document}P^0	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{P}_{1}$$\end{document}P^1
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.1$$\end{document}ϕ0=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta =1$$\end{document}θ=1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=0)=0.4$$\end{document}P(Xt=0)=0.4, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=1)=0.31$$\end{document}P(Xt=1)=0.31			\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.1$$\end{document}ϕ0=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta =3$$\end{document}θ=3 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=0)=0.18$$\end{document}P(Xt=0)=0.18, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=1)=0.21$$\end{document}P(Xt=1)=0.21
50	0.4	0.32	50	0.18	0.16
100	0.38	0.30	100	0.21	0.19
200	0.41	0.31	200	0.17	0.21
500	0.39	0.33	500	0.18	0.22
1000	0.41	0.31	1000	0.18	0.21
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.4$$\end{document}ϕ0=0.4, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta =1$$\end{document}θ=1 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=0)=0.57$$\end{document}P(Xt=0)=0.57, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=1)=0.26$$\end{document}P(Xt=1)=0.26			\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\alpha =0.2$$\end{document}α=0.2, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _0=0.4$$\end{document}ϕ0=0.4, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\phi _1=0.1$$\end{document}ϕ1=0.1, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta =3$$\end{document}θ=3 \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=0)=0.39$$\end{document}P(Xt=0)=0.39, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P(X_{t}=1)=0.22$$\end{document}P(Xt=1)=0.22
50	0.60	0.24	50	0.40	0.20
100	0.55	0.27	100	0.40	0.30
200	0.57	0.26	200	0.36	0.26
500	0.58	0.27	500	0.41	0.22
1000	0.58	0.27	1000	0.40	0.23

Estimated values of the proportion of zeros and ones in the simulated data from INARZOIG(1) processes for different values of n , , , , , , , , , , , , , , , ,

Real Data

In this subsection, using two real-life data sets, we show the applicability of the INARZOIG(1). In the first example, we use the data of new infected cases in Barbados from March 17, 2020, until January 02, 2021, and in the second, we considered the Poliomyelitis data which are the monthly cases in the USA from 1970 to 1983. To compare INARZOIG(1) model with various INAR(1) models such as OMGINAR(1) (one modified geometric INAR(1) model), PINAR(1) (INAR(1) process with Poisson-distributed innovations), ZIPINAR(1) (INAR(1) process with zero-inflated Poisson-distributed innovations), OIPINAR(1) (INAR(1) process with one-inflated Poisson-distributed innovations), ZOIPINAR(1) (INAR(1) process with zero–one-inflated Poisson-distributed innovations), ZOIPLINAR(1) (INAR(1) process with zero–one-inflated Poisson–Lindley distributed innovations, INARG(1) (INAR(1) process with geometric-distributed innovations), INARZIG(1) (INAR(1) process with zero-inflated geometric-distributed innovations) and INAROIG(1) (INAR(1) process with one-inflated geometric-distributed innovations) for these data sets. We use the AIC (Akaike information criterion), loglik (log-likelihood function), AICc (corrected version of the AIC), BIC (Bayesian information criterion), PMAE(h) (predicted mean absolute error), and the PTP(h) (percentage of true prediction ) criteria where the last two criteria are the h-step ahead forecasting accuracy measures. To calculate the last two measures, we divide the data into two parts. The first part is used to fit the considered models, and the second part which is the last 20 observations is used to compute the and then the PMAE(h) and PTP(h) are computed for h = 1.

COVID-19 Data in Barbados

In this subsection, using a real data set, we show the applicability of the INARZOIG(1). We use the data of new infected cases in Barbados from the 17th of March 2020 until the 2nd of January 2021. This data set has 292 observations for which 148 (51%) of observations are zero and 64 (22%) of observations are one, and the other 80 (27%) of observations had infected cases more than one. The mean and variance of observations are 1.35 and 5.60, respectively, and hence, the Fisher index of them is given as 4.15 and it shows that the data are overdispersed. The barplot, series plot, ACF and PACF are plotted in Figs. 2 and 3, respectively. It is noted that the PACF yields significant lags with values greater than one, and it seems that the INAR with an order greater than 1 is suitable for the data. To determine the order of the process, we use the portmanteau tests introduced by Foroughi et al. (2021) with m = 2, 3, 4, 8, 12. We want to test the null hypothesis that the data set follows the INAR(1) versus the alternative hypothesis that the data follow the INAR(p) with . The p values are reported in the Table 4 and show that order 1 is appropriate for this data set.

Fig. 2

Barplot and series plot of the new infected cases in Barbados

Fig. 3

ACF and PACF of the new infected cases in Barbados

Table 4

The p values of the portmanteau tests for different values of m

m	Test statistics
m	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_{\mathrm{LB}}$$\end{document}QLB	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_{\mathrm{BP}}$$\end{document}QBP	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Q_{\mathrm{LM}}$$\end{document}QLM
2	0.3669	0.3703	0.3661
3	0.4645	0.4386	0.4381
4	0.3779	0.3829	0.3824
8	0.3458	0.3528	0.3522
12	0.4927	0.4995	0.4986

Barplot and series plot of the new infected cases in Barbados ACF and PACF of the new infected cases in Barbados The p values of the portmanteau tests for different values of m The relative frequencies of zeros and ones and Fig. 2 show the inflation in zeros and ones. The extra zeros and ones motivated us to use INARZOIG for this data set. To show the adequacy of the INARZOIG(1), we compare the proposed model with OMGINAR(1), PINAR(1), ZIPINAR(1), OIPINAR(1), ZOIPINAR(1), ZOIPLINAR(1), INARG(1), INARZIG(1) and INAROIG(1) based on the mentioned criteria. The results in the Table 5, show that the INARZOIG(1) has the best fit since it has the largest Loglik and smallest values of other criteria except for BIC which are indicated by bold numbers. In the sense of BIC, INARZIG(1) is the best model and to show that INARZOIG(1) is more suitable for this data set than the INARZIG(1), we use the likelihood ratio test (LRT) with the following hypothesis.The LRT statistics is equal to 3.937 and the critical value at level 0.05 is equal to 3.841. Hence, we can conclude that the null hypothesis rejects and the zero-and-one-inflated distribution is more suitable than zero-inflated model for this data set. Also, we calculated two forecasting accuracy measures; however, they are the same for all models and PMAE is equal to 4.45 and PTP is equal to 20.

Table 5

Parameter estimations and their standard errors and Loglik, AIC, AICc and BIC criteria for compared models that are fitted to daily new infected cases of COVID-19 in Barbados

Model	Estimated values (SE)	AIC	Loglik	AICc	BIC
PINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha } = 0.1482 (0.0305)$$\end{document}α^=0.1482(0.0305)	1184.856	− 590.428	1184.897	1192.210
PINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\lambda }=1.1493 (0.0712)$$\end{document}λ^=1.1493(0.0712)
ZIPINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1903 (0.0314)$$\end{document}α^=0.1903(0.0314)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\lambda }=2.7531 (0.1892)$$\end{document}λ^=2.7531(0.1892)	992.236	− 493.118	992.319	1003.266
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_0 =0.6033 (0.0350)$$\end{document}ϕ^0=0.6033(0.0350)
OIPINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1858 (0.0311)$$\end{document}α^=0.1858(0.0311)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\lambda }=1.0747 (0.0590)$$\end{document}λ^=1.0747(0.0590)	1147.017	− 570.509	1147.100	1158.047
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_1= 0.0000 (0.0433)$$\end{document}ϕ^1=0.0000(0.0433)
ZOIPINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1669 (0.0370)$$\end{document}α^=0.1669(0.0370)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\lambda }=3.9909 (0.3100)$$\end{document}λ^=3.9909(0.3100)	949.333	− 470.666	949.471	964.039
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_0 =0.5890 (0.0350)$$\end{document}ϕ^0=0.5890(0.0350)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_1= 0.1723 (0.0303)$$\end{document}ϕ^1=0.1723(0.0303)
ZOIPLINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1391 (0.0393)$$\end{document}α^=0.1391(0.0393)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }= 0.6411 (0.0816)$$\end{document}θ^=0.6411(0.0816)	908.542	− 450.271	908.682	923.249
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_0= 0.4793 (0.0507)$$\end{document}ϕ^0=0.4793(0.0507)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_1= 0.0970 (0.0367)$$\end{document}ϕ^1=0.0970(0.0367)
INARG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.0763 (0.0398) $$\end{document}α^=0.0763(0.0398)	933.106	− 464.553	933.148	940.460
INARG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }=1.2472 (0.1105)$$\end{document}θ^=1.2472(0.1105)
INARZIG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1445 (0.0371)$$\end{document}α^=0.1445(0.0371)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }= 1.8385 (0.2208)$$\end{document}θ^=1.8385(0.2208)	908.344	− 451.172	908.428	919.375
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_0=0.3720 (0.0624)$$\end{document}ϕ^0=0.3720(0.0624)
INAROIG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }=0.1078 (0.0399) $$\end{document}α^=0.1078(0.0399)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta } =1.1887 (0.1035)$$\end{document}θ^=1.1887(0.1035)	930.972	− 462.486	931.553	942.002
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_1=0.000 (0.0389)$$\end{document}ϕ^1=0.000(0.0389)
INARZOIG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.1381 (0.0393)$$\end{document}α^=0.1381(0.0393)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }= 2.1965 (0.0587)$$\end{document}θ^=2.1965(0.0587)	906.407	− 449.204	906.547	921.114
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_0 = 0.4284 (0.0383)$$\end{document}ϕ^0=0.4284(0.0383)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\hat{\phi }}_1= 0.0772 (0.0334)$$\end{document}ϕ^1=0.0772(0.0334)

Parameter estimations and their standard errors and Loglik, AIC, AICc and BIC criteria for compared models that are fitted to daily new infected cases of COVID-19 in Barbados The last figure shows the daily new infected cases of COVID-19 in Barbados and their predicted values using INARZOIG(1). As can be seen, the predicted values are closed to the original data, which indicates the good performance of the proposed fitted model in the sense of forecasting (Fig. 4).

Fig. 4

Daily new infected cases of COVID-19 in Barbados and their predicted values using INARZOIG(1)

Poliomyelitis Data

In this subsection, we considered the Poliomyelitis data which are the monthly cases in the USA from 1970 to 1983. These data were analyzed by Zeger (1988) for the first time. This data set has 168 observations for which 64 (38%) of observations are zero and 55 (32%) of observations are one, and the other 49 (30%) of observations had monthly cases more than one. The mean and variance of observations are 1.33 and 3.50, respectively, and hence, the Fisher index of them is given as 2.63. The value of the Fisher index indicates that the data are overdispersed. Recently, Maiti et al. (2018) considered these data and fitted most of the existing INAR(1) models including Poisson INAR(1), overdispersed models such as geometric INAR(1) and compound Poisson INAR(1), zero-inflated models like zero-inflated and zero-modified INAR(1) and their proposed sub-model, the one-modified geometric INAR(1)(OMGINAR(1)). Using some goodness of fit criteria and 1-step ahead forecasting accuracy measures, they showed that OMGINAR(1) had the best fit among all considered models. Now, we analyze the data further. First, we plot the barplot and the series plot in Fig. 5. These figures and the frequencies of the observed zeros and ones show the extra number of zeros and ones. This fact and the overdispersion of the data, motivated us to fit the INARZOIG(1) model into this data set. The ACF and PACF of the data are plotted in Fig. 6.

Fig. 5

Barplot and series plot of monthly cases of poliomyelitis data in the USA from 1970 to 1983

Fig. 6

ACF and PACF of the monthly cases of poliomyelitis data in the USA from 1970 to 1983

Barplot and series plot of monthly cases of poliomyelitis data in the USA from 1970 to 1983 ACF and PACF of the monthly cases of poliomyelitis data in the USA from 1970 to 1983 Based on the conclusions of Maiti et al. (2018) about the considered data set, we compare our model with OMGINAR(1) and used the reported criteria in that paper for this model. Also, we considered the ZOIPLINAR(1), introduced by Mohammadi et al. (2021), as another alternative to compare with. We use the Loglik, AIC, AICc, and BIC criteria, and the results are reported in Table 6. As can be seen, the INARZOIG(1) model has the largest Loglik and smallest AIC, AICc, but the value of the BIC of the OMGINAR(1) is the smallest BIC. Nevertheless, based on Raftery (1995), since the difference between these values is less than 2, it is not significant and the other criteria show that the INARZOIG(1) is more suitable for this data set. We can conclude that our introduced model has the best fit on this data set; however, the forecasting accuracy measures are the same when PMAE is equal to 0.95 and PTP is equal to 45 for all considered models. Moreover, from Fig. 7 that shows the plot of the Poliomyelitis data and their predicted values, it can be seen that the predicted values are found to be almost close to the real data. This figure indicates the good performance of the INARZOIG(1) in the sense of forecasting, too.

Table 6

Parameter estimation and Loglik, AIC, BIC and AICc criteria for compared models for Poliomyelitis data

Model	Estimated values	Loglik	AIC	AICc	BIC
OMGINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.068$$\end{document}α^=0.068
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{p}=0.882$$\end{document}p^=0.882	− 264.005	534.01	534.1563	543.3819
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }=0.577$$\end{document}θ^=0.577
ZOIPLINAR(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.0845 (0.0493)$$\end{document}α^=0.0845(0.0493)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }= 0.9116 (0.1613)$$\end{document}θ^=0.9116(0.1613)	− 262.411	532.823	533.0685	545.318
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_0= 0.1887 (0.0970)$$\end{document}ϕ^0=0.1887(0.0970)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_1= 0.1881 (0.0660)$$\end{document}ϕ^1=0.1881(0.0660)
INARZOIG(1)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\alpha }= 0.0817 (0.0496)$$\end{document}α^=0.0817(0.0496)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\theta }= 1.4812 (0.3066)$$\end{document}θ^=1.4812(0.3066)	− 262.0769	532.1538	532.3992	544.6497
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_0= 0.1124 (0.1151)$$\end{document}ϕ^0=0.1124(0.1151)
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{\phi }_1= 0.1656 (0.0691)$$\end{document}ϕ^1=0.1656(0.0691)

Fig. 7

Poliomyelitis data and their predicted values using INARZOIG(1)

Parameter estimation and Loglik, AIC, BIC and AICc criteria for compared models for Poliomyelitis data Poliomyelitis data and their predicted values using INARZOIG(1)

Conclusion

This paper analyzes the zero-and-one-inflated time series using a flexible INAR(1) model based on the zero-and-one-inflated geometric-distributed innovation. The main properties of this novel INAR(1) process are established, and its model parameters are estimated via the CML and CLS approaches. The performance of the two estimation techniques is assessed through some Monte Carlo experiments wherein both approaches are shown to provide consistent estimates, but with the CML approach providing lesser biased estimates. Furthermore, the INARZOIG(1) model is applied to analyze the COVID-19 series from Barbados, which is found to consist of a more frequent number of zeros and ones. Also, using the portmanteau test we indicate that order 1 is suitable for this data set. As a next example, this model is applied to another real data set which is the monthly cases in the USA from 1970 to 1983 that was analyzed by Zeger (1988) for the first time. Under the data applications, the INARZOIG(1) model is shown to provide better fitting criteria than the existing competing models. Evidently, the statistical performance of the INARZOIG(1) depends on the nature of the data as well, but overall, the INARZOIG(1) model has a worthy contribution to the class of INAR models.

3 in total

Modeling Medical Data by Flexible Integer-Valued AR(1) Process with Zero-and-One-Inflated Geometric Innovations.

Introduction

Model Construction and Properties

Proposition 1

Proposition 2

Proof

Remark 1

Distributions of Lengths of Zeros and Ones

Theorem 1

Proof

Theorem 2

Parameter Estimation

Conditional Maximum Likelihood Estimation

Conditional Least Squares Estimation

Proposition 3

Proposition 4

Proposition 5

Proposition 6

Numerical Illustration

Simulation

Real Data

COVID-19 Data in Barbados

Poliomyelitis Data

Conclusion

1. Studying the trend of the novel coronavirus series in Mauritius and its implications.

2. A New Extension of Thinning-Based Integer-Valued Autoregressive Models for Count Data.

3. A New First-Order Integer-Valued Autoregressive Model with Bell Innovations.