| Literature DB >> 35645547 |
Zohreh Mohammadi1, Zahra Sajjadnia2, Maryam Sharafi2, Naushad Mamode Khan3.
Abstract
In this paper, we introduce a new stationary first-order integer-valued autoregressive process (INAR) with zero-and-one-inflated geometric innovations that is useful for modeling medical practical data. Basic probabilistic and statistical properties of the model are discussed. Conditional least squares and maximum likelihood estimators are proposed to estimate the model parameters. The performance of the estimation methods is assessed by some Monte Carlo simulation experiments. The zero-and-one-inflated INAR process is subsequently applied to analyze two medical series that include the number of new COVID-19-infected series from Barbados and Poliomyelitis data. The proposed model is compared with other popular competing zero-inflated and zero-and-one-inflated INAR models on the basis of some goodness-of-fit statistics and selection criteria, where it shows to provide better fitting and hence can be considered as another important commendable model in the class of INAR models.Entities:
Keywords: Binomial thinning operator; Estimation; Geometric distribution; INAR process; Runs; Zero-and-one-inflated geometric distribution
Year: 2022 PMID: 35645547 PMCID: PMC9124749 DOI: 10.1007/s40995-022-01297-3
Source DB: PubMed Journal: Iran J Sci Technol Trans A Sci ISSN: 1028-6276 Impact factor: 1.553
Fig. 1Barplots of limiting marginal distribution and sample paths of the simulated INARZOIG(1) process for , , and
Mean and MSE for CML and CLS estimators for
| Method | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | MSE | Mean | MSE | Mean | MSE | Mean | MSE | ||
| 50 | CML | 0.1914 | 0.0121 | 0.1287 | 0.0146 | 0.1244 | 0.0111 | 1.0248 | 0.0384 |
| CLS | 0.1994 | 0.0147 | 0.0994 | 0.0033 | 0.1010 | 0.0033 | 1.0062 | 0.0928 | |
| 100 | CML | 0.1965 | 0.0069 | 0.1134 | 0.0083 | 0.1096 | 0.0059 | 1.0104 | 0.0155 |
| CLS | 0.1916 | 0.0094 | 0.0992 | 0.0032 | 0.0996 | 0.0033 | 1.1016 | 0.0518 | |
| 200 | CML | 0.1977 | 0.0035 | 0.1051 | 0.0049 | 0.1025 | 0.0033 | 1.0039 | 0.0076 |
| CLS | 0.1915 | 0.0054 | 0.0998 | 0.0032 | 0.1007 | 0.0033 | 1.0144 | 0.0286 | |
| 500 | CML | 0.1992 | 0.0014 | 0.1007 | 0.0022 | 0.1008 | 0.0013 | 1.0004 | 0.0026 |
| CLS | 0.1955 | 0.0022 | 0.0997 | 0.0033 | 0.1001 | 0.0033 | 1.0101 | 0.0151 | |
| 1000 | CML | 0.1993 | 0.0006 | 0.0995 | 0.0011 | 0.1002 | 0.0006 | 1.0001 | 0.0012 |
| CLS | 0.1978 | 0.0011 | 0.0999 | 0.0033 | 0.1000 | 0.0032 | 1.0088 | 0.0104 | |
| 50 | CML | 0.1853 | 0.0139 | 0.4024 | 0.0198 | 0.1115 | 0.0080 | 1.0062 | 0.0201 |
| CLS | 0.1981 | 0.0152 | 0.2507 | 0.0428 | 0.0997 | 0.0033 | 0.8215 | 0.1700 | |
| 100 | CML | 0.1918 | 0.0071 | 0.3998 | 0.0109 | 0.1061 | 0.0046 | 0.9999 | 0.0094 |
| CLS | 0.1902 | 0.0100 | 0.2499 | 0.0437 | 0.1003 | 0.0033 | 0.8245 | 0.1242 | |
| 200 | CML | 0.1967 | 0.0036 | 0.3985 | 0.0060 | 0.1027 | 0.0025 | 0.9984 | 0.0056 |
| CLS | 0.1924 | 0.0059 | 0.2488 | 0.0437 | 0.1006 | 0.0033 | 0.8169 | 0.0977 | |
| 500 | CML | 0.1978 | 0.0014 | 0.3983 | 0.0023 | 0.1005 | 0.0009 | 0.9989 | 0.0024 |
| CLS | 0.1960 | 0.0025 | 0.2507 | 0.0431 | 0.0985 | 0.0033 | 0.8145 | 0.0826 | |
| 1000 | CML | 0.1997 | 0.0006 | 0.3998 | 0.0011 | 0.1003 | 0.0004 | 0.9995 | 0.0011 |
| CLS | 0.1986 | 0.0012 | 0.2506 | 0.0431 | 0.0992 | 0.0032 | 0.8123 | 0.0782 | |
Mean and MSE for CML and CLS estimators for
| Method | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Mean | MSE | Mean | MSE | Mean | MSE | Mean | MSE | ||
| 50 | CML | 0.1912 | 0.0124 | 0.1292 | 0.0144 | 0.1217 | 0.0111 | 1.0259 | 0.0393 |
| CLS | 0.1943 | 0.0136 | 0.1005 | 0.0033 | 0.0992 | 0.0033 | 3.0492 | 0.6482 | |
| 100 | CML | 0.1944 | 0.0067 | 0.1125 | 0.0081 | 0.1102 | 0.0059 | 1.0100 | 0.0161 |
| CLS | 0.1918 | 0.0088 | 0.0993 | 0.0033 | 0.1005 | 0.0033 | 3.0630 | 0.3971 | |
| 200 | CML | 0.1993 | 0.0016 | 0.1014 | 0.0037 | 0.1010 | 0.0028 | 3.0014 | 0.0020 |
| CLS | 0.1925 | 0.0049 | 0.0996 | 0.0033 | 0.1004 | 0.0033 | 3.0539 | 0.2405 | |
| 500 | CML | 0.1995 | 0.0006 | 0.0999 | 0.0015 | 0.1006 | 0.0012 | 2.9992 | 0.0004 |
| CLS | 0.1961 | 0.0021 | 0.1004 | 0.0033 | 0.0996 | 0.0033 | 3.0403 | 0.1432 | |
| 1000 | CML | 0.1996 | 0.0003 | 0.1002 | 0.0007 | 0.1002 | 0.0005 | 2.9996 | 0.0001 |
| CLS | 0.1982 | 0.0010 | 0.1003 | 0.0033 | 0.0993 | 0.0033 | 3.0319 | 0.1055 | |
| 50 | CML | 0.1973 | 0.0064 | 0.3951 | 0.0139 | 0.1089 | 0.0066 | 2.9966 | 0.0135 |
| CLS | 0.1912 | 0.0146 | 0.2505 | 0.0431 | 0.0991 | 0.0033 | 2.4848 | 1.3653 | |
| 100 | CML | 0.1987 | 0.0031 | 0.3978 | 0.0067 | 0.1043 | 0.0036 | 2.9991 | 0.0022 |
| CLS | 0.1874 | 0.0091 | 0.2521 | 0.0427 | 0.1002 | 0.0033 | 2.4954 | 0.9963 | |
| 200 | CML | 0.1991 | 0.0014 | 0.3992 | 0.0033 | 0.1014 | 0.0019 | 2.9999 | 0.0004 |
| CLS | 0.1919 | 0.0051 | 0.2513 | 0.0042 | 0.0993 | 0.0033 | 2.4793 | 0.8224 | |
| 500 | CML | 0.2001 | 0.0005 | 0.4002 | 0.0013 | 0.1009 | 0.0007 | 2.9998 | 0.0001 |
| CLS | 0.1965 | 0.0021 | 0.2473 | 0.0443 | 0.0994 | 0.0033 | 2.4504 | 0.7604 | |
| 1000 | CML | 0.2002 | 0.0002 | 0.3997 | 0.0006 | 0.1004 | 0.0003 | 3.0001 | 0.00001 |
| CLS | 0.1985 | 0.0011 | 0.2479 | 0.0440 | 0.1003 | 0.0033 | 2.4443 | 0.7251 | |
Estimated values of the proportion of zeros and ones in the simulated data from INARZOIG(1) processes for different values of n
| 50 | 0.4 | 0.32 | 50 | 0.18 | 0.16 |
| 100 | 0.38 | 0.30 | 100 | 0.21 | 0.19 |
| 200 | 0.41 | 0.31 | 200 | 0.17 | 0.21 |
| 500 | 0.39 | 0.33 | 500 | 0.18 | 0.22 |
| 1000 | 0.41 | 0.31 | 1000 | 0.18 | 0.21 |
| 50 | 0.60 | 0.24 | 50 | 0.40 | 0.20 |
| 100 | 0.55 | 0.27 | 100 | 0.40 | 0.30 |
| 200 | 0.57 | 0.26 | 200 | 0.36 | 0.26 |
| 500 | 0.58 | 0.27 | 500 | 0.41 | 0.22 |
| 1000 | 0.58 | 0.27 | 1000 | 0.40 | 0.23 |
Fig. 2Barplot and series plot of the new infected cases in Barbados
Fig. 3ACF and PACF of the new infected cases in Barbados
The p values of the portmanteau tests for different values of m
| Test statistics | |||
|---|---|---|---|
| 2 | 0.3669 | 0.3703 | 0.3661 |
| 3 | 0.4645 | 0.4386 | 0.4381 |
| 4 | 0.3779 | 0.3829 | 0.3824 |
| 8 | 0.3458 | 0.3528 | 0.3522 |
| 12 | 0.4927 | 0.4995 | 0.4986 |
Parameter estimations and their standard errors and Loglik, AIC, AICc and BIC criteria for compared models that are fitted to daily new infected cases of COVID-19 in Barbados
| Model | Estimated values (SE) | AIC | Loglik | AICc | BIC |
|---|---|---|---|---|---|
| PINAR(1) | 1184.856 | − 590.428 | 1184.897 | 1192.210 | |
| ZIPINAR(1) | |||||
| 992.236 | − 493.118 | 992.319 | 1003.266 | ||
| OIPINAR(1) | |||||
| 1147.017 | − 570.509 | 1147.100 | 1158.047 | ||
| ZOIPINAR(1) | |||||
| 949.333 | − 470.666 | 949.471 | 964.039 | ||
| ZOIPLINAR(1) | |||||
| 908.542 | − 450.271 | 908.682 | 923.249 | ||
| INARG(1) | 933.106 | − 464.553 | 933.148 | 940.460 | |
| INARZIG(1) | |||||
| 908.344 | − 451.172 | 908.428 | |||
| INAROIG(1) | |||||
| 930.972 | − 462.486 | 931.553 | 942.002 | ||
| INARZOIG(1) | |||||
| − | |||||
Fig. 4Daily new infected cases of COVID-19 in Barbados and their predicted values using INARZOIG(1)
Fig. 5Barplot and series plot of monthly cases of poliomyelitis data in the USA from 1970 to 1983
Fig. 6ACF and PACF of the monthly cases of poliomyelitis data in the USA from 1970 to 1983
Parameter estimation and Loglik, AIC, BIC and AICc criteria for compared models for Poliomyelitis data
| Model | Estimated values | Loglik | AIC | AICc | BIC |
|---|---|---|---|---|---|
| OMGINAR(1) | |||||
| − 264.005 | 534.01 | 534.1563 | 543.3819 | ||
| ZOIPLINAR(1) | |||||
| − 262.411 | 532.823 | 533.0685 | 545.318 | ||
| INARZOIG(1) | |||||
| − 262.0769 | 532.1538 | 532.3992 | 544.6497 | ||
Fig. 7Poliomyelitis data and their predicted values using INARZOIG(1)