Literature DB >> 35941888

Modified unbiased estimators for population variance: An application for COVID-19 deaths in Russia.

Hatice Oncel Cekim1.   

Abstract

The article deal with the class unbiased forms of the variance estimators using Hartley-Ross type in the simple random sampling. The mean squared error (since it is an unbiased estimator, variance is calculated) of the suggested, up to the first order of approximation, is derived. The proposed estimator using COVID-19 data in Russia has been proven to be more efficient than the considered estimators under the conditions. Thus, it allows us to see the variance estimator that best predicts the change according to federal regions in Russia's number of COVID-19 deaths. This study also provides an unbiased family of estimators that are more efficient than existing estimators, which estimate the variance of the total death number of COVID-19 based on the daily new cases number. The most important difference of this article from other studies in the literature is that it is the first study to examine the variance of the COVID-19 cumulative mortality value in terms of variance estimation using the simple random sampling method with the auxiliary variable.
© 2022 John Wiley & Sons, Ltd.

Entities:  

Keywords:  COVID‐19; Hartley–Ross type estimator; death number; unbiased estimator; variance estimator

Year:  2022        PMID: 35941888      PMCID: PMC9349872          DOI: 10.1002/cpe.7169

Source DB:  PubMed          Journal:  Concurr Comput        ISSN: 1532-0626            Impact factor:   1.831


INTRODUCTION

Since January 2020, the Coronavirus disease 2019 (COVID‐19), which has spread from Wuhan, China, affecting all countries around the world, has been a serious global crisis. On March 11, WHO declared COVID‐19 a “global epidemic”. COVID‐19 has increased rapidly since the first week it emerged in Russia. The total number of cases and the number of deaths reached approximately 11 million and 325,000 as of January 2022 (https://www.worldometers.info/coronavirus/country/Russia). However, the number of cases and mortality may differ from city to city, region to region or federal state to federal state. This may be because each part (interested unit) has different population density, health care, safeguard measures, substructure or climatic. The best parameter value that can show this difference is variance. Variance is one of the most commonly used measures of variability to describe the change in a data set. For this reason, it is desirable to know the variance prior to many studies. In cases where the variance is not known prior, it can be estimated with minor errors with the help of sampling. For this purpose, many variance estimators are proposed in sampling methods. The superiority of the proposed estimators is evaluated with many statistical criteria. Generally, it is required to provide the features of the best estimator, such as having the minimum mean squared error (MSE) value and having a zero biased value, that is, unbiased. This study proposes a family of unbiased estimators with a minor MSE. These proposed variance estimators can be of great importance in any field where variance (change in series) is actively used. In other words, the estimators are vital for all sectors; for example, the tourism sector as a country needs to plan significant scale building works or for aircraft companies as the purchase of new aircraft would require managerial decisions to be made well in advance. It is a guide for those who want to know a predictive range of variation that can be used for the variance of death numbers in the Coronavirus pandemic that has recently affected the world. For this purpose, the effectiveness of the proposed estimators in this study is desired to be investigated using the COVID‐19 data of Russia. Estimating the variance in the total number of deaths from COVID‐19 by federal states of Russia provides us to see the differences in the total number of deaths between states. Mathematical models used in studies on the COVID‐19 pandemic are methods to arrive at results such as daily cases, cumulative cases, number of deaths, risk of death and incubation period. Estimators with sampling methods are one of the different mathematical models to describe COVID‐19. Zaman et al. propose families of exponential estimators that yield more efficient results than existing estimators for estimating the population mean of COVID‐19 risk. Shahzad et al. presented some new estimators for estimating the population variance of recovery time from COVID‐19 using L‐Moments and the calibration approach. It also provided new calibration estimators with L‐Moments for the variance estimation of the total recovery time depending on COVID‐19 data in the stratified random sampling method from January 22 to August 23, 2020, by Shahzad et al. In sampling methodology, the accuracy of an estimator at the estimation process can be raised with an influential work of the constant term and the auxiliary information. The variance estimators are the most commonly used estimators of the population parameter. Since the classical variance estimators are biased and complex, some studies are trying to develop variance estimators. In this regard, a class of unbiased variance estimators is obtained for various distributions by Chen et al. under ranked set sampling. Haq et al. developed combined and separate estimators for estimating population variance using supplementary information in another paper. In other variance assessing studies, Sing and Solanki and Lone and Tailor obtained estimators (under conditions) with less MSE than classical unbiased estimators. work developed estimators to estimate the finite population variance of the study variable with the help of the Hartley–Ross type method. Later, ln type estimators of the population variance were suggested by Cekim and Kadilar , for different sampling methods.

EVALUATION OF ESTIMATORS IN LITERATURE

Many authors have defined self‐developed estimators whose explanations are included in the usual population mean to improve the variance estimation. In the year 1999, Upadhyaya and Singh introduced the estimator with the help of the kurtosis . Then, Kadilar and Cingi defined the ratio estimators to , used the coefficient of variation as well as additionally . These estimators respectively follow as: Here, and are the population variances, while and are the unbiased estimators. Also, expressing and , it is similarly shown in for samples. The bias and MSE equations of these estimators are calculated as the following equations, respectively: and Here, , and Considering their estimators in (1) and biased in (2), Hartley–Ross type estimators are proposed by Kadilar and Cekim Here instead of in (2) and . They obtained the variance equation of , in (5), as where positive integers and ,

SUGGESTED ESTIMATORS

Yadav et al., taking into account the variance estimator proposed by using the correlation coefficient and quartiles, these values are replaced by different population parameter values, and the following estimator family is obtained: where is a constant chosen so that the MSE of reaches its minimum value. Also, expressing , and , it is similarly shown in for samples. To get the bias of the estimator in (6), 's term are defined as such that The biases for these estimators are obtained as shown: where , , By the definition of Hartley–Ross type estimators, the amount of bias must be subtracted from the considered estimator to compute an unbiased estimator. We remove Equation (7) from Equation (6) and suggest the following estimator to derive an unbiased estimator. Similarly, the sample parameter is used instead of in (8). To get the variance of the estimator in (8), 's term are defined as The variance equation is calculated to the estimator according to the necessary operations as: where We assumed that can be extended into the operations performed to reach the variance equality in (9). The optimum value of is obtained as to reach the minimum variance value. When the optimum value of is substituted in Equation (9), the minimum variance is obtained as

EFFICIENCY COMPARISONS OF MENTIONED ESTIMATORS

This section shows that the proposed Hartley–Ross type unbiased estimator class is the best estimator under certain conditions. By comparing the variance or MSE values of the estimators, it can be found under which conditions they are the best. The following conditions are obtained by comparing the class of suggested estimators with the mentioned estimators in this study. First condition: if Second condition: if These conditions can be reproduced by comparing different estimators in the literature with the proposed estimators.

DATA DESCRIPTION AND PRACTICE

Data

Since COVID‐19 came to Russia, new cases, recoveries and deaths are published daily for Russia's assessment of the COVID‐19 situation by the federal region. Confirmed data used in this study were retrieved from Reference 17 in September 2021. Sampling methods use auxiliary variable information to obtain more efficient estimators. Therefore, the total number of COVID‐19 deaths and daily new cases as the dependent variable () and the auxiliary variable (), respectively, are considered in the application. Descriptive statistics of the federal region in Russia are given in Table 1.
TABLE 1

Data statistics

Y=1337.02 β2(x)=67.08 N=85
X=99.59 β2(y)=32.76 n=20
Cx=3.25 Sy=2566.00 ρ=0.90
Cy=1.92 Sx=324.05
Data statistics Figure 1 shows the number of daily new cases and total deaths for COVID‐19 in the regions. COVID‐19 new cases and cumulative deaths are highest in Moscow. Then, the areas with the highest two variables are Saint Petersburg, Moscow Oblast and Rostov Oblast, respectively. Among 85 regions, Jewish Autonomous Oblast, Nenets Autonomous Okrug and Chukotka Autonomous Okrug have the smallest new cases value. With a cumulative death value of 4, the Nenets Autonomous Okrug has the lowest among other states.
FIGURE 1

The number of new cases and cumulative deaths in COVID‐19 for the federal states of Russia

The number of new cases and cumulative deaths in COVID‐19 for the federal states of Russia

Emprical study

This section aims to find the best estimator of the variance value of COVID‐19 deaths in Russia among the estimators mentioned in the manuscript. When the correlation coefficient between COVID‐19 cumulative new case and death values is evaluated, these two variables show a high correlation of 0.90. In other words, the COVID‐19 new case and death values are closely related. Predicting COVID‐19 cumulative death can use information about daily new cases. In this context, a new family of predictors has been proposed to estimate the COVID‐19 total mortality variance with this auxiliary feature. With the help of these proposed estimators, the MSE (i.e., variance as it is an unbiased estimator) estimate of the cumulative death variance of COVID‐19 is calculated theoretically and numerically. Using Equations (3), (5), and (11), the variance and MSE values of the estimators for this data set are computed in Table 2.
TABLE 2

The variance and MSE values of the suggested and existing estimators

EstimatorsVarEstimatorsVarEstimatorsMSE
sHR12 1.15216E+14 sKC12 1.25295E+14 s12 1.69565E+14
sHR22 1.15216E+14 sKC22 1.25448E+14 s22 1.69598E+14
sHR32 1.15216E+14 sKC32 1.24811E+14 s32 1.69463E+14
sHR42 1.15216E+14 sKC42 1.25449E+14 s42 1.69598E+14
sHR52 1.15216E+14 sKC52 1.25448E+14 s52 1.69595E+14
sHR62 1.15216E+14 sKC62 1.25294E+14 s62 1.69525E+14
sHR72 1.15216E+14 sKC72 1.25449E+14 s72 1.69587E+14
sHR82 1.15216E+14 sKC82 1.25440E+14 s82 1.69597E+14
sHR92 1.15216E+14 sKC92 1.25285E+14 s92 1.69568E+14
The variance and MSE values of the suggested and existing estimators In Table 2 results, the variance values of the proposed estimators are smaller than both the variance and MSE values of the existing estimators in the paper. In addition, for the COVID‐19 data used in the study, it is seen from Table 2 that the best estimator with the smallest variance value is all suggested estimators among the mentioned estimators. The numerical results of the calculations in Table 2 are presented differently in Figure 2. Also, it is seen from Figure 2 that the proposed estimator has better values.
FIGURE 2

The variance and MSE values of the considered estimators

The variance and MSE values of the considered estimators

CONCLUSION

In this study, the unbiased estimators class is proposed for variance, the most commonly used indicator of population variation. The most preferred Hartley–Ross method is used to obtain these estimators' unbiased estimators. It is supported by a real data study that the proposed estimator is the best estimator under certain conditions. The variance of the COVID‐19 death numbers, which includes 85 federal areas of Russia, is examined in the real data study. The new case numbers are considered an auxiliary variable to explore the change in the number of deaths. In line with the obtained results, it is seen that the variance estimator that best predicts the change in the number of COVID‐19 deaths in Russia is the estimators suggested in the study. In future work, the class of proposed estimators can be diversified by substituting different parameters for the population parameters in . By using these proposed estimators, it will be possible to increase the variance estimates in future studies for the number of COVID‐19 deaths in different countries and different periods.

CONFLICT OF INTEREST

The author declares that there are no conflicts of interest regarding the publication of this article.
  3 in total

1.  Analysis and Estimation of COVID-19 Spreading in Russia Based on ARIMA Model.

Authors:  Lanlan Fang; Dingjian Wang; Guixia Pan
Journal:  SN Compr Clin Med       Date:  2020-10-09

2.  Climatic factors influence the spread of COVID-19 in Russia.

Authors:  Malay Pramanik; Parmeshwar Udmale; Praffulit Bisht; Koushik Chowdhury; Sylvia Szabo; Indrajit Pal
Journal:  Int J Environ Health Res       Date:  2020-07-16       Impact factor: 3.411

3.  Modified unbiased estimators for population variance: An application for COVID-19 deaths in Russia.

Authors:  Hatice Oncel Cekim
Journal:  Concurr Comput       Date:  2022-07-13       Impact factor: 1.831

  3 in total
  1 in total

1.  Modified unbiased estimators for population variance: An application for COVID-19 deaths in Russia.

Authors:  Hatice Oncel Cekim
Journal:  Concurr Comput       Date:  2022-07-13       Impact factor: 1.831

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.