Literature DB >> 34855840

Vague data analysis using neutrosophic Jarque-Bera test.

Muhammad Aslam1, Rehan Ahmad Khan Sherwani2, Muhammad Saleem3.   

Abstract

In decision-making problems, the researchers' application of parametric tests is the first choice due to their wide applicability, reliability, and validity. The common parametric tests require the validation of the normality assumption even for large sample sizes in some cases. Jarque-Bera test is among one of the methods available in the literature used to serve the purpose. One of the Jarque-Bera test restrictions is the computational limitations available only for the data in exact form. The operational procedure of the test is helpless for the interval-valued data. The interval-valued data generally occurs in situations under fuzzy logic or indeterminate state of the outcome variable and is often called neutrosophic form. The present research modifies the existing statistic of the Jarque-Bera test for the interval-valued data. The modified design and operational procedure of the newly proposed Jarque-Bera test will be useful to assess the normality of a data set under the neutrosophic environment. The proposed neutrosophic Jarque-Bera test is applied and compared with its existing form with the help of a numerical example of real gold mines data generated under the fuzzy environment. The study's findings suggested that the proposed test is effective, informative, and suitable to be applied in indeterminacy compared to the existing Jarque-Bera test.

Entities:  

Mesh:

Year:  2021        PMID: 34855840      PMCID: PMC8638977          DOI: 10.1371/journal.pone.0260689

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


1 Introduction

The standard statistical tests from the parametric domain play a vital role in decision-making problems and are popular in social sciences [1]. The outcomes of the tests are considered reliable and valid for the population under investigation. These parametric tests help understand the research problems for better decision-making, prediction, and estimation purposes. The fruits of the tests are only juicy when the standard assumptions under the parametric tests validate. One of the standard assumptions of these tests is the validation of the normality assumption. The analysis and recommendations without checking the normality of the data mislead the decision-makers. Several tests have been proposed to assess the normality of data. Jarque-Bera (JB) test is famous goodness of fit test used to assess the distributional structure of data. The validation of the normality assumption under the JB test relies on the principle of matched skewness and kurtosis of the sample data with the normal distribution. The test is applied for testing the null hypothesis that there is no significant difference between the data in hand and the normal distribution versus the alternative hypothesis that a significant difference exists. Several authors applied this test in various fields. [2] discussed the power of the JB test. [3] presented the modification of the JB test. [4] discuss the power of various statistical normality tests. [5] worked on the modification of the JB test for the multivariate data. [5] applied the JB test for the face recognition problem. More details about the test and other analyses can be seen in [2, 6–11]. One of the JB test restrictions is the computational limitation available only for the data in exact form. When the observations in the data are fuzzy, the existing JB test under classical statistics cannot be applied for testing the normality of the data. The data based on the fuzzy logic are often in the interval-valued form [12, 13]. An extension of the fuzzy logic and interval-based approach is called the neutrosophic logic [14]. The neutrosophic logic can provide information about the measure of indeterminacy. [15] introduced the neutrosophic logic. The other forms of fuzzy sets are singular valued fuzzy sets [16], hesitant fuzzy sets [17], Zadeh fuzzy sets [18], intuitionistic fuzzy sets [19] etc. The applications of the neutrosophic logic can be seen in [20-36]. Based on neutrosophic logic, neutrosophic statistics were introduced by [37]. Neutrosophic statistics is the generalization of classical statistics applied when the data is measured from a complex or indeterminate environment. References [38, 39] provided the methods to analyze the data having neutrosophic numbers. Reference [40] introduced the area of neutrosophic statistical quality control. References [41, 42] introduced tests of normality under neutrosophic statistics. The details about the neutrosophic statistics can be seen in [43, 44]. [45] applied the JB test using the fuzzy approach in forecasting solar radiation. Reference [46] applied this test for prediction stock closing prices. [47] preened a novel distance measure method and applied it in gold mines data. For more details, the reader may read [48-60]. Motivating from the computational limitations of the existing JB test for the exact form data, we proposed a modified version of the present JB test for the fuzzy or interval-valued data. The proposed JB test is a generalized form of the existing JB test from classical statistics as it possesses the ability to deal with both exact and fuzzy forms data sets. Gold mines data is one of the data sets like water level, temperature, stock exchange, melting points, etc., that may possess the indeterminate or neutrosophic form. We will present the application of the proposed test with the help of gold mines data taken from [47]. The efficiency of the proposed JB test will be compared with the existing test. The use of the developed JB test will be beneficial in situations where the observations under a problem are not certain, fuzzy, indeterminate, interval-valued, or in neutrosophic form.

2 Preliminary

Let zN = aN+bNIN; INϵ[IL, IU] be a neutrosophic random variable with determinate part aN and indeterminate part bNIN. Note here that the neutrosophic random variable becomes the traditional random variable if IL = 0. Suppose that nNϵ[nL, nU] be neutrosophic sample size. By following [38], the neutrosophic average for zNϵ[zL, zU] is given as where and The neutrosophic difference between zN and is given as The neutrosophic sum of square (NSS) is given by The neutrosophic measure of skewness k3Nϵ[k3L, k3U] is given as The neutrosophic measure of kurtosis k4Nϵ[k4L, k4U] is given as Note here that SNϵ[SL, SU] presents the neutrosophic standard deviation and defined as follows

3 Jarque–Bera test under neutrosophic statistics

The JB test is used to confirm the normality of a data set before applying the famous standard statistical tests like t-test, z-test, or F-test. The test is based on the null hypothesis there is no difference between the data under study and the normal distribution versus the alternative hypothesis that difference exists. The test statistic of the JB test is a function of skewness and kurtosis. Suppose S, K and n are the sample skewness, kurtosis and the sample size for a data set then the statistic used for JB test under classical statistics is defined as: The test-statistic will be useful when the data are in exact form and helpless in the case of interval-valued data. We modify the JB-statistic defined in (7) for the interval-valued data. Now, the modified JB test under neutrosophic environment used the null hypothesis H0N that the neutrosophic data is the same as the neutrosophic normal distribution versus the alternative hypothesis H1N that the neutrosophic data is different from the neutrosophic normal distribution. The operational procedure of the proposed JB test under neutrosophic statistics is stated in the following steps: Step-1: Select a neutrosophic random sample of size nNϵ[nL, nU]. Compute the averages of the determined part ai(i = 1,2,…,nL) and indeterminate part bi(i = 1,2,…,nU) as follows Step-2: The neutrosophic average of a neutrosophic random variable is calculated as Step-3: The difference between zN and will be computed as Step-4: Compute the sum of square (SS) as follows Step-5: Compute k3Nϵ[k3L, k3U] and k4Nϵ[k4L, k4U]. Step-6: Compute JB statistic under neutrosophic statistic JBNϵ[JBL, JBU] using the following formula The statistic JBN proposed in (12) follows asymptotically to a Chi-square distribution with two degrees of freedom. The normal distribution has a skewness zero and kurtosis three indicates that for a normal distribution, the value of JBN is zero and any excess value of JBN from zero will indicate the deviation from normality. Step-7: choose the tabulated value from the Chi-square table and accept H0N if JBNϵ[JBL, JBU] less than the tabulated value at the level of significance α.

4 Application in cleaner production data

This section will present the computational aspects of the proposed methodology of the newly developed JB test under a neutrosophic environment. The application of the proposed JBN test is given with the help of cleaner production data from the gold mines. [47] discussed the cleaner production data for gold mines based on experts’ evaluation evidence under fuzzy theory. The availability of the gold mines data under fuzzy logic motivates us to use the data for the application purposes for the present research. According to [47], the decision-maker is interested in selecting a suitable center from the three centers C1, C2 and C3 on the basis of five characteristics of gold mines data. [47] presented a comprehensive way to select the best center based on their decision criteria but did not perform the normality test before using the methods. To test the data normality, we will use only the characteristic management level C1. According to [47], "it indicates the production process and equipment level, which contains the mining technology and production equipment. The data of gold mines of the characteristics G1, G2 and G3 of center C1 is selected from [47] and reported in Table 1. It can be seen that the data is in the indeterminacy interval; therefore, we will apply the proposed test to check the normality of the data first.
Table 1

The gold mines data of characteristic C1.

G1G2G3
aibiaibiaibi
0.320.430.10.180.220.4
0.090.150.130.230.120.1
0.220.310.290.320.280.39
0.130.440.080.50.230.4
The proposed test using the real data for three centers G1, G2 and G3, respectively is implemented as follows Step-1: Select a neutrosophic random sample of size nNϵ[4,4]. The averages of the determined part ai(i = 1,2,…,nL) and indeterminate part bi(i = 1,2,…,nU) are: and and and The statistics indicate the average performance of the three methods G1, G2 and G3 laid down by the experts with respective average indeterminacy levels for the selection of the gold mines center, e.g., the average performance of the cleaner production gold mines for the G1 method is 0.19 with a 0.3325 average uncertainty level. Step-2: The neutrosophic averages of neutrosophic random variables are given as ; INϵ[0,0.05] ; INϵ[0,0.05] ; INϵ[0,0.05] Step-3: The difference between zN and for example for G1 is given by Step-4: Compute the sum of square (SS) for three centers are as follows , [0.0274,0.0792], [0.0134,0.1337] Step-5: The neutrosophic values of k3Nϵ[k3L, k3U] and k4Nϵ[k4L, k4U] for three centers are given as k3Nϵ<[0.3623, −0.0755], [1.000,0.0323], [−0.6113, −0.3565]> k4Nϵ<[−1.3797, −2.5762], [−0.7911, −2.6052], [−0.9277, −2.7399]> Step-6: The calculated values of statistic JBNϵ[JBL, JBU] are given as JBN<[0.4047,1.1099], [0.7711,1.1319], [0.3926,1.3359]> The value of the proposed JBN test statistic is not much far away from zero, indicating that the gold mines data follow a normal probability distribution with an indeterminacy level. The same can be verified by using the Chi-square distribution table in Step-7. Step-7: The table value for the level of significance 0.05 is 7.815. We note that JBNϵ[JBL, JBU] are less than the tabulated value. Therefore, the null hypothesis is that data from three centers are not significantly different from the neutrosophic normal distribution, and this decision is the same as [47].

5 Comparative study

In this section, the performance of the proposed test will be compared with the JB test under classical statistics. The proposed JBNϵ[JBL, JBU] defined in Eq (12) is the extension of the existing JB test presented in Eq (7). The proposed test will be reduced to the JB test under classical statistic if JBN = JBL = 0. The neutrosophic form of the proposed JBNϵ[JBL, JBU] test for centers G1, G2 and G3 along with the measures of indeterminacy are shown in Table 2.
Table 2

The measures of indeterminacy associated with the test.

CentersJBNϵ[JBL, JBU]Measure of IndeterminacyNeutrosophic formExisting JB test
G1[0.4047,1.1099]INϵ[0,0.6353]JBN = 0.4047+1.1099IN0.4047
G2[0.7711,1.1319]INϵ[0,0.3187]JBN = 0.7711+1.1319IN0.7711
G3[0.3926,1.3359]INϵ[0,0.7061]JBN = 0.3926+1.3359IN0.3926
From Table 2, we note that the measure of indeterminacy is increased if the gap between JBNϵ[JBL, JBU] is increased. We also note that the proposed test provides the measures of indeterminacy, while the existing JB test under classical statistics cannot provide this kind of information. For example, when the level of significance is 5%, according to the proposed JB test under neutrosophic statistics, the probability that the null hypothesis is accepted is 0.95, and the null hypothesis is rejected with the probability of 0.05. Other than these probabilities, the chance that the decision-makers are uncertain about the acceptance or rejection of the null hypothesis is 0.6353. We note that the sum of the probabilities is larger for the proposed test, and this theory is the same as in [37]. The proposed test can be compared with the existing JB in terms of sensitivity. From Table 2, it can be seen that values of the classical test (determined part) fluctuate much as compared to the indeterminate part. For example, for centers G2 and G3, the values of the existing JB test moves from 0.4047 to 0.7711 when measure of indeterminacy changes from 0.6353 to 0.3187. On other hand, the values of the indeterminate part of the proposed JB test moves from 1.1099I to 1.1319I when measure of indeterminacy changes from 0.6353 to 0.3187. From the study, it can be seen that the proposed test is less sensitive than the existing JB test. We also note that the proposed test provides the results in indeterminacy intervals and makes it suitable and effective to be applied in the indeterminate environment. This theory is the same as in [38, 39].

6 Concluding remarks

The paper extends the concept of the Jarque–Bera test from classical statistics to neutrosophic statistics. The classical JB test is limited to perform on exact values data. In contrast, the proposed modified form of the JB statistic can be used to both exact and interval-valued data. The design and operational procedure for the newly developed JB test are presented under the fuzzy and neutrosophic logic. The application of the proposed JB test is carried on the real data set from the cleaner production of gold mines generated in a fuzzy environment. Moreover, a comparison of the proposed neutrosophic JB test is made with the existing JB test to assess the performance of the two tests. The findings of the numerical example suggested that the proposed JB test is effective, informative, and suitable to be applied under indeterminacy compared to the existing JB test. For generalized and better analysis of the data, the proposed test is recommended when the data is obtained from indeterminate and complex systems. The proposed methodology of the JB test can be extended to test the multivariate normality under indeterminacy. The proposed test for big data can be extended for future research. The development of new software to perform the proposed test is a fruitful area for future research.
  5 in total

1.  Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases.

Authors:  Mohamed Abdel-Basset; Mai Mohamed; Mohamed Elhoseny; Le Hoang Son; Francisco Chiclana; Abd El-Nasser H Zaied
Journal:  Artif Intell Med       Date:  2019-10-05       Impact factor: 5.326

2.  Mineralogical and geochemical characterization of waste rocks from a gold mine in northeastern Thailand: application for environmental impact protection.

Authors:  Thitiphan Assawincharoenkij; Christoph Hauzenberger; Karl Ettinger; Chakkaphan Sutthirat
Journal:  Environ Sci Pollut Res Int       Date:  2017-11-20       Impact factor: 4.223

3.  New Diagnosis Test under the Neutrosophic Statistics: An Application to Diabetic Patients.

Authors:  Muhammad Aslam; Osama H Arif; Rehan Ahmad Khan Sherwani
Journal:  Biomed Res Int       Date:  2020-01-22       Impact factor: 3.411

4.  T-spherical fuzzy power aggregation operators and their applications in multi-attribute decision making.

Authors:  Harish Garg; Kifayat Ullah; Tahir Mahmood; Nasruddin Hassan; Naeem Jan
Journal:  J Ambient Intell Humaniz Comput       Date:  2021-01-22

5.  Introducing Kolmogorov-Smirnov Tests under Uncertainty: An Application to Radioactive Data.

Authors:  Muhammad Aslam
Journal:  ACS Omega       Date:  2019-12-31
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.