Literature DB >> 26858555

A new mathematical evaluation of smoking problem based of algebraic statistical method.

Maysaa J Mohammed1, Isamiddin S Rakhimov1, Mahendran Shitan2, Rabha W Ibrahim3, Nadia F Mohammed1.   

Abstract

Smoking problem is considered as one of the hot topics for many years. In spite of overpowering facts about the dangers, smoking is still a bad habit widely spread and socially accepted. Many people start smoking during their gymnasium period. The discovery of the dangers of smoking gave a warning sign of danger for individuals. There are different statistical methods used to analyze the dangers of smoking. In this study, we apply an algebraic statistical method to analyze and classify real data using Markov basis for the independent model on the contingency table. Results show that the Markov basis based classification is able to distinguish different date elements. Moreover, we check our proposed method via information theory by utilizing the Shannon formula to illustrate which one of these alternative tables is the best in term of independent.

Entities:  

Keywords:  Contingency tables; Entropy; Markov basis

Year:  2015        PMID: 26858555      PMCID: PMC4705293          DOI: 10.1016/j.sjbs.2015.08.015

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 1319-562X            Impact factor:   4.219


Introduction

US Department of Health and Human Services (2014) showed that smoking is responsible for roughly 440,000 deaths each year in the USA. Studies about the effects of smoking were done only on men in 1964 but later on women were included as well when some cigarette companies stated that smoking keeps girls and women thin. It can be noticed that health issues caused by smoking, no longer take into consideration, although most of people are familiar with the smoking consequences. Women who smoke significantly increase the danger of developing heart disease (the leading killer among women) and stroke. The danger increases with the number of cigarettes smoked and the length of time a woman has been smoking, but even people who smoke less than 5 cigarettes a day can have heart and blood vessel diseases. Although most of the women who die of heart disease are past menopause, smoking increases the danger additionally in younger women than in older women. Studies propose that smoking cigarettes increase the danger of heart disease even more among younger women who also take attractive birth control pills. Women who smoke, exclusively after going through menopause, have lower bone density (thinner bones). This means they have a higher danger of broken bones, counting hip fracture, than women who do not smoke. They could likewise be at higher danger of receiving rheumatoid arthritis and cataracts (clouding of the lenses of the eyes), as well as age-related macular degeneration, which can cause blindness. Tobacco habit can harm a woman’s reproductive health. Women who smoke are more likely to have trouble getting pregnant. Smokers incline to be younger at the start of menopause than non-smokers and may have more unpleasant symptoms while going through menopause. Nearly most of the people know how harmful it is being addictive smokers, however, few of them really recognize its risks. Some smokers convince themselves that they are social smokers or smoke only outdoors to be distinguished as non-smokers (Harris et al., 2002), and they can control smoking since smoking is not a habit for them. Stopping smoking is not that easy, yet being addictive does not mean a smoker does not have the ability to quit. The process of giving up smoking gets complicated, especially when people get older and older and the condition becomes sometimes irreversible. Some succeed from the early try while others give out more than one try; notwithstanding, some fail to quit smoking (Rollins et al., 2002). Moran et al. (2004) perceived that smokers start having a cigarette in occasional cases which means not daily and by the time they grow up, it gets irregular; then being involved in being motiveless to giving up; finally they reach a higher level (Moran et al., 2004). Due to several negative effects associated with this problem, many of the studies and research have emerged to study this problem in many respects, as in the above studies. This led to the attention of many researchers of the problem and the surrounding circumstances. In this paper, we impose the algebraic statistical method using Markov basis for the independent model to find alternative data models that simulate the original data models and maintain the same statistical and mathematical qualities of the original data. On the other hand, our method is to find alternative data tables depending on the original data by using Markov basis for independent model. Moreover, we choose the best table in term of independent that liaises the qualities and characteristics is the same of the original data and the value of information on this table is more than the original data by using (information theory) as a measure for the independent. Due to the importance of passive smoking and its impact on all groups of society, we have selected data related to clarify the relationship between sex and smoking status (see Table 1).
Table 1

Contingency table for relationship between sex and smoking status.

Smoking statusSex
ManWomanTotal
Non smoker10616
Light smoker8816
Heavy smoker4812
Total222244

Material and methods

In 2007, Hannelore (2007) used Breathing test model. The data disquiet the relations among smoking position and breathing test outcomes for employers (under 40 years old) in a certain industrial plant. His study can be extended by presenting a further variable, namely the age. Observed are not the only employers under 40 years, but likewise employers from the age group 40–59. There are no restraints on the rows and column totals, and a simple model is that the count in the (i, j) cell is an understanding of a Poisson variable with expectation μ. The consequential likelihood iswhere μ is the vector of the expectations. In 2013, Jolanta et al. (2013) imposed the relationship between Peripheral vascular and smoking, and that smoking contributes significantly to the increase of cardiovascular disease and may increase the risk of holding such kinds of diseases by using the prevalence ratio calculation for cross section study. The odds ratio (OR) is calculated as: In 2014, Suzan et al. (2014) found the relationship between the effect of smoking and disease, incontinence and the impact of smoking with the disease using hypothesis testing depending on the Fisher exact test. The study has shown that smoking has to do with the impact of a negative increase in case of this disease. The P-value calculated by formula of Fisher’s exact test as: In 2015, Anthony et al. (1999) studied that smoking during pregnancy has harmful effects and a large impact on the mother’s weight at birth and also has significant side effects in children which may cause a respiratory problem and this leads to the lungs not working well at birth. Thus, in their studies, they have selected a sample of mothers either directly during a hospital admission or within 24 h of birth. The sample was taken from 100 mothers of non-smokers and 100 mothers who smoked 1–10 cigarettes per day and 100 mothers who smoked an average of more than 10 cigarettes per day. The aim of the study is to investigate the effects of smoking on lung function in newborns by knowing whether smoking during pregnancy affects or has to do with the functions of the lung in newborns. Using test hypotheses in statistics depending on chi square test. The formula of chi square test is: where stands for observed frequencies, stands for expected frequencies, and i, runs from 1, 2, …, n where n is the number of the cells in the contingency table.

Proposed method

In this section, we illustrate our proposed method, which was based on Markov basis for the independent model. Step 1: Organize data as in Table 1, (Polit et al., 2013), thus we need a table to represent the relationship between smoking and sex. Step 2: We test the data for the independent model using hypothesis tests with the chi-square test (Minhaz, 2007) which are as: : suppose the relationship between smoking status and sex is independent. : The relationship between smoking status and sex is not independent. Degree of freedom , where r is the sum of rows and c is the sum of columns. Hence, Then, we use the formula of chi-square test as above in (4). Therefore, we use this method if the expected frequency is equal or more than 5 where the expected frequency . Moreover, if the expected frequency is less than 5, we use hypothesis test depending on loglinear model with odds ratio test (Carol, 2001). Hence, the general formula of the odds ratio is: , where and . According to this method, then, the hypothesis test as: : suppose the relationship between smoking and sex is independent if the odds ratio , then, . : If the , then, , hence the relationship between sex and smoking status is not independent. Step 3: According to Table 1, we construct Markov basis elements by usingwhere the integer matrix Z = Z ( Step 4: (see Satoshi et al., 2012, Mathias et al., 2009) such as: Step 5: In order to obtain alternative data, we add the moves in the fourth step to the original data in Table 1. Repeat this process with each alternate data and then stop, when contingency tables are equal. Fig. 2 shows all the construction of these tables.
Figure 2

All the elements of fiber with original data (CT construction).

Step 6: In order to investigate the original data in terms of the proposed method, we apply information theory by utilizing the Shannon formula (Hnizdo and Gilson, 2010)where the base (a) is a ten and the unit is hartley (Hnizdo and Gilson, 2010, Baez et al., 2011), . Satisfying . In addition, we consider that .

Results

The main objective of this study is to analyze the original data and assist researchers in finding alternative data bearing specifications and characteristics that are very close to the original data. Therefore, we started the selection of real data on smoking which was a choice of a sample consisting of 44 people, divided by gender and ability to smoke. Then, we test this sample, is it independent or not using the test hypotheses in chi-square test or loglinear model test which depends on the expected value, so if the expected value is more than or equal to 5 then we choose either one of these methods. If not, we choose the second one for loglinear model of statistical independence. Thus, here the expected value is more than 5. Therefore, we can test by any one of these methods. We see the result of and degree of freedom is 2, therefore , α = 0.05 (Minhaz, 2007). Because of , the relationship between sex and smoking is independent. After that, we installed fiber that has calculated the Markov basis elements. We add these items to the original data and the result we got 178 of alternative schedules of the original schedule start of . Consequently, we applied the Entropy theory of Shannon, to find which one of these alternative contingency tables (CT) is closest or better than the original data table in terms of independence (Tom, 2011). This study is the first one applying Markov basis for the independent model with Shannon information theory. Hence, we hope that it will help all the researchers to find the best alternative data table in terms of independent of their studying to work with it instead of the basic data.

Discussion

In this study, we applied the Entropy theory by Shannon for all elements of the fiber on the Markov basis (MB). We find that, for a random sample of MB, say {A2, …, A10}. We consider, the maximal entropy is in A7 (see Table 3).
Table 3

Contingency table A7 for the best relationship between sex and smoking status.

Smoking statusSex
ManWomanTotal
Non smoker9716
Light smoker8816
Heavy smoker5712
Total222244
Therefore, the seventh table is the alternative better table with the characteristics and qualities of information and assembly from the original table. Finally, we have found that there is more than one alternative, as shown in the Fig. 1. The information gathering where more than the original table or informational specifications are considered better than the original table.
Figure 1

The entropy values for alternative tables ( with original table .

Conclusion

We conclude that our method, which is based on algebraic statistical using Markov basis for the independence model is effective for analyzing and classifying the original data by using the elements of Markov basis with basic moves and fix fiber. Therefore, we chose the real data represented by the relationship between smoking and sex see Table 1. Consequently, we tested it is independent by using chi-square test. Then, we added the elements of Markov basis to get the elements of the fiber, that means we get the alternative tables A2, …, A10 . Moreover, we applied the Entropy theory of Shannon to find which one of these alternative data is closest or better than the original data table in terms of Independence see Table 2. Hence, this method is the first one apply a Markov basis for the independent model to the Entropy theory that helps researchers get the best result which is better than the original data table in terms of independent. We found that A7 is the best alternative table. Note that there are various approaches for evaluation based on optimization, differential equations and other statistical methods (see Qureshi et al., 2015, Kayani et al., 2014, Noor et al., 2014, Tabassum et al., 2014).
Table 2

Entropy values for some alternative tables ( with original matrix .

Table(CT)Entropy
A10.7627409102
A20.7551117264
A30.765254924
A40.7527180692
A50.7665592067
A60.7475626142
A70.7715469493
A80.7421311053
A90.7564160091
A100.7370474462
  6 in total

1.  Social smoking among US college students.

Authors:  Susan Moran; Henry Wechsler; Nancy A Rigotti
Journal:  Pediatrics       Date:  2004-10       Impact factor: 7.124

2.  Ethnobotanical uses of medicinal plants for respiratory disorders among the inhabitants of Gallies - Abbottabad, Northern Pakistan.

Authors:  Sadaf Kayani; Mushtaq Ahmad; Muhammad Zafar; Shazia Sultana; Muhammad Pukhtoon Zada Khan; Muhammad Aqeel Ashraf; Javid Hussain; Ghulam Yaseen
Journal:  J Ethnopharmacol       Date:  2014-08-19       Impact factor: 4.360

3.  Effects of smoking in pregnancy on neonatal lung function.

Authors:  A D Milner; M J Marsh; D M Ingram; G F Fox; C Susiva
Journal:  Arch Dis Child Fetal Neonatal Ed       Date:  1999-01       Impact factor: 5.747

4.  Estimation of Anticipated Performance Index and Air Pollution Tolerance Index and of vegetation around the marble industrial areas of Potwar region: bioindicators of plant pollution response.

Authors:  Mehwish Jamil Noor; Shazia Sultana; Sonia Fatima; Mushtaq Ahmad; Muhammad Zafar; Maliha Sarfraz; Masour A Balkhyour; Sher Zaman Safi; Muhammad Aqeel Ashraf
Journal:  Environ Geochem Health       Date:  2014-12-11       Impact factor: 4.609

5.  Chemodynamics of methyl parathion and ethyl parathion: adsorption models for sustainable agriculture.

Authors:  Noshabah Tabassum; Uzaira Rafique; Khaled S Balkhair; Muhammad Aqeel Ashraf
Journal:  Biomed Res Int       Date:  2014-02-06       Impact factor: 3.411

6.  Thermodynamic and Differential Entropy under a Change of Variables.

Authors:  Vladimir Hnizdo; Michael K Gilson
Journal:  Entropy (Basel)       Date:  2010-03-16       Impact factor: 2.524

  6 in total
  1 in total

1.  Gender differences in mortality among ST elevation myocardial infarction patients in Malaysia from 2006 to 2013.

Authors:  Nurliyana Juhan; Yong Z Zubairi; A S Zuhdi; Zarina Mohd Khalid; Wan Azman Wan
Journal:  Ann Saudi Med       Date:  2018 Jan-Feb       Impact factor: 1.526

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.