| Literature DB >> 30666315 |
Sayed Asaduzzaman1,2, Fuyad Al Masud1, Touhid Bhuiyan1, Kawsar Ahmed2, Bikash Kumar Paul2, S A M Matiur Rahman1.
Abstract
In this article, dataset and detailed data analysis results of Type-1 Diabetes has been given. Now-a-days Type-1 Diabetes is an appalling disease in Bangladesh. Total 306 person data (Case group- 152 and Control Group- 154) has been collected from Dhaka based on a specific questioner. The questioner includes 22 factors which were extracted by research studies. The association and significance level of factors has been elicited by using Data mining and Statistical Approach and shown in the Tables of this article. Moreover, parametric probability along with decision tree has been formed to show the effectiveness of the data was provided. The data can be used for future work like risk prediction and specific functioning on Type-1 Diabetes.Entities:
Keywords: Analysis of data; Bangladesh perspective; Data of significant factors; Dataset on Type-1 Diabetes
Year: 2018 PMID: 30666315 PMCID: PMC6205358 DOI: 10.1016/j.dib.2018.10.018
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Data table on significance of factors according to Info Gain, Gain Ratio, Gini Index and χ2-test.
| 1 | HbA1c | 0.520 | 0.522 | 0.284 | 111.447 |
| 2 | Hypoglycemia | 0.464 | 0.506 | 0.253 | 103.342 |
| 3 | Age | 0.286 | 0.154 | 0.179 | 92.146 |
| 4 | Pancreatic disease affected in child | 0.321 | 0.386 | 0.167 | 77.000 |
| 5 | Area of Residence | 0.210 | 0.136 | 0.136 | 45.003 |
| 6 | Education of Mother | 0.123 | 0.129 | 0.082 | 18.491 |
| 7 | Adequate Nutrition | 0.157 | 0.187 | 0.100 | 16.361 |
| 8 | Autoantibodies | 0.243 | 0.334 | 0.129 | 15.961 |
| 9 | Sex | 0.061 | 0.061 | 0.041 | 11.843 |
| 10 | Family History affected in Type-1 Diabetes | 0.031 | 0.035 | 0.021 | 9.081 |
| 11 | Family History affected in Type-2 Diabetes | 0.019 | 0.019 | 0.013 | 4.434 |
| 12 | Standardized growth rate infancy | 0.054 | 0.074 | 0.033 | 2.741 |
| 13 | Standardized birth weight | 0.096 | 0.122 | 0.052 | 0.517 |
| 14 | Impaired glucose metabolism | 0.001 | 0.001 | 0.000 | 0.226 |
Data table on significance of factors according to Info Gain, Gain Ratio, Gini Index and χ2-test (family history in Type-1 Diabetes).
| Mother | 0.026 | 0.058 | 0.017 | 9.354 |
| Father׳s Heredity | 0.022 | 0.047 | 0.015 | 8.211 |
| Mother׳s Heredity | 0.006 | 0.012 | 0.004 | 2.309 |
| Father | 0.001 | 0.004 | 0.001 | 0.514 |
Data table on significance of factors according to Info Gain, Gain Ratio, Gini Index and χ2-Test (family history in Type-2 Diabetes).
| Mother | 0.033 | 0.089 | 0.021 | 11.847 |
| Father׳s Heredity | 0.007 | 0.009 | 0.005 | 2.217 |
| Father | 0.003 | 0.005 | 0.002 | 1.027 |
| Mother׳s Heredity | 0.001 | 0.001 | 0.001 | 0.290 |
Data table on significance of factors according to Info Gain, Gain Ratio, Gini Index and χ2-Test (different symptoms).
| Frequent Urination | 0.668 | 0.681 | 0.364 | 129.684 |
| Increased thirst | 0.668 | 0.681 | 0.364 | 129.684 |
| Fatigue and Weakness | 0.573 | 0.597 | 0.314 | 118.539 |
| Unintended weight loss | 0.505 | 0.540 | 0.276 | 109.421 |
| Extreme Hunger | 0.445 | 0.490 | 0.242 | 100.303 |
Comparative result dataset of factors using different algorithms.
| HbA1c | Age |
| Hypoglycemia | Sex |
| pancreatic disease affected in child | Area of Residence |
| Age | HbA1c |
| Autoantibodies | Adequate Nutrition |
| Area of Residence | Standardized growth-rate in infancy |
| Adequate Nutrition | Autoantibodies |
| Education of Mother | Family History affected in Type 1 Diabetes |
| Standardized birth weight | Hypoglycemis |
| Sex | pancreatic disease affected in child |
| Standardized growth-rate in infancy | N/A |
| Family History affected in Type 1 Diabetes | N/A |
| Family History affected in Type 2 Diabetes | N/A |
| Impaired glucose metabolism | N/A |
Correlation data among factors using Apriori Algorithm.
| 1 | Standardized growth-rate in infancy (Middle quartiles pancreatic disease affected in child) |
| 2 | Autoantibodies pancreatic disease affected in child ==> Standardized birth weight Middle quartile |
| 3 | Adequate Nutrition (Yes)- Standardized growth-rate in infancy (Middle quartiles) ==> Standardized birth weight (Middle quartiles) |
| 4 | pancreatic disease affected in child =No 230 ==> Standardized birth weight=Middle quartiles 217 <conf:(0.94)> lift:(1.09) lev:(0.06) [18] conv:(2.25) |
| 5 | Adequate Nutrition (Yes) ==> Standardized birth weight (Middle quartiles) |
| 6 | Hypoglycemis (No) ==> Standardized birth weight (Middle quartiles) |
| 7 | . Hypoglycemis (No) ==> pancreatic disease affected in child (No) |
| 8 | Standardized growth-rate in infancy (Middle quartiles) Autoantibodies (Yes) ==> Standardized birth weight (Middle quartiles) |
| 9 | Hypoglycemis ==> Autoantibodies |
| 10 | Standardized growth-rate in infancy (Middle quartiles) Impaired glucose metabolism==> Standardized birth weight (Middle quartiles) |
P value and confidence interval of risk factors in Type-1 Diabetes dataset.
| Age | 0.000 | 0.2633 | 0.4884 |
| Less than 5 | |||
| Less than 11 | |||
| Less than 15 | |||
| Greater than 15 | |||
| Sex | 0.000 | 0.1111 | 0.2235 |
| Male | |||
| Female | |||
| Area of Residence | 0.000 | 0.1489 | 0.3162 |
| Rural | |||
| Urban | |||
| Suburban | |||
| Height | 0.665 | 0.245 | 0.0384 |
| Weight | 0.996 | 1.88 | 0.1.89 |
| BMI | 0.996 | 0.70 | 0.70 |
| Adequate Nutrition | 0.008 | 0.0173 | 0.1163 |
| Yes | |||
| No | |||
| Education of Mother | 0.999 | 0.0544 | 0.0544 |
| Yes | |||
| No | |||
| Standardized growth-rate infancy | 0.999 | 0.251 | 0.251 |
| Lowest quartile | |||
| Middle quartile | |||
| Highest quartile | |||
| Family History in Type-1 Diabetes | 0.000 | 0.4522 | 0.5550 |
| Father | |||
| Mother | |||
| Father׳s Heredity | |||
| Mother׳s Heredity | |||
| Family History in Type-2 Diabetes | 0.000 | 0.1864 | 0.2986 |
| Father | |||
| Mother | |||
| Father׳s Heredity | |||
| Mother׳s Heredity | |||
Significant Factors
Data for probabilities and effectiveness of factors in Type-1 Diabetes.
| 1 | Age | Greater then 15 | 0.88 | High |
| Less Than 15 | 0.42 | Moderate | ||
| Less than 11 | 0.2 | Low | ||
| Less than 5 | 0.18 | Very Low | ||
| 2 | HBA1c | Less than 7.5 | 0.21 | Low |
| Greater than 7.5 | 0.72 | High | ||
| 3 | Hypoglycemis | Yes | 0.69 | High |
| No | 0.27 | Low | ||
| 4 | Pancreatic Diseases diagnosed in affected childs | Yes | 0.5 | Moderate |
| No | 0.31 | Low | ||
| 5 | Area of Residence | Rural | 0.82 | High |
| Suburban | 0.65 | Moderate | ||
| Urban | 0.22 | Low | ||
| 6 | Adequate Nutrition | No | 0.86 | High |
| Yes | 0.36 | Low | ||
| 7 | Autoantibodies | No | 0.4 | Moderate |
| Yes | 0.38 | Moderate | ||
| 8 | Sex | Female | 0.65 | High |
| Male | 0.36 | Low | ||
| 9 | Family History type 1 Diabetes | Yes | 0.68 | High |
| No | 0.41 | Low | ||
| 10 | Family History type 2 Diabetes | Yes | 0.59 | High |
| No | 0.44 | Low | ||
| 11 | Standard Growth Rate | Lowest | 0.96 | High |
| Height | 0.72 | Moderate | ||
| Middle | 0.45 | Low |
Fig. 1Data on 2-D view of probability distribution of the age with respect to affected group.
Fig. 23-D visualization of the analyzed dataset and data distribution for BMI, height and weight.
Fig. 3Visualization of parameters and its outcomes of dataset.
Fig. 4Decision tree among the factors of Type-1 Diabetes.
| Subject area | |
|---|---|
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Data source location | |
| Data accessibility |