| Literature DB >> 27909038 |
Azra Ramezankhani1, Esmaeil Hadavandi2,3, Omid Pournik4, Jamal Shahrabi2, Fereidoun Azizi5, Farzad Hadaegh1.
Abstract
OBJECTIVE: The current study was undertaken for use of the decision tree (DT) method for development of different prediction models for incidence of type 2 diabetes (T2D) and for exploring interactions between predictor variables in those models.Entities:
Keywords: Data Mining; Decision tree; Diabetes; Interaction; Prediction
Mesh:
Substances:
Year: 2016 PMID: 27909038 PMCID: PMC5168628 DOI: 10.1136/bmjopen-2016-013336
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Flow diagram for the selection of study participants in the Tehran Lipid and Glucose Study. 2h-PCPG, 2-hour postchallenge plasma glucose; FPG, fasting plasma glucose.
Figure 2Generation of training and validation data set diagram.
Baseline characteristics of men (TLGS 1999–2012)
| Variables | Diabetes | No diabetes | Total population | p Value* | Percentage of missing data† |
|---|---|---|---|---|---|
| Age (years) | 48.5 (13.3) | 41.1 (13.5) | 41.8 (13.7) | <0.001 | 0 |
| 2-hour postchallenge plasma glucose(mmol/L) | 7.3 (2.0) | 5.5 (1.5) | 5.7 (1.7) | <0.001 | 0 |
| Fasting plasma glucose (mmol/L) | 5.6 (0.6) | 5.0 (0.5) | 5.0 (0.5) | <0.001 | 0 |
| Wrist circumference (cm) | 18 (0.9) | 17.6 (0.9) | 17.6 (0.9) | <0.001 | 1.2% |
| Waist circumferences (cm) | 95.1 (11.0) | 88.3 (10.7) | 89.0 (10.9) | <0.001 | 1.2% |
| BMI (kg/m2) | 27.9 (4.1) | 25.5 (3.8) | 25.8 (3.9) | <0.001 | 1.2% |
| Waist-to-hip ratio | 0.95 (0.1) | 0.91 (0.1) | 0.90 (0.1) | <0.001 | 1.2% |
| Waist-to-height ratio | 0.56 (0.1) | 0.51 (0.1) | 0.50 (0.1) | <0.001 | 1.2% |
| Triglyceride (mmol/L) | 2.7 (2.1) | 1.9 (1.3) | 2.1 (1.4) | <0.001 | 0 |
| Total cholesterol (mmol/L) | 5.6 (1.1) | 5.2 (1.1) | 5.2 (1.1) | <0.001 | 0 |
| HDL cholesterol (mmol/L) | 0.9 (0.2) | 0.9 (0.2) | 0.9 (0.2) | 0.196 | 0 |
| Triglyceride-to-HDL ratio | 5.1 (4.3) | 7.4 (8.4) | 5.3 (4.9) | <0.001 | 0 |
| Cholesterol-to-HDL ratio | 6.1 (1.9) | 5.5 (1.6) | 5.5 (1.7) | <0.001 | 0.1% |
| Estimated glomerular filtration rate (mL/min/1.73 m2) | 63.2 (11.2) | 67.6 (11.0) | 110.9 (67.1) | <0.001 | 0 |
| Systolic blood pressure (mm Hg) | 126.2 (17.5) | 117.4 (15.7) | 118.3 (16.1) | <0.001 | 1.4% |
| Diastolic blood pressure (mm Hg) | 81.6 (11.6) | 76.4 (10.3) | 76.9 (10.5) | <0.001 | 1.4% |
| Pulse pressure (mm Hg) | 44.6 (12.3) | 40.9 (11.8) | 41.3 (11.9) | <0.001 | 1.4% |
| Mean arterial blood pressure (mm Hg) | 96.3 (12.4) | 90.0 (10.9) | 90.7 (11.3) | <0.001 | 1.5% |
| Total length of stay in the city (years) | 40.7 (14.1) | 34.3 (12.6) | 35.0 (12.9) | <0.001 | 0.3% |
| Goitre size | |||||
| Grade 0 | 226 (74.8) | 1872 (72.5) | 2098 (72.7) | 0.511 | 0 |
| Grade 1 | 55 (18.2) | 483 (18.7) | 538 (18.6) | ||
| Grade 2 | 21 (7.0) | 228 (8.8) | 249 (8.6) | ||
| History of hospitalisation until now | 165 (54.6) | 1260 (48.7) | 1425 (49.4) | 0.059 | 0 |
| Family history of diabetes in first-degree relatives | 108 (36.1) | 611 (23.9) | 719 (25.1) | <0.001 | 0.9% |
| Family history of premature cardiovascular diseases in male relatives | 24 (7.9) | 212 (8.2) | 236 (8.2) | 0.876 | 0.9% |
| Family history of premature cardiovascular diseases in female relatives | 34 (11.3) | 179 (6.9) | 213 (7.4) | 0.006 | 0.9% |
| Former cigarette smoking | 55 (18.2) | 385 (14.9) | 440 (15.2) | 0.220 | 0 |
| Current cigarette smoking | 76 (25.1) | 652 (25.2) | 728 (25.2) | 0.989 | 0 |
| Exposed to secondhand smoke at home or at work | 83 (27.5) | 827 (32.0) | 910 (31.5) | 0.1 | 0 |
| Education | 0 | ||||
| Level 1 (illiterate) | 13 (4.3) | 80 (3.1) | 93 (3.2) | <0.001 | |
| Level 2 (≤5 years) | 90 (29.8) | 409 (15.8) | 499 (17.3) | ||
| Level 3 (6–12 years) | 164 (54.3) | 1573 (60.9) | 1737 (60.2) | ||
| Level 4 (13–16 years) | 28 (9.3) | 455 (17.6) | 483 (16.7) | ||
| Level 5 (>16 years) | 7 (2.3) | 66 (2.6) | 73 (2.5) | ||
| Marital status | 0 | ||||
| Single | 17 (5.6) | 488 (18.9) | 505 (17.5) | <0.001 | |
| Married | 282 (93.4) | 2081 (80.6) | 2363 (81.9) | ||
| Divorced | 0 (0) | 8 (0.3) | 8 (0.3) | ||
| Widowed | 3 (1.0) | 6 (0.2) | 9 (0.3) | ||
| Physical activity levels | |||||
| Inactive‡ | 215 (74.7) | 1744 (71.2) | 1959 (71.5) | 0.217 | 5% |
| Use of the ACE inhibitors | 6 (1.9) | 24 (0.9) | 30 (1.1) | 0.134 | 0 |
| Use of blood lipid-lowering drugs | 11 (3.6) | 37 (1.4) | 48 (1.7) | 0.005 | 0 |
| Use of antihypertensive drugs | 21 (6.9) | 76 (2.9) | 97 (3.4) | <0.001 | 0 |
| Use of aspirin | 55 (19.0) | 257 (11.2) | 312 (11.1) | <0.001 | 2.6% |
| Participating in the lifestyle intervention group | 100 (33.1) | 821 (31.8) | 921 (31.9) | 0.777 | 0 |
Figures are either mean (SD) or N (%) for continuously and categorically distributed variables, respectively, in the data set with no missing values (after imputation of missing values). Data were collected from the TLGS study between 1999 and 2012.
*Continuous and categorical variables were compared by Student's t-test and χ2, respectively.
†Per cent of missing data in the original data set.
‡Doing exercise or labour less than three times a week or performing activities achieving a lower than 600 MET.
BMI, body mass index; HDL, high-density lipoprotein; TLGS, Tehran Lipid and Glucose Study.
Baseline characteristics of women (TLGS 1999–2012)
| Variables | Diabetes | No diabetes | Total population | p Value* | Percentage of missing data† |
|---|---|---|---|---|---|
| Age (years) | 47.1 (11.7) | 38.7 (12.1) | 39.6 (12.3) | <0.001 | 0 |
| 2-hour postchallenge plasma glucose(mmol/L) | 7.7 (1.8) | 5.8 (1.4) | 6.0 (1.5) | <0.001 | 0 |
| Fasting plasma glucose (mmol/L) | 5.5 (0.6) | 4.9 (0.5) | 4.9 (0.5) | <0.001 | 0 |
| Wrist circumference (cm) | 16.5 (1.0) | 15.9 (0.99) | 15.9 (1.0) | <0.001 | 2.5% |
| Waist circumferences (cm) | 95.9 (11.2) | 85.5 (11.6) | 86.6 (12.0) | <0.001 | 3.8% |
| BMI (kg/m2) | 30.5 (4.9) | 26.9 (4.5) | 27.3 (4.7) | <0.001 | 3.2% |
| Waist-to-hip ratio | 0.88 (0.1) | 0.82 (0.1) | 0.83 (0.1) | <0.001 | 3.8% |
| Waist-to-height ratio | 0.61 (0.1) | 0.54 (0.1) | 0.5 (0.1) | <0.001 | 3.8% |
| Triglyceride (mmol/L) | 2.3 (1.3) | 1.6 (0.9) | 1.7 (1.0) | <0.001 | 0 |
| Total cholesterol (mmol/L) | 5.8 (1.3) | 5.3 (1.2) | 5.3 (1.2) | <0.001 | 0 |
| HDL cholesterol (mmol/L) | 1.1 (0.3) | 1.2 (0.3) | 1.1 (0.3) | <0.001 | 0 |
| Triglyceride-to-HDL ratio | 5.2 (3.7) | 3.5 (2.7) | 3.7 (2.9) | <0.001 | 0.1% |
| Cholesterol-to-HDL ratio | 5.5 (1.7) | 4.7 (1.5) | 4.8 (1.6) | <0.001 | 0.1% |
| Estimated glomerular filtration rate (mL/min/1.73 m2) | 59.4 (9.9) | 63.8 (10.9) | 63.3 (10.8) | <0.001 | 0.1% |
| Systolic blood pressure (mm Hg) | 126.4 (19.8) | 114.1 (16.4) | 115.4 (17.2) | <0.001 | 1.2% |
| Diastolic blood pressure (mm Hg) | 81.7 (10.5) | 75.4 (10.1) | 76.1 (10.3) | <0.001 | 1.2% |
| Pulse pressure (mm Hg) | 44.6 (14.9) | 38.6 (11.5) | 39.2 (12.1) | <0.001 | 1.2% |
| Mean arterial blood pressure (mm Hg) | 96.6 (12.6) | 88.3 (11.3) | 89.2 (11.7) | <0.001 | 1.2% |
| Total length of stay in the city (years) | 47.1 (11.7) | 38.7 (12.1) | 33.1 (12.3) | <0.001 | 0.3% |
| Goiter size | 0 | ||||
| Grade 0 | 342 (80.1) | 2443 (73.3) | 2785 (74) | 0.009 | |
| Grade 1 | 53 (12.4) | 530 (15.9) | 583 (15.5) | ||
| Grade 2 | 394 (10.5) | 362 (10.9) | 32 (7.5) | ||
| History of hospitalisation until now | 373 (87.4) | 2768 (83.0) | 3141 (83.5) | 0.022 | 0 |
| Family history of diabetes in first-degree relatives | 174 (40.7) | 838 (25.1) | 1012 (26.9) | <0.001 | 1% |
| Family history of premature cardiovascular diseases in male relatives | 27 (6.4) | 260 (7.9) | 287 (7.7) | 0.284 | 0 |
| Family history of premature cardiovascular diseases in female relatives | 55 (13.0) | 298 (9.0) | 353 (9.5) | 0.008 | 0 |
| Former cigarette smoking | 6 (1.4) | 43 (1.3) | 49 (1.3) | 0.747 | 3.8% |
| Current cigarette smoking | 11 (2.6) | 91 (2.7) | 102 (2.7) | 0.075 | 1.1% |
| Exposed to secondhand smoke at home or at work | 81 (19.0) | 699 (21.0) | 780 (20.7) | 0.340 | 0 |
| Education | 0 | ||||
| Level 1 (illiterate) | 68 (15.9) | 236 (7.1) | 304 (8.1) | <0.001 | |
| Level 2 (≤5 years) | 168 (39.3) | 717 (21.5) | 885 (23.5) | ||
| Level 3 (6–12 years) | 159 (37.2) | 1980 (59.4) | 2139 (56.9) | ||
| Level 4 (13–16 years) | 30 (7.0) | 385 (11.5) | 415 (11.0) | ||
| Level 5 (>16 years) | 2 (0.5) | 17 (0.5) | 19 (0.5) | ||
| Marital status | 0 | ||||
| Single | 11 (2.6) | 353 (10.6) | 364 (9.7) | <0.001 | |
| Married | 372 (87.1) | 2777 (83.3) | 3149 (83.7) | ||
| Divorced | 8 (1.9) | 40 (1.2) | 48 (1.3) | ||
| Widowed | 36 (8.4) | 165 (4.9) | 201 (5.3) | ||
| Physical activity levels | 6% | ||||
| Inactive‡ | 311 (72.8) | 2387 (71.6) | 2698 (71.7) | 0.608 | |
| Use of the ACE inhibitors | 12 (2.8) | 29 (0.9) | 41 (1.1) | <0.001 | 0 |
| Use of blood lipid-lowering drugs | 26 (6.1) | 75 (2.3) | 101 (2.7) | <0.001 | 0 |
| Use of antihypertensive drugs | 61 (14.3) | 166 (5.0%) | 227 (6.1) | <0.001 | 0 |
| Use of aspirin | 66 (15.5) | 265 (7.9) | 331 (8.8) | <0.001 | 2.5% |
| Participating in the lifestyle intervention group | 183 (42.9) | 1472 (44.1) | 1655 (44.0) | 0.036 | 0 |
| Menstruation status | 0 | ||||
| Normal menstrual cycle | 235 (55.0) | 2578 (77.3) | 2813 (74.8) | <0.001 | |
| Normal menopause | 120 (28.1) | 478 (14.3) | 598 (15.9) | ||
| Early menopause | 72 (16.9) | 279 (8.3) | 351 (9.3) | ||
| Pregnancy prevention methods | 0 | ||||
| Use of hormonal contraceptive drugs | 19 (4.4) | 243 (7.3) | 262 (7.0) | <0.001 | |
| IUDs | 6 (1.4) | 197 (5.9) | 203 (5.4) | ||
| Using condoms | 10 (2.3) | 181 (5.4) | 191 (5.1) | ||
| Withdrawal method/tubectomy/vasectomy | 381 (89.2) | 2361 (70.8) | 2742 (72.9) | ||
| Not applicable | 11 (2.6) | 353 (10.6) | 364 (9.7) | ||
| Having a birth history | 388 (90.9) | 2720 (81.6) | 3108 (82.6) | <0.001 | |
| Current status of pregnancy | 2 (0.5) | 48 (1.4) | 50 (1.3) | <0.001 | 0 |
| History of hypertension in pregnancy | 35 (8.2) | 189 (5.7) | 224 (6.0) | <0.001 | 0 |
| History of hyperglycaemia in pregnancy | 9 (2.1) | 19 (0.6) | 28 (0.7) | <0.001 | 0 |
Figures are either mean (SD) or N (%) for continuously and categorically distributed variables, respectively, in the data set with no missing values (after imputation of missing values). Data were collected from the TLGS study between 1999 and 2012.
*Continuous and categorical variables were compared by Student's t-test and χ2, respectively.
†Per cent of missing data in the original data set.
‡Doing exercise or labour less than three times a week or performing activities achieving a lower than 600 MET.
BMI, body mass index; HDL, high-density lipoprotein; IUDs, intrauterine devices; TLGS, Tehran Lipid and Glucose Study.
Baseline characteristics of followed up and non-followed up men (TLGS 1999–2012)
| Selected variables | Followed up (n=2885) | Non-followed up (n=1592) | p Value* |
|---|---|---|---|
| Age (year) | 41.8 (13.7)† | 43.4 (16.2) | 0.001 |
| 2-hour postchallenge plasma glucose (mmol/L) | 5.7 (1.7) | 5.6 (1.7) | 0.42 |
| Fasting plasma glucose (mmol/L) | 5.0 (0.5) | 5.0 (0.5) | 0.10 |
| Wrist circumference (cm) | 17.6 (0.9) | 17.6 (0.9) | 0.06 |
| Waist-to-hip ratio | 0.91 (0.07) | 0.91 (0.07) | 0.19 |
| Waist-to-height ratio | 0.52 (0.06) | 0.52 (0.07) | 0.24 |
| Cholesterol-to-HDL ratio | 5.5 (1.7) | 5.4 (1.7) | 0.014 |
| Mean arterial blood pressure (mm Hg) | 90.7 (11.3) | 91.4 (12.7) | 0.07 |
| History of hospitalisation until now | 1425 (49.4) | 813 (51.1) | 0.28 |
| Family history of diabetes in first-degree relatives | 724 (25.1) | 360 (22.6) | 0.07 |
| Exposed to secondhand smoke at home or at work | 910 (31.5) | 461 (29.0) | 0.07 |
| Goitre size | |||
| Grade 0 | 2085 (72.3) | 1155 (72.6) | 0.75 |
| Grade 1 | 545 (18.9) | 307 (19.3) | |
| Grade 2 | 255 (8.8) | 128 (8.1) | |
| Use of aspirin during the last month | 321 (11.1) | 194 (12.1) | 0.33 |
| Education | |||
| Level 1 (illiterate) | 93 (3.2) | 103 (6.5) | 0.001 |
| Level 2 (≤5 years) | 498 (17.3) | 314 (19.7) | |
| Level 3 (6-12 years) | 1738 (60.2) | 884 (55.5) | |
| Level 4 (13-16 years) | 483 (16.7) | 255 (16.0) | |
| Level 5 (>16 years) | 73 (2.5) | 36 (2.3) | |
| Participating in the lifestyle intervention group | 1230 (42.6) | 740 (46.5) | 0.01 |
*Continuous and categorical variables were compared by Student's t-test and χ2, respectively.
†Figures are either mean (SD) or N (%) for continuously and categorically distributed variables, respectively, in the data set with no missing values (after imputation of missing values). Data were collected from the TLGS study between 1999 and 2012.
HDL, high-density lipoprotein; TLGS, Tehran Lipid and Glucose Study.
Baseline characteristics of followed up and non-followed up women (TLGS 1999–2012)
| Selected variables | Followed up (n=3762) | Non-followed up (n=2071) | p Value* |
|---|---|---|---|
| Age (years) | 39.6 (12.3)† | 40.5 (15.1) | 0.03 |
| 2-hour postchallenge plasma glucose (mmol/L) | 6.0 (1.5) | 6.0 (1.6) | 0.87 |
| Fasting plasma glucose (mmol/L) | 4.9 (0.5) | 4.9 (0.5) | 0.16 |
| Wrist circumference (cm) | 15.9 (1.03) | 15.9 (1.07) | 0.56 |
| BMI (kg/m2) | 27.4 (4.7) | 27.4 (5.3) | 0.84 |
| Waist-to-hip ratio | 0.8 (0.08) | 0.8 (0.08) | 0.06 |
| Waist-to-height ratio | 0.5 (0.08) | 0.5 (0.08) | 0.09 |
| Cholesterol-to-HDL ratio | 4.8 (1.6) | 4.8 (1.6) | 0.93 |
| Triglyceride-to-HDL ratio | 3.7 (2.9) | 3.7 (3.7) | 0.47 |
| Pulse pressure (bpm) | 39.3 (12.1) | 40.2 (13.7) | 0.01 |
| Mean arterial blood pressure(mm Hg) | 89.2 (11.8) | 89.9 (12.7) | 0.04 |
| Glomerular filtration rate (mL/min/1.73 m2) | 63.3 (10.9) | 63.2 (12.1) | 0.9 |
| Total length of stay in the city (years) | 33.1 (13.2) | 33.4 (15.5) | 0.48 |
| Goitre size | |||
| Grade 0 | 2785 (74.0) | 1551 (74.9) | 0.74 |
| Grade 1 | 583 (15.5) | 316 (15.2) | |
| Grade 2 | 394 (10.5) | 204 (9.8) | |
| Use of the ACE inhibitors | 33 (0.9) | 29 (1.4) | 0.06 |
| Current status of pregnancy | 50 (1.3) | 23 (1.1) | |
| Use of aspirin | 331 (8.8) | 178 (8.6) | |
| Educational level | |||
| Level 1 (illiterate) | 304 (8.1) | 273 (13.2) | 0.001 |
| Level 2 (≤5 years) | 885 (23.5) | 456 (22.0) | |
| Level 3 (6–12 years) | 2139 (56.9) | 1099 (53.1) | |
| Level 4 (13–16 years) | 415 (11.0) | 236 (11.4) | |
| Level 5 (>16 years) | 19 (0.5) | 7 (0.3) | |
| Family history of premature cardiovascular diseases in male relatives | 353 (9.4) | 201 (9.7) | 0.63 |
| Family history of diabetes in first-degree relatives | 1012 (26.9) | 533 (25.8) | 0.06 |
*Continuous and categorical variables were compared by Student's t-test and χ2, respectively.
†Figures are either mean (SD) or N (%) for continuously and categorically distributed variables, respectively, in the data set with no missing values (after imputation of missing values). Data were collected from the TLGS study between 1999 and 2012.
BMI, body mass index; HDL, high-density lipoprotein; TLGS, Tehran Lipid and Glucose Study.
Performances of the decision tree models for men (Tehran Lipid and Glucose Study 1999–2012)
| Types of decision tree models | Performance measures | C5.0 | QUEST | CART |
|---|---|---|---|---|
| Models (1) | Sensitivity | 72% | 78% | 74% |
| Specificity | 70% | 72% | 72% | |
| PPV | 23% | 25% | 24% | |
| NPV | 95% | 96% | 96% | |
| Accuracy | 71% | 73% | 72% | |
| F-Measure | 0.35 | 0.38 | 0.36 | |
| G-Mean | 0.71 | 0.75 | 0.73 | |
| AUC | 0.80 | 0.78 | 0.79 | |
| Models (2) | Sensitivity | 67% | 68% | 64% |
| Specificity | 73% | 78% | 74% | |
| PPV | 23% | 27% | 23% | |
| NPV | 95% | 95% | 94% | |
| Accuracy | 73% | 77% | 73% | |
| F-Measure | 0.34 | 0.39 | 0.34 | |
| G-Mean | 0.70 | 0.73 | 0.69 | |
| AUC | 0.74 | 0.77 | 0.78 |
Model (1) was developed based on 15 variables which included 2h-PCPG.
Model (2) was developed based on 14 variables (2h-PCPG was excluded).
F-Measure: Harmonic mean between PPV and sensitivity, F-Measure=2 (sensitivity×PPV)/(sensitivity+PPV).
G-Mean of sensitivity and specificity, G=√sensitivity×specificity.
2h-PCPG, 2-hour postchallenge plasma glucose; AUC, area under the curve; CART, Classification and Regression Tree; G-Mean, geometric mean; NPV, negative predictive value; PPV, positive predictive value; QUEST, Quick Unbiased Efficient Statistical Tree.
Performances of the decision tree models for women (Tehran Lipid and Glucose Study 1999–2012)
| Types of decision tree models | Performance measures | C5.0 | QUEST | CART |
|---|---|---|---|---|
| Models (1) | Sensitivity | 75% | 78% | 70% |
| Specificity | 78% | 78% | 79% | |
| PPV | 29% | 30% | 29% | |
| NPV | 96% | 97% | 96% | |
| Accuracy | 78% | 78% | 78% | |
| F-Measure | 0.42 | 0.43 | 0.41 | |
| G-Mean | 0.76 | 0.78 | 0.74 | |
| AUC | 0.81 | 0.81 | 0.81 | |
| Models (2) | Sensitivity | 73% | 73% | 66% |
| Specificity | 77% | 78% | 78% | |
| PPV | 28% | 29% | 27% | |
| NPV | 96% | 96% | 95% | |
| Accuracy | 77% | 78% | 77% | |
| F-Measure | 0.40 | 0.42 | 0.38 | |
| G-Mean | 0.75 | 0.75 | 0.72 | |
| AUC | 0.80 | 0.81 | 0.81 |
Model (1) was developed based on 20 variables which included 2h-PCPG.
Model (2) was developed based on 19 variables (2h-PCPG was excluded).
F-Measure: Harmonic mean between PPV and sensitivity, F-Measure=2 (sensitivity×PPV)/(sensitivity+PPV).
G-Mean of sensitivity and specificity, G=√sensitivity×specificity.
2h-PCPG, 2-hour postchallenge plasma glucose; AUC, area under the curve; CART, Classification and Regression Tree; G-Mean, geometric mean; NPV, negative predictive value; PPV, positive predictive value; QUEST, Quick Unbiased Efficient Statistical Tree.
Figure 3Decision tree for model 1 in men. Performance measures: sensitivity: 78%, specificity: 72%, G-Mean: 0.75, AUC: 0.78. 2h-PCPG, 2-hour postchallenge plasma glucose; AUC, area under the curve; FPG, fasting plasma glucose; G-Mean, geometric mean; WHtR, waist-to-height ratio.
Groups identified by decision tree models for men (Tehran Lipid and Glucose Study 1999–2012)
| Models | Groups | Definition (rules) | Probability* | Predicted class† |
|---|---|---|---|---|
| Model (1) | 1 | FPG≤4.9 and 2h-PCPG≤7.7 | 0.90 | Non-diabetic |
| 2 | 4.9<FPG≤5.3 and 2h-PCPG≤7.7 and WHtR≤0.6 | 0.72 | Non-diabetic | |
| 3 | FPG>5.3 and 2h-PCPG≤4.4 and age ≤43 | 0.67 | Non-diabetic | |
| 4 | 4.9<FPG≤5.3 and 2h-PCPG≤7.7 and WHtR>0.6 | 0.53 | Diabetic | |
| 5 | FPG≤5.3 and 2h-PCPG>7.7 | 0.70 | Diabetic | |
| 6 | FPG>5.3 and 2h-PCPG≤4.4 and age >43 | 0.68 | Diabetic | |
| 7 | FPG>5.3 and 2h-PCPG>4.4 | 0.79 | Diabetic | |
| Model (2) | 1 | FPG≤4.9 | 0.86 | Non-diabetic |
| 2 | 4.9<FPG≤5.3 and WHtR≤0.56 | 0.70 | Non-diabetic | |
| 3 | 4.9<FPG≤5.3 and WHtR>0.56 and FHD=‘no’ | 0.58 | Non-diabetic | |
| 4 | FPG>5.3 and 0.4<WHtR≤0.49and MAP≤92 | 0.75 | Non-diabetic | |
| 5 | 4.9<FPG≤5.3 and WHtR>0.56 and FHD=‘yes’ | 0.78 | Diabetic | |
| 6 | FPG>5.3 and WHtR≤0.45 | 0.56 | Diabetic | |
| 7 | FPG>5.3 and 0.45<WHtR≤0.49 and MAP>92 | 0.67 | Diabetic | |
| 8 | FPG>5.3 and≤0.49≤WHtR<0.56 | 0.74 | Diabetic | |
| 9 | FPG>5.3 and WHtR>0.56 | 0.84 | Diabetic |
Model (1) was developed based on 15 variables which included 2h-PCPG.
Model (2) was developed based on 14 variables (2h-PCPG was excluded).
*The percentage of population in the defined subgroup, which can be interpreted as probability of an outcome.
†Predicted outcome for men who belong to the defined subgroup.
2h-PCPG, 2-hour postchallenge plasma glucose (mmol/L); FHD, family history of diabetes; FPG, fasting plasma glucose (mmol/L); MAP, mean arterial blood pressure (mm Hg); WHtR, waist-to-height ratio.
Figure 4Decision tree for model 1 in women. Performance measures: sensitivity: 78%, specificity: 78%, G-Mean: 0.78, AUC: 0.81. 2h-PCPG, 2-hour postchallenge plasma glucose; AUC, area under the curve; FPG, fasting plasma glucose; G-Mean, geometric mean; WHtR, waist-to-height ratio.
Groups identified by decision tree models for women (Tehran Lipid and Glucose Study 1999–2012)
| Models | Groups | Definition (rules) | Probability* | Predicted class† |
|---|---|---|---|---|
| Model (1) | 1 | FPG≤5.2 and WHtR≤0.55 | 0.88 | Non-diabetic |
| 2 | FPG≤5.2 and 0.55<WHtR≤0.66 and 2h-PCPG≤7.4 | 0.72 | Non-diabetic | |
| 3 | FPG≤5.2 and WHtR>0.66 and 2h-PCPG≤6.9 | 0.57 | Non-diabetic | |
| 4 | FPG>5.2 and WHtR≤0.52 | 0.74 | Non-diabetic | |
| 5 | FPG≤5.2 and 0.55<WHtR≤0.66 and 2h-PCPG>7.4 | 0.69 | Diabetic | |
| 6 | FPG≤5.2 and WHtR>0.66 and 2h-PCPG>6.9 | 0.75 | Diabetic | |
| 7 | FPG>5.2 and WHtR>0.52 | 0.81 | Diabetic | |
| Model (2) | 1 | FPG≤5.2 and WHtR≤0.55 | 0.88 | Non-diabetic |
| 2 | FPG≤4.9 and 0.55<WHtR≤0.66 | 0.73 | Non-diabetic | |
| 3 | 4.9<FPG≤5.2 and 0.55<WHtR≤0.66 and MAP≤97 | 0.64 | Non-diabetic | |
| 4 | FPG≤5.2 and WHtR>0.66 and MAP≤99 | 0.59 | Non-diabetic | |
| 5 | FPG>5.2 and WHtR≤0.52 | 0.74 | Non-diabetic | |
| 6 | 4.9<FPG≤5.2 and 0.55<WHtR≤0.66 and MAP>97 | 0.67 | Diabetic | |
| 7 | FPG≤5.2 and WHtR>0.66 and MAP>99 | 0.66 | Diabetic | |
| 8 | FPG>5.2 and WHtR>0.52 | 0.81 | Diabetic | |
| 9 | FPG>5.2 and WHtR>0.56 | 0.84 | Diabetic |
Model (1) was developed based on 20 variables which included 2h-PCPG.
Model (2) was developed based on 19 variables (2h-PCPG was excluded).
*The percentage of population in the defined subgroup, which can be interpreted as probability of an outcome.
†Predicted outcome for women who belong to the defined subgroup.
2h-PCPG, 2-hour postchallenge plasma glucose (mmol/L); FPG, fasting plasma glucose (mmol/L); MAP, mean arterial blood pressure (mm Hg); WHtR, waist-to-height ratio.