| Literature DB >> 33075216 |
Mostafa Abbas1,2, Raghvendra Mall1, Khaoula Errafii3,4, Abdelkader Lattab1, Ehsan Ullah1, Halima Bensmail1, Abdelilah Arredouani3,4.
Abstract
AIMS/Entities:
Keywords: Prediabetes; Qatar Biobank; Risk score
Mesh:
Year: 2020 PMID: 33075216 PMCID: PMC8169357 DOI: 10.1111/jdi.13445
Source DB: PubMed Journal: J Diabetes Investig ISSN: 2040-1116 Impact factor: 4.232
Baseline characteristics of participants in training and validation datasets
| Training dataset | Validation dataset | |||||
|---|---|---|---|---|---|---|
| Controls (3,912) | Cases ( |
| Controls ( | Cases ( |
| |
| HbA1c% | 5.3 ± 0.3 | 6.2 ± 0.2 | <0.001 | 5.4 ± 0.3 | 6.4 ± 0.2 | <0.001 |
| HbA1c (mmol/mol) | 30.5 ± 5.2 | 40.4 ± 6.3 | <0.001 | 31.6 ± 4.2 | 42.6 ± 7.3 | <0.001 |
| Age (years) | 37 ± 11 | 48 ± 12 | <0.001 | 37 ± 10 | 49 ± 12 | <0.001 |
| Waist (cm) | 87 ± 13 | 97 ± 13 | <0.001 | 87 ± 13 | 96 ± 12 | <0.001 |
| Hip (cm) | 106 ± 11 | 111 ± 11 | <0.001 | 107 ± 11 | 110 ± 11 | <0.001 |
| SBP (mmHg) | 112 ± 13 | 122 ± 16 | <0.001 | 112 ± 13 | 122 ± 16 | <0.001 |
| DBP (mmHg) | 68 ± 10 | 73 ± 11 | <0.001 | 68 ± 10 | 73 ± 11 | <0.001 |
| Pulse (b.p.m.) | 70 ± 10 | 70 ± 10 | 0.18 | 70 ± 10 | 71 ± 10 | 0.06 |
| BMI (kg/m2) | 29 ± 6 | 32 ± 6 | <0.001 | 29 ± 6 | 32 ± 6 | <0.001 |
| Sex (% men) | 47.37 | 49.58 | 0.12 | 46.08 | 48.2 | 0.48 |
| Prediabetes (%) | 32 | 32 | ||||
Data shown as the mean ± standard deviation or proportion. Student’s t‐test was used to compare continuous variables, and the χ2‐test to compare proportions.
BMI, body mass index; HBA1c, hemoglobin A1C; DBP, diastolic blood pressure; SBP, systolic blood pressure.
Multivariate logistic regression model and assigned score for each variable category
| β |
| OR | Assigned score | |
|---|---|---|---|---|
| Age (years) | ||||
| 18–36 | Reference | 0 | ||
| 36–54 | 1.22 | <0.001 | 3.39 | 12 |
| ≥55 | 2.2 | <0.001 | 9 | 22 |
| BMI | ||||
| Normal | Reference | 0 | ||
| Overweight | 0.18 | 0.12 | 1.19 | 2 |
| Obese | 0.47 | <0.001 | 1.6 | 5 |
| Blood pressure | ||||
| Normal | Reference | 0 | ||
| Hypertension | 0.61 | <0.001 | 1.85 | 6 |
| Sex | ||||
| Female | Reference | 0 | ||
| Male | 0.3 | <0.001 | 1.36 | 3 |
| Waist circumference | ||||
| Level 1 | Reference | 0 | ||
| Level 2 | 0.27 | 0.01 | 1.31 | 3 |
| Level 3 | 0.86 | <0.001 | 2.36 | 9 |
| Cutoff point | 16 | |||
A score is attributed to each variable category by multiplying the β coefficient by 10 and rounding to the nearest integer. The odds ratio (OR) was obtained by exponentiating the b coefficients (OR exp[b]).
BMI, body mass index.
The cut‐off point is the score that gives the best balanced accuracy, sensitivity and specificity.
Prediabetes prevalence in training and testing populations based on risk levels defined by the risk score
| Risk level | Score range | Control | Case | Total | Percentage of prediabetes |
|---|---|---|---|---|---|
| Training dataset | |||||
| Low | 0–16 | 2,648 | 450 | 3,098 | 14.53 |
| Moderate | 17–27 | 943 | 781 | 1,724 | 45.3 |
| High | >27 | 321 | 671 | 992 | 67.64 |
| Testing dataset | |||||
| Low | 0–16 | 677 | 107 | 784 | 13.65 |
| Moderate | 17–27 | 226 | 194 | 420 | 46.19 |
| High | >27 | 80 | 170 | 250 | 68 |
Baseline characteristics of the low, medium and high‐risk groups in the training and validation sets, and clusters obtained with K‐means clustering analysis based on age, body mass index, systolic blood pressure, diastolic blood pressure, waist size and sex
| Training data set | Validation dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| Low ( | Moderate ( | High ( |
| Low ( | Moderate ( | High ( |
| |
| Basic characteristics | ||||||||
| Age (years) | 30.71 ± 6.91 | 43.08 ± 8.01 | 54.07 ± 9.55 | <0.0001 | 31.61 ± 7.43 | 42.87 ± 7.74 | 54.61 ± 9.86 | <0.0001 |
| SBP (mmHg) | 106.90 ± 9.93 | 115.78 ± 2.06 | 129.87 ± 15.10 | <0.0001 | 106.42 ± 9.20 | 115.81 ± 12.35 | 130.65 ± 14.99 | <0.0001 |
| DBP (mmHg) | 64.79 ± 8.63 | 70.63 ± 10.28 | 75.97 ± 11.45 | <0.0001 | 64.36 ± 8.39 | 71.40 ± 10.48 | 75.64 ± 11.50 | <0.0001 |
| BMI (kg/m2) | 26.53 ± 4.80 | 31.04 ± 5.16 | 33.77 ± 5.24 | <0.0001 | 26.40 ± 4.49 | 31.37 ± 5.18 | 33.53 ± 5.71 | <0.0001 |
| Waist (cm) | 81.01 ± 10.44 | 93.07 ± 10.79 | 102.28 ± 11 | <0.0001 | 80.66 ± 9.69 | 93.61 ± 10.79 | 101.77 ± 11.43 | <0.0001 |
| Sex | ||||||||
| Women | 1,399 (57.38%) | 1,041 (50.31%) | 578 (44.22%) | <0.0001 | 377 (59.46%) | 253 (51.84%) | 144 (43.37%) | <0.0001 |
| Sex (% men) | 1,039 (42.62%) | 1,028 (49.69%) | 729 (55.78%) | <0.0001 | 257 (40.54%) | 235 (48.16%) | 188 (56.63%) | <0.0001 |
| Clustering | ||||||||
| Cluster 1 | 1,462 (59.97%) | 203 (9.81%) | 1 (0.08%) | <0.0001 | 382 (60.25%) | 48 (9.84%) | 1 (0.30%) | <0.0001 |
| Cluster 2 | 959 (39.34%) | 1,249 (60.37%) | 143 (10.94%) | <0.0001 | 250 (39.43%) | 291 (59.63%) | 30 (9.04%) | <0.0001 |
| Cluster 3 | 17 (0.70%) | 617 (29.82%) | 1,163 (88.98%) | <0.0001 | 2 (0.32%) | 149 (30.53%) | 301 (90.66%) | <0.0001 |
The P‐value shows significance among the three groups based on anova test. Data are mean ± standard deviation or proportions.
BMI, body mass index; DBP, diastolic blood pressure; SBP, systolic blood pressure.
Baseline characteristics for the three clusters in training and validation datasets
| Training dataset | Validation dataset | |||||||
|---|---|---|---|---|---|---|---|---|
| Cluster 1 ( | Cluster 2 ( | Cluster 3 ( |
| Cluster 1 ( | Cluster 2 ( | Cluster 3 ( |
| |
| Age (years) | 31.58 ± 8 | 38.14 ± 9.16 | 51.42 ± 10.34 | <0.0001 | 32.10 ± 8.07 | 38.27 ± 8.94 | 51.78 ± 10.64 | <0.0001 |
| SBP (mmHg) | 101.16 ± 6.71 | 113.19 ± 7.49 | 130.93 ± 12.92 | <0.0001 | 101.23 ± 6.51 | 112.80 ± 7.32 | 131.24 ± 3.15 | <0.0001 |
| DBP (mmHg) | 60.28 ± 6.84 | 69.59 ± 7.65 | 77.55 ± 10.79 | <0.0001 | 60.45 ± 7.21 | 69.80 ± 7.57 | 77.10 ± 11.18 | <0.0001 |
| BMI (kg/m2) | 25.49 ± 3.95 | 30.16 ± 4.89 | 33.20 ± 5.91 | <0.0001 | 25.48 ± 3.79 | 30.23 ± 5.06 | 33.06 ± 5.97 | <0.0001 |
| Waist (cm) | 76.68 ± 7.64 | 91.51 ± 9.45 | 100.65 ± 12.20 | <0.0001 | 76.66 ± 7.27 | 91.31 ± 9.32 | 100.51 ± 2.20 | <0.0001 |
| Women | 1,230 (73.83%) | 1,013 (43.09%) | 775 (43.13%) | <0.0001 | 322 (74.71%) | 255 (44.66%) | 197 (43.58%) | <0.0001 |
| Men | 436 (26.17%) | 1,338 (56.91%) | 1,022 (56.87%) | <0.0001 | 109 (25.29%) | 316 (55.34%) | 255 (56.42%) | <0.0001 |
Data are the mean ± standard deviation or proportion. P for difference across clusters for the different variables.
BMI, body mass index; DBP, diastolic blood pressure; SBP, systolic blood pressure.
Specificity, sensitivity, balanced accuracy, accuracy, positive predictive value, negative predictive value and area under the curve for selected risk point scores of Prediabetes Risk Score in Qatar in the training and testing datasets using the logistic regression model
| Cut‐off point | Specificity % (95% CI) | Sensitivity % (95% CI) | BACC % (95% CI) | ACC% (95% CI) | PPV% (95% CI) | NPV % (95% CI) | AUC % (95% CI) |
|---|---|---|---|---|---|---|---|
| Training data | |||||||
| 1 | 9.71 (8.8–10.68) | 98.79 (98.19–99.23) | 54.25 (53.5–54.96) | 38.85 (37.6–40.12) | 34.73 (33.46–36.01) | 94.29 (91.56–96.35) | 0.79 (0.77–0.80) |
| 5 | 33.03 (31.55–34.53) | 95.43 (94.39–96.32) | 64.2 (62.97–65.42) | 53.4 (52.15–54.73) | 40.9 (39.47–42.39) | 93.6 (92.28–94.92) | |
| 10 | 39.19 (37.65–40.74) | 93.9 (92.73–94.93) | 66.5 (65.19–67.84) | 57.0 (55.8–58.36) | 42.8 (41.37–44.4) | 92.9 (91.62–94.15) | |
| 15 | 55.34 (53.77–56.91) | 85.65 (83.99–87.19) | 70.4 (68.88–72.05) | 65.2 (64.02–66.48) | 48.2 (46.55–49.95) | 88. (87.48–90.03) | |
| 16 | 55.34 (53.77–56.91) | 85.65 (83.99–87.19) | 70.4 (68.88–72.05) | 65.2 (64.02–66.48) | 48.2 (46.55–49.95) | 88. (87.48–90.03) | |
| 17 | 67.38 (65.89–68.85) | 76.66 (74.69–78.54) | 72.0 (70.29–73.7) | 70.4 (69.22–71.59) | 53.3 (51.44–55.21) | 85.5 (84.29–86.81) | |
| 18 | 67.69 (66.2–69.15) | 76.34 (74.36–78.24) | 72.0 (70.28–73.69) | 70.5 (69.33–71.69) | 53.4 (51.56–55.35) | 85.4 (84.18–86.7) | |
| 19 | 67.69 (66.2–69.15) | 76.34 (74.36–78.24) | 72.0 (70.28–73.69) | 70.5 (69.33–71.69) | 53.4 (51.56–55.35) | 85.4 (84.18–86.7) | |
| 20 | 73.29 (71.87–74.67) | 70.4 (68.29–72.44) | 71.8 (70.08–73.56) | 72.3 (71.17–73.49) | 56.1 (54.15–58.17) | 83.5 (82.3–84.81) | |
| 25 | 79.65 (78.36–80.9) | 61.72 (59.5–63.92) | 70.6 (68.93–72.41) | 73.7 (72.64–74.91) | 59.5 (57.39–61.77) | 81.0 (79.79–82.29) | |
| 30 | 91.79 (90.89–92.64) | 35.28 (33.13–37.47) | 63.5 (62.01–65.05) | 73.3 (72.15–74.44) | 67.6 (64.63–70.55) | 74.4 (73.22–75.7) | |
| 35 | 96.09 (95.43–96.67) | 19.98 (18.2–21.85) | 58.0 (56.82–59.26) | 71.1 (70.01–72.35) | 71.2 (67.25–75.1) | 71.1 (69.94–72.4) | |
| 40 | 98.34 (97.89–98.72) | 9.1 (7.84–10.48) | 53.7 (52.86–54.6) | 69.1 (67.94–70.33) | 72.6 (66.56–78.25) | 68.9 (67.76–70.2) | |
| 45 | 100 (99.91–100) | 0 (0–0.19) | 5 (49.95–50.1) | 67.2 (66.06–68.49) | NaN(0–100) | 67.2 (66.06–68.49) | |
| Testing data | |||||||
| 1 | 9.46 (7.7–11.46) | 99.15 (97.84–99.77) | 54.3 (52.77–55.62) | 38.5 (36–41.07) | 34.4 (31.89–37.01) | 95.8 (89.78–98.87) | 0.80 (0.78–0.83) |
| 5 | 33.06 (30.12–36.1) | 97.03 (95.06–98.37) | 65.0 (62.59–67.23) | 53.7 (51.18–56.37) | 40.9 (38.08–43.94) | 95.8 (93.17–97.72) | |
| 10 | 38.56 (35.5–41.68) | 94.69 (92.26–96.54) | 66.6 (63.88–69.11) | 56.7 (54.15–59.31) | 42.4 (39.46–45.53) | 93.8 (91–95.96) | |
| 15 | 57.88 (54.73–60.99) | 86.2 (82.75–89.19) | 72.0 (68.74–75.09) | 67.0 (64.57–69.47) | 49.5 (46.04–52.99) | 89.7 (87.12–92) | |
| 16 | 57.88 (54.73–60.99) | 86.2 (82.75–89.19) | 72.0 (68.74–75.09) | 67.0 (64.57–69.47) | 49.5 (46.04–52.99) | 89.7 (87.12–92) | |
| 17 | 68.57 (65.56–71.46) | 77.28 (73.23–80.99) | 72.9 (69.39–76.23) | 71.3 (68.99–73.7) | 54.0 (50.24–57.9) | 86. (83.69–88.63) | |
| 18 | 68.87 (65.87–71.76) | 77.28 (73.23–80.99) | 73.0 (69.55–76.37) | 71. (69.2–73.9) | 54.3 (50.47–58.15) | 86.3 (83.75–88.68) | |
| 19 | 68.87 (65.87–71.76) | 77.28 (73.23–80.99) | 73.0 (69.55–76.37) | 71. (69.2–73.9) | 54.3 (50.47–58.15) | 86.3 (83.75–88.68) | |
| 20 | 75.28 (72.46–77.95) | 71.34 (67.02–75.38) | 73.3 (69.74–76.67) | 7 (71.67–76.24) | 58.0 (53.89–62.09) | 84.5 (82.01–86.9) | |
| 25 | 80.16 (77.53–82.61) | 62.42 (57.87–66.81) | 71.2 (67.7–74.71) | 74.4 (72.09–76.64) | 60.1 (55.63–64.49) | 81.6 (79.07–84.05) | |
| 30 | 91.86 (89.97–93.49) | 36.09 (31.75–40.61) | 63.9 (60.86–67.05) | 73. (71.46–76.04) | 6 (61.83–73.74) | 7 (72.45–77.42) | |
| 35 | 97.05 (95.79–98.02) | 23.14 (19.41–27.22) | 60. (57.6–62.62) | 73.1 (70.75–75.37) | 78.9 (71.23–85.45) | 72.4 (69.99–74.89) | |
| 40 | 99.39 (98.68–99.78) | 10.83 (8.17–13.99) | 55.1 (53.42–56.88) | 70. (68.29–73.03) | 89.4 (78.48–96.04) | 69.9 (67.46–72.33) | |
| 45 | 100 (99.63–100) | 0 (0–0.78) | 5 (49.81–50.39) | 67.6 (65.13–70.01) | NaN (0–100) | 67.6 (65.13–70.01) | |
ACC, accuracy; AUC, area under the curve; BACC, balanced accuracy; CI, confidence interval; NPV, negative; predictive value; PPV, positive predictive value.
Figure 1Receiver operating characteristic curves for diagnosing prediabetes using either the (a) logistic regression model or (b) more complex machine learning models (Qatar Biobank cohort). AUC, area under the curve; DL, deep learning; GBM, gradient boosting machine; RF, random forest; XGB, XgBoost.