| Literature DB >> 33842630 |
Xiaohan Tang1,2,3, Rui Tang4, Xingzhi Sun4, Xiang Yan1,2,3, Gan Huang1,2,3, Houde Zhou1,3,5, Guotong Xie4, Xia Li1,2,3, Zhiguang Zhou1,2,3.
Abstract
BACKGROUND: Accurate classification of type 1 diabetes (T1DM) and type 2 diabetes (T2DM) in the early phase is crucial for individual precision treatment. This study aimed to develop a classification model having fewer and easier to access clinical variables to distinguish T1DM in newly diagnosed diabetes in adults.Entities:
Keywords: Type 1 diabetes (T1DM); diagnostic model; eXtreme Gradient Boosting algorithm (XGBoost algorithm); type 2 diabetes (T2DM)
Year: 2021 PMID: 33842630 PMCID: PMC8033361 DOI: 10.21037/atm-20-7115
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839
Figure 1Flow diagram of the study design. GADA, glutamic acid decarboxylase autoantibody; T1DM, type 1 diabetes; T2DM, type 2 diabetes.
Characteristics of patients diagnosed with T1DM and T2DM
| Variables | T1DM | T2DM | P value | |||
|---|---|---|---|---|---|---|
| N | Value | N | Value | |||
| Age of onset (years) | 1,465 | 43.7±15.1 | 13,741 | 51.0±12.8 | <0.001 | |
| Sex | 0.227 | |||||
| Female (n, %) | 567 | 38.7% | 5,542 | 40.3% | ||
| Male (n, %) | 898 | 61.30% | 8,199 | 59.67% | ||
| BMI (kg/m2) | 1,416 | 21.5±3.5 | 13,385 | 25.1±3.5 | <0.001 | |
| FPG (mmol/L) | 1,419 | 9.33 (6.60, 13.43) | 13,603 | 7.87 (6.52, 10.31) | <0.001 | |
| PPG (mmol/L) | 1,397 | 17.0±7.1 | 13,462 | 15.1±5.5 | <0.001 | |
| HbA1c (%) | 1,417 | 11.6 (9.0, 13.6) | 13,491 | 8.6 (6.9, 10.9) | <0.001 | |
| SBP (mmHg) | 1,415 | 120.5±15.6 | 13,139 | 128.2±16.2 | <0.001 | |
| DBP (mmHg) | 1,417 | 76.2±10.4 | 13,139 | 80.4±10.5 | <0.001 | |
| Waist (cm) | 1,320 | 80.6±10.3 | 12,478 | 89.1±10.5 | <0.001 | |
| TG (mmol/L) | 1,419 | 1.2 (0.8, 1.8) | 13,262 | 1.77 (1.22, 2.75) | <0.001 | |
| TC (mmol/L) | 1,422 | 4.6±1.4 | 13,366 | 4.8±1.3 | <0.001 | |
| LDL-C (mmol/L) | 1,421 | 2.7±1.0 | 13,327 | 2.9±1.0 | <0.001 | |
| HDL-C (mmol/L) | 1,413 | 1.2 (1.0, 1.5) | 13,205 | 1.11 (0.94, 1.32) | <0.001 | |
| FCP (nmol/L) | 1,456 | 0.14 (0.06, 0.24) | 13,576 | 0.62 (0.43, 0.85) | <0.001 | |
| PCP (nmol/L) | 1,465 | 0.22 (0.12, 0.34) | 13,741 | 1.60 (1.09, 2.40) | <0.001 | |
| Current smoking (n, %) | 448 | 31.0% | 4,084 | 30.1% | 0.485 | |
| Current drinking (n, %) | 212 | 14.8% | 2,447 | 18.1% | 0.002 | |
| Diet treatment (n, %) | 687 | 53.5% | 6,240 | 62.3% | <0.001 | |
| Physical activity (n, %) | 568 | 44.2% | 5,357 | 53.5% | <0.001 | |
% reported for all categorical variables. Data are presented as mean ± SD or median with upper and lower quartiles based on evaluation of normal distribution. Differences between T1DM and T2DM were compared using t-test or Mann-Whitney test for continuous variables where appropriate and chi-square test for categorical variables. T1DM, type 1 diabetes; T2DM, type 2 diabetes; BMI, body mass index; FPG, fasting plasma glucose; PPG, postprandial plasma glucose; HbA1c, hemoglobin A1c; SBP, systolic blood pressure; DBP, diastolic blood pressure; TG, triglyceride; TC, total cholesterol; LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein cholesterol; FCP, fasting C-peptide; PCP, postprandial C-peptide.
Figure 2Feature importance scores of clinical variables. BMI, body mass index; HbA1c, hemoglobin A1c; TG, triglyceride; FPG, fasting plasma glucose; PPG, postprandial plasma glucose; SBP, systolic blood pressure; DBP, diastolic blood pressure; HDL-C, high-density lipoprotein cholesterol; TC, total cholesterol; LDL-C, low-density lipoprotein cholesterol.
Figure 3Impact of (A) age at onset, (B) BMI, (C) HbA1c, (D) waist, (E) TG, (F) TC, (G) LDL-C, (H) HDL-C on SHAP value for test dataset only. SHAP value represents the impact of each variable on the model output (diagnosis of T1DM in this model). SHAP, Shapley Additive exPlanations; BMI, body mass index; HbA1c, hemoglobin A1c; TG, triglyceride; TC, total cholesterol; LDL-C, low-density lipoprotein cholesterol; HDL-C, high-density lipoprotein cholesterol; T1DM, type 1 diabetes.
Performance of models with different combinations of variables
| Features | ROC AUC (95% CI) | Cut-off (%) | Youden’s index | Sensitivity | Specificity | PPV | NPV |
|---|---|---|---|---|---|---|---|
| All 13 features | 0.86 (0.83, 0.88) | 7 | 0.56 | 0.78 | 0.78 | 0.27 | 0.97 |
| BMI + age of onset + HbA1c | 0.83 (0.80, 0.85) | 8 | 0.52 | 0.77 | 0.76 | 0.25 | 0.97 |
| BMI + age of onset + FPG | 0.82 (0.78, 0.84) | 9 | 0.49 | 0.71 | 0.77 | 0.25 | 0.96 |
| BMI + age of onset + TG | 0.81 (0.78, 0.84) | 8 | 0.48 | 0.73 | 0.75 | 0.23 | 0.96 |
| BMI + age of onset + PPG | 0.81 (0.78, 0.84) | 10 | 0.49 | 0.69 | 0.80 | 0.27 | 0.96 |
| BMI + age of onset + men | 0.80 (0.77, 0.83) | 14 | 0.48 | 0.60 | 0.88 | 0.34 | 0.95 |
| BMI + age of onset + TC | 0.80 (0.77, 0.83) | 10 | 0.48 | 0.67 | 0.81 | 0.27 | 0.96 |
| BMI + age of onset + HDL-C | 0.80 (0.77, 0.83) | 12 | 0.46 | 0.61 | 0.85 | 0.30 | 0.95 |
| BMI + age of onset + DBP | 0.80 (0.77, 0.83) | 13 | 0.46 | 0.61 | 0.86 | 0.31 | 0.95 |
| BMI + age of onset + LDL-C | 0.80 (0.77, 0.83) | 14 | 0.47 | 0.60 | 0.87 | 0.33 | 0.95 |
| BMI + age of onset + SBP | 0.80 (0.77, 0.83) | 9 | 0.47 | 0.69 | 0.77 | 0.24 | 0.96 |
| BMI + age of onset | 0.80 (0.77, 0.83) | 13 | 0.47 | 0.61 | 0.86 | 0.32 | 0.95 |
| BMI + age of onset + waist | 0.79 (0.76, 0.82) | 10 | 0.46 | 0.67 | 0.79 | 0.25 | 0.96 |
ROC AUC, area under the receiver operating characteristic curve; PPV, positive predictive value; NPV, negative predictive value; BMI, body mass index; HbA1c, hemoglobin A1c; FPG, fasting plasma glucose; TG, triglyceride; PPG, postprandial plasma glucose; TC, total cholesterol; HDL-C, high-density lipoprotein cholesterol; DBP, diastolic blood pressure; LDL-C, low-density lipoprotein cholesterol; SBP, systolic blood pressure.
Figure 4Calibration curve of the model composed of BMI, age of onset and HbA1c. BMI, body mass index; HbA1c, hemoglobin A1c.