Literature DB >> 35845157

Establishment of a diagnostic model of coronary heart disease in elderly patients with diabetes mellitus based on machine learning algorithms.

Hu Xu^1,2,3, Wen-Zhe Cao^1,4, Yong-Yi Bai², Jing Dong⁵, He-Bin Che⁵, Po Bai⁶, Jian-Dong Wang^1,3, Feng Cao^1,2, Li Fan^1,2.

Abstract

OBJECTIVE: To establish a prediction model of coronary heart disease (CHD) in elderly patients with diabetes mellitus (DM) based on machine learning (ML) algorithms.
METHODS: Based on the Medical Big Data Research Centre of Chinese PLA General Hospital in Beijing, China, we identified a cohort of elderly inpatients (≥ 60 years), including 10,533 patients with DM complicated with CHD and 12,634 patients with DM without CHD, from January 2008 to December 2017. We collected demographic characteristics and clinical data. After selecting the important features, we established five ML models, including extreme gradient boosting (XGBoost), random forest (RF), decision tree (DT), adaptive boosting (Adaboost) and logistic regression (LR). We compared the receiver operating characteristic curves, area under the curve (AUC) and other relevant parameters of different models and determined the optimal classification model. The model was then applied to 7447 elderly patients with DM admitted from January 2018 to December 2019 to further validate the performance of the model.
RESULTS: Fifteen features were selected and included in the ML model. The classification precision in the test set of the XGBoost, RF, DT, Adaboost and LR models was 0.778, 0.789, 0.753, 0.750 and 0.689, respectively; and the AUCs of the subjects were 0.851, 0.845, 0.823, 0.833 and 0.731, respectively. Applying the XGBoost model with optimal performance to a newly recruited dataset for validation, the diagnostic sensitivity, specificity, precision, and AUC were 0.792, 0.808, 0.748 and 0.880, respectively.
CONCLUSIONS: The XGBoost model established in the present study had certain predictive value for elderly patients with DM complicated with CHD.

Entities: Chemical

Year: 2022 PMID： 35845157 PMCID： PMC9248279 DOI： 10.11909/j.issn.1671-5411.2022.06.006

Source DB: PubMed Journal: J Geriatr Cardiol ISSN： 1671-5411 Impact factor: 3.189

With the population in China ageing at an increasing rate, the phenomenon of comorbidity among the elderly has become increasingly prominent. Comorbidity is the coexistence of two or more chronic diseases or conditions.[ The prevalence of comorbidity in individuals aged 65 years or older is 76.6%.[ Comorbidity has brought a heavy burden to individuals, families and society. In the elderly, comorbidity with coronary heart disease (CHD) and diabetes mellitus (DM) is one of the most common occurrences. There are 114 million patients with DM in China, which is more than any other country in the world.[ Cardiovascular disease, which is represented by CHD, is the leading cause of death among urban and rural residents. It is estimated that there are 11.39 million patients with CHD in China.[ In patients with DM, the risk of CHD increased significantly.[ DM is an independent risk factor for CHD.[ CHD is one of the most common complications of DM.[ The life expectancy of patients with DM complicated with cardiovascular disease is significantly decreased.[ In elderly patients with comorbid CHD and DM, the symptoms of myocardial ischaemia are relatively mild or absent, which is not easy to determine and diagnose.[ Irreversible pathological damage is often present when the diagnosis of CHD is confirmed, which seriously affects the quality of life of patients.[ Coronary angiography is the gold standard for diagnosing of CHD. However, this operation is complex, invasive and expensive, and is not included in routine examinations. Therefore, early screening of CHD in elderly patients with DM is of great significance. In recent years, many experts and researchers have begun to explore new models of DM diagnosis, making full use of medical big data and machine learning (ML) algorithms, and have achieved good results in prediction and diagnosis of CHD.[ For different data sets, researchers use different algorithms, such as sequential minimal optimization,[ artificial neural network[ and neural network models,[ to build CHD diagnosis models, including a variety of features, like electrocardiogram (ECG),[ heart rate variability,[ and even facial photos.[ Currently, there is no specific model to predict the risk of CHD in elderly patients with DM. This study aimed to establish a prediction model of CHD in elderly patients with DM using the ML algorithm, and to provide a new auxiliary screening method from the perspective of medicine and big data science.

METHODS

Study Population

Based on Medical Big Data Research Center of Chinese PLA General Hospital in Beijing, China, we identified a total of 28,059 elderly inpatients (≥ 60 years) who were diagnosed with DM from January 2008 to December 2017. The inclusion criteria were inpatients aged 60 years or older and the discharge diagnosis included DM according to the corresponding diagnostic criteria. The exclusion criteria were malignant tumor, severe hepatic and renal dysfunction, respiratory failure, sepsis, shock, severe autoimmsune disease, severe hematological disease, acute rheumatic fever, chronic rheumatic valvular disease, myocarditis, and myocardiopathy. The diagnostic criteria for CHD are based on the European Society of Cardiology guidelines for the diagnosis and management of chronic coronary syndromes.[ This study was approved by the Ethics Committee of Chinese PLA General Hospital (No.S2018-269-02). According to the inclusion and exclusion criteria, 4892 patients were excluded. Finally, a total of 23,167 elderly patients with DM were included in the present study. Among them, 10,533 patients complicated with CHD were included in the research group and 12,634 patients complicated without CHD were included in the control group. To further validate the performance of the models, we collected information on 7447 elderly patients with DM admitted from January 2018 to December 2019. Of these patients, there were 3116 patients with CHD and 4331 patients without CHD.

Candidate Variables

All demographic characteristics, laboratory examinations, history of complications and other clinical data were used to establish prediction models. For missing data, the nonparametric filling missForest algorithm was used. Any variable with > 30% of missing data was removed. Recursive feature elimination (RFE) is a method for feature selection that combines with various ML models to eliminate redundant information, thus identifying the most influential features for each model. In our study, RFE was used to rank the importance of all 67 feature variables, and the top 15 feature variables were selected to construct models. The importance of each variable in each model was also evaluated.

Model Development

To develop ML models, random split validation was used. We selected 80% of the research group samples and 80% of the control group samples as the training dataset. The remaining subset (20%) was reserved as the testing dataset. Five commonly used ML algorithms were used to train the models and diagnose CHD in elderly patients with DM. Extreme gradient boosting (XGBoost) is an ensemble ML algorithm based on decision tree (DT). Random forest (RF) is a combined classifier composed of multiple DTs, and it is an excellent ensemble learning method. DT is a ML algorithm that continuously introduces features to reduce the uncertainty of original random variables. Adaptive boosting (Adaboost) is an algorithm that integrates the weak classifiers to form a strong classifier with excellent performance. Logistic regression (LR) is a probabilistic nonlinear regression model, which is a multivariate analysis method to study binary output classification.[ Various Python packages were used to conduct this analysis.

Model Performance

The performance of the model was evaluated using the standard format, namely confusion matrix.[ The evaluation indicators included sensitivity, specificity, accuracy, precision, F1-score, receiver operating characteristic curves, and area under the curve (AUC). By comparing the relevant parameters of various models, the optimal classification model was found out.

Statistical Analysis

Continuous variables were presented as mean ± SD or median (interquartile range) and categorical variables were presented as counts (percentages). The Mann-Whitney U test and the Pearson’s chi-squared test were used to test the difference of variables between the research group and the control group. Different Python packages were used to establish the five ML models. DeLong method was used to compare AUC between different models, which was realized by MedCalc 11.4.2.0 (MedCalc Software, Ostend, Belgium). Two-sided P-value < 0.05 were considered statistically significant. Statistical analysis was performed with SPSS 24.0 (SPSS Inc., IBM, Armonk, NY, USA) and Python 3.7.0 ( https://www.python.org/downloads/release/python-370/).

RESULTS

Clinical Characteristics

A cohort of 23,167 elderly patients with DM was enrolled in our study, including 10,533 patients in the research group and 12,634 patients in the control group. We collected demographic characteristics, laboratory examinations, and history of complications. The clinical characteristics of all patients are summarized in Table 1.

Table 1

Clinical characteristics of all patients.

Characteristics	Research group (n = 10,533)	Control group (n = 12,634)	P-value
Data are presented as n (%). ^*Presented as median (interquartile range).
Demographics
Male	5537 (52.6%)	6168 (48.8%)	< 0.001
Age, yrs	69 (64–74)^*	68 (63–73)^*	< 0.001
Body mass index, kg/m²	25.7 (23.6–28.0)^*	25.3 (23.1–27.6)^*	< 0.001
Systolic blood pressure, mmHg	138 (126–150)^*	138 (126–150)^*	0.606
Diastolic blood pressure, mmHg	75 (68–82)^*	80 (70–85)^*	< 0.001
Blood electrolyte
Ca²⁺, mmol/L	2.25 (2.16–2.32)^*	2.22 (2.11–2.31)^*	< 0.001
P⁵⁺, mmol/L	1.09 (0.93–1.24)^*	1.10 (0.87–1.25)^*	0.002
Mg²⁺, mmol/L	0.86 (0.80–0.92)^*	0.85 (0.76–0.91)^*	< 0.001
K⁺, mmol/L	3.91 (3.63–4.17)^*	3.90 (3.54–4.18)^*	< 0.001
Na⁺, mmol/L	141.5 (139.1–143.5)^*	141.7 (138.6–143.7)^*	0.032
Cl⁻, mmol/L	103.8 (101.2–106.1)^*	103.2 (99.8–105.7)^*	< 0.001
CO₂, mmol/L	25.9 (23.6–27.9)^*	25.7 (22.9–27.8)^*	< 0.001
Glycemic control
Hemoglobin A1c, %	6.30 (3.77–7.60)^*	3.78 (3.75–6.80)^*	< 0.001
Glycated serum protein, μmol/L	171.0 (120.1–230.3)^*	136.2 (120.1–218.1)^*	< 0.001
Myocardial enzyme
Troponin T, ng/mL	0.006 (0.003–0.016)^*	0.003 (0.003–0.003)^*	< 0.001
Creatine kinase, U/L	66.3 (45.2–101.3)^*	57.8 (31.6–89.7)^*	< 0.001
Creatine kinase-MB, U/L	7.2 (0.6–13.9)^*	9.9 (0.5–14.9)^*	< 0.001
Lactate dehydrogenase, U/L	156.4 (135.4–183.7)^*	150.0 (123.4–175.1)^*	< 0.001
Pro-B-type natriuretic peptide, pg/mL	59.5 (5.0–315.9)^*	5.0 (5.0–5.1)^*	< 0.001
Blood coagulation
Thrombin time, s	16.2 (15.4–17.0)^*	16.0 (12.7–16.9)^*	< 0.001
Activated partial thromboplastin time, s	22.5 (22.3–35.7)^*	22.4 (22.2–34.5)^*	< 0.001
Prothrombin time, s	13.3 (12.8–14.1)^*	13.3 (12.7–14.3)^*	0.988
Prothrombin activity, %	96 (86–105)^*	96 (82–106)^*	< 0.001
International normalized ratio	1.03 (0.97–1.07)^*	1.03 (0.97–1.07)^*	0.041
Fibrinogen, g/L	3.33 (2.81–3.98)^*	3.19 (2.53–3.88)^*	< 0.001
D-dimer, μg/mL	0.30 (0.01–0.52)^*	0.01 (0.01–0.37)^*	< 0.001
Routine blood + C-reactive protein
Hemoglobin, g/dL	132 (120–143)^*	131 (116–142)^*	< 0.001
Red blood cell count, × 10¹²/L	4.35 (3.97–4.70)^*	4.30 (3.84–4.67)^*	< 0.001
White blood cell count, × 10⁹/L	6.42 (5.27–7.73)^*	6.03 (4.77–7.46)^*	< 0.001
Neutrophil, %	0.61 (0.54–0.68)^*	0.58 (0.50–0.66)^*	< 0.001
Lymphocyte, %	0.28 (0.21–0.35)^*	0.29 (0.19–0.36)^*	0.225
Monocyte, %	0.06 (0.05–0.07)^*	0.06 (0.04–0.07)^*	< 0.001
Eosinophil, %	0.02 (0.01–0.03)^*	0.02 (0.01–0.03)^*	< 0.001
Basophil, %	0.003 (0.002–0.005)^*	0.003 (0.002–0.005)^*	< 0.001
Hematocrit, %	0.40 (0.37–0.43)^*	0.40 (0.37–0.44)^*	< 0.001
Mean corpuscular volume, fl	90.0 (87.1–92.9)^*	89.4 (85.8–92.4)^*	< 0.001
Mean corpuscular hemoglobin, pg	30.4 (29.3–31.4)^*	30.2 (28.9–31.3)^*	< 0.001
Mean corpuscular hemoglobin concentration, g/L	337 (329–344)^*	336 (326–344)^*	< 0.001
Red cell distribution width, %	12.8 (12.3–13.4)^*	12.8 (12.2–13.3)^*	< 0.001
Platelet count, × 10⁹/L	198 (162–240)^*	193 (150–237)^*	< 0.001
Mean platelet volume, fl	10.6 (9.9–11.2)^*	10.4 (9.7–11.2)^*	< 0.001
C-reactive protein, mg/dL	0.1 (0–0.348)^*	0.0 (0–0.328)^*	< 0.001
Routine urine
Specific gravity	1.014 (1.010–1.019)^*	1.013 (1.005–1.019)^*	< 0.001
pH	5.5 (5.0–6.5)^*	5.5 (5.0–6.5)^*	< 0.001
Blood biochemical
Alanine aminotransferase, U/L	17.2 (12.3–25.6)^*	15.1 (10.1–22.0)^*	< 0.001
Aspartate aminotransferase, U/L	16.6 (13.4–21.8)^*	15.3 (11.9–19.7)^*	< 0.001
Total protein, g/L	67.7 (63.5–71.9)^*	65.7 (60.4–70.2)^*	< 0.001
Albumin, g/L	40.5 (37.7–42.9)^*	39.7 (35.3–42.4)^*	< 0.001
Total bilirubin, μmol/L	10.1 (7.4–13.5)^*	9.4 (6.4–13.0)^*	< 0.001
Direct bilirubin, μmol/L	3.1 (2.2–4.3)^*	2.8 (1.8–4.1)^*	< 0.001
Total bile acid, μmol/L	1.4 (0.002–3.7)^*	2.4 (0.003–4.6)^*	< 0.001
Homocysteine, μmol/L	0.82 (0.76–13.20)^*	1.10 (0.77–13.10)^*	< 0.001
Alkaline phosphatase, U/L	65.6 (52.9–79.9)^*	64.7 (48.9–80.7)^*	< 0.001
Glutamyl transpeptidase, U/L	22.6 (15.8–34.6)^*	19.9 (12.5–32.1)^*	< 0.001
Glucose, mmol/L	7.02 (5.57–9.46)^*	6.37 (4.97–8.32)^*	< 0.001
Blood urea nitrogen, mmol/L	5.56 (4.46–6.91)^*	5.29 (4.04–6.59)^*	< 0.001
Creatinine, μmol/L	70.7 (58.5–84.5)^*	64.1 (51.4–78.0)^*	< 0.001
Uric acid, μmol/L	296.3 (239.2–359.0)^*	269.0 (196.1–333.7)^*	< 0.001
Total cholesterol, mmol/L	3.80 (2.99–4.65)^*	3.92 (1.24–4.85)^*	0.290
Triglyceride, mmol/L	1.27 (0.85–1.81)^*	1.11 (0.23–1.69)^*	< 0.001
Apolipoprotein A1, g/L	0.95 (0.23–1.23)^*	0.77 (0.23–1.22)^*	< 0.001
Apolipoprotein B, g/L	0.61 (0.19–0.88)^*	0.50 (0.18–0.91)^*	< 0.001
High-density lipoprotein cholesterol, mmol/L	0.99 (0.79–1.19)^*	0.94 (0.08–1.20)^*	< 0.001
Low-density lipoprotein cholesterol, mmol/L	2.18 (1.54–2.90)^*	2.26 (0.09–3.04)^*	< 0.001
Lipoprotein(a), mg/dL	0.00 (0–16.82)^*	0.00 (0–15.51)^*	0.543
Apolipoprotein A1/Apolipoprotein B	1.28 (1.19–1.51)^*	1.28 (1.18–1.39)^*	< 0.001
Estimated glomerular filtration rate, mL/min per 1.73 m²	98.2 (80.3–117.9)^*	108.1 (88.2–135.5)^*	< 0.001

Predictor Variables

RFE was used to rank the importance of all 67 feature variables, and the top 15 feature variables were selected to construct models. The top 15 feature variables were pro-B-type natriuretic peptide, hemoglobin A1c, troponin T, high-density lipoprotein cholesterol, total bile acid, D-dimer, glycated serum protein, activated partial thromboplastin time, triglyceride, low-density lipoprotein cholesterol, total cholesterol, total protein, creatine kinase, creatine kinase-MB and lymphocyte.

Performance of the Predictive Models on Test set

The classification precision in test set of XGBoost, RF, DT, Adaboost and LR models was 0.778, 0.789, 0.753, 0.750 and 0.689, respectively; and the AUC of the subjects was 0.851, 0.845, 0.823, 0.833 and 0.731, respectively (Table 2, Figure 1). Among the five models, XGBoost performed the best with the highest AUC. The AUC is 0.851 (95% CI: 0.841–0.861). The importance scores of the 15 features of XGBoost are shown in Figure 2.

Table 2

Performance of the five models on the testing sets.

Test set	AUC	95% CI	Precision	Accuracy	F1-score	Sensitivity	Specificity
Adaboost: adaptive boosting; AUC: area under the curve; DT: decision tree; LR: logistic regression; RF: random forest; XGBoost: extreme gradient boosting.
XGBoost	0.851	0.841–0.861	0.778	0.773	0.732	0.690	0.841
RF	0.845	0.834–0.855	0.789	0.778	0.735	0.688	0.851
DT	0.823	0.812–0.834	0.753	0.757	0.714	0.679	0.820
Adaboost	0.833	0.822–0.844	0.750	0.757	0.715	0.683	0.816
LR	0.731	0.718–0.744	0.689	0.686	0.607	0.542	0.803

Figure 1

The receiver operating characteristic and the AUC of the five prediction models on the testing dataset.

Figure 2

Ranking of the relative importance of XGBoost model features.

The receiver operating characteristic and the AUC of the five prediction models on the testing dataset. Adaboost: adaptive boosting; AUC: area under the curve; DT: decision tree; LR: logistic regression; RF: random forest; XGBoost: extreme gradient boosting. Ranking of the relative importance of XGBoost model features. XGBoost: extreme gradient boosting.

Performance of the XGBoost Model on a Newly Recruited Independent set

The clinical characteristics of newly recruited subjects are shown in Table 3. Applying the XGBoost model with optimal performance to a newly recruited independent dataset for validation, the diagnostic sensitivity, specificity, precision, and AUC were 0.792, 0.808, 0.748 and 0.880, respectively (Table 4, Figure 3). The XGBoost model performed well in diagnosing CHD in elderly patients with DM.

Table 3

Clinical characteristics of newly recruited subjects.

Characteristics	Research group (n = 3116)	Control group (n = 4331)	P-value
Data are presented as n (%). ^*Presented as median (interquartile range).
Demographics
Male	1816 (58.3%)	2321 (53.6%)	< 0.001
Age, yrs	68 (64–74)^*	67 (64–72)^*	< 0.001
Body mass index, kg/m²	25.6 (23.5–27.7)^*	25.3 (23.2–27.4)^*	< 0.001
Systolic blood pressure, mmHg	138 (124–150)^*	138 (126–150)^*	0.082
Diastolic blood pressure, mmHg	75 (67–82)^*	78 (70–85)^*	< 0.001
Blood electrolyte
Ca²⁺, mmol/L	2.25 (2.17–2.33)^*	2.22 (2.11–2.3)^*	< 0.001
P⁵⁺, mmol/L	1.08 (0.93–1.21)^*	1.10 (0.90–1.24)^*	0.209
Mg²⁺, mmol/L	0.85 (0.79–0.91)^*	0.83 (0.75–0.89)^*	< 0.001
K⁺, mmol/L	3.84 (3.58–4.11)^*	3.88 (3.48–4.19)^*	0.326
Na⁺, mmol/L	141.1 (139.1–142.7)^*	141.1 (138.4–143.0)^*	0.238
Cl⁻, mmol/L	102.8 (100.4–104.8)^*	102.0 (98.7–104.5)^*	< 0.001
CO₂, mmol/L	26.6 (24.6–28.6)^*	26 (23.3–27.9)^*	< 0.001
Glycemic control
Hemoglobin A1c, %	6.8 (3.8–7.9)^*	3.8 (3.8–6.5)^*	< 0.001
Glycated serum protein, μmol/L	180.4 (129.9–221.2)^*	172.8 (139.9–201.5)^*	< 0.001
Myocardial enzyme
Troponin T, ng/mL	0.009 (0.004–0.019)^*	0.003 (0.003–0.008)^*	< 0.001
Creatine kinase, U/L	67.1 (45.9–99.6)^*	61.1 (35.5–93.8)^*	< 0.001
Creatine kinase-MB, U/L	6.0 (0.5–11.7)^*	10.2 (0.5–15.8)^*	< 0.001
Lactate dehydrogenase, U/L	159.7 (137.7–187.0)^*	152.9 (126.4–177.9)^*	< 0.001
Pro-B-type natriuretic peptide, pg/mL	103.9 (26.0–355.7)^*	5.0 (5.0–76.0)^*	< 0.001
Blood coagulation
Thrombin time, s	15.8 (15.1–16.5)^*	15.7 (14.7–16.5)^*	< 0.001
Activated partial thromboplastin time, s	22.28 (22.16–22.40)^*	22.22 (22.06–22.34)^*	< 0.001
Prothrombin time, s	13.3 (12.8–13.9)^*	13.3 (12.7–14.4)^*	0.048
Prothrombin activity, %	97 (88–105)^*	96 (82–107)^*	0.006
International normalized ratio	1.02 (0.98–1.07)^*	1.03 (0.97–1.07)^*	0.854
Fibrinogen, g/L	3.26 (2.79–3.85)^*	3.00 (2.34–3.61)^*	< 0.001
D-dimer, μg/mL	0.33 (0.23–0.55)^*	0.26 (0.01–0.47)^*	< 0.001
Routine blood + C-reactive protein
Hemoglobin, g/dL	132 (121–143)^*	129 (114–141)^*	< 0.001
Red blood cell count, × 10¹²/L	4.34 (3.97–4.68)^*	4.25 (3.75–4.61)^*	< 0.001
White blood cell count, × 10⁹/L	6.40 (5.18–7.65)^*	5.92 (4.61–7.37)^*	< 0.001
Neutrophil, %	0.61 (0.54–0.67)^*	0.58 (0.49–0.66)^*	< 0.001
Lymphocyte, %	0.28 (0.21–0.34)^*	0.28 (0.17–0.35)^*	0.008
Monocyte, %	0.06 (0.05–0.08)^*	0.06 (0.05–0.07)^*	< 0.001
Eosinophil, %	0.02 (0.01–0.03)^*	0.02 (0.01–0.03)^*	< 0.001
Basophil, %	0.004 (0.003–0.006)^*	0.004 (0.002–0.006)^*	< 0.001
Hematocrit, %	0.39 (0.36–0.42)^*	0.40 (0.36–0.43)^*	< 0.001
Mean corpuscular volume, fl	88.0 (85.4–90.9)^*	87.7 (84.1–90.8)^*	< 0.001
Mean corpuscular hemoglobin, pg	30.4 (29.3–31.5)^*	30.3 (28.8–31.4)^*	< 0.001
Mean corpuscular hemoglobin concentration, g/L	344 (336–351)^*	342 (332–351)^*	< 0.001
Red cell distribution width, %	12.6 (12.1–13.1)^*	12.4 (11.9–13.0)^*	< 0.001
Platelet count, × 10⁹/L	202 (164–245)^*	197 (149–239)^*	< 0.001
Mean platelet volume, fl	10.5 (9.9–11.2)^*	10.3 (9.5–11.1)^*	< 0.001
C-reactive protein, mg/dL	0.097 (0.050–0.183)^*	0.05 (0–0.117)^*	< 0.001
Routine urine
Specific gravity	1.014 (1.010–1.019)^*	1.012 (1.000–1.019)^*	< 0.001
pH	6.0 (5.5–6.5)^*	5.5 (5.0–6.0)^*	< 0.001
Blood biochemical
Alanine aminotransferase, U/L	16.3 (11.7–23.7)^*	14.3 (9.5–20.7)^*	< 0.001
Aspartate aminotransferase, U/L	15.9 (13.0–20.6)^*	14.8 (11.3–18.9)^*	< 0.001
Total protein, g/L	67.8 (64.0–71.8)^*	65.4 (60.1–69.5)^*	< 0.001
Albumin, g/L	40.7 (38.0–43.1)^*	40.1 (35.4–42.8)^*	< 0.001
Total bilirubin, μmol/L	10.4 (7.5–13.9)^*	9.8 (6.6–13.5)^*	< 0.001
Direct bilirubin, μmol/L	3.0 (2.0–4.2)^*	2.8 (1.7–4.0)^*	< 0.001
Total bile acid, μmol/L	1.2 (0.001–1.7)^*	1.8 (0.002–4.1)^*	< 0.001
Homocysteine, μmol/L	0.79 (0.75–11.70)^*	0.83 (0.76–12.40)^*	< 0.001
Alkaline phosphatase, U/L	62.9 (50.6–76.8)^*	61.0 (46.3–76.1)^*	< 0.001
Glutamyl transpeptidase, U/L	22.0 (15.3–32.5)^*	18.9 (11.8–29.6)^*	< 0.001
Glucose, mmol/L	7.07 (5.66–9.57)^*	6.37 (4.85–8.09)^*	< 0.001
Blood urea nitrogen, mmol/L	5.44 (4.45–6.79)^*	5.07 (3.78–6.37)^*	< 0.001
Creatinine, μmol/L	72.6 (60.3–86.4)^*	65.7 (51.8–79.6)^*	< 0.001
Uric acid, μmol/L	303.3 (245.4–367.7)^*	275.6 (203.1–341.6)^*	< 0.001
Total cholesterol, mmol/L	3.45 (2.69–4.25)^*	3.28 (1.22–4.34)^*	< 0.001
Triglyceride, mmol/L	1.25 (0.85–1.77)^*	0.95 (0.22–1.56)^*	< 0.001
Apolipoprotein A1, g/L	0.89 (0.23–1.21)^*	0.24 (0.23–1.22)^*	0.066
Apolipoprotein B, g/L	0.45 (0.19–0.79)^*	0.21 (0.18–0.84)^*	0.120
High-density lipoprotein cholesterol, mmol/L	0.98 (0.77–1.18)^*	0.87 (0.07–1.15)^*	< 0.001
Low-density lipoprotein cholesterol, mmol/L	2.02 (1.38–2.76)^*	1.83 (0.08–2.83)^*	< 0.001
Lipoprotein(a), mg/dL	0.00 (0–0.65)^*	0.00 (0–8.92)^*	< 0.001
Apolipoprotein A1/Apolipoprotein B	1.28 (1.19–1.57)^*	1.28 (1.18–1.43)^*	< 0.001
Estimated glomerular filtration rate, mL/min per 1.73 m²	95.9 (78.9–115.1)^*	105.4 (87.1–134.3)^*	< 0.001

Table 4

Performance of the five models on the validation sets.

Model	AUC	95% CI	Precision	Accuracy	F1-score	Sensitivity	Specificity
Adaboost: adaptive boosting; AUC: area under the curve; DT: decision tree; LR: logistic regression; RF: random forest; XGBoost: extreme gradient boosting.
XGBoost	0.880	0.872–0.887	0.748	0.801	0.769	0.792	0.808
RF	0.877	0.869–0.884	0.771	0.811	0.776	0.781	0.833
DT	0.858	0.849–0.865	0.770	0.802	0.760	0.750	0.839
Adaboost	0.869	0.862–0.877	0.692	0.772	0.751	0.820	0.737
LR	0.797	0.788–0.807	0.682	0.741	0.698	0.715	0.761

Figure 3

The receiver operating characteristic and the AUC of the five prediction models on the newly recruited dataset.

The receiver operating characteristic and the AUC of the five prediction models on the newly recruited dataset. Adaboost: adaptive boosting; AUC: area under the curve; DT: decision tree; LR: logistic regression; RF: random forest; XGBoost: extreme gradient boosting.

DISCUSSION

Based on data from the Medical Big Data Research Centre of Chinese PLA General Hospital in Beijing, China, the present study collected hospitalization information for tens of thousands of elderly patients with DM in the past ten years. Five ML algorithms were used to build a diagnostic model of CHD in elderly patients with DM. Ultimately, it was found that the XGBoost model best distinguished patients with CHD and had good performance. Based on ML algorithms, researchers explored the diagnosis of CHD.[ A review published in 2019 studied 149 research articles related to ML-based CHD detection.[ The 67 relevant datasets came from 18 countries and regions of three continents. The sample size ranged from 20 to 240,000, and the median sample size was approximately 350. Similar to our study, Fan, et al.[ trained an artificial intelligence model using RF to predict CHD risk among patients with DM, there were 1273 patients enrolled in the study. Researchers selected the top eight features (age, heart rate, diastolic blood pressure, blood platelet, low-density lipoprotein cholesterol, total cholesterol level, course of hypertension and course of DM) from the total 50 features to develop the model. The model achieved an AUC of 0.77 on the training dataset and 0.80 on the testing dataset. In our study, a dataset of 28,059 elderly patients with DM was constructed. Our research sample size is large, and the performance of the model is better. The results in different studies cannot be generalized due to the differences in the analyzed datasets, sample sizes, features, data collection areas, performance metrics, and applied ML algorithms. Feature selection (selecting subsets of relevant features for model development) has a significant impact on the model performance.[ Many feature selection methods have been used for model creation, including information gain, correlation, principal component analysis, Gini index and the genetic algorithm.[ In our study, we applied RFE as a filter for muting irrelevant features in the process of feature selection. RFE was a feature selection method that trained a model and removed the weakest feature (or features) until a specified number of features were reached. The features were sorted by feature_importances_attributes of the model. In contrast to other methods, RFE took the final desired number of features to use as input, and then recursively reduced the number of features to use that attempts to eliminate dependencies and collinearity that may exist in the model. A variety of feature categories were used to detect CHD. Some datasets only contain ECG features,[ while others contain demographic, laboratory, symptom and examination, fluoroscopy and echo features.[ In most CHD datasets, demographic features were widely used, such as age and sex. Other features, such as genetic features, were seldom used. Most datasets only show whether the patients have CHD. Few of them contain information on the stenosis severities of the three main arteries of the heart, which is the main limitation of these datasets. In our study, the variables included in the screening were mostly demographic and laboratory examinations. Compared to other categories of features, it is much easier to collect data that belong to these categories. Unfortunately, with the ML method, we have not found a new specific index that can well identify CHD in elderly patients with DM. However, we still believe that the auxiliary diagnosis system based on such research will effectively serve the clinic in the future and play a positive role in the screening, early warning and early diagnosis of CHD in elderly patients with DM. Among the five models, RF, Adaboost and XGBoost showed better prediction performance. The RF algorithm performed random sampling, and the trained model had small variance and strong generalization ability, but on a sample set with relatively large noise, the RF model was prone to overfitting. Adaboost had high accuracy and fully considered the weight of each classifier, however, when the data was unbalanced, it would lead to a decrease in the classification accuracy of the model. XGBoost added a regular term to the objective function to control the complexity of the model and made the learned model simpler. XGBoost drew on the practice of RF and supported column sampling, which could not only reduce overfitting, but also reduced the workload of calculation. The above three ML algorithms used in this study minimized error and improved prediction accuracy, and identified other latent variables that were not easily observed, but the “black box” characteristics of these models were more difficult to explain, that is, they could not explain the inherent complexity of how risk factor variables interact and their independent effects on outcomes. In the testing sets, the sensitivity of the XGBoost model was not ideal (0.690). However, in the validation sets, the sensitivity was 0.792, which reflected that the XGBoost model had good transportability and generalizability. In the future, we will incorporate richer features and try more algorithms to further improve the performance of the model.

STRENGTHS AND LIMITATIONS

The strengths of this study are in the specific study population, large sample size, and multiple ML models. We collected the hospitalization information of tens of thousands of elderly patients with DM from the past ten years. We used five ML algorithms to establish models. Their performance was assessed with external validation by thousands of patients. We also obtained a prediction model for CHD in elderly patients with DM. However, there were some limitations that must be noted. Firstly, the variables included in the screening were mostly laboratory examinations. The inclusion of symptoms, new biomarkers, environmental factors, ECGs, cardiac ultrasounds and other data may further improve the prediction efficiency of the model. Secondly, doctors may miss diagnosis when writing medical records, which is also the limitation of this study based on medical records. Last but not least, the datasets for training and validation were from the same hospital. If the research is conducted in multiple centres, models would be more substantial and robust. Using larger datasets, more features and ML approaches may achieve better results.

CONCLUSIONS

In summary, this study established a ML model to predict CHD in elderly patients with DM, which may provide a reference for the early detection and intervention of CHD in elderly patients with DM.

30 in total

1. A new machine learning technique for an accurate diagnosis of coronary artery disease.

Authors: Moloud Abdar; Wojciech Książek; U Rajendra Acharya; Ru-San Tan; Vladimir Makarenkov; Paweł Pławiak
Journal: Comput Methods Programs Biomed Date: 2019-07-24 Impact factor: 5.428

2. Computer aided decision making for heart disease detection using hybrid neural network-Genetic algorithm.

Authors: Zeinab Arabasadi; Roohallah Alizadehsani; Mohamad Roshanzamir; Hossein Moosaei; Ali Asghar Yarifard
Journal: Comput Methods Programs Biomed Date: 2017-01-18 Impact factor: 5.428

Review 3. Global aetiology and epidemiology of type 2 diabetes mellitus and its complications.

Authors: Yan Zheng; Sylvia H Ley; Frank B Hu
Journal: Nat Rev Endocrinol Date: 2017-12-08 Impact factor: 43.330

4. Diabetes mellitus, fasting blood glucose concentration, and risk of vascular disease: a collaborative meta-analysis of 102 prospective studies.

Authors: N Sarwar; P Gao; S R Kondapally Seshasai; R Gobin; S Kaptoge; E Di Angelantonio; E Ingelsson; D A Lawlor; E Selvin; M Stampfer; C D A Stehouwer; S Lewington; L Pennells; A Thompson; N Sattar; I R White; K K Ray; J Danesh
Journal: Lancet Date: 2010-06-26 Impact factor: 202.731

5. Comprehensive electrocardiographic diagnosis based on deep learning.

Authors: Oh Shu Lih; V Jahmunah; Tan Ru San; Edward J Ciaccio; Toshitaka Yamakawa; Masayuki Tanabe; Makiko Kobayashi; Oliver Faust; U Rajendra Acharya
Journal: Artif Intell Med Date: 2020-01-20 Impact factor: 5.326

Review 6. 10. Cardiovascular Disease and Risk Management: Standards of Medical Care in Diabetes-2020.

Authors:
Journal: Diabetes Care Date: 2020-01 Impact factor: 19.112

7. Machine Learning to Predict the Risk of Incident Heart Failure Hospitalization Among Patients With Diabetes: The WATCH-DM Risk Score.

Authors: Matthew W Segar; Muthiah Vaduganathan; Kershaw V Patel; Darren K McGuire; Javed Butler; Gregg C Fonarow; Mujeeb Basit; Vaishnavi Kannan; Justin L Grodin; Brendan Everett; Duwayne Willett; Jarett Berry; Ambarish Pandey
Journal: Diabetes Care Date: 2019-09-13 Impact factor: 19.112

8. Association Between Diabetes and Cause-Specific Mortality in Rural and Urban Areas of China.

Authors: Fiona Bragg; Michael V Holmes; Andri Iona; Yu Guo; Huaidong Du; Yiping Chen; Zheng Bian; Ling Yang; William Herrington; Derrick Bennett; Iain Turnbull; Yongmei Liu; Shixian Feng; Junshi Chen; Robert Clarke; Rory Collins; Richard Peto; Liming Li; Zhengming Chen
Journal: JAMA Date: 2017-01-17 Impact factor: 56.272

9. Multimorbidity, health care utilization and costs in an elderly community-dwelling population: a claims data based observational study.

Authors: Caroline Bähler; Carola A Huber; Beat Brüngger; Oliver Reich
Journal: BMC Health Serv Res Date: 2015-01-22 Impact factor: 2.655

10. AI-based prediction for the risk of coronary heart disease among patients with type 2 diabetes mellitus.

Authors: Rui Fan; Ning Zhang; Longyan Yang; Jing Ke; Dong Zhao; Qinghua Cui
Journal: Sci Rep Date: 2020-09-02 Impact factor: 4.379