| Literature DB >> 35309227 |
Qian Xu1,2,3, Yan Peng4, Juntao Tan5, Wenlong Zhao1,2, Meijie Yang1, Jie Tian2,6,7.
Abstract
Background: The objective of this study was to use machine learning algorithms to construct predictive models for atrial fibrillation (AF) in elderly patients with coronary heart disease (CHD) and type 2 diabetes mellitus (T2DM).Entities:
Keywords: atrial fibrillation; coronary heart disease; machine learning; prediction models; type 2 diabetes mellitus
Mesh:
Year: 2022 PMID: 35309227 PMCID: PMC8931193 DOI: 10.3389/fpubh.2022.842104
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Flow of inclusions and exclusions.
Comparison of continuous variables before and after interpolation.
|
|
|
|
|
|---|---|---|---|
| DBP (IQR, mmHg) | 77.00 (69.00, 85.00) | 77.00 (69.00, 85.00) | 0.894 |
| GGT (IQR, IU/L) | 26.00 (18.00, 44.00) | 26.00 (18.00, 44.67) | 0.760 |
| WBC (IQR, × 109/L) | 6.78 (5.52, 8.56) | 6.78 (5.52, 8.56) | 1.000 |
| NLR (IQR) | 3.33 (2.32, 5.51) | 3.33 (2.32, 5.52) | 0.969 |
| LMR (IQR) | 3.63 (2.37, 5.23) | 3.63 (2.37, 5.23) | 0.999 |
| Blood creatinine | 73.35 (57.70, 97.73) | 73.35 (57.70, 97.78) | 0.981 |
| TB (IQR, umol/l) | 10.30 (7.60, 14.00) | 10.30 (7.60, 14.00) | 0.944 |
| UA (IQR, umol/L) | 339.30 (273.48, 417.03) | 338.90 (273.30, 417.08) | 0.962 |
| UN (IQR, mmol/L) | 6.54 (5.20, 8.58) | 6.54 (5.20, 8.57) | 0.973 |
| LDL (IQR, mmol/L) | 2.23 (1.67, 2.91) | 2.24 (1.67, 2.91) | 0.871 |
| Hemoglobin (IQR, g/L) | 127.00 (115.00, 138.00) | 127.00 (115.00, 138.00) | 1.000 |
| PLR (IQR) | 128.43 (94.69, 178.29) | 127.98 (94.40, 177.71) | 0.776 |
| ALT (IQR, IU/L) | 18.16 (13.00, 27.14) | 18.12 (13.00, 27.58) | 0.987 |
| ALB (IQR, g/L) | 39.50 (36.50, 42.60) | 39.50 (36.50, 42.60) | 0.966 |
| TGs (IQR, mmol/L) | 1.41 (1.02, 2.02) | 1.42 (1.02, 2.03) | 0.774 |
| TC (IQR, mmol/L) | 4.10 (3.37, 4.96) | 4.09 (3.37, 4.96) | 0.958 |
| HDL (IQR, mmol/L) | 1.10 (0.91, 1.33) | 1.09 (0.91, 1.32) | 0.587 |
| HbA1c (IQR, %) | 7.41 (6.60, 9.04) | 7.44 (6.60, 9.10) | 0.743 |
| ALP (IQR, IU/L) | 74.00 (61.00, 92.00) | 74.00 (61.00, 92.00) | 0.878 |
| AST (IQR, IU/L) | 20.20 (16.00, 27.00) | 20.20 (16.01, 27.00) | 0.968 |
| Blood potassium (IQR, mmol/L) | 4.02 (3.72, 4.34) | 4.02 (3.71, 4.33) | 0.639 |
| Blood calcium (IQR, mmol/L) | 2.23 (2.13, 2.33) | 2.23 (2.13, 2.33) | 0.921 |
| Blood phosphorus (IQR, mmol/L) | 1.09 (0.95, 1.24) | 1.09 (0.94, 1.23) | 0.160 |
DBP, diastolic blood pressure; GGT, γ-glutamyltransferase; WBC, white blood cell; NLR, neutrophil-to-lymphocyte ratio; LMR, lymphocyte-to-monocyte ratio; TB, total bilirubin; UA, uric acid; UN, urea nitrogen; LDL, low-density lipoprotein; PLR, platelet-lymphocyte ratio; ALT, alanine aminotransferase; ALB, albumin; TGs, triglycerides; TC, total cholesterol; HDL, high-density lipoprotein; HbA1c, glycated hemoglobin; ALP, alkaline phosphatase; AST, aspartate aminotransferase; IQR, interquartile range.
Univariate analyses of variables associated with AF.
|
|
|
|
|
|---|---|---|---|
| Gender ( | 119 (44.07%) | 924 (43.90%) | 0.999 |
| Smoking status ( | 78 (28.89%) | 574 (27.27%) | 0.625 |
| Drinking status ( | 55 (20.37%) | 411 (19.52%) | 0.804 |
| Hypertension ( | 206 (76.30%) | 1,670 (79.33%) | 0.283 |
| Age (IQR, years) | 78.00 (73.00, 83.00) | 75.00 (70.00, 81.00) | <0.001 |
| DBP (IQR, mmHg) | 79.00 (71.00, 88.00) | 77.00 (69.00, 85.00) | 0.005 |
| GGT (IQR, IU/L) | 36.00 (22.55, 69.00) | 25.00 (17.60, 43.00) | <0.001 |
| WBC (IQR, × 109/L) | 6.82 (5.57, 8.39) | 6.73 (5.53, 8.50) | 0.848 |
| NLR (IQR) | 4.44 (2.74, 6.36) | 3.23 (2.28, 5.29) | <0.001 |
| LMR (IQR) | 3.18 (2.02, 4.69) | 3.81 (2.46, 5.44) | <0.001 |
| Blood creatinine | 80.30 (62.78, 105.18) | 72.40 (56.90, 97.20) | 0.001 |
| TB (IQR, umol/l) | 12.30 (9.20, 17.55) | 10.00 (7.40, 13.70) | <0.001 |
| UA (IQR, umol/L) | 370.40 (296.43, 469.68) | 332.10 (270.40, 410.00) | <0.001 |
| UN (IQR, mmol/L) | 7.09 (5.41, 9.46) | 6.51 (5.23, 8.39) | 0.001 |
| LDL (IQR, mmol/L) | 1.92 (1.47, 2.62) | 2.27 (1.69, 2.95) | <0.001 |
| Hemoglobin (IQR, g/L) | 125.00 (112.25, 137.75) | 127.00 (115.00, 139.00) | 0.377 |
| PLR (IQR) | 129.96 (95.24, 180.14) | 127.95 (94.27, 175.86) | 0.356 |
| ALT (IQR, IU/L) | 19.00 (12.02, 28.00) | 18.00 (13.00, 27.00) | 0.722 |
| ALB (IQR, g/L) | 38.40 (35.79, 41.00) | 39.60 (36.60, 42.70) | <0.001 |
| TGs (IQR, mmol/L) | 1.17 (0.83, 1.72) | 1.42 (1.04, 2.03) | <0.001 |
| TC (IQR, mmol/L) | 3.64 (3.02, 4.54) | 4.16 (3.41, 4.97) | <0.001 |
| HDL (IQR, mmol/L) | 1.05 (0.85, 1.27) | 1.10 (0.92, 1.32) | 0.007 |
| HbA1c (IQR, %) | 7.35 (6.50, 8.50) | 7.40 (6.60, 9.00) | 0.129 |
| ALP (IQR, IU/L) | 74.80 (58.43, 96.08) | 74.00 (61.00, 91.90) | 0.579 |
| AST (IQR, IU/L) | 22.17 (17.25, 31.23) | 20.00 (16.00, 27.00) | 0.001 |
| Blood potassium (IQR, mmol/L) | 3.97 (3.69, 4.36) | 4.02 (3.72, 4.32) | 0.694 |
| Blood calcium (IQR, mmol/L) | 2.21 (2.11, 2.29) | 2.24 (2.14, 2.34) | 0.001 |
| Blood phosphorus (IQR, mmol/L) | 1.09 (0.96, 1.25) | 1.10 (0.95, 1.24) | 0.942 |
AF, atrial fibrillation; DBP, diastolic blood pressure; GGT, γ-glutamyltransferase; WBC, white blood cell; NLR, neutrophil-to-lymphocyte ratio; LMR, lymphocyte-to-monocyte ratio; TB, total bilirubin; UA, uric acid; UN, urea nitrogen; LDL, low-density lipoprotein; PLR, platelet-lymphocyte ratio; ALT, alanine aminotransferase; ALB, albumin; TGs, triglycerides; TC, total cholesterol; HDL, high-density lipoprotein; HbA1c, glycated hemoglobin; ALP, alkaline phosphatase; AST, aspartate aminotransferase; IQR, interquartile range.
Figure 2Receiver operating characteristic (ROC) curves of five different models in internal validation cohort (A) and external validation cohort (B).
Detailed performance metrics for the five models in internal validation.
|
|
|
|
|
|
|---|---|---|---|---|
| LR | 0.602 | 0.726 | 0.712 | 0.684 (0.629–0.739) |
| LR+LASSO | 0.694 | 0.627 | 0.633 | 0.712 (0.659–0.765) |
| CART | 0.676 | 0.626 | 0.631 | 0.686 (0.632–0.739) |
| RF | 0.611 | 0.753 | 0.735 | 0.733 (0.683–0.783) |
| XGBoost | 0.833 | 0.562 | 0.587 | 0.743 (0.693–0.792) |
LR, logistic regression; CART, Classification and Regression Tree; RF, random forest; AUC, area under the receiver operating characteristic curve; CI, confidence interval.
Detailed performance metrics for the five models in external validation.
|
|
|
|
|
|
|---|---|---|---|---|
| LR | 0.529 | 0.746 | 0.722 | 0.621 (0.532–0.711) |
| LR+LASSO | 0.353 | 0.889 | 0.828 | 0.630 (0.659–0.765) |
| CART | 0.157 | 0.925 | 0.841 | 0.523 (0.444–0.601) |
| RF | 0.686 | 0.743 | 0.733 | 0.726 (0.655–0.797) |
| XGBoost | 0.627 | 0.770 | 0.754 | 0.705 (0.628–0.781) |
LR, logistic regression; CART, Classification and Regression Tree; RF, random forest; AUC, area under the receiver operating characteristic curve; CI, confidence interval.
Ranks of feature importance in RF, XGBoost, and LR+LASSO for predicting AF.
|
|
|
|
|
|---|---|---|---|
| 1 | TB | TGs | TC |
| 2 | GGT | UA | TGs |
| 3 | TGs | TC | Age |
| 4 | UA | GGT | TB |
| 5 | DBP | TB | UA |
LR, logistic regression; RF, random forest; DBP, diastolic blood pressure; GGT, γ-glutamyltransferase; TB, total bilirubin; UA, uric acid; TGs, triglycerides; TC, total cholesterol.
Figure 3Features selection by LASSO. (A) LASSO coefficients profiles (y-axis) of the 16 features. The upper x-axis is the average numbers of predictors and the lower x-axis is the log (λ). (B) Five-fold cross-validation for tuning parameter selection in the LASSO model.
Figure 4Importance analysis of indexes in RF model.
Figure 5Importance analysis of indexes in XGBoost model.