| Literature DB >> 35330437 |
Chao Li1, Guanhua Dou2, Yipu Ding3, Ran Xin3, Jing Wang2, Jun Guo4, Yundai Chen4, Junjie Yang4.
Abstract
BACKGROUND: Transesophageal echocardiography (TEE) is the first technique of choice for evaluating the left atrial appendage flow velocity (LAAV) in clinical practice, which may cause some complications. Therefore, clinicians require a simple applicable method to screen patients with decreased LAAV. Therefore, we investigated the feasibility and accuracy of a machine learning (ML) model to predict LAAV.Entities:
Keywords: atrial fibrillation; flow velocity; left atrial appendage; machine learning
Year: 2022 PMID: 35330437 PMCID: PMC8954392 DOI: 10.3390/jpm12030437
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1Flow diagram. TEE: transesophageal echocardiography; LAAV: left atrial appendage flow velocity.
Baseline characteristics of the records of the enrolled patients with decreased LAAV and normal LAAV.
| ALL | Normal LAAV | Decreased LAAV | ||
|---|---|---|---|---|
| Age (years) | 62 (54–69) | 62 (54–68) | 64 (54–71.5) | 0.022 |
| Gender (male) | 757 (72.9%) | 663 (72.5%) | 94 (75.2%) | 0.530 |
| BMI (kg/m2) | 26 (23–28) | 25.8 (23.8–28.0) | 26.9 (24.4–29.3) | 0.010 |
| Persistent AF | 336 (32.3%) | 253 (27.7%) | 83 (66.4%) | <0.001 |
| Antithrombotic therapy | <0.001 | |||
| No antithrombotic therapy | 528 (50.8%) | 478 (52.3%) | 50 (40.0%) | |
| Aspirin | 130 (12.5%) | 113 (12.4%) | 17 (13.6%) | |
| Clopidogrel | 11 (1.1%) | 8 (0.9%) | 3 (2.4%) | |
| Dual antiplatelet | 21 (2.0%) | 18 (2.0%) | 3 (2.4%) | |
| Warfarin | 95 (9.1%) | 76 (8.3%) | 19 (15.2%) | |
| NOAC | 254 (24.5%) | 221 (24.2%) | 33 (26.4%) | |
| Heart failure | 108 (10.4%) | 67 (7.3%) | 41 (32.8%) | <0.001 |
| Hypertension | 553 (53.2%) | 474 (51.9%) | 79 (63.2%) | 0.017 |
| Diabetes | 239 (23.0%) | 200 (21.9%) | 39 (31.2%) | 0.020 |
| Stroke | 154 (14.8%) | 127 (13.9%) | 27 (21.6%) | 0.023 |
| Vascular disease | 92 (8.9%) | 83 (9.1%) | 9 (7.2%) | 0.487 |
| NT-pro BNP (pg/mL) | 319 (116–770) | 266 (103–635) | 997 (590.5–1862) | <0.001 |
| LA diameter (mm) | 39 (35–43) | 38 (35–42) | 43 (40–47.5) | <0.001 |
NOAC: New oral anticoagulants, include dabigatran and rivaroxaban; LA diameter: left atrial diameter; BMI: Body Mass Index.
The characteristics of training set and testing set.
| Feature | Training Set | Testing Set | |
|---|---|---|---|
| Age (years) | 62 (54–69) | 61 (54–68.5) | 0.357 |
| Gender(male) | 592 (71.0%) | 165 (80.5%) | 0.006 |
| BMI (kg/m2) | 26 (23.9–28.1) | 26 (24.2–28.2) | 0.772 |
| Persistent AF | 264 (31.7%) | 72 (35.1%) | 0.342 |
| Antithrombotic therapy | 0.68 | ||
| No antithrombotic therapy | 426 (51.2%) | 102 (49.8%) | |
| Aspirin | 104 (12.5%) | 26 (12.7%) | |
| Clopidogrel | 10 (1.2%) | 1 (0.5%) | |
| Dual antiplatelet | 19 (2.3%) | 2 (1%) | |
| Warfarin | 72 (8.6%) | 23 (11.2%) | |
| NOAC | 203 (24.3%) | 51 (24.8%) | |
| Heart failure | 82 (9.8%) | 26 (12.7%) | 0.231 |
| Hypertension | 449 (53.8%) | 104 (50.7%) | 0.425 |
| Diabetes | 191 (22.9%) | 48 (23.4%) | 0.876 |
| Stroke | 124 (14.9%) | 30 (14.6%) | 0.933 |
| Vascular disease | 78 (9.4%) | 14 (6.8%) | 0.255 |
| NT-pro BNP (pg/mL) | 309 (111–725) | 401 (150–885) | 0.061 |
| LA size (mm) | 39 (35–43) | 39 (36–42) | 0.446 |
| LAAV (cm/s) | 46 (33–60.25) | 46 (36–42) | 0.727 |
| Decreased LAAV | 97 (11.6%) | 28 (13.7%) | 0.424 |
Figure 2Development of KNN Model and Random Forest Model. (A) Development of KNN model; (B) Development of Random Forest model.
Comparison of the predictive performance for three models in the training set.
| Model | Accuracy (%) | F1 Score | AUC | Cut-off | R2 | RMSE (cm/s) | MAE (cm/s) |
|---|---|---|---|---|---|---|---|
| KNN | 76 | 0.44 | 0.81 (0.76–0.85) | 41.3 | 0.26 | 17.29 | 13.80 |
| RF | 92 | 0.77 | 0.98 (0.97–0.99) | 33.8 | 0.85 | 7.85 | 5.97 |
| SVM | 84 | 0.58 | 0.91 (0.87–0.94) | 37.1 | 0.23 | 17.66 | 13.68 |
KNN, k-nearest neighbor; RF, random forest; SVM, support vector machine; AUC, area under the receiver operating characteristic curve; R2, R Squared; RMSE, root-mean-square error; MAE, mean absolute deviation.
Comparison of the predictive performance for three models in the testing set.
| Model | Accuracy | F1 Score | AUC | Specificity | Sensitivity | R2 | RMSE (cm/s) | MAE (cm/s) |
|---|---|---|---|---|---|---|---|---|
| KNN | 63 | 0.43 | 0.81 (0.73–0.89) | 58 | 93.5 | 0.24 | 17.51 | 13.40 |
| RF | 85 | 0.62 | 0.89 (0.83–0.95) | 86 | 81 | 0.31 | 16.65 | 13.04 |
| SVM | 67 | 0.48 | 0.87 (0.82–0.93) | 62 | 100 | 0.23 | 17.66 | 13.68 |
KNN, k-nearest neighbor; RF, random forest; SVM, support vector machine; AUC, area under the receiver operating characteristic curve; R2, R Squared; RMSE, root-mean-square error; MAE, mean absolute deviation.
Figure 3Calibration plots for the KNN(A), RF(B), and SVM(C) models. (KNN, k-Nearest Neighbor; RF, random forest; SVM, support vector machine). All models indicated appropriate calibration (p > 0.05). (A) Calibration plot for the KNN model; (B) Calibration plot for the RF model; (C) Calibration plot for the SVM model.
Comparison of the predictive performance for three models (fivefold cross-validation).
| Model | AUCROC | R2 | RMSE (cm/s) | MAE (cm/s) |
|---|---|---|---|---|
| KNN | 0.806 | 0.24 | 17.43 | 13.40 |
| RF | 0.854 | 0.735 | 10.28 | 7.44 |
| SVM | 0.84 | 0.39 | 15.76 | 11.92 |
Figure 4The importance of variables in Random Forest model. BMI: body mass index; DM: diabetes mellitus.