| Literature DB >> 33148619 |
Yan Li1, Matthew Sperrin1, Darren M Ashcroft2,3, Tjeerd Pieter van Staa1,4,5.
Abstract
OBJECTIVE: To assess the consistency of machine learning and statistical techniques in predicting individual level and population level risks of cardiovascular disease and the effects of censoring on risk predictions.Entities:
Mesh:
Year: 2020 PMID: 33148619 PMCID: PMC7610202 DOI: 10.1136/bmj.m3919
Source DB: PubMed Journal: BMJ ISSN: 0959-8138
Baseline characteristics of two study populations (patients aged 25-84 years without history of cardiovascular disease (CVD) or previous statin use). Values are numbers (percentages) unless stated otherwise
| Characteristics | Overall cohort | Cohort without censoring | |||
|---|---|---|---|---|---|
| Derivation cohort (n=2 746 453) | Validation cohort (n=915 479) | Derivation cohort (n=335 632) | Validation cohort (n=111 868) | ||
| CVD cases | 86 769 (3.2) | 28 828 (3.1) | 78 826 (23.5) | 26 168 (23.4) | |
| Patients censored within 10 years | 2 410 516 (87.8) | 803 916 (87.8) | NA | NA | |
|
| |||||
| Female sex | 1 406 796 (51.2) | 469 098 (51.2) | 173 691 (51.8) | 58 169 (52.0) | |
| Mean (SD) age, years | 44.7 (15.6) | 44.7 (15.7) | 53.3 (16.2) | 53.4 (16.2) | |
| Mean (SD) body mass index | 26.7 (5.0) | 26.7 (5.0) | 27.2 (4.8) | 27.1 (4.8) | |
| Mean (SD) total cholesterol/high density lipoprotein cholesterol ratio | 3.9 (1.3) | 3.9 (1.3) | 4.1 (1.3) | 4.1 (1.3) | |
| Atypical antipsychotic drug | 12 306 (0.4) | 4030 (0.4) | 932 (0.3) | 316 (0.3) | |
| Antihypertensive treatment | 183 964 (6.7) | 61 962 (6.8) | 42 704 (12.7) | 14 245 (12.7) | |
| Regular steroid tablets | 2059 (0.1) | 694 (0.1) | 289 (0.1) | 100 (0.1) | |
| History of systemic lupus erythematosus | 1840 (0.1) | 606 (0.1) | 257 (0.1) | 74 (0.1) | |
| History of angina or heart attack in first degree relative <60 years | 98 455 (3.6) | 32 619 (3.6) | 7950 (2.4) | 2669 (2.4) | |
| History of atrial fibrillation | 20 778 (0.8) | 6965 (0.8) | 5213 (1.6) | 1757 (1.6) | |
| History of chronic kidney disease (stage 3, 4, or 5) | 30 133 (1.1) | 10 240 (1.1) | 4364 (1.3) | 1514 (1.4) | |
| History of erectile dysfunction | 39 651 (1.4) | 13 110 (1.4) | 3867 (1.2) | 1287 (1.2) | |
| History of migraines | 177 439 (6.5) | 59 106 (6.5) | 19 629 (5.8) | 6593 (5.9) | |
| History of rheumatoid arthritis | 16 167 (0.6) | 5459 (0.6) | 3043 (0.9) | 1030 (0.9) | |
| History of severe mental illness | 219 861 (8.0) | 72 832 (8.0) | 32 190 (9.6) | 10 673 (9.5) | |
| History of type 1 diabetes | 5899 (0.2) | 2097 (0.2) | 820 (0.2) | 251 (0.2) | |
| History of type 2 diabetes | 35 569 (1.3) | 11 826 (1.3) | 8134 (2.4) | 2641 (2.4) | |
| Mean (SD) systolic blood pressure | 126.9 (16.7) | 126.9 (16.7) | 133.1 (18.4) | 133.1 (18.4) | |
| Mean (SD) standard deviation of each individual patient’s systolic blood pressure | 9.9 (5.6) | 9.9 (5.6) | 10.7 (5.9) | 10.7 (5.9) | |
|
| |||||
| Other ethnicity | 173 271 (6.3) | 58 124 (6.3) | 6900 (2.1) | 2273 (2.0) | |
| White or not recorded | 2 573 182 (93.7) | 857 355 (93.7) | 328 732 (97.9) | 109 595 (98.0) | |
|
| |||||
| Ex-smoker | 629 500 (22.9) | 209 186 (22.8) | 76 060 (22.7) | 25 429 (22.7) | |
| Current smoker | 806 978 (29.4) | 269 717 (29.5) | 94 082 (28.0) | 31 335 (28.0) | |
| Never smoker | 1 309 975 (47.7) | 436 576 (47.7) | 165 490 (49.3) | 55 104 (49.3) | |
|
| |||||
| Score 1—Least deprived | 600 392 (21.9) | 199 937 (21.8) | 86 596 (25.8) | 29 156 (26.1) | |
| Score 2 | 594 739 (21.7) | 197 677 (21.6) | 82 211 (24.5) | 27 266 (24.4) | |
| Score 3 | 572 903 (20.9) | 191 045 (20.9) | 69 897 (20.8) | 23 326 (20.9) | |
| Score 4 | 568 006 (20.7) | 189 520 (20.7) | 60 424 (18.0) | 20 140 (18.0) | |
| Score 5—Most deprived | 410 413 (14.9) | 137 300 (15.0) | 36 504 (10.9) | 11 980 (10.7) | |
Performance indicators of machine learning and statistical models in overall cohort
| Model | Model performance | Average absolute change in |
|---|---|---|
| Logistic (Caret) | 0.879 (0.879 to 0.879) | 0.00 (−0.03 to 0.04) |
| Random forest (Caret) | 0.869 (0.867 to 0.869) | −1.20 (−1.33 to −1.10%) |
| Neural network (Caret) | 0.878 (0.867 to 0.880) | −0.15 (−1.35 to 0.06) |
| Statistic logistic model | 0.879 (0.879 to 0.879) | 0.01 (−0.02 to 0.04) |
| QRISK3 | 0.879 | Reference model |
| Framingham | 0.865 | −1.66 (−1.66 to −1.66) |
| Local Cox model | 0.877 (0.877 to 0.878) | −0.22 (−0.28 to −0.17) |
| Parametric survival model (Weibull) | 0.877 (0.876 to 0.877) | −0.29 (−0.35 to −0.24) |
| Parametric survival model (Gaussian) | 0.876 (0.876 to 0.877) | −0.33 (−0.39 to −0.29) |
| Parametric survival model (Logistic) | 0.876 (0.875 to 0.876) | −0.36 (−0.43 to −0.31) |
| Logistic (Sklearn) | 0.879 (0.879 to 0.879) | 0.00 (−0.05 to 0.03) |
| Random forest (Sklearn) | 0.872 (0.871 to 0.873) | −0.80 (−0.89 to −0.71) |
| Neural network (Sklearn) | 0.872 (0.832 to 0.879) | −0.85 (−5.39 to −0.03) |
| Gradient boosting classifier (Sklearn) | 0.878 (0.877 to 0.878) | −0.17 (−0.29 to −0.08) |
| extra-trees (Sklearn) | 0.863 (0.861 to 0.864) | −1.89 (−2.05 to −1.76) |
| Logistic (h2o) | 0.879 (0.878 to 0.879) | −0.06 (−0.10 to −0.02) |
| Random forest (h2o) | 0.877 (0.877 to 0.878) | −0.22 (−0.29 to −0.17) |
| Neural network (h2o) | 0.875 (0.870 to 0.879) | −0.45 (−1.09 to −0.04) |
| autoML (h2o) | 0.879 (0.879 to 0.880) | −0.00 (−0.07 to 0.06) |
Model performance was calculated in binary framework. Threshold 7.5% was used to calculate precision and recall for all models.
95% range (2.5-97.5%) of model performance derived from 100 random samples.
Fig 1Distribution of individual risk predictions with machine learning and statistical models in overall cohort for patients with predicted cardiovascular disease risks of 9.5-10.5% in QRISK3
Fig 2Calibration slope of machine learning models and statistical models in overall cohort in survival framework (observed events consider censoring). CVD=cardiovascular disease
Fig 395% range of individual risk predictions with machine learning and statistical models stratified by deciles of predicted cardiovascular disease (CVD) risks with QRISK3 in overall cohort
Fig 4Bland-Altman analysis comparing QRISK3 with neural network model
Reclassification of individual risk predictions with machine learning and statistical models
| Model | Reclassification in overall testing cohort: No (%) | |
|---|---|---|
| Reclassified | Not reclassified | |
|
| ||
| QRISK3 10 year risk prediction (reference model): | ||
| ≤7.5% threshold | 94 186 (13.6) | 597 478 (86.4) |
| >7.5% threshold | 129 348 (57.8) | 94 467 (42.2) |
| Logistic model (Caret) 10 year risk prediction (reference model): | ||
| ≤7.5% threshold | 209 221 (25.9) | 597 478 (74.1) |
| >7.5% threshold | 14 313 (13.2) | 94 467 (86.8) |
|
| ||
| QRISK3 10 year risk prediction (reference model): | ||
| ≤7.5% threshold | 34 607 (54.6) | 28 779 (45.4) |
| >7.5% threshold | 1248 (2.6) | 47 234 (97.4) |
| Logistic model (Caret) 10 year risk prediction (reference model): | ||
| ≤7.5% threshold | 6004 (17.3) | 28 779 (82.7) |
| >7.5% threshold | 29 851 (38.7) | 47 234 (61.3) |
Patients were reclassified if they had a risk prediction in any model that crossed the threshold compared with the prediction of the reference model.