| Literature DB >> 33969418 |
Matthew Chun1,2, Robert Clarke1, Benjamin J Cairns1,3, David Clifton2,4, Derrick Bennett1, Yiping Chen1,3, Yu Guo5, Pei Pei5, Jun Lv6, Canqing Yu6, Ling Yang1, Liming Li6, Zhengming Chen3, Tingting Zhu2.
Abstract
OBJECTIVE: To compare Cox models, machine learning (ML), and ensemble models combining both approaches, for prediction of stroke risk in a prospective study of Chinese adults.Entities:
Keywords: China; cardiovascular diseases; machine learning; risk assessment; stroke
Mesh:
Year: 2021 PMID: 33969418 PMCID: PMC8324240 DOI: 10.1093/jamia/ocab068
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Cox and machine learning (ML) model development and validation. All analyses were performed separately for men and women. Included individuals were divided into a training set (85%), validation set (12.75%), and test set (2.25%). Risk prediction models were developed in the training set and assessed in the validation set, with a best ML model selected. The traditional Cox model and best ML model were then used for screening high-risk individuals in the validation set using a 10% predicted risk threshold. A second training set was created from a subset of the validation set wherein the Cox model and best ML model disagreed on risk classification, and a decision tree was trained to predict which model would yield a better risk classification for each individual. Screening approaches, including a (i) Cox-only approach, (ii) best ML-only approach, and (iii) an ensemble approach, were assessed and compared using the held-out test set.
Discrimination and calibration performance for prediction of 9-year risk of stroke. Comparisons included the 2017 Framingham Stroke Risk Profile (FSRP), a recalibrated and refitted FSRP, Cox, random survival forest (RSF), logistic regression (LR), support vector machine (SVM), gradient boosted tree (GBT), and multilayer perceptron (MLP) models
| Men | Women | |||
|---|---|---|---|---|
| Model Type | Discrimination | Calibration | Discrimination | Calibration |
| AUROCs | χ2 | AUROCs | χ2 | |
| [95%CI] | [95%CI] | [95%CI] | [95%CI] | |
| FSRP | 0.781 | 5541 | 0.772 | 19402 |
| [0.772-0.790] | [4996-6107] | [0.764-0.780] | [17784-21019] | |
| Recalibrated and refitted | 0.824 | 138 | 0.825 | 140 |
| FSRP | [0.816-0.831] | [96-185] | [0.819-0.833] | [97-186] |
| Cox | 0.829 | 122 | 0.831 | 129 |
| [0.822-0.837] | [83-166] | [0.824-0.838] | [89-172] | |
| RSF | 0.826 | 61 | 0.832 | 62 |
| [0.818-0.834] | [36-90] | [0.824-0.839] | [36-95] | |
| LR | 0.831 | 56 | 0.832 | 57 |
| [0.823-0.838] | [31-86] | [0.825-0.838] | [34-85] | |
| SVM | 0.830 | 712 | 0.831 | 24 |
| [0.823-0.838] | [582-852] | [0.824-0.838] | [11-41] | |
| GBT | 0.833 | 44 | 0.836 | 47 |
| [0.825-0.840] | [24-67] | [0.829-0.843] | [30-69] | |
| MLP | 0.831 | 515 | 0.833 | 19 |
| [0.824-0.839] | [410-627] | [0.826-0.841] | [8-35] | |
Distribution of established risk factors for stroke in men and women by presence or absence of stroke during follow-up
| Men | Women | |||
|---|---|---|---|---|
| Risk factors in 2017 Framingham Stroke Risk Profile | No Stroke (n = 185 706) | Stroke (n = 19 587) | No Stroke (n = 274 902) | Stroke (n = 23 647) |
| Age, mean, year | 51.8 | 60.7 | 50.6 | 59.6 |
| Current smoking, % | 68.5 | 59.9 | 3.1 | 4.8 |
| Coronary heart disease, % | 2.1 | 6.4 | 2.5 | 9.4 |
| Age 65 yrs+, % | 13.8 | 39.9 | 10.7 | 34.0 |
| Diabetes at age <65 yrs, % | 3.7 | 6.5 | 4.0 | 8.1 |
| Diabetes at age 65+ yrs % | 1.1 | 4.8 | 1.3 | 6.0 |
| BP-lowering treatment, % | 8.6 | 22.3 | 10.1 | 26.3 |
| SBP-untreated, mean, mmHg | 130 | 142 | 126 | 138 |
| SBP- treated, mean, mmHg | 148 | 153 | 150 | 155 |
Note: “No Stroke” column includes individuals who remained stroke-free until being censored, even if lost to follow-up before 9 years.
Atrial fibrillation is a part of the Framingham Stroke Risk Profile but was not recorded in CKB.
Figure 2.Calibration plots for the 2017 Framingham Stroke Risk Profile (FSRP), a recalibrated and refitted FSRP, Cox, random survival forest (RSF), logistic regression (LR), support vector machine (SVM), gradient boosted tree (GBT), and multilayer perceptron (MLP) models in (A) men and (B) women. Each point represents a decile of predicted risk.
Figure 3.Discrimination (subplots A and B) and calibration (subplots C and D) performance in men and women, respectively, for risk prediction of stroke at various time scales (0–3 years, 3–6 years, 6–9 years after baseline). Comparisons made between the 2017 Framingham Stroke Risk Profile (FSRP), a recalibrated and refitted FSRP, Cox, random survival forest (RSF), logistic regression (LR), support vector machine (SVM), gradient boosted tree (GBT), and multilayer perceptron (MLP) models.
Figure 4.t-Distributed Stochastic Neighbor Embedding (t-SNE) visualizations of CKB individuals in validation set and test set. Individuals are colored by agreement between Cox and GBT risk prediction models for screening of high-risk individuals. High-risk individuals were defined as individuals with >10% predicted 9-yr risk of stroke. t-SNE plots were created using Rtsne package version 0.15 with perplexity = 50, theta = 0.5, and max iterations = 3000.
Summary metrics from screening of high-risk individuals in test set using (i) a Cox-only approach, (ii) a GBT-only approach, and (iii) an ensemble approach in which a decision tree selects between Cox and GBT in cases of disagreement over an individual’s risk classification
| Men | Women | |||||
|---|---|---|---|---|---|---|
| Metric | Cox-Only | GBT-Only | Ensemble | Cox-Only | GBT-Only | Ensemble |
| [95% CI] | [95% CI] | [95% CI] | [95% CI] | [95% CI] | [95% CI] | |
| Sensitivity | 76% | 80% | 76% | 68% | 74% | 67% |
| [72%–80%] | [76%–84%] | [72%–80%] | [64%–72%] | [70%–78%] | [64%–71%] | |
| Specificity | 75% | 74% | 76% | 80% | 78% | 81% |
| [74%–77%] | [73%–76%] | [75%–78%] | [79%–81%] | [77%–79%] | [80%–82%] | |
| PPV | 25% | 25% | 26% | 24% | 23% | 24% |
| [23%–27%] | [23%–28%] | [23%–28%] | [22%–26%] | [21%–25%] | [22%–26%] | |
| NPV | 97% | 97% | 97% | 97% | 97% | 97% |
| [96%–97%] | [96%–98%] | [96%–97%] | [96%–97%] | [97%–98%] | [96%–97%] | |
| Accuracy | 75% | 75% | 76% | 79% | 77% | 80% |
| [74%–77%] | [74%–76%] | [75%–77%] | [78%–80%] | [76%–78%] | [79%–81%] | |