| Literature DB >> 32437368 |
Agni Orfanoudaki1, Emma Chesley1, Christian Cadisch1, Barry Stein2, Amre Nouh3, Mark J Alberts3, Dimitris Bertsimas4.
Abstract
Current stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient's risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85-0.90) vs. 73.74% (CI 0.70-0.76); validation 75.29% (CI 0.74-0.76) vs 65.93% (CI 0.64-0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention.Entities:
Year: 2020 PMID: 32437368 PMCID: PMC7241753 DOI: 10.1371/journal.pone.0232414
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Baseline characteristics of the derivation and validation populations.
| Dataset Name | Parameter | Value |
|---|---|---|
| Sample size | 18,793 | |
| Number of participants | 4,385 | |
| Number of stroke cases | 1,013 | |
| Number of distinct participants with stroke | 460 | |
| Proportion of female population | 53.97% | |
| Sample size | 2,989 | |
| Number of stroke cases | 221 | |
| Proportion of female population | 54.26% | |
| Sample size | 9,029 | |
| Number of stroke cases | 909 | |
| Proportion of female population | 58.63% | |
| Sample size | 2,862 | |
| Number of stroke cases | 230 | |
| Proportion of female population | 58.97% | |
| Sample size | 5,636 | |
| Number of stroke cases | 406 | |
| Proportion of female population | 50.19% |
Stroke risk factors considered in the N-SRS model.
| Category | Variable |
|---|---|
| Age | |
| Gender | |
| Current cigarette smoking | |
| Presence of Cardiovascular disease | |
| Presence of Atrial Fibrillation | |
| History of Transient Ischemic Attacks | |
| History of Myocardial Infarctions | |
| Diabetes mellitus | |
| Blood Pressure Category | |
| Antihypertensive medication | |
| Statins | |
| Nitrates | |
| Diuretics | |
| CABG | |
| PCI | |
| X-ray Enlargement | |
| Left Ventricular Hypertrophy | |
| Presence of T-Wave abnormality | |
| Intraventricular Block | |
| Atrioventricular Block | |
| ST-Segment abnormality | |
| U-Wave abnormality | |
| Premature beats | |
| SBP | |
| HDL | |
| BMI | |
| Hematocrit | |
| Fasting plasma glucose level |
Fig 1A visualization of the N-SRS tree-based model.
Comparison of the N-SRS, the R-FSRS, and other machine learning methods performance on the testing set of the Framingham datasets.
Reported metrics include sensitivity, specificity, precision, negative predictive value (NPV), and positive predictive value (PPV) at the probability threshold of 0.5. The Table also presents the overall c-statistic (AUC) and calibration χ2 results.
| A) Framingham Dataset 1 (FD1) | ||||||||
| 0.9142 | 0.8510 | 0.8461 | 0.8554 | 0.8933 | 0.8802 | 0.9175 | 0.9167 | |
| 0.7238 | 0.6902 | 0.6890 | 0.7043 | 0.7102 | 0.7099 | 0.7161 | 0.7354 | |
| 0.9408 | 0.9620 | 0.9353 | 0.9758 | 0.9701 | 0.9736 | 0.9605 | 0.9412 | |
| 0.0592 | 0.0380 | 0.0647 | 0.0242 | 0.0423 | 0.0380 | 0.0863 | 0.0588 | |
| 0.9408 | 0.9620 | 0.9353 | 0.9758 | 0.9621 | 0.9736 | 0.9137 | 0.9412 | |
| 0.8743 | 0.7374 | 0.7188 | 0.7552 | 0.8065 | 0.7981 | 0.8829 | 0.8846 | |
| 0.8569–0.9014 | 0.6976–0.7619 | 0.6765–0.7636 | 0.7081–0.8102 | 0.772–0.8351 | 0.7676–0.8287 | 0.8578–0.9081 | 0.8643–0.9048 | |
| 1.96 | 8.05 | 11.98 | 5.44 | 2.88 | 3.04 | 1.43 | 1.58 | |
| 0.8948 | 0.8533 | 0.8605 | 0.8487 | 0.8763 | 0.8504 | 0.8938 | 0.8934 | |
| 0.5097 | 0.4217 | 0.4066 | 0.4800 | 0.4867 | 0.2505 | 0.4994 | 0.5110 | |
| 0.9693 | 0.9617 | 0.9531 | 0.9712 | 0.9688 | 0.9393 | 0.9816 | 0.9700 | |
| 0.3973 | 0.2233 | 0.2321 | 0.1933 | 0.2576 | 0.1704 | 0.3804 | 0.4053 | |
| 0.9693 | 0.9617 | 0.9531 | 0.9712 | 0.9401 | 0.9486 | 0.9535 | 0.9700 | |
| 0.8238 | 0.7488 | 0.7281 | 0.7677 | 0.7754 | 0.6884 | 0.8216 | 0.8260 | |
| 0.791–0.8558 | 0.7145–0.7831 | 0.6775–0.7788 | 0.7149–0.8204 | 0.738–0.8119 | 0.6435–0.7333 | 0.7881–0.8536 | 0.7938–0.8567 | |
| 2.75 | 7.3 | 12.1 | 4.1 | 6.5 | 20.34 | 2.81 | 2.7 | |
Fig 2Calibration plots for all models on the Derivation cohort.
Fig 2A refers to the testing population of FD1 and Fig 2B to FD2. The plots show the relation between the true class of the samples and the predicted probabilities. Samples were binned to their class probabilities generated by the model. The following intervals were defined: [0,10%], (10,20%], (20,30%], … (90,100%]. The event rate for each bin was subsequently identified. For example, if 4 out of 5 samples falling into the last bin are actual events, then the event rate for that bin would be 80%. The calibration plot displays the bin mid-points on the x-axis and the event rate on the y-axis. Ideally, the event rate should be reflected as a 45° line.
Comparison of the N-SRS, the R-FSRS, and other machine learning methods performance on the Validation cohort.
Reported metrics include sensitivity, specificity, precision, negative predictive value (NPV), and positive predictive value (PPV) at the probability threshold of 0.5. The overall c-statistic (AUC) and calibration χ2 results are also presented. The results refer to the aggregated population.
| N-SRS | R-FSRS (both genders) | R-FSRS (men) | R-FSRS (women) | Log. Reg | CART | Random Forest | XGBoost | |
|---|---|---|---|---|---|---|---|---|
| 0.8986 | 0.8403 | 0.8411 | 0.8396 | 0.8576 | 0.8402 | 0.9055 | 0.9076 | |
| 0.4019 | 0.3663 | 0.3786 | 0.3565 | 0.3733 | 0.3599 | 0.4078 | 0.4092 | |
| 0.9395 | 0.9320 | 0.9329 | 0.9313 | 0.9349 | 0.9348 | 0.9407 | 0.9455 | |
| 0.2771 | 0.1815 | 0.1882 | 0.1762 | 0.2026 | 0.1805 | 0.2811 | 0.2818 | |
| 0.9395 | 0.9320 | 0.9329 | 0.9313 | 0.9345 | 0.9317 | 0.9421 | 0.9446 | |
| 0.7403 | 0.6491 | 0.6246 | 0.6735 | 0.7065 | 0.6829 | 0.7482 | 0.7501 | |
| 0.7149–0.771 | 0.6266–0.6716 | 0.5931–0.6555 | 0.6411–0.7058 | 0.6772–0.7558 | 0.6484–0.7175 | 0.7198–0.7801 | 0.7202–0.7856 | |
| 7.12 | 36.66 | 37.42 | 35.98 | 25.03 | 35.76 | 6.67 | 6.52 |
Comparison of the N-SRS, and the R-FSRS performance on the Validation population using the c-statistic.
Detailed results are shown for the main ethnicity groups.
| BMC–White | BMC–Black | BMC–Hispanic | ||||
|---|---|---|---|---|---|---|
| Model | AUC | 95% CI | AUC | 95% CI | AUC | 95% CI |
| 74.30% | 0.7149–0.771 | 75.80% | 0.7345–0.767 | 72.79% | 0.6889–0.7671 | |
| 64.91% | 0.6266–0.6716 | 64.85% | 0.6304–0.6666 | 61.04% | 0.5601–0.6587 | |
| 67.35% | 0.6411–0.7058 | 65.22% | 0.628–0.6764 | 61.06% | 0.5548–0.6663 | |
| 62.46% | 0.5931–0.6555 | 64.49% | 0.6181–0.6717 | 61.01% | 0.5621–0.6587 | |
| 71.55% | 0.6823–0.7402 | 69.77% | 0.6823–0.7402 | 70.46% | 0.6765–0.7359 | |
| 69.01% | 0.6627–0.7134 | 66.41% | 0.6272–0.6609 | 66.10% | 0.6286–0.6934 | |
| 75.08% | 0.7162–0.7855 | 73.14% | 0.7139–0.749 | 70.80% | 0.6807–0.7354 | |
| 77.32% | 0.7582–0.7881 | 74.88% | 0.7133–0.7842 | 74.27% | 0.7187–0.7667 | |
Fig 3Deep-dives in insightful risk profiles of the N-SRS model.
Fig 4An example illustrating the user-friendly interface of N-SRS.
Due to its interactive nature the answer to a question dictates the next question. In this specific example, whether the provider answer yes to no to the question regarding CVD takes the algorithm and questions in a different direction.