| Literature DB >> 33178123 |
Lucas A Ramos1,2, Manon Kappelhof3, Hendrikus J A van Os4, Vicky Chalos5,6,7, Katinka Van Kranendonk3, Nyika D Kruyt4, Yvo B W E M Roos8, Aad van der Lugt7, Wim H van Zwam9, Irene C van der Schaaf10, Aeilko H Zwinderman2, Gustav J Strijkers1,3, Marianne A A van Walderveen11, Mariekke J H Wermer4, Silvia D Olabarriaga2, Charles B L M Majoie3, Henk A Marquering1,3.
Abstract
Background: Although endovascular treatment (EVT) has greatly improved outcomes in acute ischemic stroke, still one third of patients die or remain severely disabled after stroke. If we could select patients with poor clinical outcome despite EVT, we could prevent futile treatment, avoid treatment complications, and further improve stroke care. We aimed to determine the accuracy of poor functional outcome prediction, defined as 90-day modified Rankin Scale (mRS) score ≥5, despite EVT treatment.Entities:
Keywords: MRS; endovascular treatment (EVT); functional outcome; ischemic stroke; machine learning; poor outcome; prediction modeling
Year: 2020 PMID: 33178123 PMCID: PMC7593486 DOI: 10.3389/fneur.2020.580957
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Baseline characteristics; overall compared with mRS 5–6 vs. 0–4.
| Age (years)—median (IQR) | 71 (60–79) | 77 (69–84) | 67 (55–75) |
| Male sex— | 809 (53.0) | 245 (51.0) | 502 (54.5) |
| Diabetes— | 145 (13.9) | 117 (24.4) | 262 (17.2) |
| Pre-stroke mRS— | 1,327 (86.9) | 370 (77.1) | 957 (91.5) |
| 3–5 | 172 (11.3) | 95 (19.8) | 77 (7.4) |
| NIHSS at baseline—median (IQR) | 14 (9–18) | 16 (12–20) | 13 (8–16) |
| Systolic blood pressure (mmHg)—mean (SD) | 150 (24.6) | 154 (25.8) | 147 (23.7) |
| Glucose level before EVT median (IQR) | 6.7 (8.0–5.9) | 7.2 (8.8–6.1) | 6.6 (7.8–5.8) |
| Intravenous alteplase—n (%) | 1,170 (76.7) | 327 (68.1) | 743 (80.7) |
| Onset to groin puncture time (min)—median (IQR) | 210 (160–270) | 219 (170–273) | 200 (155–266) |
| Hyperdense artery sign— | 773 (50.7) | 248 (51.7) | 459 (49.8) |
| ASPECTS subgroups— | 95 (6.2) | 39 (8.13) | 51 (5.5) |
| 5–7 | 351 (23.0) | 120 (25.0) | 198 (21.5) |
| 8–10 | 1,013 (66.4) | 292 (60.8) | 639 (69.4) |
| Occlusion location— | 322 (21.1) | 128 (26.7) | 194 (18.6) |
| M1 | 842 (55.2) | 242 (50.4) | 600 (57.4) |
| M2 | 181 (11.9) | 52 (10.8) | 129 (12.3) |
| Intracranial ICA | 85 (5.6) | 21 (4.4) | 64 (6.2) |
| Other (M3 or anterior) | 19 (1.3) | 6 (1.3) | 13 (1.2) |
| Clot Burden Score—median (IQR) | 6 (4–8) | 6 (4–8) | 6 (4–8) |
| Collateral score— | 98 (6.4) | 57 (11.9) | 35 (3.8) |
| 1 | 467 (30.6) | 188 (39.2) | 246 (26.7) |
| 2 | 547 (35.8) | 135 (28.1) | 361 (39.2) |
| 3 | 305 (20.0) | 61 (12.7) | 218 (23.7) |
EVT, endovascular treatment; ICA, internal carotid artery; IQR, interquartile range; mRS, modified Rankin Scale; NIHSS, National Institutes of Health stroke scale; ASPECTS, Alberta Stroke Programme Early CT Score.
Evaluation measures in validation data for all poor outcome prediction models, trained to maximize the AUC.
| RFC | 0.84 (0.81–0.86) | 0.56 (0.51–0.62) | 0.62 (0.56–0.68) | 0.80 (0.78–0.83) | 0.80 (0.77–0.82) | 0.66 (0.61–0.72) |
| SVM | 0.67 (0.61–0.72) | 0.78 (0.75–0.81) | 0.53 (0.48–0.57) | 0.87 (0.84–0.89) | 0.77 (0.74–0.76) | 0.69 (0.65–0.74) |
| NN | 0.89 (0.87–0.90) | 0.53 (0.49–0.57) | 0.69 (0.65–0.74) | 0.80 (0.78–0.83) | 0.81 (0.79–0.83) | 0.68 (0.64–0.73) |
| XGB | 0.79 (0.76–0.83) | 0.63 (0.60–0.67) | 0.59 (0.54–0.65) | 0.82 (0.80–0.84) | 0.78 (0.76–0.81) | 0.64 (0.59–0.69) |
| LR | 0.75 (0.73–0.78) | 0.71 (0.68–0.73) | 0.57 (0.53–0.62) | 0.85 (0.83–0.86) | 0.80 (0.78–0.82) | 0.68 (0.63–0.74) |
The average of 10 cross-validation iterations is presented. RFC, random forest classifier; SVM, support vector machine; LR, logistic regression; XGB, gradient boosting; NN, neural networks; AUC, area under the curve; AUPRC, area under the precision recall curve; NPV, negative predictive value; PPV, positive predictive value.
Figure 1Distribution of mRS for the predictions of each model as poor vs. non-poor outcome with 95% specificity threshold. Along the y-axis, the various ML methods are presented including the number of patients who were classified as poor and non-poor outcome. Along the x-axis, the percentage of patients per mRS value is presented. In each graph, the black bar separates mRS 0–4 from 5 to 6. RFC, random forest classifier; SVM, support vector machine; LR, logistic regression; NN, neural network; XGB, gradient boosting; mRS, modified Rankin Scale. Numbers in bars represent absolute number of patients.
Number of false positives (mRS 0–4 classified as poor) and true positives (mRS 5–6 classified as poor) per specificity threshold for each ML method.
| RFC | 95% | 145 (30.2%) | 52 (5.6%) |
| 98% | 91 (19.0%) | 20 (2.2%) | |
| 100% | 39 (8.1%) | 8 (0.9%) | |
| SVM | 95% | 136 (28.3%) | 61 (6.2%) |
| 98% | 62 (12.9%) | 28 (3.0%) | |
| 100% | 33 (6.9%) | 10 (1.1%) | |
| NN | 95% | 163 (34.0%) | 41 (4.5%) |
| 98% | 92 (19.2%) | 10 (1.1%) | |
| 100% | 21 (4.4%) | 2 (0.2%) | |
| XGB | 95% | 99 (20.6%) | 27 (2.8%) |
| 98% | 63 (13.1%) | 12 (1.3%) | |
| 100% | 21 (4.4%) | 6 (0.7%) | |
| LR | 95% | 147 (30.6%) | 35 (3.8%) |
| 98% | 92 (19.2%) | 10 (1.1%) | |
| 100% | 23 (4.8%) | 1 (0.1%) |
RFC, random forest classifier; SVM, support vector machine; LR, logistic regression; NN, neural network; XGB, gradient boosting; mRS, modified Rankin Scale.
Figure 2Distribution of mRS for the predictions of each model as poor vs. non-poor outcome with 98% specificity threshold. Along the y-axis, the various ML methods are presented including the number of patients who were classified as poor and non-poor outcome. Along the x-axis, the percentage of patients per mRS value is presented. In each graph, the black bar separates mRS 0–4 from 5 to 6. RFC, random forest classifier; SVM, support vector machine; LR, logistic regression; mRS, modified Rankin Scale; NN, neural network; XGB, gradient boosting. Numbers in bars represent absolute number of patients.
Figure 3Distribution of mRS for the predictions of each model as poor vs. non-poor outcome with 100% specificity threshold. Along the y-axis, the various ML methods are presented including the number of patients who were classified as poor and non-poor outcome. Along the x-axis, the percentage of patients per mRS value is presented. In each graph, the black bar separates mRS 0–4 from 5 to 6. RFC, random forest classifier; SVM, support vector machine; LR, logistic regression; NN, neural network; XGB, gradient boosting; mRS, modified Ranking Scale. Numbers in bars represent absolute number of patients.
Odds ratio of each variable included in the logistic regression model.
| Age (years) | 1.05 (1.04–1.06) |
| Pre-stroke mRS | 1.35 (1.21–1.50) |
| Atrial fibrillation | 1.37 (1.01–1.85) |
| NIHSS at baseline | 1.06 (1.03–1.09) |
| Glucose level | 1.16 (1.10–1.22) |
| Glasgow Coma Scale | 0.90 (0.84–0.97) |
| Time: onset to groin puncture | 1.00 (1.00–1.01) |
| 50% or more atherosclerotic stenosis at symptomatic carotid bifurcation on CTA | 0.61 (0.38–0.99) |
| ASPECTS on baseline NCCT | 0.94 (0.88–1.01) |
| Leukoaraiosis | 1.69 (1.28–2.24) |
| Collaterals | 0.60 (0.51–0.70) |
CI, confidence interval; CTA, CT angiography; mRS, modified Rankin Scale; NIHSS, National Institutes of Health stroke scale; NCCT, non-contrast CT; ASPECTS, Alberta Stroke Programme Early CT Score.
Figure 4Permutation feature importance for the Neural Network models. Average impact on the AUC. *50% or more atherosclerotic stenosis at symptomatic carotid bifurcation on CTA baseline. ASPECTS, Alberta Stroke Programme Early CT Score; CRP, C reactive protein; mRS, modified Rankin Scale; NIHSS, National Institutes of Health stroke scale.
Figure 5Permutation feature importance for the Logistic Regression models. Average impact on the AUC. mRS, modified Rankin Scale; NIHSS, National Institutes of Health stroke scale; RR, blood pressure (Riva-Rocci).