| Literature DB >> 34220671 |
Lai Wei1, Yidi Cao2,3, Kangwei Zhang1, Yun Xu1, Xiang Zhou1, Jinxi Meng1, Aijun Shen1, Jiong Ni1, Jing Yao1, Lei Shi2,3, Qi Zhang2,3,4, Peijun Wang1.
Abstract
Purpose: Accurate prediction of the progression to severe stroke in initially diagnosed nonsevere patients with acute-subacute anterior circulation nonlacuna ischemic infarction (ASACNLII) is important in making clinical decision. This study aimed to apply a machine learning method to predict if the initially diagnosed nonsevere patients with ASACNLII would progress to severe stroke by using diffusion-weighted images and clinical information on admission.Entities:
Keywords: artificial intelligence; ischemic infarction; machine learning; severe stroke prediction; stroke volume
Year: 2021 PMID: 34220671 PMCID: PMC8249916 DOI: 10.3389/fneur.2021.652757
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Figure 1Flow chart illustrating patients selection.
Figure 2The architecture of the proposed U-net model.
Basic patient information.
| Age, median (IQR) | 71 (63, 82.5) | 75 (66, 84) | 0.093 |
| Male (percentile: %) | 179 (66.1%) | 25 (53.4%) | 0.064 |
| NIHSS on admission | 6 (3, 12) | 8 (4.5, 12) | 0.062 |
| GCS on admission | 15 (13, 15) | 15 (14, 15) | 0.338 |
| Alcohol | 91 (34.2%) | 15 (20.55%) | 0.036 |
| Smoking | 132 (49.6%) | 21 (28.8%) | 0.002 |
| Myocardial infarction | 9 (3.4%) | 1 (0.0%) | 0.610 |
| Coronary atherosclerosis | 54 (20.3%) | 15 (20.5%) | 0.906 |
| Atrial fibrillation | 43 (16.2%) | 17 (23.3%) | 0.215 |
| Hypertension | 188 (70.7%) | 52 (71.2%) | 0.958 |
| Stroke | 73 (27.4%) | 22 (30.1%) | 0.759 |
| Diabetes | 95 (35.7%) | 23 (31.5%) | 0.596 |
| Prothrombin time | 11.0 (10.6, 11.6) | 11.1 (10.7, 11.6) | 0.090 |
| Fibrinogen | 2.83 (2.43, 3.56) | 2.75 (2.35, 3.76) | 0.278 |
| D-dimer | 0.66 (0.34, 1.67) | 0.75 (0.33, 1.81) | 0.494 |
| Serum troponin I | 0.01 (0.01, 0.03) | 0.01 (0.01, 0.03) | 0.222 |
| Blood sugar | 6.46 (5.41, 9.01) | 7.65 (5.54, 10.11) | 0.051 |
| Blood lipids | 1.25 (0.94, 1.75) | 1.27 (0.92, 1.62) | 0.346 |
| Brain natriuretic peptide | 83.7 (38.1, 231.8) | 131.6 (49.9, 286.9) | 0.070 |
| Progress to severe stroke (percentile: %) | 89 (33.0%) | 19 (26.0%) | 0.322 |
IQR, interquartile range; NIHSS, national institute of health stroke scale; GCS, glasgow coma scale.
Figure 3Comparison between artificial intelligence (AI)-based segmentation and manual segmentation in three cases of ASACNLII. (A) DWI images. (B) Segmentation results, in which red represents manual labeling results, green the AI output results, and yellow the consistent areas.
Figure 4The correlation between AI-derived volumes and the manually segmented volumes. The squared correlation coefficients R2 were calculated for all patients (A), patients with lesion sizes <100,000 mm3 (B), and those with lesion sizes <30,000 mm3 (C).
Comparison between different feature sets.
| AUC | 0.7686 (0.6474–0.8717) | 0.6929 (0.5434–0.8262) | 0.8358 (0.7321–0.9269) |
| Sensitivity | 0.6022 (0.3607–0.9630) | 0.8735 (0.7636–1.0) | 0.7695 (0.6102–0.9074) |
| Specificity | 0.8891 (0.5217–1.0) | 0.5612 (0.3000–0.7857) | 0.8686 (0.6923–1.0) |
| Youden's index | 0.4913 (0.3494–0.6522) | 0.4347 (0.1897–0.6628) | 0.6380 (0.4475–0.8182) |
| Accuracy | 0.6386 (0.5753–0.7945) | 0.7846 (0.7808–0.7945) | 0.7780 (0.7397–0.7945) |
AUC, area under the receiver operating characteristic curve.
Figure 5Performances of machine learning models for the prediction of progression to severe stroke: receiver operating characteristic (ROC) curves of three feature sets when using the random forest classifier.
Comparison between different classifiers.
| AUC | 0.8104 (0.6952–0.9113) | 0.8165 (0.6854–0.9344) | 0.8358 (0.7321–0.9269) |
| Sensitivity | 0.7226 (0.4074–0.9815) | 0.8631 (0.6471–0.9608) | 0.7695 (0.6102–0.9074) |
| Specificity | 0.8200 (0.5000–1.0) | 0.8013 (0.6000–1.0) | 0.8686 (0.6923–1.0) |
| Youden's index | 0.5426 (0.3680–0.7363) | 0.6644 (0.4657–0.8644) | 0.6380 (0.4475–0.8182) |
| Accuracy | 0.7103 (0.5616–0.8219) | 0.8334 (0.6849–0.8493) | 0.7780 (0.7397–0.7945) |
AUC, area under the receiver operating characteristic curve.
Figure 6Shapley additive explanation (SHAP) diagram of variable contributions for the optimal predictive model, i.e., the random forest classifier with volume + clinical data. (A) The relative contributions of AI-derived volumes and clinical variables for progression prediction. Features on the right of the risk explanation bar pushed the risk higher, and features on the left pushed the risk lower: a patient with a larger volume, higher NIHSS, and lower GCS is at a higher risk. (B) The relative contributions of variables for progression prediction quantified with the mean of the absolute SHAP values.
Figure 7Shapley additive explanation (SHAP) values to show interpretability of the effects of AI-derived volumes and clinical variables as the input risk factors for the prediction of progression to severe stroke. (A) SHAP values for all 73 patients in the test set. Samples from left to right are ordered by the sum of the SHAP values from all variables, and the bottom bar shows the true labels of each sample, namely, red for the positive group (progression to severe stroke) and blue for the negative group (non-progression to severe stroke). The 27 samples on the left are predicted as positive samples by the random forest model. (B, C) SHAP values of two typical patients from the positive group (B) and the negative group (C), illustrated with their most important variables.