| Literature DB >> 35892499 |
Manon L Tolhuisen1,2, Jan W Hoving2, Miou S Koopman2, Manon Kappelhof2, Henk van Voorst1, Agnetha E Bruggeman2, Adam M Demchuck3, Diederik W J Dippel4, Bart J Emmer2, Serge Bracard5, Francis Guillemin6, Robert J van Oostenbrugge7,8, Peter J Mitchell9, Wim H van Zwam10, Michael D Hill11, Yvo B W E M Roos12, Tudor G Jovin13, Olvert A Berkhemer2,4,14, Bruce C V Campbell9,15, Jeffrey Saver16, Phil White17,18, Keith W Muir19, Mayank Goyal11, Henk A Marquering1,2, Charles B Majoie2, Matthan W A Caan1.
Abstract
Infarct volume (FIV) on follow-up diffusion-weighted imaging (FU-DWI) is only moderately associated with functional outcome in acute ischemic stroke patients. However, FU-DWI may contain other imaging biomarkers that could aid in improving outcome prediction models for acute ischemic stroke. We included FU-DWI data from the HERMES, ISLES, and MR CLEAN-NO IV databases. Lesions were segmented using a deep learning model trained on the HERMES and ISLES datasets. We assessed the performance of three classifiers in predicting functional independence for the MR CLEAN-NO IV trial cohort based on: (1) FIV alone, (2) the most important features obtained from a trained convolutional autoencoder (CAE), and (3) radiomics. Furthermore, we investigated feature importance in the radiomic-feature-based model. For outcome prediction, we included 206 patients: 144 scans were included in the training set, 21 in the validation set, and 41 in the test set. The classifiers that included the CAE and the radiomic features showed AUC values of 0.88 and 0.81, respectively, while the model based on FIV had an AUC of 0.79. This difference was not found to be statistically significant. Feature importance results showed that lesion intensity heterogeneity received more weight than lesion volume in outcome prediction. This study suggests that predictions of functional outcome should not be based on FIV alone and that FU-DWI images capture additional prognostic information.Entities:
Keywords: acute ischemic stroke; follow-up DWI; functional independence; infarct core image features; infarct core segmentation; support vector machine
Year: 2022 PMID: 35892499 PMCID: PMC9331690 DOI: 10.3390/diagnostics12081786
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Study workflow for functional outcome prediction. Three different feature sets were extracted: follow-up infarct volume, features extracted by a convolutional autoencoder, and radiomic features. Each feature set was split into a training (80%) set and a test (20%) set. A support vector machine (SVM) was trained on the training set to classify favorable outcome. The SVMs were tested on the test set. The results were evaluated for each SVM.
Figure 2The convolutional autoencoder architecture. The dimensions of the input image were 64 × 80 × 64. The encoder consisted of four 4 × 4 × 4 convolutional layers with stride 2 and rectified linear unit activation. For each subsequent convolutional layer, the number of filters was doubled, starting at 16. Each convolutional layer was followed by group normalization. After the final convolutional layer of the encoder, the feature space was flattened, and a dense layer was added. The decoder contained the same components as the encoder in the opposite direction, except that the feature space was upsampled first by a factor of 2, and the stride of the convolutional layers was kept at 1. After the fourth convolutional layer, three additional convolutional layers reduced the fourth dimension of feature space to 1, resulting in the original image dimensions.
Figure 3Illustration of the three radiomic feature classes. Radiomic features consist of shape, texture, and first-order statistics features. Shape features describe the 2D and 3D size and shape of the lesion. Textural features describe the intensity distribution and relations between neighboring voxels. First-order statistics describe the intensity distributions of the lesion.
Figure 4Example of imaging reconstruction using a trained convolutional autoencoder. (Left) An axial slice of the original validation image. (Middle) The corresponding slice of the predicted image. (Right) The absolute difference between the normalized original and predicted images.
Training and testing accuracy, AUC, precision, and recall for the best-performing SVM classifiers based on FIV, the autoencoder features, and the radiomic features. The p-values resulting from deLong’s tests against the radiomic features are presented in the last column.
| Feature Extraction Method | Training Accuracy ( | Testing Accuracy ( | AUC | Precision | Recall | deLong’s Test |
|---|---|---|---|---|---|---|
| FIV only * | 0.73 | 0.74 | 0.79 | 0.78 | 0.73 | 0.15 |
| Autoencoder ** | 0.76 | 0.71 | 0.81 | 0.70 | 0.71 | 0.37 |
| Radiomics *** | 0.75 | 0.71 | 0.88 | 0.80 | 0.65 |
* SVM parameters: {C: 1000, gamma: 0.01, kernel: rbf}, ** SVM parameters: {C: 0.1, gamma: 0.01, kernel: linear}, *** SVM parameters: {C: 1, gamma: 0.001, kernel: sigmoid}.
Figure 5Receiver operating curves for the best-performing support vector machine model based on three different inputs: features extracted by a convolutional autoencoder, radiomic features, and follow-up infarct volume.
Figure 6SHAP summary plot showing the top 15 radiomic features (and their feature classes) in terms of impact on the classification based on the SHAP values. Negative and positive SHAP values represent unfavorable and favorable outcome classifications, respectively. The feature values are represented by a color map, ranging from blue (low value) to red (high value). Abbreviations of second-order radiomic feature classes in gray-level matrices: size zone (glszm), dependence (gldm), and run length (glrlm).