| Literature DB >> 30921550 |
Rafael Ortiz-Ramón1, Maria Del C Valdés Hernández2, Victor González-Castro3, Stephen Makin4, Paul A Armitage5, Benjamin S Aribisala6, Mark E Bastin7, Ian J Deary8, Joanna M Wardlaw7, David Moratal1.
Abstract
BACKGROUND: The differential quantification of brain atrophy, white matter hyperintensities (WMH) and stroke lesions is important in studies of stroke and dementia. However, the presence of stroke lesions is usually overlooked by automatic neuroimage processing methods and the-state-of-the-art deep learning schemes, which lack sufficient annotated data. We explore the use of radiomics in identifying whether a brain magnetic resonance imaging (MRI) scan belongs to an individual that had a stroke or not.Entities:
Keywords: Radiomics; Small vessel disease; Stroke; Texture analysis; White matter hyperintensities
Mesh:
Year: 2019 PMID: 30921550 PMCID: PMC6553681 DOI: 10.1016/j.compmedimag.2019.02.006
Source DB: PubMed Journal: Comput Med Imaging Graph ISSN: 0895-6111 Impact factor: 4.790
Fig. 1Set of images obtained in each patient. In this representative case, T1-weighted (T1W), T2-weighted (T2W) and fluid-attenuated inversion recovery (FLAIR) brain images of normal appearing white matter (NAWM), white matter hyperintensities (WMH) and subcortical structures (SS) of a lacunar stroke patient are presented.
Fig. 2Process followed to extract the 3D features of a FLAIR image of the NAWM. The same process is applied to all the images of each MRI modality (FLAIR, T1W and T2W) and of each tissue/structure (NAWM, WMH and SS).
Texture features extracted.
| Method | Textural Features | Number of Features |
|---|---|---|
| GLCM | Energy, Contrast, Correlation, Variance, Homogeneity, Sum average, Sum variance, Sum entropy, Entropy, Difference variance, Difference entropy, First information measure of correlation (FIMC), Second information measure of correlation (SIMC) | 13 |
| GLRLM | Short Run Emphasis (SRE), Long Run Emphasis (LRE), Grey-level Non-uniformity (GLN), Run-Length Non-uniformity (RLN), Run Percentage (RP), Low Grey-level Run Emphasis (LGRE), High Grey-level Run Emphasis (HGRE), Short Run Low Grey-level Emphasis (SRLGE), Short Run High Grey-level Emphasis (SRHGE), Long Run Low Grey-level Emphasis (LRLGE), Long Run High Grey-level Emphasis (LRHGE) | 11 |
| LBP | LBP histogram bins: LBP1, LBP2, LBP3, …, LBP36 | 40 |
| LBP image statistics: LBP_Median, LBP_Variance, LBP_Skewness, LBP_Kurtosis | ||
| WSF | Mean_OI, SD_OI (OI: Original image) | 26 |
| Mean_LL | ||
| SD_LL | ||
| WCF | Energy_LL1, Contrast_LL1, Correlation_LL1, Homogeneity_LL1, Entropy_LL1, Variance_LL1 | 24 |
| Energy_LH1, Contrast_LH1, Correlation_LH1, Homogeneity_LH1, Entropy_LH1, Variance_LH1 | ||
| Energy_HL1, Contrast_HL1, Correlation_HL1, Homogeneity_HL1, Entropy_HL1, Variance_HL1 | ||
| Energy_HH1, Contrast_HH1, Correlation_HH1, Homogeneity_HH1, Entropy_HH1, Variance_HH1 |
Fig. 3First DWT level of decomposition of a FLAIR image of the white matter tissue (NAWM and WMH) of a single brain slice.
Fig. 4Cross-validation structure used to evaluate the 90 texture datasets. All the samples of each texture dataset were randomly separated R = 10 times in F = 5 folds to evaluate the model with the averaged AUC. This process was repeated for the two models studied (SVM with linear kernel and RF) and for all the chosen tuning parameters. The feature selection process was only applied to those sets which provided the best results to examine the influence of the number of features used to train the model.
Number of significant features (p < 0.05) for discriminating cortical vs. lacunar stroke patients before (numerator) and after (denominator) Holm-Bonferroni correction for multiple comparisons per MRI sequence and brain tissue/structure.
| MRI Sequence | Tissue or Structure | ||
|---|---|---|---|
| Normal-appearing white matter | Subcortical structures | White matter hyperintensities | |
| FLAIR | 0 / 0 | 16 / 0 | 1 / 0 |
| T2W | 3 / 0 | 9 / 0 | 1 / 0 |
| T1W | 11 / 0 | 19 / 2 | 1 / 0 |
Results of the classification analysis for cortical and lacunar stroke patients. The AUC values computed by averaging the results of the validation data (mean ± standard deviation) are shown for the two models (SVM with linear kernel and RF) and for all the MRI sequences and brain tissues/structures when using the texture features extracted from the 5 texture analysis methods. The presented values are obtained for the best tuning parameter in each case.
| AUC: Mean (SD) | GLRLM | GLCM | LBP | WCF | WSF | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | ||
| FLAIR | RF | <0.6 | <0.6 | <0.6 | <0.6 | <0.6 | <0.6 | <0.5 | <0.6 | <0.6 | <0.5 | 0.600 (0.118) | <0.5 | <0.5 | <0.5 | <0.5 |
| SVM | <0.6 | <0.6 | <0.5 | <0.5 | <0.6 | <0.5 | <0.6 | 0.604 (0.121) | <0.6 | <0.6 | <0.5 | <0.5 | <0.6 | <0.5 | <0.5 | |
| T2W | RF | <0.6 | <0.6 | <0.6 | <0.6 | 0.611 (0.121) | <0.5 | <0.5 | <0.5 | 0.616 (0.107) | <0.5 | <0.6 | <0.6 | <0.4 | 0.605 (0.117) | <0.4 |
| SVM | <0.5 | 0.622 (0.125) | <0.6 | 0.667 (0.117) | <0.5 | <0.5 | <0.6 | <0.5 | <0.6 | <0.6 | 0.604 (0.129) | <0.6 | 0.621 (0.140) | <0.6 | 0.661 (0.132) | |
| T1W | RF | <0.5 | <0.5 | <0.5 | <0.6 | <0.6 | <0.6 | <0.6 | <0.6 | <0.5 | <0.6 | <0.6 | <0.5 | 0.649 (0.114) | <0.6 | <0.5 |
| SVM | <0.5 | <0.5 | <0.6 | <0.6 | 0.637 (0.140) | <0.6 | <0.6 | <0.6 | 0.616 (0.092) | <0.6 | <0.5 | <0.5 | 0.618 (0.128) | <0.5 | <0.6 | |
AUC: area under the curve, RF: random forest classifier, SVM: support vector machine classifier, GLRM: grey–level run length matrix features, GLCM: grey-level co-occurrence matrix features, LBP: local binary patterns features, WCF: wavelet co-occurrence features, WSF: wavelet statistical features, NAWM: normal-appearing white matter, SS: subcortical structures, WMH: white matter hyperintensities.
Number of significant features (p < 0.05) for discriminating “no stroke” and “old stroke” individuals before (numerator) and after (denominator) Holm-Bonferoni correction for multiple comparisons per MRI sequence and brain tissue/structure.
| MRI Sequence | Tissue or Structure | ||
|---|---|---|---|
| Normal-appearing white matter | Subcortical structures | White matter hyperintensities | |
| FLAIR | 5 / 1 | 79 / 71 | 30 / 9 |
| T2W | 1 / 0 | 79 / 72 | 34 / 3 |
| T1W | 20 / 4 | 71 / 66 | 30 / 9 |
Results of the classification analysis for “old stroke” and “no-stroke” individuals. The AUC values computed by averaging the results of the validation data (mean ± SD) are shown for the two models (SVM with linear kernel and RF) and for all the MRI sequences and brain tissues/structures when using the texture features extracted from the 5 texture analysis methods. The presented values are obtained for the best tuning parameter in each case.
| GLRLM | GLCM | LBP | WCF | WSF | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | NAWM | SS | WMH | ||
| FLAIR | RF | <0.6 | 0.691 (0.109) | 0.674 (0.108) | <0.6 | 0.612 (0.099) | 0.608 (0.111) | <0.5 | <0.6 | 0.647 (0.099) | 0.669 (0.114) | 0.635 (0.094) | <0.6 | |||
| SVM | <0.6 | 0.676 (0.097) | <0.5 | 0.666 (0.090) | 0.614 (0.124) | <0.5 | 0.682 (0.136) | 0.637 (0.121) | <0.6 | <0.6 | 0.637 (0.137) | <0.6 | <0.6 | |||
| T2W | RF | <0.5 | 0.643 (0.099) | <0.6 | <0.5 | 0.641 (0.107) | 0.617 (0.102) | <0.5 | 0.680 (0.112) | 0.608 (0.116) | <0.5 | 0.680 (0.097) | <0.5 | 0.665 (0.103) | ||
| SVM | 0.665 (0.084) | 0.646 (0.128) | 0.601 (0.138) | 0.644 (0.111) | <0.5 | <0.6 | 0.671 (0.122) | <0.5 | 0.608 (0.157) | <0.5 | <0.6 | 0.677 (0.123) | ||||
| T1W | RF | 0.609 (0.104) | 0.654 (0.112) | <0.6 | <0.6 | 0.662 (0.091) | 0.659 (0.113) | 0.667 (0.125) | 0.649 (0.120) | 0.611 (0.140) | 0.624 (0.105) | 0.682 (0.109) | 0.664 (0.115) | <0.6 | <0.5 | <0.6 |
| SVM | <0.6 | 0.662 (0.104) | <0.5 | 0.642 (0.126) | <0.6 | <0.5 | <0.6 | 0.676 (0.122) | 0.630 (0.126) | 0.628 (0.143) | <0.6 | <0.6 | <0.6 | <0.5 | <0.6 | |
*Values in bold indicate the best AUC results (AUC > 0.7).
AUC: area under the curve, RF: random forest classifier, SVM: support vector machine classifier, GLRM: grey–level run length matrix features, GLCM: grey-level co-occurrence matrix features, LBP: local binary patterns features, WCF: wavelet co-occurrence features, WSF: wavelet statistical features, NAWM: normal-appearing white matter, SS: subcortical structures, WMH: white matter hyperintensities.
Values of AUC obtained when analysing the best texture datasets with and without applying feature selection, i.e. using all the features of the dataset (All) and reducing the number of features based on two metrics: the p-value (p-val) and the maximal information coefficient (MIC).
| GLRLM FLAIR – WMH | LBP FLAIR – SS | LBP T2W – SS | WCF FLAIR – NAWM | WCF T2W – WMH | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All | MIC | All | MIC | All | MIC | All | MIC | All | MIC | ||||||
| RF | 0.674 (0.108) | = | = | 0.742 (0.100) | = | 0.744 (0.104) | 0.680 (0.112) | 0.693 (0.101) | 0.714 (0.113) | 0.761 (0.097) | 0.766 (0.099) | 0.766 (0.086) | 0.752 (0.097) | = | = |
| SVM | 0.770 (0.089) | 0.773 (0.089) | 0.773 (0.093) | 0.751 (0.103) | = | 0.759 (0.103) | 0.763 (0.116) | 0.774 (0.099) | 0.828 (0.075) | 0.637 (0.121) | 0.713 (0.125) | 0.712 (0.112) | <0.5 | <0.6 | = |
AUC: area under the curve, RF: random forest classifier, SVM: support vector machine classifier, GLRM: grey–level run length matrix features, GLCM: grey-level co-occurrence matrix features, LBP: local binary patterns features, WCF: wavelet co-occurrence features, WSF: wavelet statistical features, NAWM: normal-appearing white matter, SS: subcortical structures, WMH: white matter hyperintensities, MIC: maximal information coefficient.
The symbol “=” is used when no improvement is obtained by reducing the number of features.
Fig. 5Results of applying the feature selection method based on MIC to the texture dataset of LBP features extracted from T2W images of subcortical structures (SS) when training the SVM model with cost C = 8. The profile of AUC values obtained for all possible subsets of features according to the MIC ranking is illustrated in (a). The ROC curves provided by the model when using all the features (14 features) and when using the optimal number of features (7 features) is shown in (b).
Values of AUC obtained when analysing the best texture datasets (without feature selection) with and without including the textures extracted from the additional older patients.
| GLRLM FLAIR – WMH | LBP FLAIR – SS | LBP T2W – SS | WCF FLAIR – NAWM | WCF T2W – WMH | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Without older patients | With older patients | Without older patients | With older patients | Without older patients | With older patients | Without older patients | With older patients | Without older patients | With older patients | |
| RF | 0.674 (0.108) | 0.682 (0.089) | 0.742 (0.100) | 0.655 (0.098) | 0.680 (0.112) | 0.623 (0.078) | 0.761 (0.097) | 0.645 (0.086) | 0.752 (0.097) | 0.678 (0.074) |
| SVM | 0.770 (0.089) | 0.736 (0.084) | 0.751 (0.103) | 0.644 (0.106) | 0.763 (0.116) | 0.670 (0.083) | 0.637 (0.121) | 0.580 (0.106) | <0.5 | <0.5 |
AUC: area under the curve, RF: random forest classifier, SVM: support vector machine classifier, GLRM: grey–level run length matrix features, GLCM: grey-level co-occurrence matrix features, LBP: local binary patterns features, WCF: wavelet co-occurrence features, WSF: wavelet statistical features, NAWM: normal-appearing white matter, SS: subcortical structures, WMH: white matter hyperintensities.
Exception where the AUC increased after adding older patients.
Fig. 6Pattern of the classification performance of the best models (i.e. for which the accuracy was above 80%) per stroke subtype (i.e. no stroke, large cortical, small cortical or lacunar) (left) and per stroke occurrence (i.e. had stroke or not) (right).