| Literature DB >> 34556732 |
Dixiang Song1, Yixuan Zhai1, Xiaogang Tao1, Chao Zhao1, Minkai Wang1, Xinting Wei2.
Abstract
This study attempts to explore the radiomics-based features of multi-parametric magnetic resonance imaging (MRI) and construct a machine-learning model to predict the blood supply in vestibular schwannoma preoperatively. By retrospectively collecting the preoperative MRI data of patients with vestibular schwannoma, patients were divided into poor and rich blood supply groups according to the intraoperative recording. Patients were divided into training and test cohorts (2:1), randomly. Stable features were retained by intra-group correlation coefficients (ICCs). Four feature selection methods and four classification methods were evaluated to construct favorable radiomics classifiers. The mean area under the curve (AUC) obtained in the test set for different combinations of feature selecting methods and classifiers was calculated separately to compare the performance of the models. Obtain and compare the best combination results with the performance of differentiation through visual observation in clinical diagnosis. 191 patients were included in this study. 3918 stable features were extracted from each patient. Least absolute shrinkage and selection operator (LASSO) and logistic regression model was selected as the optimal combinations after comparing the AUC calculated by models, which predicted the blood supply of vestibular schwannoma by K-Fold cross-validation method with a mean AUC = 0.88 and F1-score = 0.83. Radiomics machine-learning classifiers can accurately predict the blood supply of vestibular schwannoma by preoperative MRI data.Entities:
Mesh:
Year: 2021 PMID: 34556732 PMCID: PMC8460834 DOI: 10.1038/s41598-021-97865-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The radiomics workflow in this study. The patient's MRI data was acquired. After uniform preprocessing, two neurosurgeons drew the ROI separately for feature extraction. Samples were grouped by intraoperative recording, the stable features screened by ICC entered the machine learning process to select the best model, which was used to compare the performance with the neurosurgeon's visual observation.
Sex, age and intraoperative blood loss distribution between the poor blood supply group and the rich blood supply group.
| Characteristics | Groups | Total | p-value | |
|---|---|---|---|---|
| Poor blood supply | Rich blood supply | |||
| Male | 26 | 56 | 72 | 0.09 |
| Female | 46 | 63 | 109 | |
| Age (y) | 52.7 ± 11.8 | 48.4 ± 12.0 | 50.0 ± 12.1 | 0.73 |
| Intraoperative blood loss (ml) | 165.3 ± 156.6 | 251.9 ± 217.9 | 219.0 ± 201.2 | 0.01 |
Mean ± standard deviation.
Figure 2The distribution of intraoperative blood loss in the poor blood supply group and rich blood supply group.
Number of features extracted by different feature extraction methods and the average AUC.
| Feature_Selection_Methods | AUC of classification methods | Number of selected Voxel | ||||
|---|---|---|---|---|---|---|
| RF | MLR | SVM | DT | Average | ||
| t_test | 0.65 | 0.74 | 0.70 | 0.63 | 0.68 | 10 |
| LASSO | 0.70 | 0.88 | 0.84 | 0.62 | 0.76 | 12 |
| ANOVA | 0.60 | 0.61 | 0.50 | 0.61 | 0.58 | 14 |
| t_test + LASSO | 0.73 | 0.80 | 0.77 | 0.65 | 0.74 | 12 |
| Average | 0.67 | 0.76 | 0.70 | 0.63 | 0.69 | |
Figure 3The building of machine-learning models. (A) Heatmap of AUC for different feature selection methods and classifier combinations. (B) Learning curve of the best combination (LASSO + MLR). (C) ROC curves for each of the 15 validations of the LASSO + MLR combination and the average ROC curve for all validation results. (D) ROC curves for combinations of LASSO and different classifier.
Selected features and their coefficient weights in LASSO feature selection method.
| Sequences | Features | Coefficient |
|---|---|---|
| T1-CE | T1c_square_glcm_InverseVariance | − 0.050471655 |
| T1c_square_glcm_JointEnergy | 0.033219662 | |
| T1c_squareroot_glszm_SizeZoneNonUniformityNormalized | − 0.051365988 | |
| T1c_wavelet-HHL_firstorder_Kurtosis | 0.097581273 | |
| T1c_wavelet-HLL_gldm_DependenceVariance | − 0.055243396 | |
| T1WI | T1_wavelet-HLL_glrlm_GrayLevelNonUniformityNormalized | − 0.029010829 |
| T2WI | T2_exponential_glcm_ClusterProminence | 0.036160982 |
| T2_exponential_gldm_DependenceVariance | 0.050859759 | |
| T2_exponential_glrlm_ShortRunLowGrayLevelEmphasis | 0.064971769 | |
| T2-FLAIR | T2_Flair_logarithm_glcm_ClusterShade | 0.035088135 |
| T2_Flair_log-sigma-3-0-mm-3D_firstorder_InterquartileRange | − 0.02945977 | |
| T2_log-sigma-3-0-mm-3D_glrlm_LongRunHighGrayLevelEmphasis | − 0.031169 |
Comparison of the performance results between the neurosurgeons in visual observation and machine-learning model.
| Reader | Precision | Sensitivity | F1 score | Accuracy |
|---|---|---|---|---|
| Model | 0.87 ± 0.03 | 0.86 ± 0.02 | 0.83 ± 0.02 | 0.83 ± 0.01 |
| Doctor A | 0.65 | 0.61 | 0.62 | 0.61 |
| Doctor B | 0.68 | 0.64 | 0.65 | 0.64 |
| Average | 0.67 | 0.63 | 0.64 | 0.63 |
F1 score F-score or F-measure is a measure of a test’s accuracy which is calculated from the precision and recall of the test.
Mean ± standard deviation.
Figure 4Pseudo-color map generated from ROI of T1-CE sequences. (A) T1-CE sequence after imaging preprocessing. (B) Pseudo-color map reflects the difference in the details of the grayscale within the ROI, which makes some details more visible.