| Literature DB >> 35805036 |
Erika Yamazawa1, Satoshi Takahashi2,3, Masahiro Shin1,4, Shota Tanaka1, Wataru Takahashi5, Takahiro Nakamoto5,6, Yuichi Suzuki5, Hirokazu Takami1, Nobuhito Saito1.
Abstract
Chordoma and chondrosarcoma share common radiographic characteristics yet are distinct clinically. A radiomic machine learning model differentiating these tumors preoperatively would help plan surgery. MR images were acquired from 57 consecutive patients with chordoma (N = 32) or chondrosarcoma (N = 25) treated at the University of Tokyo Hospital between September 2012 and February 2020. Preoperative T1-weighted images with gadolinium enhancement (GdT1) and T2-weighted images were analyzed. Datasets from the first 47 cases were used for model creation, and those from the subsequent 10 cases were used for validation. Feature extraction was performed semi-automatically, and 2438 features were obtained per image sequence. Machine learning models with logistic regression and a support vector machine were created. The model with the highest accuracy incorporated seven features extracted from GdT1 in the logistic regression. The average area under the curve was 0.93 ± 0.06, and accuracy was 0.90 (9/10) in the validation dataset. The same validation dataset was assessed by 20 board-certified neurosurgeons. Diagnostic accuracy ranged from 0.50 to 0.80 (median 0.60, 95% confidence interval 0.60 ± 0.06%), which was inferior to that of the machine learning model (p = 0.03), although there are some limitations, such as the risk of overfitting and the lack of an extramural cohort for truly independent final validation. In summary, we created a novel MRI-based machine learning model to differentiate skull base chordoma and chondrosarcoma from multiparametric signatures.Entities:
Keywords: MRI; chondrosarcoma; chordoma; diagnostic performance; machine learning model; radiomics
Year: 2022 PMID: 35805036 PMCID: PMC9265125 DOI: 10.3390/cancers14133264
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Patient and tumor characteristics.
| Training Dataset | Final Validation Dataset | |||
|---|---|---|---|---|
| Chordoma | Chondorosarcoma | Chordoma | Chondrosarcoma | |
| Case number | 27 | 20 | 5 | 5 |
| Median age | 51.0 | 40.5 | 57.5 | 64.0 |
| Sex | ||||
| Male | 15 | 11 | 4 | 0 |
| Female | 12 | 9 | 1 | 5 |
| Median volume (cm3) | 12.7 | 9.5 | 6.1 | 12.0 |
Figure 1Diagram of features extraction. A total of 2438 features were extracted from each MRI sequence (GdT1 and T2). Abbreviations: quantization levels—QLs; gray-level co-occurrence matrix—GLCM; gray-level run-length matrix—GLRLM; gray-level size-zone matrix—GLSZM; neighboring gray-level dependence matrix—NGLDM; neighborhood gray-tone difference matrix—NGTDM.
Selected features for machine learning models.
| MRI Sequence | Wavelet | Quantization | Type | Name | Frequency |
|---|---|---|---|---|---|
| GdT1 | HLL | - | Intensity | Mean | 2 |
| HHL | 4-bit (16) | GLCM | Correlation | 2 | |
| HLH | 4-bit (16) | GLSZM | High gray-level zone emphasis (HGZE) | 1 | |
| LLL | 4-bit (16) | GLSZM | Zone size variance (ZSV) | 1 | |
| LLL | 5-bit (32) | GLSZM | Low gray-level zone emphasis (LGZE) | 1 | |
| LLH | 7-bit (128) | GLRLM | Gray-level variance (GLV) | 1 | |
| LHH | - | Intensity | Skewness | 1 | |
| T2 | LHL | 4-bit (16) | GLSZM | Gray-level non-uniformity (GLN) | 2 |
| HHH | 4-bit (16) | GLRLM | Gray-level non-uniformity (GLN) | 1 |
Figure 2Workflow. The workflow is composed of 3 main phases, “Feature selection Phase”, “Model selection Phase”, and “Test Phase”. ” Feature selection Phase” includes determination of appropriate feature number and feature selection. “Model selection Phase” includes parameter determination with nested validation and evaluation of the learning model. “Test Phase” is a final test of models with 10 independent cases for final validation. * Test cases. X: 7 for GdT1, 2 for T2. Abbreviations: recursive feature elimination—RFE; area under the curve—AUC; logistic regression—LR; support vector machine—SVM.
Figure 3Diagnostic accuracy of the machine learning models in the training dataset (refer to Figure 2). (A) The AUC of the model with T2 logistic regression. (B) The AUC of the model with GdT1 logistic regression. (C) The AUC of the model with T2 and GdT1 logistic regression. (D) The AUC of the model with T2 SVM. (E) The AUC of the model with GdT1 SVM. (F) The AUC of the model with T2 and GdT1 SVM. The number of each mean ROC represents mean +/− standard deviation of the 6 AUCs. Abbreviations: receiver operating characteristics—ROC; area under the curve—AUC; T2 weighted image—T2; post-gadolinium T1-weighted images—GdT1; support vector machine—SVM.
AUC and diagnostic accuracy of machine learning models.
| Logistic Regression | Support Vector Machine | |
|---|---|---|
| T2 | 0.87 ± 0.07 (0.4) | 0.86 ± 0.05 (0.4) |
| GdT1 | 0.93 ± 0.06 (0.9) * | 0.89 ± 0.10 (0.7) |
| T2 + GdT1 | 0.95 ± 0.07 (0.7) | 0.92 ± 0.07 (0.7) |
The numbers in each cell represent AUC (mean ± standard deviation) and diagnostic accuracy in parentheses. Abbreviations: area under the curve—AUC; T2-weighted image—T2; post-gadolinium T1-weighted image—GdT1. * Highest accuracy.
Figure 4Representative cases of chordomas and chondrosarcomas. (A–C) GdT1, T2, and fsGdT1 coronal images of a typical chordoma case. The machine learning model correctly diagnosed the tumor, and high diagnostic accuracy (80%) by neurosurgeons was noted. (D–F) GdT1, T2, and fsGdT1 coronal images of a typical chondrosarcoma case. The machine learning model correctly diagnosed the tumor, and the highest diagnostic accuracy by the neurosurgeons (85%) was noted. (G–I) GdT1, T2, and fsGdT1 coronal images of the chordoma case with the correct diagnosis by the machine learning model but with low diagnostic accuracy by the neurosurgeons (40%). (J–L) GdT1, T2, and fsGdT1 coronal images of the chondrosarcoma case with the incorrect diagnosis by the best machine learning model and the lowest diagnostic accuracy by the neurosurgeons (20%). Abbreviations: T2 weighted image—T2; post-gadolinium T1-weighted images—GdT1; fat suppression post-gadolinium T1-weighted images—fsGdT1. Red dotted lines indicates tumor.