| Literature DB >> 26697407 |
Chintan Parmar1, Patrick Grossmann2, Derek Rietveld3, Michelle M Rietbergen4, Philippe Lambin5, Hugo J W L Aerts2.
Abstract
INTRODUCTION: "Radiomics" extracts and mines a large number of medical imaging features in a non-invasive and cost-effective way. The underlying assumption of radiomics is that these imaging features quantify phenotypic characteristics of an entire tumor. In order to enhance applicability of radiomics in clinical oncology, highly accurate and reliable machine-learning approaches are required. In this radiomic study, 13 feature selection methods and 11 machine-learning classification methods were evaluated in terms of their performance and stability for predicting overall survival in head and neck cancer patients.Entities:
Keywords: cancer; computational science; machine learning; quantitative imaging; radiology; radiomics
Year: 2015 PMID: 26697407 PMCID: PMC4668290 DOI: 10.3389/fonc.2015.00272
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Table defining the acronyms related to the used feature selection and classification methods.
| Classification method acronym | Classification method name | Feature Selection method acronym | Feature selection method name |
|---|---|---|---|
| Nnet | Neural network | RELF | Relief |
| DT | Decision tree | FSCR | Fisher score |
| BST | Boosting | GINI | Gini index |
| BY | Bayesian | JMI | Joint mutual information |
| BAG | Bagging | CIFE | Conditional infomax feature extraction |
| RF | Random forset | DISR | Double input symmetric relevance |
| MARS | Multi adaptive regression splines | MIM | Mutual information maximization |
| SVM | Support vector machines | CMIM | Conditional mutual information maximization |
| NN | Neirest neighbor | ICAP | Interaction capping |
| GLM | Generalized linear models | TSCR | |
| PLSR | Partial least squares and prinicipal component regression | MRMR | Minimum redundancy maximum relevance |
| – | – | MIFS | Mutual information feature selection |
| – | – | WLCX | Wilcoxon |
Table describing the representative AUC and stability of feature selection methods.
| Feature selection method | AUC (HNSCC) | AUC (NSCLC) | Stability (HNSCC) | Stability (NSCLC) |
|---|---|---|---|---|
| RELF | 0.62 ± 0.09 (High) | 0.61 ± 0.04 (High) | 0.63 ± 0.12 (Low) | 0.91 ± 0.05 (High) |
| FSCR | 0.63 ± 0.08 (High) | 0.62 ± 0.04 (High) | 0.51 ± 0.13 (Low) | 0.78 ± 0.08 (High) |
| GINI | 0.58 ± 0.07 (Low) | 0.62 ± 0.04 (High) | 0.66 ± 0.11 (High) | 0.68 ± 0.10 (Low) |
| JMI | 0.59 ± 0.07 (Low) | 0.61 ± 0.04 (High) | 0.67 ± 0.05 (High) | 0.68 ± 0.05 (Low) |
| CIFE | 0.68 ± 0.08 (High) | 0.60 ± 0.03 (Low) | 0.7 ± 0.04 (High) | 0.69 ± 0.05 (Low) |
| DISR | 0.56 ± 0.06 (Low) | 0.62 ± 0.05 (High) | 0.65 ± 0.08 (Low) | 0.69 ± 0.05 (Low) |
| MIM | 0.61 ± 0.08 (High) | 0.61 ± 0.04 (High) | 0.64 ± 0.1 (Low) | 0.94 ± 0.02 (High) |
| CMIM | 0.6 ± 0.07 (Low) | 0.62 ± 0.04 (High) | 0.71 ± 0.04 (High) | 0.73 ± 0.04 (Low) |
| ICAP | 0.59 ± 0.07 (Low) | 0.61 ± 0.03 (High) | 0.71 ± 0.03 (High) | 0.72 ± 0.04 (Low) |
| TSCR | 0.62 ± 0.07 (High) | 0.61 ± 0.02 (High) | 0.54 ± 0.01 (Low) | 0.78 ± 0.12 (High) |
| MRMR | 0.69 ± 0.07 (High) | 0.63 ± 0.06 (High) | 0.66 ± 0.03 (High) | 0.74 ± 0.03 (High) |
| MIFS | 0.66 ± 0.07 (High) | 0.63 ± 0.06 (High) | 0.69 ± 0.04 (High) | 0.8 ± 0.03 (High) |
| WLCX | 0.55 ± 0.06 (Low) | 0.65 ± 0.02 (High) | 0.71 ± 0.06 (High) | 0.84 ± 0.05 (High) |
HNSCC thresholds: AUC = 0.61, stability = 0.66; NSCLC thresholds: AUC = 0.61, stability = 0.74.
Table describing the representative AUC and stability of classification methods.
| Classification method | AUC (HNSCC) | AUC (NSCLC) | RSD% (HNSCC) | RSD% (NSCLC) |
|---|---|---|---|---|
| Nnet | 0.59 ± 0.07 (Low) | 0.57 ± 0.04 (Low) | 11.54 (Low) | 6.41 (Low) |
| DT | 0.56 ± 0.05 (Low) | 0.54 ± 0.04 (Low) | 11.4 (High) | 7.89 (Low) |
| BST | 0.56 ± 0.07 (Low) | 0.58 ± 0.04 (Low) | 11.28 (High) | 8.23 (Low) |
| BY | 0.67 ± 0.06 (High) | 0.64 ± 0.05 (High) | 11.28 (High) | 0.86 (High) |
| BAG | 0.55 ± 0.06 (Low) | 0.64 ± 0.03 (High) | 9.27 (High) | 5.56 (High) |
| RF | 0.61 ± 0.06 (High) | 0.66 ± 0.03 (High) | 7.36 (High) | 3.52 (High) |
| MARS | 0.58 ± 0.05 (Low) | 0.61 ± 0.03 (High) | 12.47 (Low) | 6.98 (Low) |
| SVM | 0.64 ± 0.09 (High) | 0.61 ± 0.03 (High) | 12.69 (Low) | 6.39 (Low) |
| NN | 0.62 ± 0.05 (High) | 0.61 ± 0.02 (High) | 10.52 (High) | 4.08 (High) |
| GLM | 0.72 ± 0.08 (High) | 0.63 ± 0.02 (High) | 11.78 (Low) | 2.19 (High) |
| PLSR | 0.73 ± 0.07 (High) | 0.63 ± 0.02 (High) | 12.75 (Low) | 2.24 (High) |
HNSCC thresholds: AUC = 0.61, RSD% = 11.4; NSCLC thresholds: AUC = 0.61, RSD% = 5.56.
Figure 1In total, 196 HNSCC patients were considered. Four hundred forty radiomic features were extracted from the segmented tumor regions of the CT images. Feature selection and classification training were done using the training cohort HN1 (n = 101), whereas HN2 (n = 95) cohort was used as a validation cohort.
Figure 2Heatmap depicting the prognostic performance (AUC) of feature selection (in rows) and classification (in columns) methods. It can be observed that PLSR and GLM classification methods and feature selection methods MRMR and MIFS shows relatively high prognostic performance in many cases.
Figure 3Scatterplots of stability and prognostic performance (AUC) for feature selection (Left) and classification methods (right). Feature selection methods having stability ≥0.66 (median stability) and AUC ≥ 0.61 (median AUC) are considered as highly reliable and prognostic methods. Similarly, classification methods having RSD ≤ 11.4 (median RSD) and AUC ≥ 0.61 (median AUC) are considered as highly reliable and accurate ones. Highly reliable and prognostic methods are displayed in a gray square region.
Figure 4Variation of AUC explained by the experimental factors and their interactions. It can be observed that classification method was the most dominant source of variation in prediction score. Size of the selected (representative) feature subset shared the least of the total variance.