| Literature DB >> 32391171 |
Barbara Burtness1, Seyedmehdi Payabvash2, Stefan P Haider2,3, Wendell G Yarbrough4.
Abstract
Recent advancements in computational power, machine learning, and artificial intelligence technology have enabled automated evaluation of medical images to generate quantitative diagnostic and prognostic biomarkers. Such objective biomarkers are readily available and have the potential to improve personalized treatment, precision medicine, and patient selection for clinical trials. In this article, we explore the merits of the most recent addition to the "-omics" concept for the broader field of head and neck cancer - "Radiomics". This review discusses radiomics studies focused on (molecular) characterization, classification, prognostication and treatment guidance for head and neck squamous cell carcinomas (HNSCC). We review the underlying hypothesis, general concept and typical workflow of radiomic analysis, and elaborate on current and future challenges to be addressed before routine clinical application.Entities:
Keywords: Machine learning; Neck; Oropharynx; Radiomics; Squamous cell carcinoma
Year: 2020 PMID: 32391171 PMCID: PMC7197186 DOI: 10.1186/s41199-020-00053-7
Source DB: PubMed Journal: Cancers Head Neck ISSN: 2059-7347
Fig. 1Typical radiomics workflow pipeline
Prediction of HPV status based on radiomics features of HNSCC tumors
| Authors, year | Sample size, cancer type | Ground truth | Imaging modality | ML classifier | Metric: maximum performance |
|---|---|---|---|---|---|
Train: 93, HNSCC Test: 56, HNSCC | p16 | Contrast CT | Logistic regression | Test-AUC: 0.78 | |
| Total: 40, OPSCC | Not reported | Contrast CT | n/a b | n/a b | |
| Total: 46: non-OPSCC | Not reported | Contrast CT | n/a b | n/a b | |
Train: 113, HNSCC Test: 53, HNSCC | Train: HPV RNA c Test: p16 | Contrast CT | LASSO-regularized logistic regression | Nested CV-AUC: 0.73 Test-AUC: 0.76 | |
Train: 628, OPSCC Test:150, OPSCC | p16 | Contrast CT | LASSO-regularized logistic regression | Test-AUC: 0.70–0.80 d | |
| Total: 50, OPSCC | Not reported | Contrast CT | Logistic regression | n/a e | |
Train: 136, OPSCC and LSCC Test:95, OPSCC | Not reported | Contrast CT | Logistic regression | Test-AUC: 0.60 | |
| Total: 107, OPSCC | HPV DNA-ISH | Contrast CT | Diagonal quadratic discriminant analysis | LOOCV-AUC: 0.80 | |
Train: 150, OPSCC Test:165, OPSCC | p16 | Contrast CT | Logistic regression | CV-AUC: 0.75 test-AUC 1 f: 0.87 test-AUC 2 f: 0.92 | |
| Total: 126, HNSCC | Not reported | Contrast CT | Random forest | CV-AUC: 0.71 |
a The reported performance pertains to pure imaging feature-based HPV classification (i.e. models with clinical features were not considered)
b A t-test was used to evaluate differences in texture parameters between HPV-positive and HPV-negative cases
c The VirusSeq-software was used to detect strain-specific HPV RNA sequences in whole-transcriptome sequencing data [51, 58]
d This study evaluated the impact of CT artifacts on the HPV classification performance. A test set AUC performance of 0.8 was achieved after exclusion of all artifact-affected cases from both the training- and test set. The test AUC ranged between 0.70 and 0.80 for all evaluated dataset combinations, including those with artifacts, and was not significantly different for all tested models
e The logistic regression model was trained and tested on the same dataset without feature selection or cross validation, which is prone to overfitting, and overestimation of classification accuracy
f Study reports results of winning submission of radiomics competition, wherein 165 test cases were split into two test sets
AUC Area under the receiver operating characteristics curve, CV Cross validation (of total set or training dataset), DNA-ISH DNA in situ hybridization, HNSCC Head and neck SCC, LOOCV Leave one out cross validation of total set, LSCC Laryngeal SCC, OPSCC Oropharyngeal SCC, Test Independent test dataset, Total Only one dataset used, Train Training dataset
Prediction of locoregional recurrence, treatment response, and survival
| Author, year | Dataset: sample size, HNSCC type | Imaging modality | Therapy | Outcome, model, analysis type | (Endpoint:) variables: metric: maximum performance |
|---|---|---|---|---|---|
Test: 231, HNSCC (HNSCC cohort used for validation only, training on 422 lung cancer primaries) | Pre-treatment contrast CT | RT + chemotherapy or RT + chemotherapy + surgery or RT alone | OS, multivariable Cox regression, regression | Radiom: test-CI 1:0.69 test-CI 2: 0.69 Radiom+Clin: test-CI 1: 0.70 test-CI 2: 0.69 Clin: test-CI 1: 0.69 test-CI 2: 0.66 | |
Train: 128, HNSCC Test: 50, HNSCC | 3-month post treatment FDG-PET | RT + cisplatin / cetuximab or RT + cisplatin + cetuximab | LC, multivariable Cox regression, regression | Radiom: CV-CI: 0.74–0.76 test-CI: 0.71–0.73 (study evaluated reproducibility of identical features using two different software - performance range is reported) | |
Train: 121, HNSCC Test: 51, HNSCC | Pre-treatment FDG-PET, contrast CT | RT + cisplatin / cetuximab or RT + cisplatin + cetuximab | LC, 3 different regression methods, regression | Radiom: CV-CI: 0.77 test-CI: 0.73 | |
Train: 93, HNSCC Test: 56, HNSCC | Pre-treatment contrast CT | RT + cisplatin / cetuximab | LC, multivariable Cox regression, regression | Radiom: train-CI: 0.75 test-CI: 0.78 Clin: train-CI: 0.79 test-CI: 0.73 Radiom + Clin: train-CI: 0.80 test-CI: 0.76 | |
Train: 77, HNSCC Test: 51, HNSCC | Pre-treatment contrast CT | RT + cisplatin / cetuximab or RT + cisplatin + cetuximab | LC and LRC, multivariable Cox regression, regression | LC: Radiom: CV-CI: 0.81 test-CI: 0.70 LRC: Radiom: CV-CI: 0.75 test-CI: 0.67 (addition of lymph node to primary tumor radiomics features was investigated –the best performance reported) | |
| Total: 45, HNSCC | Pre-treatment perfusion CT | IMRT + cisplatin / cetuximab or IMRT + cisplatin + cetuximab | LC, multivariable Cox regression, regression | Radiom: CV-CI: 0.79 Clin: CV-CI: 0.66 | |
| Total: 70, OPSCC | Pre-treatment FDG-PET | RT + platinum-based chemotherapy / cetuximab or RT alone | DSS, OS and PFS, multivariate Cox regression, regression | DSS: Radiom: HR: 0.28 ( OS: Radiom: HR: 0.46 ( PFS: Radiom: HR: 0.32 (p = 0.001) (Only 1 radiomics feature was tested in multivariate Cox regression, along with clinicopathological and FDG-PET variables) | |
| Total: 88, OPSCC | Pre-treatment FDG-PET | RT + chemotherapy / biotherapy or RT alone | PFS and DSS, multivariate Cox regression, regression | PFS: Radiom: HR: 4.38 ( DSS: Radiom: HR: 4.24 ( (single radiomics features were tested in multivariate Cox regression, along with clinicopathological and FDG-PET variables) | |
Train: 70, HNSCC Test: 40, HNSCC | Pre-treatment contrast CT | RT + chemotherapy or RT + chemotherapy + induction-chemotherapy | OS, PFS and LC, multivariable Cox regression, regression | OS: Radiom: train-CI: 0.88 test-CI: 0.90 PFS: Radiom: train-CI: 0.72 test-CI: 0.80 LC: Radiom: train-CI: 0.72 test-CI: 0.80 | |
| Total: 90, HNSCC | Pre-treatment FDG-PET | IMRT + platinum-based chemotherapy w/ or w/o adjuvant / neoadjuvant chemotherapy | PFS and OS, multivariate Cox regression, regression | PFS: Radiom + Clin: CV-CI: 0.76 Clin: CV-CI: 0.65 OS: Radiom + Clin: CV-CI: 0.76 Clin: CV-CI: 0.73 | |
Train: 174, OPSCC Test: 65, OPSCC | Pre-treatment FDG-PET | RT + platinum-based chemotherapy / cetuximab / multidrug regimens | ACM, LF and DM, multivariable logistic regression, classification | ACM: Radiom + Clin: CV-AUC: 0.65 test-AUC: 0.60 LF: Radiom + Clin: CV-AUC: 0.73 test-AUC: 0.68 DM: Radiom + Clin: CV-AUC: 0.66 test-AUC: 0.65 | |
Train 1: 377, HNSCC (CT) Train 2: 345, HNSCC (PET) Test 1: 349, HNSCC (CT) Test 2: 341, HNSCC (PET) | Pre-treatment contrast CT, pre-treatment FDG-PET (separately analyzed) | Not reported (definitive RT as part of treatment was inclusion criterion) | OS, multivariable Cox regression, regression | Radiom: test-AUC 1: 0.72 (CT) test-AUC 2: 0.59 (PET) Clin: test-AUC 1: 0.73 (CT) (AUC was calculated at 3 years post treatment, with patients with risk prediction > median assigned to the high-risk group) | |
| Total: 62, HNSCC | Pre-treatment contrast CT | IMRT + chemotherapy w/ or w/o induction chemotherapy or IMRT alone | LF, multivariate Cox regression, regression | Radiom: HR: 3.75–8.61 (8 features were significant after adjusting for clinical variables; the HR range is reported above) | |
Train: 213, HNSCC Test: 80, HNSCC | Pre-treatment non-contrast CT | RT + chemotherapy | LRC and OS, 11 different ML algorithms, regression | LRC: Radiom: test-CI: 0.71 OS: Radiom: test-CI: 0.64 | |
Test: 542, OPSCC (validation of radiomics signature by Aerts et al. [ | Pre-treatment contrast CT | IMRT + chemotherapy or IMRT alone | OS, multivariable Cox regression, regression | Radiom: test-CI: 0.63 | |
Train: 42, NPC Test: 11, NPC | Pre-treatment T2, contrast-enhanced T1 MRI, diffusion weighted MRI | RT + cisplatin | Therapy response (complete/partial response vs. stable / progressive disease), k-nearest neighbors, neural network, classification | Radiom: CV-acc: 0.95 CV-sens: 0.97 CV-spec: 091 test-acc: 0.91 test-sens: 0.88 test-spec: 1 | |
Total: 296, HNSCC (various partitions in train/test were evaluated) | Pre-treatment non-contrast CT, FDG-PET | RT + chemotherapy or RT alone | RFS, MFS and OS, multivariate Cox regression, regression | RFS: Radiom: mean test-CI: 0.61 Radiom + Clin: mean test-CI: 0.60 Clin: mean test-CI: 0.58 MFS: Radiom: mean test-CI: 0.70 Radiom + Clin: mean test-CI: 0.71 Clin: mean test-CI: 0.61 OS: Radiom: mean test-CI: 0.62 Radiom + Clin: mean test-CI: 0.65 Clin: mean test-CI: 0.62 (the mean was calculated across all test partitions) | |
Train: 85, NPC Test: 43, NPC | Pre-treatment CT, FDG-PET | IMRT + cisplatin or IMRT alone | PFS, multivariate Cox regression, regression | Radiom: train-CI: 0.76 test-CI: 0.62 Radiom + Clin: train-CI: 0.75 test-CI: 0.75 Clin: train-CI: 0.71 test-CI: 0.75 | |
Train: 255, OPSCC Tune: 165, OPSCC Test: 45, OPSCC | Pre-treatment contrast CT | One or combinations of: IMRT / chemotherapy / induction chemotherapy / neck dissection | LC, multivariate Cox regression, regression | Overall performance evaluation of Cox models not reported | |
Train: 80, HYSCC Test: 33, HYSCC | Pre-treatment non-contrast CT and contrast-CT | Laryngeal-preservation treatments (RT, chemotherapy, induction-chemotherapy, neck dissection) | PFS, multivariable Cox regression, regression | Radiom: train-CI: 0.79 test-CI: 0.76 Radiom + Clin: train-CI: 0.80 test-CI: 0.76 Clin: train-CI: 0.63 test-CI: 0.54 | |
| Total: 120, HNSCC | Pre-treatment CT | CRT / IMRT + cisplatin / cetuximab | OS and PFS, multivariable Cox regression, regression | OS: Radiom: HR: 0.3 ( PFS: Radiom: HR: 0.3 (p = 0.01) | |
Train: 70, NPC Test: 30, NPC | Pre-treatment T2, contrast-enhanced T1 MRI | Not reported | PFS, multivariable Cox regression, regression | Radiom: train-HR: 5.14 ( test-HR: 7.28 ( | |
Train: 101, HNSCC Test: 95, HNSCC | Pre-treatment contrast CT | RT + chemotherapy or RT + chemotherapy + surgery or RT alone | OS, 12 different ML classifiers, classification | Radiom: test-AUC: 0.79 | |
Train: 136, HNSCC Test: 95, HNSCC | Pre-treatment contrast CT | RT + chemotherapy or RT + chemotherapy + surgery or RT alone | OS, multivariable Cox regression, regression | Radiom: test-CI: 0.63 | |
| Total: 30, OPSCC and LSCC | Pre-treatment 18F-fluorothymidine PET | RT + platinum-based chemotherapy | PFS, univariate Cox regression, regression | Radiom: HR: 4.10 ( | |
Train: 194, HNSCC Test: 106, HNSCC | Pre-treatment FDG-PET, non-contrast CT | RT + platinum-based chemotherapy / cetuximab or RT alone | LR, DM and OS, logistic regression, random forests, classification (regression analysis was performed for a subset of models; see publication) | LR: Radiom: test-AUC: 0.64 Radiom + Clin: test-AUC: 0.69 DM: Radiom: test-AUC: 0.86 Radiom + Clin: test-AUC: 0.86 OS: Radiom: test-AUC: 0.62 Radiom + Clin: test-AUC: 0.74 | |
| Total: 120, NPC | Pre-treatment T2, contrast-enhanced T1 MRI | Induction-chemotherapy (cisplatin + 5-fluorouracil + docetaxel) | Early response to induction chemotherapy, “Rad-score”, classification | Radiom: train-AUC: 0.82 internally bootstrap-validated train-AUC: 0.82 | |
Train: 240, HNSCC Test: 204, HNSCC | Pre-treatment contrast CT | RT + chemotherapy / cetuximab or RT alone | LC, RC, MFS and DFS, multivariate Cox regression, regression | LC: Radiom: train-CI: 0.62 test-CI: 0.62 Radiom + Clin: train-CI: 0.66 test-CI: 0.64 Clin: train-CI: 0.64 test-CI: 0.62 RC: Radiom: train-CI: 0.78 test-CI: 0.80 Radiom + Clin: train-CI: 0.78 test-CI: 0.80 Clin: train-CI: 0.74 test-CI: 0.76 MFS: Radiom: train-CI: 0.73 test-CI: 0.68 Radiom + Clin: train-CI: 0.72 test-CI: 0.71 Clin: train-CI: 0.71 test-CI: 0.70 DFS: Radiom: train-CI: 0.66 test-CI: 0.65 Radiom + Clin: train-CI: 0.69 test-CI: 0.70 Clin: train-CI: 0.66 test-CI: 0.66 | |
Train: 80, NPC Test: 33, NPC | Pre-treatment T2, contrast-enhanced T1 MRI | Not reported | PFS, “Rad-score”, classification | Radiom: train-AUC: 0.89 test-AUC: 0.82 | |
Train: 70, NPC Test: 40, NPC | Pre-treatment T2, contrast-enhanced T1 MRI | Not reported | LF and DF, 9 different ML classifiers, classification | LF and DF: Radiom: test-AUC: 0.85 | |
Train: 88, NPC Test: 30, NPC | Pre-treatment T2, contrast-enhanced T1 MRI | Not reported | PFS, univariate / multivariable Cox regression, regression | Radiom: train-CI: 0.76 test-CI: 0.74 Clin: train-CI: 0.65 test-CI: 0.63 Radiom + Clin: train-CI: 0.78 test-CI: 0.72 |
a The reported performance pertains to the maximum observed performance among all models of each respective category (i.e. we are reporting the highest achieved performance, in case different radiomics features / models / signatures or clinical predictors / models were tested). For radiomics-based models, the performance of the purest imaging feature-based model is reported. (i.e. the model with fewest or no other predictors)
acc Accuracy, ACM All-cause mortality, AUC Area under the receiver operating characteristics curve, CI Concordance index, Clin Non-radiomic predictor(s) or model(s) (“clinical”), CRT Conformal radiotherapy, DF/DM Distant failure/metastasis, DFS Disease-free survival, DSS Disease-specific survival, HNSCC Head and neck SCC, HR Hazard ratio, HYSCC Hypopharyngeal SCC, IMRT Intensity-modulated radiotherapy, LC/LF Local tumor control/failure, LR Locoregional recurrence, LRC Locoregional control, LSCC Laryngeal SCC, MFS Metastasis-free survival, NPC Nasopharyngeal carcinoma, OPSCC Oropharyngeal SCC, OS Overall survival, PFS Progression-free survival, Radiom Radiomics model, radiomic feature(s) or feature combinations (“signature”, “Rad-score”), RC Regional control, RFS Recurrence-free survival, RT Radiotherapy, sens Sensitivity, spec Specificity, test Independent test dataset, total Only one dataset used, train Training dataset, tune Validation set used for hyperparameter tuning
Prediction of post-radiation xerostomia based on salivary gland radiomics features
| Authors, year | Dataset: sample size, cancer type | Time of xerostomia assessment, endpoint/scale | Imaging modality | VOI | Classifier / regression model(s) | Metric: maximum performance |
|---|---|---|---|---|---|---|
Train:216, HNSCC Test:50, HNSCC | 3-month post-RT, CTCAE v4.0 b grade ≥ 2 vs. grade 0/1 | Pre-treatment CT, T1-weighted MRI | Parotid and submandibular glands (bilateral) | Multivariable logistic regression | CV-AUC: 0.75 test-AUC: 0.70 | |
Train:35, NPC Test:4, NPC | day of 10th and 30th RT, saliva amount (ml) over 5 min (a regression analysis) | CT at start and day of 10th RT fraction | Parotid glands (bilateral) | 8 different regression models | CV-MSE: 0.9042 (10th fraction), 0.0569 (30th fraction) test-MSE: 0.0233 (30th fraction) | |
Train:68, HNSCC Test:25, HNSCC | 12 moth post-RT, patient-rated moderate-to-severe xerostomia present vs. not present | Pre-treatment T1-weighted MRI | Parotid glands, (bilateral) | Multivariable logistic regression | n/a c | |
| Total: 249, HNSCC | 12 moth post-RT, EORTC QLQ-H, N35 questionnaire d moderate-to-severe xerostomia vs. not present | Pre-treatment contrast CT | Parotid and submandibular glands (bilateral) | Multivariable logistic regression | n/a c | |
| Total: 161, HNSCC | 12-month post-RT, EORTC QLQ-H questionnaire d moderate-to-severe xerostomia present vs. not present | Pre-treatment FDG PET | Contralateral parotid gland | Multivariable logistic regression | n/a c |
a The reported performance pertains to the maximum observed performance among the purest imaging feature-based models reported (i.e. the best model with fewest or no other predictors is reported)
b Common Terminology Criteria for Adverse Events Version 4.0 [115]
c “Pure” radiomics models were not built. Instead, the contribution of individual radiomics features to baseline models was investigated in terms of performance (gains)
d European Organization for Research and Treatment of Cancer questionnaire module for quality of life assessments in head and neck cancer patients [116]
AUC Area under the receiver operating characteristics curve, CV Cross validation (of total set or training data set), HNSCC Head and neck SCC, MSE Mean squared error, NPC Nasopharyngeal carcinoma, RT Radiotherapy, Test Independent test data set, Total Only one data set used, Train Training data set