| Literature DB >> 28860628 |
Martin Vallières1, Emily Kay-Rivest2, Léo Jean Perrin3, Xavier Liem4, Christophe Furstoss5, Hugo J W L Aerts6, Nader Khaouam5, Phuc Felix Nguyen-Tan4, Chang-Shu Wang3, Khalil Sultanem2, Jan Seuntjens7, Issam El Naqa8.
Abstract
Quantitative extraction of high-dimensional mineable data from medical images is a process known as radiomics. Radiomics is foreseen as an essential prognostic tool for cancer risk assessment and the quantification of intratumoural heterogeneity. In this work, 1615 radiomic features (quantifying tumour image intensity, shape, texture) extracted from pre-treatment FDG-PET and CT images of 300 patients from four different cohorts were analyzed for the risk assessment of locoregional recurrences (LR) and distant metastases (DM) in head-and-neck cancer. Prediction models combining radiomic and clinical variables were constructed via random forests and imbalance-adjustment strategies using two of the four cohorts. Independent validation of the prediction and prognostic performance of the models was carried out on the other two cohorts (LR: AUC = 0.69 and CI = 0.67; DM: AUC = 0.86 and CI = 0.88). Furthermore, the results obtained via Kaplan-Meier analysis demonstrated the potential of radiomics for assessing the risk of specific tumour outcomes using multiple stratification groups. This could have important clinical impact, notably by allowing for a better personalization of chemo-radiation treatments for head-and-neck cancer patients from different risk groups.Entities:
Mesh:
Year: 2017 PMID: 28860628 PMCID: PMC5579274 DOI: 10.1038/s41598-017-10371-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1From radiomics analysis to treatment personalization. (a) Example of diagnostic FDG-PET and CT images of two head-and-neck cancer patients with tumour contours. The patient that did not respond well to treatment (right) has a more heterogeneous intratumoural intensity distribution in both FDG-PET and CT images than the patient that responded well to treatment (left). (b) The radiomics analysis strategy involves the extraction of features differentiating responders from non-responders to treatment. Features are extracted from the FDG-PET and CT tumour contours and quantify tumour shape, intensity, and texture. (c) Advanced machine learning combines radiomic features and patient clinical information via a random forest algorithm. The classifier is trained to differentiate between responders and non-responders to treatment (prediction model). (d) The output probability of the random forest classifier computed on new patients can be used to assess the risk of non-response to treatment via probabilities of occurrence of outcome events and time estimates. Eventually, accurate risk assessment of specific tumour outcomes via radiomics analysis could help to better personalize cancer treatments.
Figure 2Models construction strategy and analysis workflow. Four different cohorts were used to demonstrate the utility of radiomics analysis for the pre-treatment assessment of the risk of locoregional recurrence and distant metastases in head-and-neck cancer. The H&N1 and H&N2 cohorts were combined and used as a single training set (n = 194), whereas the H&N3 and H&N4 cohorts were combined and used as a single testing set (n = 106). The best combinations of radiomic features were selected in the training set using imbalance-adjusted logistic regression learning and bootstrapping validations. These radiomic features were combined with selected clinical variables in the training set using imbalance-adjusted random forest learning and stratified random sub-sampling validations. Independent prediction analysis was performed in the testing set for all classifiers fully constructed in the training set. Independent prognosis analysis and Kaplan-Meier risk stratification was carried out in the testing set using the output probability of occurrence of events of random forests fully constructed in the training set.
Figure 3Prediction performance of selected models. All prediction models were selected and built using the training set (H&N1 and H&N2; n = 194) for three initial radiomic feature sets: I) PET radiomic features (PET); II) CT radiomic features (CT); and III) PET and CT radiomic features (PETCT). The prediction performance is evaluated here in terms of the area under the receiver operating characteristic curve (AUC) for patients of the testing set (H&N3 and H&N4; n = 106), for two types of prediction models: I) Radiomic models constructed using logistic regression (Radiomics); and II) Radiomic models combined with clinical variables via random forests (Radiomics + clinical). Significant increase in AUC from Radiomics to Radiomics + clinical models is identified with an asterisk (*), and non-significant increase is identified by n.s. The radiomic feature sets providing the prediction models with highest performance in this study are identified with an arrow for each outcome.
Comparison of prediction/prognostic performance of models constructed in this work with other combinations of variables.
| Outcome | Variables | Prediction | Prognosis | ||||
|---|---|---|---|---|---|---|---|
| AUCa | Sensitivitya | Specificitya | Accuracya | CIb |
| ||
| Locoregional | RadiomicsPETCT | 0.64 | 0.56 | 0.67 | 0.65 | 0.63 | 0.28 |
| Volume | 0.43 | 0.31 | 0.58 | 0.54 | 0.40 | 0.80 | |
| Clinical | 0.72 | 0.50 | 0.76 | 0.72 | 0.69 | 0.05 | |
|
|
|
|
|
|
|
| |
| Volume + Clinical | 0.71 | 0.50 | 0.76 | 0.72 | 0.68 | 0.06 | |
| Distant | RadiomicsCT | 0.86 | 0.79 | 0.77 | 0.77 | 0.88 | 0.0001 |
| Volume | 0.80 | 0.86 | 0.65 | 0.68 | 0.83 | 0.10 | |
| Clinical | 0.55 | 0.64 | 0.46 | 0.48 | 0.60 | 0.61 | |
|
|
|
|
|
|
|
| |
| Volume + Clinical | 0.78 | 1 | 0.50 | 0.57 | 0.80 | 0.0004 | |
| Survival | RadiomicsPET | 0.62 | 0.58 | 0.66 | 0.64 | 0.60 | 0.03 |
| Volume | 0.68 | 0.67 | 0.57 | 0.59 | 0.67 | 0.29 | |
|
|
|
|
|
|
|
| |
| RadiomicsPET + Clinical | 0.74 | 0.79 | 0.57 | 0.62 | 0.71 | 0.002 | |
| Volume + Clinical | 0.79 | 0.88 | 0.52 | 0.60 | 0.76 | 0.0006 | |
| Survivald | RadiomicsCTcompleteSign e | — | — | — | — | 0.66 | 0.70 |
| RadiomicsCTsign f | 0.68 | 0.71 | 0.50 | 0.55 | 0.66 | 0.05 | |
| RadiomicsCTsign g + Clinical | 0.80 | 0.96 | 0.38 | 0.51 | 0.75 | 0.001 | |
→ Performance is shown for models constructed in the training set (H&N1 and H&N2; n = 194) and independently evaluated in the testing set (H&N3 and H&N4; n = 106).
→ Models involving Radiomic variables only or the Volume variable only were optimized using logistic/cox regression. All models involving Clinical variables were optimized using random forests.
→ The best predictive/prognostic and balanced models for each outcome (final models) are identified in italic and are fully described in Supplementary Table S4.
aBinary prediction of outcomes using logistic regression/random forest output responses.
bConcordance-index between cox regression/random forest output responses and time to events.
cLog-rank test from Kaplan-Meier curves with a risk stratification into two groups (thresholds: median hazard ratio for cox regression, output probability of 0.5 for random forests).
dRadiomic signature variables as defined in the study of Aerts & Velazquez et al.[21]
eUsing the original definition of the radiomic signature variables, and the original cox regression coefficients and median hazard ratio trained from the Lung1 cohort in the study of Aerts & Velazquez et al.[21]
fUsing a revised version of the radiomic signature variables (Supplementary Methods section 2.6.2) and new cox/logistic regression coefficients trained using the current training set of this work.
gUsing a revised version of the radiomic signature variables (Supplementary Methods section 2.6.2) and a random forest classifer trained using the current training set of this work.
Figure 4Risk assessment of tumour outcomes. (a) Probability of occurrence of events (locoregional recurrence, distant metastases, death) for each patient of the testing set (H&N3 and H&N4; n = 106) as determined by the random forest classifiers built using the training set (H&N1 and H&N2; n = 194). The output probability of occurrence of events of random forests allows for risk stratification; for example, three risk groups can be defined (low, medium, high) using probability thresholds of and . (b) Kaplan-Meier curves of the testing set using a risk stratification into two groups as defined by a random forest output probability threshold of 0.5. All curves have significant prognostic performance, thus demonstrating the possibility of outcome-specific risk assessment in head-and-neck cancer. (c) Kaplan-Meier curves of the testing set using a risk stratification into three groups as defined by random forest output probability thresholds of and . Some pair of curves have significant prognostic performance, thus demonstrating the possibility of risk stratification into multiple groups for treatment escalation/personalization in head-and-neck cancer.