| Literature DB >> 36202941 |
Asier Rabasco Meneghetti1,2, Alex Zwanenburg1,3,4, Annett Linge1,3,4,5, Fabian Lohaus1,3,4,5, Marianne Grosser6, Gustavo B Baretton3,4,5,6,7, Goda Kalinauskaite8,9, Ingeborg Tinhofer8,9, Maja Guberina10,11, Martin Stuschke10,11, Panagiotis Balermpas12,13, Jens von der Grün12,13, Ute Ganswindt14,15,16,17, Claus Belka14,15,16, Jan C Peeken14,18,19, Stephanie E Combs14,18,19, Simon Böke20,21, Daniel Zips20,21, Esther G C Troost1,2,3,4,5, Mechthild Krause1,2,3,4,5, Michael Baumann1,22, Steffen Löck23,24,25,26.
Abstract
Patients with locally advanced head and neck squamous cell carcinoma (HNSCC) may benefit from personalised treatment, requiring biomarkers that characterize the tumour and predict treatment response. We integrate pre-treatment CT radiomics and whole-transcriptome data from a multicentre retrospective cohort of 206 patients with locally advanced HNSCC treated with primary radiochemotherapy to classify tumour molecular subtypes based on radiomics, develop surrogate radiomics signatures for gene-based signatures related to different biological tumour characteristics and evaluate the potential of combining radiomics features with full-transcriptome data for the prediction of loco-regional control (LRC). Using end-to-end machine-learning, we developed and validated a model to classify tumours of the atypical subtype (AUC [95% confidence interval] 0.69 [0.53-0.83]) based on CT imaging, observed that CT-based radiomics models have limited value as surrogates for six selected gene signatures (AUC < 0.60), and showed that combining a radiomics signature with a transcriptomics signature consisting of two metagenes representing the hedgehog pathway and E2F transcriptional targets improves the prognostic value for LRC compared to both individual sources (validation C-index [95% confidence interval], combined: 0.63 [0.55-0.73] vs radiomics: 0.60 [0.50-0.71] and transcriptomics: 0.59 [0.49-0.69]). These results underline the potential of multi-omics analyses to generate reliable biomarkers for future application in personalized oncology.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36202941 PMCID: PMC9537286 DOI: 10.1038/s41598-022-21159-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Overview of the study design. Pre-treatment CT and transcriptome data from patients with HNSCC are first pre-processed to obtain stable and filtered radiomics features and normalised gene expressions. For molecular subtype classification, molecular subtypes are assigned based on centroid data. Then radiomics features are selected through a machine learning pipeline to train OVA models for each subtype. For gene signature surrogates, binary cluster assignment is performed based on a k-means algorithm. Radiomics models are then trained to classify the gene signatures. For signature combination, a radiomics signature is developed and relevant metagenes are derived using GSVA. The identified features are combined to predict LRC. Subsequently, all models are independently validated.
Characteristics of clinical features for the discovery cohort (left) and validation cohort (right) along with p values for homogeneity tests between the cohorts.
| Variables | Discovery cohort | Validation cohort | p value |
|---|---|---|---|
| Median (range) | Median (range) | ||
| GTV (cm3) | 27.0 (4.4–175.8), missing: 0 | 40.5 (2.7–238.8) missing: 0 | |
| Age (years) | 59.5 (39.2–80.2), missing: 0 | 56.0 (41.0–82.1) missing: 0 | 0.16 |
| Total dose (Gy) | 72.0 (69.2–74.0), missing: 0 | 72.0 (69.0–72.0) missing: 0 | |
| 0 (male) | 99 (81.1) | 68 (80.9) | 1.0 |
| 1 (female) | 23 (18.9) | 16 (19.1) | |
| Oropharynx | 61 (50.0) | 22 (26.2) | |
| Hypopharynx | 39 (32.0) | 24 (28.6) | |
| Larynx | 0 (0.0) | 9 (10.7) | |
| Oral cavity | 22 (18.0) | 29 (34.5) | |
| 2 | 0 (0.0) | 1 (1.2) | 0.39 |
| 3 | 11 (9.1) | 9 (10.7) | |
| 4 | 111 (90.9) | 72 (85.7) | |
| Missing | 0 (0.0) | 2(2.4) | |
| 2 | 15 (12.3) | 4 (4.8) | 0.18 |
| 3 | 36 (29.5) | 22 (26.2) | |
| 4 | 71 (58.2) | 58 (69.0) | |
| 0 | 26 (21.3) | 12 (14.3) | 0.39 |
| 1 | 4 (3.3) | 6 (7.1) | |
| 2 | 86 (70.5) | 61 (72.6) | |
| 3 | 6 (4.9) | 5 (6.0) | |
| 1 | 4 (3.3) | 1 (1.2) | |
| 2 | 85 (69.7) | 34 (40.5) | |
| 3 | 30 (24.6) | 30 (35.7) | |
| Missing | 3 (2.4) | 19 (22.6) | |
| 0 (negative) | 109 (88.5) | 70 (83.3) | 0.18 |
| 1 (positive) | 12 (9.8) | 8 (9.5) | |
| Missing | 1 (0.7) | 6 (7.2) | |
| 0 (no) | 50 (41.0) | 26 (31.0) | 0.28 |
| 1 (regular) | 70 (57.4) | 35 (41.6) | |
| Missing | 2 (1.6) | 23 (27.4) | |
| 0 (negative) | 17 (13.9) | 17 (20.2) | 1.00 |
| 1 (positive) | 105 (86.1) | 67 (79.8) | |
| 3D-conformal | 36 (29.5) | 30 (33.7) | 0.48 |
| IMRT | 84 (68.9) | 54 (64.3) | |
| Missing | 2 (1.6) | 0 (0.0) | |
| Normofractionated | 41 (33.6) | 3 (3.6) | |
| Hyperfractionated accelerated | 66 (54.1) | 81 (96.4) | |
| Missing | 15 (12.3) | 0 (0.0) | |
Abbreviations: GTV: gross tumour volume, UICC: Union for International Cancer Control, HPV16: human papillomavirus type 16, DNA: deoxyribonucleic acid, IMRT: intensity-modulated radiotherapy.
Figure 2Outcome stratified by molecular subtypes and their classification using radiomics. Kaplan–Meier survival curves show that atypical tumours have the best prognosis for both OS (A) and LRC (B) with the other three subtypes having similar outcomes. The receiver-operating characteristics (ROC) curves with false positive rate (FPR) and true positive rate (TPR) show good AUC point-estimates in the discovery cohort (C). For the atypical subtype and to a lesser extent for the basal subtype, this performance is translated to the validation cohort (D). The associations of molecular subtypes with the clinical characteristics T stage, N stage, HPV16 status and gross tumour volume (GTV) are shown in (E) for the discovery cohort. Atypical tumours are smaller, have a lower T stage, and are more often HPV+.
Model performance for molecular subtype classification: median area under the curve (AUC) and f1 score in discovery (disc) and validation (val) cohorts with 95% confidence intervals (CI) alongside the p value for calibration using the Hosmer–Lemeshow (HL) test in validation.
| Positive class | AUC disc [95% CI] | AUC val [95% CI] | f1 disc [95% CI] | f1 val [95% CI] | HL val p value |
|---|---|---|---|---|---|
| Atypical | 0.68 [0.52–0.79] | 0.69 [0.53–0.83] | 0.47 [0.43–0.51] | 0.53 [0.44–0.62] | 0.37 |
| Basal | 0.76 [0.62–0.88] | 0.63 [0.44–0.80] | 0.67 [0.52–0.80] | 0.55 [0.40–0.68] | 0.10 |
| Classical | 0.69 [0.51–0.84] | 0.62 [0.28–0.81] | 0.33 [0.31–0.36] | 0.21 [0.11–0.28] | 0.18 |
| Mesenchymal | 0.68 [0.51–0.84] | 0.33 [0.12–0.58] | 0.42 [0.35–0.48] | 0.26 [0.11–0.38] | < 0.001 |
| Atypical (HPV−) | 0.70 [0.53–0.84] | 0.74 [0.56–0.89] | 0.42 [0.40–0.46] | 0.54 [0.48–0.62] | 0.15 |
Abbreviations: HPV: human papillomavirus.
Information on features and intercept in the final logistic regression model for the atypical subtype: coefficients with 95% confidence intervals (CI) along with model p values, Yeo–Johnson (λ) and z-transform (z-shift and z-scale) parameters.
| Feature | Coefficient [95% CI] | p value | λ | z-shift | z-scale |
|---|---|---|---|---|---|
| morph_vol_dens_aabb | 0.114 [− 0.468 to 0.696] | 0.69 | − 0.5 | 0.246 | 0.046 |
| szm_glnu | − 0.545 [− 1.166 to 0.076] | 0.087 | 0 | 4.190 | 0.669 |
| Intercept | − 0.968 [− 1.523 to − 0.413] | < 0.001 | NA | NA | NA |
Figure 3Representative pre-treatment CT slices (primary tumour in red) with high and low values of the two features from the radiomics model for the atypical subtype and accumulated local effects plot (ALE) in the discovery cohort. CT slices of patients with tumours that are characterised by low and high values of the texture feature grey level non-uniformity (siye zone matrix) are presented in panels (A,B), classified as atypical and non-atypical, respectively. A lower value of this feature represents a more spatially homogeneous tumour. The patient presented in panel (D) suffered from a non-atypical tumour with a low expression of the morphological feature volume density (axis-aligned bounding box). It showed a less regular shape than the atypical tumour of the patient in panel (E) with high feature expression. ALE plots (C,F) show difference in probability of being classified as atypical compared to an average patient for each feature value.
Figure 4Stratification (A–C) and calibration (D–F) of the radiomics (left), metagene (centre), and combined model (left) for the prognostic value of loco-regional control (LRC) in the validation cohort. The radiomics model has a non-significant stratification. The metagene model achieves significant stratification and good calibration. When combining the radiomics signature and metagenes, a well-calibrated model with the best stratification in the validation cohort is obtained.
Information on features of the multivariable Cox regression model integrating CT radiomics features and the two selected metagenes: hazard ratio (HR) (95% CI) along with model p values, Yeo–Johnson parameter (λ) and transformation parameters for z-transform (z-shift and z-scale).
| Feature | HR [95% CI] | p value | λ | z-shift | z-scale |
|---|---|---|---|---|---|
| GTV (cm3) | 1.196 [0.897–1.594] | 0.22 | 0 | 10.170 | 0.824 |
| Log_stat_p90 | 1.393 [1.036–1.875] | 0.028 | 0 | 1.256 | 0.499 |
| E2F_targets | 0.774 [0.580–1.033] | 0.081 | 1 | − 0.014 | 0.416 |
| Hedgehog_signalling | 1.329 [0.990–1.785] | 0.058 | 0 | − 0.055 | 0.280 |