| Literature DB >> 34234160 |
Harsh Patel1, David M Vock2, G Elisabeta Marai3, Clifton D Fuller4, Abdallah S R Mohamed4, Guadalupe Canahuate5.
Abstract
To improve risk prediction for oropharyngeal cancer (OPC) patients using cluster analysis on the radiomic features extracted from pre-treatment Computed Tomography (CT) scans. 553 OPC Patients randomly split into training (80%) and validation (20%), were classified into 2 or 3 risk groups by applying hierarchical clustering over the co-occurrence matrix obtained from a random survival forest (RSF) trained over 301 radiomic features. The cluster label was included together with other clinical data to train an ensemble model using five predictive models (Cox, random forest, RSF, logistic regression, and logistic-elastic net). Ensemble performance was evaluated over the independent test set for both recurrence free survival (RFS) and overall survival (OS). The Kaplan-Meier curves for OS stratified by cluster label show significant differences for both training and testing (p val < 0.0001). When compared to the models trained using clinical data only, the inclusion of the cluster label improves AUC test performance from .62 to .79 and from .66 to .80 for OS and RFS, respectively. The extraction of a single feature, namely a cluster label, to represent the high-dimensional radiomic feature space reduces the dimensionality and sparsity of the data. Moreover, inclusion of the cluster label improves model performance compared to clinical data only and offers comparable performance to the models including raw radiomic features.Entities:
Mesh:
Year: 2021 PMID: 34234160 PMCID: PMC8263609 DOI: 10.1038/s41598-021-92072-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Processing pipeline overview. The data is split into disjoint training and validation (test) sets. Initially the data is preprocessed (remove radiomics with zero variance and highly correlated features, normalization, and clinical data categorization) and then the patients are clustered using Random Survival Forest (RSF) clustering. A regression model is trained using the cluster label as dependent variable and later used to assign test patients into a cluster. The ensemble model is trained using clinical covariates and the cluster labels and evaluated over the test data using the discrimination metrics C-Index and AUC.
Data demographics.
| Name | Train (442) | Test (111) |
|---|---|---|
| | ||
| Male | 388 (87.8%) | 97 (87.4%) |
| Female | 54 (12.2%) | 14 (12.6%) |
| Age at diagnosis (years) | 58.2 (52.5–65.8) | 56.6 (52.5–65.8) |
| T1/T2 | 277 (62.7%) | 69 (62.2%) |
| T3/T4 | 165 (37.3%) | 42 (37.8%) |
| N0/N1 | 226 (51.1%) | 59 (53.2%) |
| N2/N3 | 216 (48.8%) | 52 (46.8%) |
| I | 153 (34.6%) | 42 (37.8%) |
| II | 82 (18.6%) | 17 (15.3%) |
| III | 57 (12.9%) | 9 (8.1%) |
| IV | 150 (33.9%) | 43 (38.8%) |
| Former | 158 (35.8%) | 44 (39.7%) |
| Current | 92 (20.8%) | 26 (23.4%) |
| Never | 192 (43.4%) | 41 (36.9%) |
| | ||
| CC | 228 (51.6%) | 68 (61.3%) |
| IC + CC | 119 (26.9%) | 26 (23.4%) |
| IC + radiation alone | 44 (10.0%) | 10 (9.0%) |
| Radiation alone | 51 (11.5%) | 7 (6.3%) |
| Positive | 270 (61%) | 64 (60%) |
| Negative | 41 (9%) | 9 (8%) |
| Unknown | 131 (29%) | 39 (35%) |
| Alive | 355 (80.3%) | 89 (80.2%) |
| Deceased | 87 (19.7%) | 22 (19.8%) |
| Survival time in months | 65.4 (45.9–98.7) | 75.3 (48.3–98.1) |
| | ||
| Alive | 363 (82.1%) | 91 (82.0%) |
| Deceased | 79 (17.9%) | 20 (18.0%) |
| Survival time in months | 61.0 (40.6–96.4) | 69.4 (39.3–94.8) |
The table shows the demographics for the clinical covariates used in this study. The dataset (533 patients) was randomly split into training and testing disjoint sets using a 80–20 split. As expected, the same distributions can be observed for the train (442 patients) and test (111 patients) datasets. Within the cells in the table, the reported number is either: count (frequency %) for categorical/discrete covariates, or median (25th–75th percentiles) for continuous covariates.
Figure 2Kaplan–Meier (KM) Curves for Overall Survival (OS). The figure shows the KM curves for OS outcome stratified by the cluster label over (a) training and (b) test data. For the training, the patients were grouped using Hierarchical Clustering over the co-occurrence matrix from the Random Survival Forest. For the testing, the patients were assigned to a cluster by applying the regression model trained for predicting the cluster labels using the top radiomic features identified by the random survival forest. For both training and testing, the KM curves are significantly different which indicates that the proposed clustering is effective in identifying a risk stratification and can be effectively used as a predictive covariate.
Figure 3Kaplan–Meier (KM) Curves for Recurrence Free Survival (RFS). The figure shows the KM curves for RFS outcome stratified by the cluster label over (a) training and (b) test data. For the training, the patients were grouped using Hierarchical Clustering over the co-occurrence matrix from the Random Survival Forest. For the testing, the patients were assigned to a cluster by applying the regression model trained for predicting the cluster labels using the top radiomic features identified by the random survival forest. For both training and testing, the KM curves show two consistent risk groups which indicates that the proposed clustering can be effectively used as a predictive covariate within a risk prediction model.
Covariates used in the ensemble model.
| Name | Count | Covariates |
|---|---|---|
| Clinical | 7 | Age, HPV status (positive | negative | unknown), Smoking Status (never | former | current), T.category ([T1-T2],[T3-T4]), N.category ([N0-N1],[N2-N3]), Therapeutic Combination (RT alone, Concurrent Chemotherapy (CC), Induction + RT, Induction + CC), AJCC Stage (8th edition) |
| RSF (OS) | Up to 10 | F4.GrayLevelRunLengthMatrix25..90ShortRunLowGrayLevelEmpha,F48.GrayLevelCooccurenceMatrix25180.2ClusterProminence,F48.GrayLevelCooccurenceMatrix25270.1Contrast,F48.GrayLevelCooccurenceMatrix25225.7ClusterShade,F29.IntensityDirectLocalRangeMax,F2.GrayLevelCooccurenceMatrix25270.1Contrast,F2.GrayLevelCooccurenceMatrix25.333.4Correlation,F2.GrayLevelCooccurenceMatrix25180.6MaxProbability,F4.GrayLevelRunLengthMatrix25..90RunLengthNonuniformity,F4.GrayLevelRunLengthMatrix25.333ShortRunEmphasis |
| RSF (RFS) | Up to 10 | F48.GrayLevelCooccurenceMatrix25180.2ClusterProminence,F48.GrayLevelCooccurenceMatrix25315.6ClusterProminence,F8.IntensityDirectKurtosis, F9.IntensityDirectSkewness,F11.IntensityDirectKurtosis, F13.IntensityDirectEnergy,F48.GrayLevelCooccurenceMatrix25180.1InverseDiffNorm,F2.GrayLevelCooccurenceMatrix25180.5ClusterProminence,F2.GrayLevelCooccurenceMatrix25180.5ClusterShade,F14.IntensityDirectEnergy |
| COX (OS) | 5 | F25.ShapeVolume, F29.IntensityDirectLocalRangeMax,F4.GrayLevelRunLengthMatrix25..90RunLengthNonuniformity,F6.IntensityDirectSkewness,F48.GrayLevelCooccurenceMatrix25225.7AutoCorrelation |
| COX (RFS) | 8 | F5.IntensityDirectGlobalMax, F13.IntensityDirectGlobalMax,F14.IntensityDirectGlobalMax, F25.ShapeVolume,F29.IntensityDirectLocalRangeMax,F4.GrayLevelRunLengthMatrix25..90RunLengthNonuniformity,F4.GrayLevelRunLengthMatrix25..90ShortRunLowGrayLevelEmpha,F48.GrayLevelCooccurenceMatrix25225.7AutoCorrelation |
| Cluster | 1 | Cluster label with 2, 3, or 4 values |
The clinical covariates are used independently of the outcome being evaluated. Since Random Survival Forests (RSF) and Coxnet (COX) can be used as supervised feature selection methods, the radiomic features selected depend on the outcome used. The top covariates from RSF are selected for each outcome. For COX, the features selected depend on the number of non-zero weights learned by the regularization coefficient. COX selected 5 and 8 radiomics features for OS and RFS, respectively. Cluster refers to the cluster label extracted using Random Survival Forest Clustering.
Figure 4Top Radiomic Features identified by the Random Survival Forest (RSF) for Overall Survival (OS). Boxplots of top 6 features selected using the variable importance from the Random Survival Forest (RSF) over the training data and their distribution within the two clusters identified for Overall Survival (OS). The difference in distribution suggests that these variables can be used in a model to assign cluster labels to test patients. Radiomic features names have been abbreviated to fit in the figure: GL = GrayLevel, CoM = CoocurrenceMatrix, RL = RunLength.
Figure 5Ensemble model performance over test data. The ensemble model discrimination was evaluated using the AUC metric over the test data for two survival outcomes: (a) Overall Survival (OS) and (b) Recurrence Free Survival (RFS). Comparison is done between a Clinical baseline model using seven clinical covariates: age, hpv status, smoking status, T-category, N-category, therapeutic combination, AJCC staging, and the models including additional model covariates: selected radiomic features (Clinical + rsf/ + cox), and the proposed cluster labels (Clinical + N Clusters). In all cases, the inclusion of the cluster labels outperforms the Clinical model. The models including the cluster labels show comparable performance to the models including a subset of radiomic features while being considerably more parsimonious models.
Ensemble discrimination performance over training and testing data.
| Covariates used in model | Overall survival (OS) | Recurrence free survival (RFS) | ||||||
|---|---|---|---|---|---|---|---|---|
| C-index | AUC | C-index | AUC | |||||
| Train | Test | Train | Test | Train | Test | Train | Test | |
| Clinical | .66 | .62 | .66 | .62 | .63 | .64 | .70 | .66 |
| Clinical + rsf (top 3) | .72 | .75 | .65 | .73 | .72 | .77 | ||
| Clinical + rsf (top 5) | .73 | .71 | .76 | .75 | .65 | .75 | .71 | .80 |
| Clinical + rsf (top 10) | .76 | .71 | .80 | .76 | .66 | .75 | .72 | .80 |
| Clinical + cox | .73 | .67 | .76 | .70 | .67 | .73 | .75 | .79 |
| Clinical + 2 Clusters | .81 | .75 | .85 | .79 | .73 | .66 | .82 | .70 |
| Clinical + 3 Clusters | .81 | .72 | .86 | .75 | .79 | .72 | .88 | .77 |
| Clinical + 4 Clusters | .87 | .74 | .92 | .75 | .91 | .95 | ||
Comparison of ensemble performance over Train and Test data using C-Index and AUC for both OS and RFS outcomes. Each row in the table corresponds to the ensemble model using different covariates. The Clinical baseline is the model where only clinical covariates are included. The subsequent rows include additional covariates into the baseline model: top n selected radiomic features using rsf (+ rsf (top n)), selected radiomic features using coxnet (+ cox), and the proposed cluster labels (Clinical + N Clusters). The best test results are highlighted in bold. The best test results for OS are obtained by the Clinical + rfs (top 3) (C-Index: .75, AUC: .80) while the best test results for RFS are obtained by Clinical + 4 Clusters (C-Index: .75, AUC: .80).