| Literature DB >> 35158971 |
José Marcio Luna1,2,3, Andrew R Barsky4, Russell T Shinohara1,5, Leonid Roshkovan2, Michelle Hershman2, Alexandra D Dreyfuss4, Hannah Horng6, Carolyn Lou5, Peter B Noël2, Keith A Cengel4, Sharyn Katz2, Eric S Diffenderfer4, Despina Kontos1,2.
Abstract
We evaluate radiomic phenotypes derived from CT scans as early predictors of overall survival (OS) after chemoradiation in stage III primary lung adenocarcinoma. We retrospectively analyzed 110 thoracic CT scans acquired between April 2012-October 2018. Patients received a median radiation dose of 66.6 Gy at 1.8 Gy/fraction delivered with proton (55.5%) and photon (44.5%) beam treatment, as well as concurrent chemotherapy (89%) with carboplatin-based (55.5%) and cisplatin-based (36.4%) doublets. A total of 56 death events were recorded. Using manual tumor segmentations, 107 radiomic features were extracted. Feature harmonization using ComBat was performed to mitigate image heterogeneity due to the presence or lack of intravenous contrast material and variability in CT scanner vendors. A binary radiomic phenotype to predict OS was derived through the unsupervised hierarchical clustering of the first principal components explaining 85% of the variance of the radiomic features. C-scores and likelihood ratio tests (LRT) were used to compare the performance of a baseline Cox model based on ECOG status and age, with a model integrating the radiomic phenotype with such clinical predictors. The model integrating the radiomic phenotype (C-score = 0.69, 95% CI = (0.62, 0.77)) significantly improved (p<0.005) upon the baseline model (C-score = 0.65, CI = (0.57, 0.73)). Our results suggest that harmonized radiomic phenotypes can significantly improve OS prediction in stage III NSCLC after chemoradiation.Entities:
Keywords: ComBat; computed tomography; non-small cell lung cancer; overall survival; radiomics
Year: 2022 PMID: 35158971 PMCID: PMC8833400 DOI: 10.3390/cancers14030700
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Figure 1Pipeline for multivariate survival analysis. Notice the ComBat-based harmonization block right after the feature extraction that accounts for batch effects due to differences in CT scan vendors and the presence or absence of intravenous contrast.
Categorial patient characteristics. Description of clinical characteristics of the cohort with their respective categorization and percentages.
| Categorical Features | Classes | No. of Patients | (%) |
|---|---|---|---|
| Contrast Enhancement | Non-Contrast Enhanced | 82 | 74.5 |
| Contrast Enhanced | 28 | 25.5 | |
| CT Scanner Manufacturer | Philips Healthcare | 67 | 60.9 |
| Siemens Healthineers | 36 | 32.7 | |
| GE Healthcare | 7 | 6.4 | |
| Sex | Female | 68 | 61.8 |
| Male | 42 | 38.2 | |
| Race | White | 80 | 72.7 |
| African American | 22 | 20.0 | |
| Asian | 3 | 2.7 | |
| Native American | 1 | 0.9 | |
| Other | 4 | 3.6 | |
| Marital Status | Married | 66 | 60.0 |
| Single | 24 | 21.8 | |
| Divorced | 8 | 7.3 | |
| Widowed | 8 | 7.3 | |
| Separated | 4 | 3.6 | |
| Radiation Modality | Proton | 61 | 55.5 |
| Linac | 49 | 44.5 | |
| ECOG Status | 0 | 50 | 45.5 |
| 1 | 48 | 43.6 | |
| 2 | 10 | 9.1 | |
| Unknown | 2 | 1.8 | |
| Tobacco Use | Former Smoker | 91 | 82.7 |
| Current Smoker | 13 | 11.8 | |
| Never Smoker | 6 | 5.5 | |
| Histology | Adenocarcinoma | 110 | 100.0 |
| Chemotherapy Agents | Carboplatin-based Doublet | 61 | 55.5 |
| Cisplatin-based Doublet | 40 | 36.4 | |
| Platinum-based Triplet | 2 | 1.8 | |
| Unknown | 7 | 6.4 | |
| Chemotherapy | Concurrent | 89 | 80.9 |
| Sequential | 14 | 12.7 | |
| Unknown | 7 | 6.4 |
Continuous patient characteristics. Description of continuous characteristics of the cohort with their respective median and interquartile ranges.
| Continuous Features | Median | Range * |
|---|---|---|
| Age (yr.) | 66 | (60–71) |
| Radiation Dose Delivered (Gy) | 66.6 | (60.0–66.7) |
| Dose per Fraction (Gy) | 1.8 | (1.8–1.8) |
| BMI (kg/m2) | 26.5 | (23.8–29.9) |
| Pack per year (smokers only) | 35.0 | (20.0–50.0) |
* Interquartile range.
Figure 2Heatmap representing the imaging phenotype calculated through unsupervised hierarchical clustering. The imaging phenotype obtained from harmonized features using ComBat is able to predict the event of death () and ECOG status ( ).
Univariate analysis results. ECOG status is the only clinical variable showing good predictive performance. Additionally, Phenotype (ComBat) is able to predict survival differences in our cohort.
| Predictor | C-Score | 95% CI |
|---|---|---|
| ECOG Status | 0.62 | (0.55, 0.69) |
| Phenotype (ComBat) | 0.61 | (0.54, 0.67) |
| Age at Diagnosis | 0.58 | (0.50, 0.66) |
| Sex | 0.56 | (0.49, 0.63) |
| BMI | 0.55 | (0.44, 0.64) |
| Pack per year | 0.55 | (0.49, 0.62) |
| Phenotype (non-ComBat) | 0.48 | (0.47, 0.59) |
Multivariate analysis results. A baseline model based on ECOG + Age significantly improves ( ) when the clinical variables of the baseline model are integrated with the imaging phenotype obtained from harmonized features using ComBat, i.e., Phenotype (ComBat) + ECOG + Age.
| Predictor | C-Score | 95% CI | |
|---|---|---|---|
| Phenotype (ComBat) + ECOG + Age | 0.69 | (0.62, 0.77) | 0.003 |
| Phenotype (non-ComBat) + ECOG + Age | 0.66 | (0.58, 0.74) | 0.15 |
| ECOG + Age (Baseline) | 0.65 | (0.57, 0.73) | -- |
| 2 PCs + ECOG + Age | 0.65 | (0.60, 0.75) | 0.27 |
* p-value of LRT with respect to the baseline model (ECOG + Age).
Figure 3Sample of high- and low-risk patients classified by our predictive pipeline. Notice that the tumors in CT scans belonging to patients with a high risk of death tend to be larger than the ones in patients with a low risk of death.
Figure 4Details of the tumors classified by our predictive pipeline as high and low risk of death. Notice that the tumors in CT scans belonging to patients with a high risk of death tend to be more heterogeneous than the ones in patients with a low risk of death.