| Literature DB >> 34409252 |
Ilke Tunali1, Yan Tan2, Jhanelle E Gray3, Evangelia Katsoulakis4, Steven A Eschrich5, James Saller6, Hugo J W L Aerts7, Theresa Boyle6, Jin Qi1, Albert Guvenis8, Robert J Gillies1, Matthew B Schabath3.
Abstract
Background: Immunotherapy yields survival benefit for some advanced stage non-small cell lung cancer (NSCLC) patients. Because highly predictive biomarkers of immunotherapy response are an unmet clinical need, we used pretreatment radiomics and clinical data to train and validate a parsimonious model associated with survival outcomes among NSCLC patients treated with immunotherapy.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34409252 PMCID: PMC8363765 DOI: 10.1093/jncics/pkab048
Source DB: PubMed Journal: JNCI Cancer Spectr ISSN: 2515-5091
Figure 1.The radiomics pipeline. After pretreatment (ie, baseline), imaging, and patient data are obtained, radiomic features are extracted from standard-of-care imaging studies (yellow). Radiologists mark target the lesions, and lesions are automatically (or semi-automatically) segmented. Radiomic features are then extracted from the region of interest (purple). Unstable, nonreproducible and correlated radiomic features are removed. The remaining features are combined with the pretreatment clinical covariates (green), and model building approaches are applied, which can be used for patient stratification and/or treatment selection.
Patient characteristics of the training and 2 validation cohorts
| Characteristics | Training | Validation |
| Validation |
|
|---|---|---|---|---|---|
| MCC1 cohort | MCC2 cohort | VA cohort | |||
| (n = 180) | (n = 90) | (n = 62) | |||
| Age at initiation of treatment, No. (%) | |||||
| Dichotomized | |||||
| <65 y | 68 (37.8) | 37 (41.1) | 15 (24.2) | ||
| ≥65 y | 112 (62.2) | 53 (58.9) | .60 | 47 (75.8) | .06 |
| Median (95% CI), y | 67 (65 to 68) | 67 (64 to 69) | .78 | 68 (67 to 71) | .03 |
| Sex, No. (%) | |||||
| Female | 95 (52.8) | 43 (47.8) | 3 (4.8) | ||
| Male | 85 (47.2) | 47 (52.2) | .44 | 59 (95.2) | <.001 |
| Smoking status, No. (%) | |||||
| Never smoker | 30 (16.7) | 16 (17.8) | 2 (3.2) | ||
| Ever smoker | 146 (81.1) | 74 (82.2) | .87 | 60 (96.8) | .004 |
| Unknown/Missing | 4 (2.2) | 0 (0) | 0 (0) | ||
| Stage, No. (%) | |||||
| III | 6 (3.3) | 4 (4.4) | 13 (21.0) | ||
| IV | 174 (96.7) | 86 (95.6) | .74 | 49 (79.0) | <.001 |
| Histology, No. (%) | |||||
| Adenocarcinoma/others | 137 (76.1) | 71 (78.9) | 43 (69.3) | ||
| Squamous cell carcinoma | 43 (23.9) | 19 (21.1) | .65 | 19 (30.7) | .14 |
| Checkpoint inhibitors, No. (%) | |||||
| Anti–PD-L1 | 48 (26.6) | 18 (20.0) | 8 (12.9) | ||
| Anti–PD-1 | 57 (31.7) | 69 (76.7) | 54 (87.1) | ||
| Doublet | 75 (41.7) | 3 (3.3) | <.001 | 0 (0) | <.001 |
| ECOG performance status, No. (%) | |||||
| 0 | 39 (21.7) | 10 (11.1) | 12 (19.4) | ||
| 1 | 141 (78.3) | 67 (74.4) | 39 (62.9) | ||
| 2 | 0 (0) | 13 (14.4) | <.001 | 11 (17.7) | <.001 |
| Previous lines of therapy on current diagnosis, No. (%) | |||||
| None | 70 (43.9) | 21 (23.3) | N/A | ||
| 1 | 48 (26.7) | 47 (52.2) | N/A | ||
| ≥2 | 62 (34.4) | 22 (24.4) | <.001 | N/A | — |
| Number of metastatic sites, No. (%) | |||||
| 1 | 82 (46.6) | 51 (56.7) | 25 (40.3) | ||
| ≥2 | 98 (54.4) | 39 (43.3) | .09 | 37 (59.7) | .55 |
| Not detected | 107 (59.4) | 37 (41.1) | N/A | ||
| Detected | 25 (13.9) | 5 (5.6) | .36 | N/A | — |
| Missing/Inconclusive | 48 (26.7) | 48 (53.3) | N/A | ||
| Not detected | 61 (33.9) | 20 (22.2) | N/A | ||
| Detected | 29 (16.1) | 12 (13.3) | .66 | N/A | — |
| Missing/Inconclusive | 90 (50.0) | 58 (64.4) | N/A | ||
| Hematology, median (95% CI) | |||||
| Serum albumin, g/dL | 4.0 (2.8 to 4.9) | 3.8 (2.3 to 4.7) | < 001 | 3.9 (2.9 to 4.6) | .09 |
| Lymphocytes, 1x109/L | 1.3 (0.3 to 3.6) | 1.0 (0.3 to 3.4) | <.001 | 1.0 (0.2 to 3.2) | .01 |
| WBC, 1x109/L | 7.1 (3.2 to 61.5) | 7.7 (1.4 to 45.1) | .25 | 7.5 (1.8 to 19.4) | .38 |
| Neutrophils, 1x109/L | 4.8 (1.6 to 31.7) | 5.3 (0.4 to 40.2) | .13 | 5.6 (1.1 to 15.1) | .33 |
| Ratio of neutrophils: Lymphocytes | 3.7 (1.1 to 30.4) | 5.2 (0.5 to 53.1) | .002 | 5.3 (0.8 to 34.3) | .004 |
The majority of the MCC1 (95.3%) and the MCC2 cohorts (86.7%) were self-reported White race and majority of MCC1 cohort (97.0%) and the MCC2 cohort (88.9%) were self-reported non-Hispanic. For the VA cohort, racial and ethnicity data were not available. CI = confidence interval; ECOG = Eastern Cooperative Oncology Group; EGFR = estimated glomerular filtration rate; MCC = Moffitt Cancer Center; PD-L1 = programmed cell death ligand-1; VA = Veterans Health Administration; WBC = white blood cell; N/A = not available (i.e., these covariates were not available in the VA validation cohort).
P values for continuous variables were calculated using Mann-Whitney test and Fisher exact test for categorical variables. All P values are 2-sided. P values were not calculated for N/A cells and an em dash was denoted.
P values were calculated comparing MCC1 and MCC2 cohorts.
P values were calculated comparing MCC1 and VA cohorts.
P values for smoking status, EGFR mutational status, and KRAS mutational status were calculated excluding missing or inconclusive data.
Figure 2.The heat map of concordance correlation coefficients, the correlation matrix for the “avatar” feature, and the Classification and Regression Tree (CART). A) The heat map plots the concordance correlation coefficients (CCC) of the radiomic features acquired by different segmentations and image acquisitions. Each column in the heat map represents a radiomic feature from the indicated feature group and region of interest (eg, intratumoral or peritumoral). The features are compared between different segmentation algorithms (ALG), different initial parameters (IP), and test-retest scans (RIDER). The green boxes represent higher (CCC > 0.95), blue boxes represent moderate (CCC ≥ 0.75 and CCC ≤ 0.95), and red boxes represent lower (CCC < 0.75) CCCs. B) The correlation matrix plots the radiomic features that were statistically significantly associated with overall survival in the univariable analysis. The most informative radiomic feature (gray-level co-occurrence matrix [GLCM] inverse difference) was correlated with 7 other features. C) CART analysis was used to identify patient risk groups based on a decision tree containing 1 radiomic feature and 2 clinical features. Patients were grouped from low-risk to very high-risk based on overall survival outcomes.
Figure 3.Kaplan-Meier survival curves for the 4 risk groups identified by CART analysis. Overall survival is presented for (A) MCC1 cohort, (B) MCC2 cohort, and (C) VA cohort. Progressive-free survival (PFS) is presented for (D) MCC1 cohort and (E) MCC2 cohort. PFS was not available for the VA cohort. Risk groups 2 and 3 were combined, and risk groups 4 and 5 were combined (as shown in Supplementary Figure 2, A, available online) to create the moderate-risk and the high-risk groups, respectively. The log-rank test was used to calculate 2-sided P values. MCC = Moffitt Cancer Center; VA = Veterans Health Administration.
Univariable and multivariable Cox regression analysis for overall survival and progression-free survival
| Characteristics | MCC1 cohort (n = 180) | MCC2 cohort (n = 90) | |||
|---|---|---|---|---|---|
| Univariable modela | Multivariable modelb | Multivariable modelc | Univariable modela | Multivariable modelb | |
| HR (95% CI) | HR (95% CI) | HR (95% CI) | HR (95% CI) | HR (95% CI) | |
| Overall survival | |||||
| Risk groupd | |||||
| Low risk | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) |
| Moderate risk | 3.79 (1.13 to 12.68)e | 3.08 (0.89 to 10.66) | 3.56 (1.02 to 12.48)e | 1.70 (0.75 to 3.87) | 1.51 (0.66 to 3.51) |
| High risk | 8.02 (2.47 to 26.09)e | 7.87 (2.38 to 25.97)e | 6.98 (2.10 to 23.18)e | 2.73 (1.33 to 5.63)e | 3.33 (1.57 to 7.05)e |
| Very high risk | 19.32 (5.80 to 64.32)e | 17.33 (5.11 to 58.72)e | 17.24 (5.09 to 58.36)e | 10.52 (4.58 to 24.17)e | 5.35 (2.14 to 13.36)e |
| ECOG | — | 1.22 (0.70 to 2.11) | 1.20 (0.69 to 2.07) | — | 2.63 (1.47 to 4.68)e |
| Pr. treatment | — | — | 1.36 (1.01 to 1.81)e | — | — |
| Lymphocytes | — | 1.04 (0.74 to 1.46) | . | — | 0.73 (0.45 to 1.17) |
| WBC | — | — | 0.98 (0.88 to 1.09) | — | — |
| Neutrophils | — | — | 1.10 (0.89 to 1.34) | — | — |
| NLR | — | 1.01 (0.97 to 1.06) | 0.98 (0.92 to 1.05) | — | 1.05 (1.02 to 1.08)e |
| Progression-free survival | |||||
| Risk groupd | |||||
| Low risk | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) | 1.00 (Referent) |
| Moderate risk | 2.02 (0.89 to 4.64) | 2.05 (0.88 to 4.76) | 2.36 (1.00 to 5.58)e | 2.96 (1.43 to 6.14)e | 2.80 (1.34 to 5.85)e |
| High risk | 5.15 (2.33 to 11.36)e | 5.55 (2.46 to 12.49)e | 4.89 (2.15 to 11.14)e | 2.58 (1.29 to 5.14)e | 3.05 (1.50 to 6.18)e |
| Very high risk | 9.62 (4.12 to 22.44)e | 9.03 (3.77 to 21.63)e | 8.79 (3.66 to 21.11)e | 7.13 (3.31 to 15.35)e | 3.95 (1.56 to 8.54)e |
| ECOG | — | 1.09 (0.68 to 1.74) | 1.05 (0.66 to 1.68) | — | 2.33 (1.35 to 4.03)e |
| Pr. treatment | — | — | 1.32 (1.04 to 1.67)e | — | — |
| Lymphocytes | — | 0.83 (0.63 to 1.09) | — | — | 0.88 (0.59 to 1.33) |
| WBC | — | — | 1.00 (0.90 to 1.11) | — | — |
| Neutrophils | — | — | 1.04 (0.86 to 1.26) | — | — |
| NLR | — | 1.04 (0.99 to 1.09) | 1.01 (0.95 to 1.07) | — | 1.05 (1.02 to 1.08)e |
The main effects for each risk group with the low-risk group as the referent category (ie, HR = 1.00). Hazard ratios were not calculated for the cells with the em dash. CI = confidence interval; ECOG = Eastern Cooperative Oncology Group; HR = hazard ratio; MCC = Moffitt Cancer Center; NLR = neutrophils to lymphocytes ratio; PFS = progression-free survival; Pr. treatment = previous lines of treatments at current diagnosis; WBC = white blood cell.
These models included the clinical covariates that were found to be statistically significant different between the MCC1 and MCC2 cohorts (Table 1). The low risk group was the referent category.
These models included the clinical covariates that were found to be statistically significantly different between the Classification and Regression Tree risk groups in Supplementary Table 5 (available online).
Low-risk group refers to patients who have low gray level co-occurrence matrix (GLCM) inverse difference (≤0.43) and lower number of metastatic sites (n = 1). The moderate risk group refers to patients who either have low GLCM inverse difference (≤0.43) and higher number of metastatic sites (≥2), or patients who have higher GLCM inverse difference (>0.43), higher serum albumin (≥3.9), and lower number of metastatic sites (1). The high-risk group refers to either patients who have higher GLCM inverse difference (>0.43), higher serum albumin (≥3.9), and higher number of metastatic sites (≥2), or patients who have higher GLCM inverse difference (>0.43), lower serum albumin (<3.9), and lower number of metastatic sites (n = 1). The very high-risk group refers to patients who have higher GLCM inverse difference (>0.43), lower serum albumin (<3.9), and lower number of metastatic sites (≥2).eHazard ratios are statistically significant.
Figure 4.Gray level co-occurrence matrix (GLCM) inverse difference radiomic feature and CAIX expression. A-D) The association between GLCM inverse difference and CAIX expression based off 2 different probesets: merck2-DQ892208_at is presented in panels A and B, and merck-NM_001216_at is presented in panels C and D. E) The association between the automated pathology IHC scoring for CAIX and GLCM inverse difference. F) The correlation between the automated pathology IHC scoring for CAIX and GLCM inverse difference. In panels A, C, and E, the Mann-Whitney U test was used to calculate 2-sided P values, and the error bars depict Tukey whiskers (fences). In panels B, D, and F, Pearson correlation coefficient was used to calculate 2-sided P values.
Figure 5.Kaplan-Meier survival curves (overall survival) for the dichotomized radiomics feature (gray level co-occurrence matrix [GLCM] inverse difference). The same dichotomized cut-point found in the MCC1 training cohort was used for the survival analyses (A-C) in the training and 2 validation cohorts, respectively, and (D-G) the 4 prognostic datasets (ie, gene-expression dataset, National Lung Screening Trial (NLST) dataset, Moffitt adenocarcinoma dataset, and Maastricht Radiation Oncology Clinic (MAASTRO) adenocarcinoma dataset, respectively). The log-rank test was used to calculate 2-sided P values.