| Literature DB >> 32601340 |
Jaileene Pérez-Morales1, Ilke Tunali2,3, Olya Stringfield2, Steven A Eschrich4, Yoganand Balagurunathan4, Robert J Gillies2, Matthew B Schabath5,6.
Abstract
The National Lung Screening Trial (NLST) demonstrated that screening with low-dose computed tomography (LDCT) is associated with a 20% reduction in lung cancer mortality. One potential limitation of LDCT screening is overdiagnosis of slow growing and indolent cancers. In this study, peritumoral and intratumoral radiomics was used to identify a vulnerable subset of lung patients associated with poor survival outcomes. Incident lung cancer patients from the NLST were split into training and test cohorts and an external cohort of non-screen detected adenocarcinomas was used for further validation. After removing redundant and non-reproducible radiomics features, backward elimination analyses identified a single model which was subjected to Classification and Regression Tree to stratify patients into three risk-groups based on two radiomics features (NGTDM Busyness and Statistical Root Mean Square [RMS]). The final model was validated in the test cohort and the cohort of non-screen detected adenocarcinomas. Using a radio-genomics dataset, Statistical RMS was significantly associated with FOXF2 gene by both correlation and two-group analyses. Our rigorous approach generated a novel radiomics model that identified a vulnerable high-risk group of early stage patients associated with poor outcomes. These patients may require aggressive follow-up and/or adjuvant therapy to mitigate their poor outcomes.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32601340 PMCID: PMC7324394 DOI: 10.1038/s41598-020-67378-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Patient characteristics in the training and test cohorts.
| Characteristics | Total (N = 234) | Training Cohort (N = 161) | Test Cohort (N = 73) | Validation Cohort (N = 62) | |
|---|---|---|---|---|---|
| Age, mean (sd) | 63.8 (5.1) | 63.9(5.1) | 63.5 (5.3) | 0.63 | 67.2 (9.47) |
| By sex | |||||
| Female | 36.0 (5.2) | 63.2 (5.2) | 62.8 (5.2) | 67.73 (8.92) | |
| Male | 64.3 (5.0) | 64.3 (4.9) | 64.3 (5.3) | 66.58 (10.10) | |
| Sex, N(%) | 0.06 | ||||
| Female | 101 (43%) | 63 (39%) | 38 (52%) | 30 (48.4%) | |
| Male | 133 (57%) | 98 (61%) | 35 (48%) | 31 (50%) | |
| Missing | 1 (1.6%) | ||||
| Smoking status, N(%) | 0.82 | ||||
| Former | 100 (43%) | 68 (42%) | 32 (44%) | 59 (95.2%) | |
| Current | 134 (57%) | 93 (58%) | 41 (56%) | 0(0%) | |
| Never | 2 (3.2%) | ||||
| Missing | 1 (1.6%) | ||||
| No. pack-years, mean (SD) | 64.7 (23.9) | 64.5 (23.6) | 65.1 (24.9) | 0.86 | n/a |
| FH of lung cancer | 0.74 | ||||
| No | 170 (73%) | 118 (73%) | 52 (71%) | n/a | |
| Yes | 64 (27%) | 43 (27%) | 21 (29%) | n/a | |
| Self-reported history of COPD | 0.02 | ||||
| No | 211 (90%) | 150 (93%) | 61 (84%) | n/a | |
| Yes | 23 (10%) | 11 (7%) | 12 (16%) | n/a | |
| Histology | 0.23 | ||||
| Adenocarcinoma-BAC4 | 130 (56%) | 96 (60%) | 34 (47%) | 55 (88.7%) | |
| Squamous | 44 (19%) | 28 (17%) | 16 (22%) | 0 (0) | |
| Small Cell | 10 (4%) | 5 (3%) | 5 (7%) | 0 (0) | |
| Other NOS | 50 (21%) | 32 (20%) | 18 (25%) | 0 (0) | |
| Missing | 7 (11.3%) | ||||
| Treatment | 0.22 | ||||
| Surgical resection | 182 (78%) | 129 (80%) | 53 (73%) | n/a | |
| Chemotherapy/Other | 21 (9%) | 11 (7%) | 10 (14%) | n/a | |
| Radiation therapy | 31 (13%) | 21 (13%) | 10 (14%) | n/a | |
| Staging | 0.70 | ||||
| I and II | 179 (76%) | 122 (76%) | 57 (78%) | 45 (72.6%) | |
| III and IV | 55 (24%) | 39 (24%) | 16 (22%) | 17 (27.4%) | |
| Baseline Screening | 0.19 | ||||
| Positive (T0+) | 158 (68%) | 113 (70%) | 45 (62%) | n/a | |
| Negative (T0−) | 76 (32%) | 48 (30%) | 28 (38%) | n/a | |
| 5-year overall survival rate | 64% | 62.60% | 56.20% | 0.44 | 64% |
Abbreviations: SD = standard deviation; FH = family history; Pack-years = packs smoked/day x years smoked; COPD = chronic obstructive pulmonary disease; NOS = not otherwise specified;
1 P value obtained from Chi-squared for categorical variables.
2 P value obtained from T-test for continuous variables.
3 P value obtained from Log-rank for survival variables.
4 BAC and adenocarcinoma were combined into one group.
Figure 1Identification of risk groups based on peritumoral and intratumoral features (A) Statistical analysis pipeline for radiomics feature selection. (B) The tree structure of the classification and regression tree analysis (CART) which identified four risk groups based on two radiomics features, in which we combined the two intermediate risk groups.
Figure 2Risk-groups associated with overall survival for training and test cohorts and among early stage patients. Across the training and test cohort as well as in early-stage the high-risk group had a significantly worse outcome in OS (A) and PFS (B).
Patient characteristics of the three risk groups in the total data set (N = 234).
| Characteristics | Low risk group (N = 153) | Intermediate risk group (N = 69) | High risk (N = 12) | |
|---|---|---|---|---|
| Age, mean (sd) | 63.1 (4.9) | 65.2 (5.4) | 63.7 (3.4) | 0.07 |
| Female | 63.1 (4.9) | 64.7 (5.6) | 64 | |
| Male | 63.8 (4.9) | 65.6 (5.4) | 63.6 (3.6) | |
| Sex, N (%) | ||||
| Female | 71 (46%) | 29 (42%) | 1 (8%) | |
| Male | 82 (54%) | 40 (58%) | 11 (92%) | |
| Smoking, N (%) | 0.43 | |||
| No | 66 (43%) | 31 (45%) | 3 (25%) | |
| Yes | 87 (57%) | 38 (55%) | 9 (75%) | |
| Pack-years, mean (sd) | 63.2 (23.8) | 66.5 (23.9) | 74.0 (26.7) | 0.95 |
| FH of lung cancer | 0.49 | |||
| No | 115 (75%) | 47 (68%) | 8 (67%) | |
| Yes | 38 (25%) | 22 (32%) | 4 (33%) | |
| Self-reported history of COPD | 0.19 | |||
| No | 140 (92%) | 59 (86%) | 12 (100%) | |
| Yes | 13 (8%) | 10 (14%) | 0 (0%) | |
| Histology | 0.27 | |||
| Adenocarcinoma-BAC4 | 91 (59%) | 36 (52%) | 3 (25%) | |
| Squamous | 26 (17%) | 13 (19%) | 5 (42%) | |
| Small cell | 5 (3%) | 4 (6%) | 1 (8%) | |
| Other NOS | 31 (20%) | 16 (23%) | 3 (25%) | |
| Treatment | 0.13 | |||
| Surgical | 125 (82%) | 50 (72%) | 7 (58%) | |
| Chemotherapy/Other | 13 (8%) | 7 (10%) | 1 (8%) | |
| Radiation therapy | 15 (10%) | 12 (17%) | 4 (33%) | |
| Staging | ||||
| Early stage (I and II) | 123 (80%) | 52 (75%) | 4 (33%) | |
| Late stage (III and IV) | 30 (20%) | 17 (25%) | 8 (67%) | |
| Baseline screening | 0.06 | |||
| Positive (T0+) | 111 (73%) | 41 (59%) | 6 (50%) | |
| Negative (T0−) | 42 (27%) | 28 (41%) | 6 (50%) | |
| 2.5-year overall survival rate, % | ||||
| Training | 89% | 63% | 10% | |
| Test Cohort | 68% | 78% | 50% | |
| 5-year overall survival rate, % | ||||
| Training | 77% | 41% | n/a | |
| Test Cohort | 51% | 73% | n/a |
Abbreviations: sd = standard deviation; FH = family history; Pack-years = packs smoked/day x years smoked; COPD = chronic obstructive pulmonary disease; NOS = not otherwise specified;
1 P-value obtained from Chi-squared for categorical variables.
2 P-value obtained from Anova for continuous variables.
3 P-value obtained from Log-rank for survival variables.
4 BAC and adenocarcinoma were combined into one group.
Figure 3Overall survival for the risk patient risk groups among non-screen detected adenocarcinoma lung cancers (A) and for early-stage (B) for late-stage patients.
Multivariable Cox proportional hazards models for overall survival in the training and test cohorts.
| Characteristic | |||
|---|---|---|---|
| Risk group | |||
| Low | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Intermediate | 1.55 (0.95, 2.53) | 0.46 (0.15, 1.43) | |
| High | 0.93 (0.14, 6.33) | ||
| Sex | |||
| Female | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Male | 1.46 (0.88, 2.41) | 1.13 (0.46, 2.74) | |
| Smoking status | |||
| Former | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Current | 1.13 (0.72, 1.80) | 0.96 (0.53, 1.75) | 1.71 (0.68, 4.34) |
| Baseline screening (T0) | |||
| Positive (T0+) | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Negative (T0−) | 1.19 (0.72, 1.97) | 0.84 (0.44, 1.61) | 2.55 (0.94, 6.96) |
| Stage | |||
| I and II | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| III and IV | 1.68 (0.55, 5.11) | ||
| Histology | |||
| Adenocarcinoma-BAC1 | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Squamous | 1.31 (0.70, 2.46) | 1.33 (0.85, 3.03) | 1.19 (0.37, 3.84) |
| Small Cell | 1.80 (0.79, 4.11) | 2.21 (0.69, 7.08) | 2.32 (0.62, 8.64) |
| Other NOS | 1.18 (0.67, 2.06) | 1.49 (0.71, 3.10) | 2.16 (0.70, 6.73) |
| Treatment | |||
| Surgical | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Chemotherapy/Other | |||
| Radiation | 1.55 (0.64, 3.76) | ||
| Harrell's C index | 0.79 | 0.83 | 0.81 |
Abbreviations: HR: Hazard Ratios; NOS: Not otherwise specified;
Data in parentheses are 95% CIs.
1BAC and adenocarcinoma were combined into one group.
2 “All patients” combines the training and test sets into a single cohort.
Multivariable Cox proportional hazards models for progression free survival in the training and test cohorts.
| Characteristic | All Patients HR (95% CI) | Training Cohort HR (95% Cl) | Test Cohort HR (95% CI) |
|---|---|---|---|
| Risk Group | |||
| Low | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Intermediate | 1.42 (0.92, 2.22) | ||
| High | 2.02 (0.34, 11.99) | ||
| Sex | |||
| Female | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Male | 0.92 (0.60, 1.42) | 1.29 (0.72, 2.31) | 0.60 (0.27, 1.32) |
| Smoking Status | |||
| Former | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Current | 1.21 (0.79, 1.85) | 0.98 (0.57, 1.69) | 1.77 (0.74, 4.23) |
| Baseline Screening | |||
| Positive (T0+) | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Negative (T0−) | 1.47 (0.94, 2.31) | 1.02 (0.57, 1.83) | |
| Stage | |||
| I and II | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| III and IV | 1.42 (0.55, 3.69) | ||
| Histology | |||
| Adenocarcinoma-BAC1 | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Squamous | 1.03 (0.60, 1.87) | 1.06 (0.50, 2.26) | 1.12 (0.36, 3.49) |
| Small Cell | 1.49 (0.68, 3.25) | 2.31 (0.75, 7.13) | 1.49 (0.41, 5.37) |
| Other NOS | 1.05 (0.63, 1.74) | 0.95 (0.48, 1.89) | 2.31 (0.86, 6.21) |
| Treatment | |||
| Surgical | 1.00 (Reference) | 1.00 (Reference) | 1.00 (Reference) |
| Chemotherapy/Other | |||
| Radiation | 1.60 (0.69, 3.69) | ||
| Harrel’s C statistics | 0.79 | 0.81 | 0.80 |
Figure 4Time-dependent AUC plot of the multivariable model for overall survival for (A) all patients and (B) early-stage patients.
Figure 5The association between radiomics gene expression. (A) Correlation between intratumoral RMS radiomic feature and FOXF2. (B) FOXF2 expression by dichotomizing RMS at the median and FOXF2 expression by the three patient risk groups. (C) RABGAP1L expression by dichotomizing NGTDM busyness at the median and NGTDM busyness expression by the three patient risk groups.