| Literature DB >> 34932879 |
Xinyu Ji1, Lijuan Lin1, Juanjuan Fan1, Yi Li2, Yongyue Wei1,3,4, Sipeng Shen1, Li Su3, Andrea Shafer5, Maria Moksnes Bjaanaes6, Anna Karlsson7, Maria Planck7, Johan Staaf7, Åslaug Helland6,8, Manel Esteller9,10,11,12, Ruyang Zhang1,3,4, Feng Chen1,4,13,14, David C Christiani3,5.
Abstract
The interaction between DNA methylation of tripartite motif containing 27 (cg05293407TRIM27 ) and smoking has previously been identified to reveal histologically heterogeneous effects of TRIM27 DNA methylation on early-stage non-small-cell lung cancer (NSCLC) survival. However, to understand the complex mechanisms underlying NSCLC progression, we searched three-way interactions. A two-phase study was adopted to identify three-way interactions in the form of pack-year of smoking (number of cigarettes smoked per day × number of years smoked) × cg05293407TRIM27 × epigenome-wide DNA methylation CpG probe. Two CpG probes were identified with FDR-q ≤ 0.05 in the discovery phase and P ≤ 0.05 in the validation phase: cg00060500KIAA0226 and cg17479956EXT2 . Compared to a prediction model with only clinical information, the model added 42 significant three-way interactions using a looser criterion (discovery: FDR-q ≤ 0.10, validation: P ≤ 0.05) had substantially improved the area under the receiver operating characteristic curve (AUC) of the prognostic prediction model for both 3-year and 5-year survival. Our research identified the complex interaction effects among multiple environment and epigenetic factors, and provided therapeutic target for NSCLC patients.Entities:
Keywords: DNA methylation; non-small-cell lung cancer; overall survival; prognostic prediction; three-way interaction
Mesh:
Substances:
Year: 2022 PMID: 34932879 PMCID: PMC8807353 DOI: 10.1002/1878-0261.13167
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 7.449
Fig. 1Flow chart of study design and statistical analyses. Patients with lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) from the Harvard, Spain, Norway, and Sweden cohorts were used in the discovery phase for screening, whereas data from the Cancer Genome Atlas (TCGA) were used for validation. Subgroup analyses were based on quartiles of iTWINS.
Demographic and clinical descriptions for early‐stage NSCLC patients with DNA methylation data in five international study centers.
| Variable | Discovery | Validation | Combined | ||||
|---|---|---|---|---|---|---|---|
| USA ( | Spain | Norway ( | Sweden ( | All ( | TCGA ( | Overall ( | |
| Age (years) | 67.78 ± 9.92 | 65.77 ± 10.66 | 65.4 ± 9.28 | 72.25 ± 7.22 | 66.7 ± 10.04 | 66.52 ± 9.3 | 66.61 ± 9.69 |
| Pack‐year of smoking | 51.49 ± 40.62 | 43.24 ± 31.72 | 26.93 ± 17.62 | 22.64 ± 25.19 | 40.06 ± 33.00 | 47.05 ± 28.40 | 43.36 ± 31.10 |
| Sex | |||||||
| Female | 66 (44.30%) | 99 (47.83%) | 70 (53.03%) | 22 (61.11%) | 257 (49.05%) | 185 (39.53%) | 442 (44.56%) |
| Male | 83 (55.70%) | 108 (52.17%) | 62 (46.97%) | 14 (38.89%) | 267 (50.95%) | 283 (60.47%) | 550 (55.44%) |
| Smoking status | |||||||
| Never | 18 (12.08%) | 28 (13.53%) | 16 (12.12%) | 18 (50.00%) | 80 (15.27%) | 0 (0.00%) | 80 (8.06%) |
| Former | 81 (54.36%) | 113 (54.59%) | 74 (56.06%) | 11 (30.56%) | 279 (53.24%) | 323 (69.02%) | 602 (60.69%) |
| Current | 50 (33.56%) | 66 (31.88%) | 42 (31.82%) | 7 (19.44%) | 165 (31.49%) | 145 (30.98%) | 310 (31.25%) |
| Clinical stage | |||||||
| I | 102 (68.46%) | 167 (80.68%) | 92 (69.70%) | 34 (94.44%) | 395 (75.38%) | 301 (64.32%) | 696 (70.16%) |
| II | 47 (31.54%) | 40 (19.32%) | 40 (30.30%) | 2 (5.56%) | 129 (24.62%) | 167 (35.68%) | 296 (29.84%) |
| Histology | |||||||
| LUAD | 96 (64.43%) | 169 (81.64%) | 132 (100.00%) | 28 (77.78%) | 425 (81.11%) | 227 (48.50%) | 652 (65.73%) |
| LUSC | 53 (35.57%) | 38 (18.36%) | 0 (0.00%) | 8 (22.22%) | 99 (18.89%) | 241 (51.50%) | 340 (34.27%) |
| Chemotherapy | |||||||
| No | 140 (93.96%) | 166 (91.21%) | 101 (76.52%) | 25 (89.29%) | 432 (87.98%) | 151 (75.88%) | 583 (84.49%) |
| Yes | 9 (6.04%) | 16 (8.79%) | 31 (23.48%) | 3 (10.71%) | 59 (12.02%) | 48 (24.12%) | 107 (15.51%) |
| Unknown | 0 | 25 | 0 | 8 | 33 | 269 | 302 |
| Radiotherapy | |||||||
| No | 130 (87.25%) | 172 (94.51%) | 131 (99.24%) | 28 (100.00%) | 461 (93.89%) | 190 (95.48%) | 651 (94.35%) |
| Yes | 19 (12.75%) | 10 (5.49%) | 1 (0.76%) | 0 (0.00%) | 30 (6.11%) | 9 (4.52%) | 39 (5.65%) |
| Unknown | 0 | 25 | 0 | 8 | 33 | 269 | 302 |
| Adjuvant therapy | |||||||
| No | 125 (83.89%) | 157 (86.26%) | 100 (75.76%) | 25 (89.29%) | 407 (82.89%) | 146 (73.37%) | 553 (80.14%) |
| Yes | 24 (16.11%) | 25 (13.74%) | 32 (24.24%) | 3 (10.71%) | 84 (17.11%) | 53 (26.63%) | 137 (19.86%) |
| Unknown | 0 | 25 | 0 | 8 | 33 | 269 | 302 |
| Survival year | |||||||
| Median (95% CI) | 6.61 (5.14–7.49) | 3.83 (3.21–4.46) | 5.40 (5.16–5.77) | 3.25 (2.09–4.39) | 5.01 (4.53–5.25) | 0.58 (0.50–0.70) | 2.31 (2.00–2.64) |
| Censoring rate | 18.79% | 54.59% | 68.94% | 52.78% | 47.90% | 76.07% | 61.19% |
TCGA, The Cancer Genome Atlas; 95% CI, 95% confidence interval; LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma.
Spain center is a collaborative study center, containing samples from Spain, Italy, the UK, France, and the USA.
Adjuvant therapy included chemotherapy and/or radiotherapy.
Restricted mean survival time was given because median was not available; proportion of samples lost to follow‐up or alive at the end of study.
Association results of two three‐way interactions in the discovery phase, the validation phase, and the combined data.
| Discovery phase | Validation phase | Combined data | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HR | 95% CI |
| FDR‐ | HR | 95% CI |
| HR | 95% CI |
| ||||
| cg05293407 | 0.13 | 0.03 | 0.53 | 4.21E‐03 | 0.23 | 0.01 | 6.94 | 0.396 | 0.14 | 0.04 | 0.49 | 2.02E‐03 | |
| cg00060500 | 0.67 | 0.51 | 0.88 | 4.24E‐03 | 0.59 | 0.29 | 1.19 | 0.142 | 0.65 | 0.51 | 0.83 | 6.09E‐04 | |
| Pack‐year | 0.90 | 0.86 | 0.94 | 2.06E‐06 | 0.88 | 0.80 | 0.97 | 0.012 | 0.90 | 0.86 | 0.93 | 1.04E‐07 | |
| cg05293407 × cg00060500 | 1.26 | 1.08 | 1.48 | 3.91E‐03 | 1.25 | 0.84 | 1.85 | 0.271 | 1.26 | 1.09 | 1.45 | 1.30E‐03 | |
| cg05293407 × pack‐year | 1.07 | 1.04 | 1.10 | 2.32E‐06 | 1.07 | 1.01 | 1.14 | 0.016 | 1.07 | 1.04 | 1..09 | 1.14E‐07 | |
| cg00060500 × pack‐year | 1.01 | 1.01 | 1.02 | 4.23E‐06 | 1.01 | 1.00 | 1.03 | 0.019 | 1.01 | 1.01 | 1.02 | 6.62E‐07 | |
| cg05293407 × cg00060500 × pack‐year | 0.993 | 0.990 | 0.996 | 7.79E‐06 | 0.039 | 0.992 | 0.986 | 0.999 | 0.021 | 0.993 | 0.991 | 0.996 | 8.91E‐07 |
| cg05293407 | 0.22 | 0.10 | 0.46 | 4.97E‐05 | 0.20 | 0.02 | 1.76 | 0.146 | 0.32 | 0.17 | 0.61 | 5.23E‐04 | |
| cg17479956 | 0.73 | 0.64 | 0.83 | 2.57E‐06 | 0.63 | 0.41 | 0.97 | 0.034 | 0.77 | 0.68 | 0.87 | 4.12E‐05 | |
| pack‐year | 0.95 | 0.93 | 0.97 | 1.37E‐05 | 0.91 | 0.85 | 0.99 | 0.019 | 0.95 | 0.93 | 0.97 | 1.74E‐05 | |
| cg05293407 × cg17479956 | 1.18 | 1.10 | 1.27 | 1.94E‐06 | 1.32 | 1.03 | 1.70 | 0.031 | 1.14 | 1.07 | 1.22 | 4.18E‐05 | |
| cg05293407 × pack‐year | 1.03 | 1.02 | 1.04 | 3.15E‐06 | 1.06 | 1.01 | 1.11 | 0.015 | 1.03 | 1.02 | 1.04 | 4.69E‐06 | |
| cg17479956 × pack‐year | 1.005 | 1.003 | 1.008 | 2.46E‐05 | 1.011 | 1.002 | 1.019 | 0.017 | 1.005 | 1.003 | 1.007 | 8.99E‐05 | |
| cg05293407 × cg17479956 × pack‐year | 0.997 | 0.996 | 0.998 | 1.16E‐05 | 0.046 | 0.993 | 0.987 | 0.998 | 0.011 | 0.997 | 0.996 | 0.999 | 5.72E‐05 |
Fig. 2The three‐way interaction pattern between pack‐year of smoking, cg05293407 and cg00060500 . (A) The pattern illustrated by a 3D plot. (B‐E) Kaplan–Meier survival analysis of cg00060500 stratified by pack‐year of smoking and cg05293407 . The number of patients in each subgroup was 266, 282, 211 and 233. Hazard ratio (HR), 95% CI, and P‐values were derived from a Cox proportional hazards regression model adjusted for age, sex, clinical stage and study centers.
Fig. 3Estimated survival curves for patients among iTWINS subgroups. (A) Kaplan–Meier survival curves for patients grouped by iTWINS subgroups. Patients were categorized into four subgroups by using the quantile of iTWINS as the cutoffs. The number of patients in groups 1‐4 was 248. (B) Hazard ratio (HR) and P‐values were derived from the Cox proportional hazards model for patients with different quartile levels of the iTWINS.
Fig. 4Forest plots of results from stratification analysis of iTWINS. Hazard ratio (HR) with 95% CI of iTWINS on non‐small‐cell lung cancer (NSCLC) survival in various subgroups is stratified by clinical characteristics. LUAD, lung adenocarcinoma; LUSC, lung squamous cell carcinoma. Hazard ratio (HR), 95% CI, and P‐values were derived from a Cox proportional hazards regression model for patients in different subgroups.
Fig. 5The functional enrichment analyses of DEGs. (A) Top 36 significant KEGG pathways. (B) Top 36 significant biological process pathways. (C) Top 36 significant cellular component pathways and (D) top 36 significant molecular function pathways.
Fig. 6The association analysis between immune cells and iTWINS. (A) Comparisons of the abundances of 22 immune cells in iTWINS subgroups. ‘ns’ means P > 0.05, ∗ means P < 0.05, ∗∗∗ means P < 0.001 and ∗∗∗∗ means P < 0.0001. (B) Correlation analysis between immune cells and iTWINS. Yellow bars meant correlation coefficient > 0 and P ≤ 0.05, blue bars meant correlation coefficient< 0 and P ≤ 0.05, and gray bars meant P > 0.05. Correlation coefficients and hypothesis tests were based on Pearson correlation tests.
Fig. 7ROC curves for different predictive models using the covariates (clinical information), two‐way interaction (cg05293407 and pack‐year of smoking) and three‐way interactions (cg05293407 , pack‐year of smoking and another CpG probe) with FDR‐q ≤ 0.05 or FDR‐q ≤ 0.10. (A) Three‐year survival prediction. (B) Five‐year survival prediction. The AUC increase (%) was evaluated by comparing the model with that with only the covariates. P‐values and 95% CIs were calculated by using 1000 bootstrap samples and t‐tests. AUC, area under the receiver operating characteristic curve; ROC, receiver operating characteristic.