| Literature DB >> 35218396 |
Jing Zhu1, Jinxing Guan2, Xinyu Ji2, Yunjie Song2, Xiaoshuang Xu2, Qianqian Wang3, Quanan Zhang4, Renhua Guo5, Rui Wang6, Ruyang Zhang7,8.
Abstract
Long noncoding RNA (lncRNA) are involved in regulating physiological behaviors for various malignant tumors, including non-small-cell lung cancer (NSCLC). However, few studies comprehensively evaluated both lncRNA-lncRNA interaction effects and main effects of lncRNA on overall survival of NSCLC. Hence, we performed a two-phase designed study of lncRNA expression in tumor tissues using 604 NSCLC patients from The Cancer Genome Atlas as the discovery phase and 839 patients from Gene Expression Omnibus as the validation phase. In the discovery phase, we adopted a two-step strategy, Screening before Testing, for dimension reduction and signal detection. These candidate lncRNAs first screened out by the weighted random forest (Ranger), were then tested through the Cox proportional hazards model adjusted for covariates. Significant lncRNAs with either type of effects aforementioned were carried forward into the validation phase to confirm their significances again. As a result, in the discovery phase, 19 lncRNAs were identified by Ranger, among which five lncRNAs and one pair of lncRNA-lncRNA interaction exhibited significant effects (FDR-q ≤ 0.05) main and interaction effects on NSCLC survival, respectively, through Cox model. After the independent validation, we finally observed that one lncRNA (ENSG00000227403.1) with main effect was robustly associated with NSCLC prognosis (HRdiscovery = 0.90, P = 1.20 × 10-3; HRvalidation = 0.94, P = 4.11 × 10-3) and one pair of lncRNAs (ENSG00000267121.4 and ENSG00000272369.1) had significant interaction effect on NSCLC survival (HRdiscovery = 1.12, P = 3.07 × 10-4; HRvalidation = 1.11, P = 0.0397). Our comprehensive NSCLC prognostic study of lncRNA provided population-level evidence for further functional study.Entities:
Keywords: Interaction; Main effect; Non-small-cell lung cancer; Overall survival; lncRNA
Mesh:
Substances:
Year: 2022 PMID: 35218396 PMCID: PMC8960609 DOI: 10.1007/s00438-022-01869-3
Source DB: PubMed Journal: Mol Genet Genomics ISSN: 1617-4623 Impact factor: 3.291
Demographic and clinical descriptions of NSCLC patients in the discovery and validation phases
| Variable | Discovery phase ( | Validation phase ( |
|---|---|---|
| Age (years) | 66.02 ± 9.55 | 63.51 ± 9.34 |
| Gender, | ||
| Male | 355 (58.77%) | 411 (56.38%) |
| Female | 249 (41.23%) | 318 (43.62%) |
| Unknown | 0 | 110 |
| Race, | ||
| Asian | 8 (1.32%) | 0 |
| Black | 55 (9.11%) | 0 |
| White | 541 (89.57%) | 0 |
| Unknown | 0 | 839 |
| Smoke status, | ||
| Never | 0 | 26 (6%) |
| Former | 416 (68.87%) | 243 (56.12%) |
| Current | 188 (31.13%) | 164 (37.88%) |
| Unknown | 0 | 406 |
| Smoking packs every year | 47.38 ± 29.96 | – |
| Clinical stage, | ||
| I | 328 (54.30%) | 496 (68.04%) |
| II | 171 (28.31%) | 172 (23.59%) |
| III | 88 (14.57%) | 53 (7.27%) |
| IV | 17 (2.81%) | 8 (1.10%) |
| Unknown | 0 | 110 |
| Histology, | ||
| LUAD | 294 (48.68%) | 634 (75.57%) |
| LUSC | 310 (51.32%) | 205 (24.43%) |
| Survival year | ||
| Median (95% CI) | 3.91 (3.33–4.93) | 7.53 (6.30–8.92) |
| Censoring rate | 59.60% | 55.66% |
The validation phase consists of six datasets from GEO, including GSE3141 (N = 110), GSE37745 (N = 172), GSE30219 (N = 104), GSE50081 (N = 172), GSE31210 (N = 226) and GSE29013 (N = 55)
Fig. 1Statistical analysis work flow
Association results of one lncRNA and one pair of lncRNAs derived from Cox proportional hazards model adjusted for covariates in main effect and interaction effect analyses
| Type of analysis | LncRNA | Discovery phase | Validation phase | ||
|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | ||||
| Main effect | ENSG00000227403.1 | 0.90 (0.84, 0.96) | 1.20 × 10–3 | 0.94 (0.90, 0.98) | 4.11 × 10–3 |
| Interaction effect | ENSG00000272369.1 | 0.69 (0.57, 0.83) | 8.28 × 10–5 | 0.44 (0.24, 0.79) | 6.41 × 10–3 |
| ENSG00000267121.4 | 1.30 (1.12, 1.51) | 5.61 × 10–4 | 0.38 (0.17, 0.88) | 0.0228 | |
| Interaction term | 1.12 (1.05, 1.19) | 3.07 × 10–4 | 1.11 (1.01, 1.23) | 0.0397 | |
In the validation phase, the surrogates of ENSG00000227403.1, ENSG00000272369.1 and ENSG00000267121.4 were ENSG00000253738.1, ENSG00000265666.1 and ENSG00000227039.5, respectively
Fig. 2Kaplan–Meier survival curves for patients with low and high expression of ENSG00000227403.1 in the discovery and validation phases
Fig. 3Line and forest plots illustrating lncRNA–lncRNA interaction effect between ENSG00000267121.4 and ENSG0000027236.9 on NSCLC survival
Fig. 4Kaplan–Meier survival curves for patients having different expression levels of two lncRNAs