| Literature DB >> 26183581 |
Meng Zhou1,2, Maoni Guo3, Dongfeng He4, Xiaojun Wang5, Yinqiu Cui6, Haixiu Yang7, Dapeng Hao8, Jie Sun9.
Abstract
BACKGROUND: Accumulated evidence suggests that dysregulated expression of long non-coding RNAs (lncRNAs) may play a critical role in tumorigenesis and prognosis of cancer, indicating the potential utility of lncRNAs as cancer prognostic or diagnostic markers. However, the power of lncRNA signatures in predicting the survival of patients with non-small cell lung cancer (NSCLC) has not yet been investigated.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26183581 PMCID: PMC4504221 DOI: 10.1186/s12967-015-0556-3
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Clinical features of all 603 NSCLC patients included in this study
| Features | GSE37745 (n = 196) | GSE31210 (n = 226) | GSE50081 (n = 181) | |
|---|---|---|---|---|
| Age (years), no (%) | ||||
| ≤65 | 94 (48.0) | 176 (77.9) | 59 (32.6) | |
| >65 | 102 (52.0) | 50 (22.1) | 122 (67.4) | |
| Gender, no (%) | ||||
| Male | 107 (55.0) | 105 (46.5) | 98 (54.1) | |
| Female | 89 (45.0) | 121 (53.5) | 83 (45.9) | |
| Vital status (%) | ||||
| Alive | 52 (26.5) | 191 (84.5) | 106 (58.6) | |
| Dead | 144 (73.5) | 35 (15.5) | 75 (41.4) | |
| Disease stage, no (%) | ||||
| I | 130 (66.0) | 168 (74.3) | 127 (70.2) | |
| II | 35 (18.0) | 58 (25.7) | 54 (29.8) | |
| III | 27 (14.0) | |||
| IV | 4 (2.0) | |||
| Smoking status | ||||
| Never-smoker | 115 (50.9) | 24 (13.3) | ||
| Ever-smoker | 111 (49.1) | 79 (43.6) | ||
| Current | 54 (29.8) | |||
| Undetermined | 21 (11.6) | |||
| Histology | ||||
| Adenocarcinoma | 106 (54.0) | 226 | 128 (70.7) | |
| Large cell carcinoma | 24 (12.0) | 7 (3.9) | ||
| Squamous cell carcinoma | 66 (34.0) | 43 (23.9) | ||
Eight lncRNAs significantly associated with the overall survival of NSCLC patients in the training set (n = 196)
| Ensembl id | Gene symbol | Chromosomal position |
| Hazard ratioa | Coefficienta |
|---|---|---|---|---|---|
| ENSG00000261578.1 |
| Chr11: 76,800,364-76,804,555(+) | 2.57E−05 | 1.374 | 0.318 |
| ENSG00000261731.2 |
| Chr16: 31,709,113-31,711,984(−) | 2.22E−04 | 0.725 | −0.322 |
| ENSG00000264198.2 |
| Chr17: 39,757,715-39,763,836(−) | 2.88E−04 | 0.738 | −0.303 |
| ENSG00000233642.1 |
| Chr10: 25,158,072-25,176,276(−) | 4.52E−04 | 1.334 | 0.288 |
| ENSG00000244558.3 |
| Chr20: 44,694,892-44,746,021(−) | 1.27E−03 | 0.750 | −0.287 |
| ENSG00000225420.1 |
| Chr2: 88,538,720-88,575,610 (+) | 1.85E−03 | 0.760 | −0.274 |
| ENSG00000251230.3 |
| Chr4: 184,844,585-184,855,751(−) | 2.01E−03 | 1.301 | 0.263 |
| ENSG00000240207.4 |
| Chr3: 158,732,263-158,784,070(+) | 2.80E−03 | 1.433 | 0.360 |
aDerived from the univariable Cox regression analysis in the training set.
Figure 1The eight-lncRNA signature-focused risk score in prognosis of overall survival in the GSE37745 patient set. a Receiver operating characteristic (ROC) analysis of the risk scores for overall survival prediction in the training dataset. The area under the curve (AUC) was calculated for ROC curves, and sensitivity and specificity were calculated to assess score performance. b The Kaplan–Meier curve for overall survival of two patient groups with high- and low-risk scores in the GSE37745 training set (n = 196). The differences between the two curves were evaluated by the two-sided log-rank test. c The eight lncRNA-based risk score distribution, patients’ survival status and heatmap of the eight lncRNA expression profiles. The black dotted line represents the cutoff value of the risk score derived from the training set which separated patients into high- and low-risk groups.
Univariable and multivariable Cox regression analysis of the lncRNA signature and overall survival of NSCLC patients in the training and two independent cohorts
| Variables | Univariable analysis | Multivariable analysis | |||||
|---|---|---|---|---|---|---|---|
| HR | 95% CI of HR |
| HR | 95% CI of HR |
| ||
| GSE37745 training set, n = 196 | |||||||
| Eight-lncRNA risk score | |||||||
| Low risk/high risk | 2.641 | 1.887–3.697 | <0.001 | 2.761 | 1.934–3.942 | <0.001 | |
| Age | |||||||
| ≤65/>65 | 1.355 | 0.977–1.878 | 0.069 | 1.427 | 1.024–1.986 | 0.035 | |
| Gender | |||||||
| Female/male | 1.096 | 0.789–1.523 | 0.585 | 0.913 | 0.640–1.303 | 0.616 | |
| Stage | |||||||
| I | 1 (reference) | 1 (reference) | |||||
| II | 1.220 | 0.793–1.875 | 0.366 | 1.169 | 0.758–1.802 | 0.479 | |
| III | 1.864 | 1.187–2.928 | 0.007 | 1.656 | 1.052–2.608 | 0.029 | |
| IV | 1.313 | 0.415–4.152 | 0.643 | 1.566 | 0.488–5.022 | 0.451 | |
| Subtype | |||||||
| Adenocarcinoma | 1 (reference) | 1 (reference) | |||||
| Large cell carcinoma | 0.891 | 0.520–1.528 | 0.675 | 0.782 | 0.449–1.360 | 0.383 | |
| Squamous cell carcinoma | 1.257 | 0.883–1.791 | 0.205 | 0.920 | 0.623–1.359 | 0.676 | |
| GSE31210 testing set, n = 226 | |||||||
| Eight-lncRNA risk score | |||||||
| Low risk/High risk | 3.067 | 1.471–6.395 | 0.003 | 2.643 | 1.263–5.528 | 0.010 | |
| Age | |||||||
| ≤65/>65 | 2.584 | 1.313–5.083 | 0.006 | 3.685 | 1.800–7.544 | <0.001 | |
| Gender | |||||||
| Female/male | 1.519 | 0.780–2.955 | 0.219 | 1.143 | 0.402–3.246 | 0.802 | |
| Smoking status | |||||||
| No/Yes | 1.637 | 0.837–3.201 | 0.15 | 1.388 | 0.482–3.996 | 0.544 | |
| Stage | |||||||
| I/II | 4.232 | 2.175–8.236 | <0.001 | 4.363 | 2.161–8.811 | <0.001 | |
| GSE50081 testing set, n = 181 | |||||||
| Eight-lncRNA risk score | |||||||
| Low risk/high risk | 1.795 | 1.127–2.859 | 0.014 | 1.752 | 1.014–3.026 | 0.044 | |
| Age | |||||||
| ≤65/>65 | 1.559 | 0.932–2.608 | 0.090 | 1.316 | 0.752–2.303 | 0.336 | |
| Gender | |||||||
| Female/male | 1.934 | 1.190–3.143 | 0.008 | 1.743 | 1.011–3.005 | 0.046 | |
| Smoking status | |||||||
| No/Yes | 1.387 | 0.659–2.916 | 0.389 | 1.054 | 0.476–2.333 | 0.897 | |
| Stage | |||||||
| I/II | 1.689 | 1.049–2.718 | 0.031 | 2.359 | 1.379–4.034 | 0.002 | |
| Subtype | |||||||
| Adenocarcinoma | 1 (reference) | 1 (reference) | |||||
| Large cell carcinoma | 1.326 | 0.479–3.671 | 0.587 | 1.094 | 0.376–3.184 | 0.870 | |
| Squamous cell carcinoma | 0.791 | 0.456–1.371 | 0.403 | 0.479 | 0.241–0.952 | 0.036 | |
Figure 2The eight-lncRNA signature-focused risk score in prognosis of overall survival in additional validation datasets. a Kaplan–Meier survival curves were plotted for GSE31210 (n = 226). b The eight lncRNA-based risk score distribution, patients’ survival status and heatmap of the eight lncRNA expression profiles were analyzed in the GSE31210. c Kaplan–Meier survival curves were plotted for GSE50081 (n = 181). d The eight lncRNA-based risk score distribution, patients’ survival status and heatmap of the eight lncRNA expression profiles were analyzed in the GSE50081.
Figure 3Survival analyses of all patients with available age or tumor stage information using the eight-lncRNA signature. a Kaplan–Meier survival curves for younger patients with NSCLC (age ≤65, n = 337). b Kaplan–Meier survival curves for elder patients with NSCLC (age >65, n = 226). c Survival prediction in early stage patients: Kaplan–Meier survival curves for all patients with stage I and II (n = 165). d Survival prediction in late stage patients: Kaplan–Meier survival curves for all patients with stage III and IV (n = 31).
Figure 4Functional enrichment results of the co-expressed protein-coding genes with prognostic lncRNAs. a Significantly enriched GO terms of the co-expressed protein-coding genes with prognostic lncRNAs. b The functional enrichment map of GO terms. Each node represents a GO term, which are grouped and annotated by GO similarity. A link represents the overlap of shared genes between connecting GO terms. Node size represents the number of gene in the GO terms. Color intensity is proportional to enrichment significance.