| Literature DB >> 28978079 |
Lei Zhong1, Ge Lou2, Xinglu Zhou3, Youyou Qin1, Lin Liu1, Wenqian Jiang1.
Abstract
Dysregulated expression of lncRNAs has been observed in various human complex diseases (including cancers) by recent transcriptional profiling studies, highlighting potentials of lncRNAs as biomarkers for cancer diagnosis and prognosis. Despite some efforts have been made to search for novel lncRNA signature in breast cancer, the prognostic value of lncRNAs for ER-positive breast cancer patients still needs to be systematically investigated. In this study, we analyzed lncRNA expression profiles in a large of more than 600 breast cancer patients with ER-positive status from The Cancer Genome Atlas (TCGA) and identified six lncRNAs that are significantly associated with survival. Then a linear risk score model comprising six prognostic lncRNAs, termed six-lncRNA signature, was developed to identify high-risk patients from low-risk cases. The results of Kaplan-Meier analysis and ROC curves demonstrated the good sensitivity and specificity in survival prediction both in the training and testing datasets. Multivariate Cox regression analysis and stratified analysis showed that the six-lncRNA signature is an independent prognostic marker in survival prediction for ER-positive breast cancer patients. The GO enrichment analysis suggested that the six-lncRNA might involve with known breast cancer-related biological processes. With further experimental validation, these identified prognostic lncRNAs might have clinical implications for more personalized risk assessment for ER-positive breast cancer patients.Entities:
Keywords: breast cancer; estrogen receptor; long non-coding RNAs; prognosis
Year: 2017 PMID: 28978079 PMCID: PMC5620219 DOI: 10.18632/oncotarget.18919
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Detailed information of prognostic lncRNAs significantly associated with the overall survival in the training dataset
| Ensembl version | Gene name | Position | Hazarda | Coefficienta | |
|---|---|---|---|---|---|
| ENSG00000224189 | HAGLR | Chr 2: 176, 173, 195–176, 188, 958(−) | 2.088 | 0.736 | < 0.001 |
| ENSG00000227477 | STK4-AS1 | Chr 20: 44, 963, 794–44, 966, 402(−) | 1.389 | 0.329 | 0.002 |
| ENSG00000237152 | DLEU7-AS1 | Chr 13: 50, 807, 856–50, 849, 905(+) | 1.608 | 0.475 | 0.01 |
| ENSG00000235314 | LINC00957 | Chr 7: 44, 039, 171–44, 042, 306(+) | 1.711 | 0.537 | 0.025 |
| ENSG00000230838 | LINC01614 | Chr 2: 215, 718, 043–215, 719, 424(+) | 1.529 | 0.425 | 0.031 |
| ENSG00000231249 | ITPR1-AS1 | Chr 3: 4, 490, 891–4, 493, 163(−) | 1.499 | 0.405 | 0.039 |
aDerived from the univariate Cox proportional hazards regression analysis in the training dataset.
Figure 1The performance of the six-lncRNA signature for survival prediction in the training dataset
(A) Kaplan-Meier estimates of the overall survival between high-risk group and low-risk group in the training dataset. (B) ROC curves of the six-lncRNA signature at three and five years.
Univariate and multivariate Cox regression analyses in the training, testing and entire TCGA datasets
| Variables | Univariate analysis | Multivariate analysis | |||||
|---|---|---|---|---|---|---|---|
| HR | 95% CI of HR | HR | 95% CI of HR | ||||
| Six-lncRNA signature | High/Low | 1.543 | 1.341–1.776 | < 0.001 | 1.455 | 1.257–1.685 | < 0.001 |
| Age | >= 59/< 59 | 1.029 | 1.006–1.053 | 0.015 | 1.019 | 0.995–1.044 | 0.117 |
| Stage | (III/IV)/(I/II) | 2.109 | 1.067–4.171 | 0.032 | 1.739 | 0.821–3.684 | 0.148 |
| PR | +/− | 1.604 | 0.628–4.1 | 0.323 | 1.156 | 0.443–3.014 | 0.767 |
| Six-lncRNA signature | High/Low | 1.127 | 1.071–1.458 | 0.036 | 1.189 | 1.166–1.525 | 0.017 |
| Age | >= 59/< 59 | 1.053 | 1.027–1.08 | < 0.001 | 1.062 | 1.033–1.092 | < 0.001 |
| Stage | (III/IV)/(I/II) | 2.121 | 1.11–4.052 | 0.023 | 2.44 | 1.272–4.68 | 0.007 |
| PR | +/− | 0.356 | 0.165–0.766 | 0.008 | 0.421 | 0.182–0.971 | 0.042 |
| Six-lncRNA signature | High/Low | 1.474 | 1.305–1.666 | < 0.001 | 1.473 | 1.284–1.690 | < 0.001 |
| Age | >= 59/< 59 | 1.041 | 1.024–1.059 | < 0.001 | 1.046 | 1.028–1.065 | < 0.001 |
| Stage | (III/IV)/(I/II) | 2.222 | 1.402–3.522 | < 0.001 | 2.489 | 1.564–3.962 | < 0.001 |
| PR | +/− | 0.836 | 1.305–1.666 | 0.541 | 0.724 | 0.391–1.340 | 0.303 |
Figure 2Validation of the six-lncRNA signature for survival prediction in the testing dataset
(A) Kaplan-Meier estimates of the overall survival between high-risk group and low-risk group in the testing dataset. (B) ROC curves of the six-lncRNA signature at three and five years.
Figure 3Validation of the six-lncRNA signature for survival prediction in the entire TCGA dataset
(A) Kaplan-Meier estimates of the overall survival between high-risk group and low-risk group in the entire TCGA dataset. (B) ROC curves of the six-lncRNA signature at three and five years.
Figure 4Kaplan–Meier estimates of the overall survival of patients with different clinical features
Kaplan-Meier estimates of the overall survival between high-risk group and low-risk group for younger patients (A) and elder patients (B). Kaplan–Meier estimates of the overall survival between high-risk group and low-risk group for early-stage patients (C) and advanced-stage patients (D).
Figure 5GO-based functional enrichment analysis
Summary of clinical characteristics of patients with ER-positive breast cancer in each dataset
| Variables | Training dataset ( | Testing dataset ( | Entire TCGA dataset ( | |
|---|---|---|---|---|
| Age, years, | >= 59 | 150 (48.5) | 170 (55.2) | 320 (51.9) |
| < 59 | 159 (51.5) | 138 (44.8) | 297 (48.1) | |
| PR status, | Positive | 259 (83.8) | 265 (86.0) | 524 (84.9) |
| Negative | 48 (15.5) | 42 (13.6) | 90 (14.6) | |
| Unknown | 2 (0.6) | 1 (0.3) | 3 (0.5) | |
| Stage, | I/II | 226 (73.1) | 227 (73.7) | 453 (73.4) |
| III/IV | 76 (24.6) | 72 (23.4) | 148 (24.0) | |
| Unknown | 7 (2.3) | 9 (2.9) | 16 (2.6) | |
| Survival status, | Alive | 267 (86.4) | 264 (85.7) | 531 (86.1) |
| Dead | 42 (13.6) | 44 (14.3) | 86 (13.9) |