| Literature DB >> 35432441 |
Ze-Jun Zheng1, Yan-Shang Li1,2, Jun-De Zhu1, Hai-Ying Zou1, Wang-Kai Fang1, Yi-Yao Cui3, Jian-Jun Xie1.
Abstract
Esophageal squamous cell carcinoma (ESCC) is a common malignant gastrointestinal tumor threatening global human health. For patients diagnosed with ESCC, determining the prognosis is a huge challenge. Due to their important role in tumor progression, long non-coding RNAs (lncRNAs) may be putative molecular candidates in the survival prediction of ESCC patients. Here, we obtained three datasets of ESCC lncRNA expression profiles (GSE53624, GSE53622, and GSE53625) from the Gene Expression Omnibus (GEO) database. The method of statistics and machine learning including survival analysis and LASSO regression analysis were applied. We identified a six-lncRNA signature composed of AL445524.1, AC109439.2, LINC01273, AC015922.3, LINC00547, and PSPC1-AS2. Kaplan-Meier and Cox analyses were conducted, and the prognostic ability and predictive independence of the lncRNA signature were found in three ESCC datasets. In the entire set, time-dependent ROC curve analysis showed that the prediction accuracy of the lncRNA signature was remarkably greater than that of TNM stage. ROC and stratified analysis indicated that the combination of six-lncRNA signature with the TNM stage has the highest accuracy in subgrouping ESCC patients. Furthermore, experiments subsequently confirmed that one of the lncRNAs LINC01273 may play an oncogenic role in ESCC. This study suggested the six-lncRNA signature could be a valuable survival predictor for patients with ESCC and have potential to be an auxiliary biomarker of TNM stage to subdivide ESCC patients more accurately, which has important clinical significance.Entities:
Keywords: LASSO; LINC01273; esophageal squamous cell carcinoma; long non-coding RNAs; machine learning; prognosis
Year: 2022 PMID: 35432441 PMCID: PMC9008717 DOI: 10.3389/fgene.2022.839589
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Clinical features of the ESCC patients from GEO.
| Features | GSE53624 | GSE53622 | GSE53625 |
|---|---|---|---|
| Age (years) | |||
| ≤60 | 61 | 29 | 90 |
| >60 | 58 | 31 | 89 |
| Sex | |||
| Female | 21 | 12 | 33 |
| Male | 98 | 48 | 146 |
| Tumor grade | |||
| G1 | 23 | 9 | 32 |
| G2 | 64 | 34 | 98 |
| G3 | 32 | 17 | 49 |
| T stage | |||
| 1 | 8 | 4 | 12 |
| 2 | 20 | 7 | 27 |
| 3 | 62 | 48 | 110 |
| 4 | 29 | 1 | 30 |
| N stage | |||
| 0 | 54 | 29 | 83 |
| 1 | 42 | 20 | 62 |
| 2 | 13 | 9 | 22 |
| 3 | 10 | 2 | 12 |
| TNM stage | |||
| 1 | 6 | 4 | 10 |
| 2 | 47 | 30 | 77 |
| 3 | 66 | 26 | 92 |
| Survival status | 0 | ||
| Alive | 46 | 30 | 76 |
| Dead | 73 | 30 | 103 |
FIGURE 1Derivation and selection of the lncRNA signature in the training dataset. (A) Univariate Cox regression and KM analysis identified 209 prognosis-related lncRNAs in the training dataset. (B) LASSO coefficient profiles for the 209-lncRNA set in the training dataset. (C) Cross-validation error rates for selecting the tuning parameters. (D) Hazard ratio of the selected lncRNAs by LASSO. (E) The AUC values of 127 multi-lncRNA signatures were calculated by ROC curve analysis. (F) ROC curve analysis for the 127 combinations and selected six-lncRNA signature in the training dataset.
Prognostic significance of the six lncRNAs in the signature.
| Ensembl ID | Gene name | HR | 95% CI of HR |
| Chromosome location | |
|---|---|---|---|---|---|---|
| Lower | Upper | |||||
| ENSG00000233461 | AL445524.1 | 0.579 | 0.432 | 0.778 | <0.001 | 1:231520729-231528618: 1 |
| ENSG00000250284 | AC109439.2 | 0.781 | 0.664 | 0.918 | 0.003 | 5:136734830-136763409:1 |
| ENSG00000231742 | LINC01273 | 1.526 | 1.148 | 2.026 | 0.004 | 20:50171809-50176676:1 |
| ENSG00000276855 | AC015922.3 | 2.209 | 1.403 | 3.477 | 0.001 | 17:15789016-15789705:1 |
| ENSG00000275226 | LINC00547 | 2.223 | 1.523 | 3.243 | <0.001 | 13:37534940-37551536:1 |
| ENSG00000226352 | PSPC1-AS2 | 2.273 | 1.587 | 3.256 | <0.001 | 13:19674624-19675884:1 |
FIGURE 2Kaplan–Meier analysis of the six-lncRNA signature in the GSE53624 (A), GSE53622 (B), and GSE53625(C) datasets.
FIGURE 3Expression heatmap of the six lncRNAs, plot of six-lncRNA risk scores, and ESCC patient’s survival status in the GSE53624 (A), GSE53622 (B), and GSE53625 (C) datasets.
Cox regression analysis of the signature with ESCC survival.
| Univariable analysis | Multivariable analysis | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Variables | HR | 95% CI of HR |
| HR | 95% CI of HR |
| |||
| Lower | Upper | Lower | Upper | ||||||
| GSE53624 | |||||||||
| Age | >60 | 1.42 | 0.90 | 2.25 | 0.14 | 1.66 | 1.02 | 2.72 | 0.04 |
| Sex | Male | 0.83 | 0.47 | 1.46 | 0.51 | 1.29 | 0.70 | 2.38 | 0.42 |
| TNM stage | III | 1.90 | 1.23 | 2.95 | <0.001 | 1.80 | 1.15 | 2.83 | 0.01 |
| Signature | High risk | 4.50 | 2.71 | 7.46 | <0.001 | 4.97 | 2.94 | 8.42 | <0.001 |
| GSE53622 | |||||||||
| Age | >60 | 2.07 | 1.02 | 4.21 | 0.05 | 1.79 | 0.87 | 3.71 | 0.12 |
| Sex | Male | 0.71 | 0.31 | 1.64 | 0.42 | 0.54 | 0.22 | 1.34 | 0.18 |
| TNM stage | III | 2.12 | 1.15 | 3.91 | 0.02 | 2.37 | 1.24 | 4.53 | 0.01 |
| Signature | High risk | 2.26 | 1.11 | 4.61 | 0.02 | 2.26 | 1.11 | 4.60 | 0.03 |
| GSE53625 | |||||||||
| Age | >60 | 1.59 | 1.08 | 2.34 | 0.02 | 1.49 | 1.01 | 2.22 | 0.05 |
| Sex | Male | 0.78 | 0.49 | 1.25 | 0.31 | 0.80 | 0.49 | 1.30 | 0.37 |
| TNM stage | III | 1.99 | 1.40 | 2.85 | <0.001 | 1.95 | 1.35 | 2.80 | <0.001 |
| Signature | High risk | 2.12 | 1.43 | 3.14 | <0.001 | 2.11 | 1.42 | 3.13 | <0.001 |
FIGURE 4Comparison of TNM stage and the six-lncRNA signature and stratification analysis. (A) Time-dependent ROC curve analysis of the six-lncRNA signature and other clinical characters in the GSE53625 group. (B) Comparison of survival prediction performance of TNM stage and the six-lncRNA signature. The signature could further classify ESCC patients from TNM high (C)/low (D) stage into two groups according to markedly different survival.
FIGURE 5Oncogenic effect of LINC01273 on ESCC cells. (A) RT-qPCR analysis of LINC01273 expression in ESCC cell lines. (B) siRNA-mediated silencing of LINC01273 was evaluated by using RT-qPCR. (C,D) Results of the MTS assay (C) and colony formation assay (D) demonstrated that cell proliferation was inhibited after depletion of LINC01273 in KYSE410 and TE5 cells. (E,F) Transwell assays suggested that migration (E) and invasion (F) abilities were reduced after LINC01273 knockdown. All data are expressed as mean ± SD (*p < 0.05, **p < 0.01, ***p < 0.001).