| Literature DB >> 32831646 |
Lan Zhang1, Pan Li1, Enjie Liu1, Chenju Xing1, Di Zhu1, Jianying Zhang2, Weiwei Wang1, Guozhong Jiang1.
Abstract
BACKGROUND: The aim of this study was to identify prognostic long non-coding RNAs (lncRNAs) and develop a multi-lncRNA signature for suvival prediction in esophageal squamous cell carcinoma (ESCC).Entities:
Keywords: Esophageal squamous cell carcinoma; Overall survival; Prognosis; Signature; lncRNA
Year: 2020 PMID: 32831646 PMCID: PMC7419219 DOI: 10.1186/s12935-020-01480-9
Source DB: PubMed Journal: Cancer Cell Int ISSN: 1475-2867 Impact factor: 5.722
Summary of ESCC patients and clinical characteristics in this study
| Characteristic | GSE53624 | RNA seq | qPCR validation |
|---|---|---|---|
| Group | Training | Test | Validation |
| Age (years) | |||
| > 61 | 39 | 49 | 52 |
| ≤ 61 | 80 | 49 | 32 |
| Sex | |||
| Female | 21 | 32 | 36 |
| Male | 98 | 66 | 48 |
| Vital status | |||
| Living | 46 | 46 | 59 |
| Dead | 13 | 52 | 25 |
| TNM stage | |||
| Stage I | 6 | 3 | 4 |
| Stage II | 47 | 51 | 51 |
| Stage III | 66 | 40 | 26 |
| Stage IV | 0 | 4 | 3 |
Fig. 1Constructing the prognostic lncRNA model in the high throughput sequencing datasets. a Venn diagram for analyzing the prognostic lncRNAs. b Screening out the lncRNA signature with largest AUC from all 1023 signatures which were calculated by ROC for k = 1, 2…… 10. c The AUC of the screened lncRNA signature
The selected lncRNAs in the prognostic signature of ESCC
| Gene symbol | Coefficienta | Expression level association with prognosis | ||
|---|---|---|---|---|
| RP13-30A9.2 | − 0.635 | 0.008 | 0.007 | Low |
| RP11-488I20.9 | − 0.468 | 0.048 | 0.046 | Low |
| MORF4L2-AS1 | 0.582 | 0.014 | 0.013 | High |
| AC007179.1 | 0.470 | 0.047 | 0.045 | High |
| RP4-735C1.6 | 0.616 | 0.010 | 0.008 | High |
aDerived from the univariable Cox regression analysis in the GSE53624 set; KM Kaplan–Meier analysis
Fig. 2The lncRNA signature classification power for ESCC prognosis. Kaplan–Meier curves found ESCC patients were classified into two different risk groups based on the risk score of the signature in the GSE53624 (a), RNA-seq (b) and qPCR validation datasets (c)
Fig. 3Risk score distribution, survival status and gene expression of ESCC patients in high- and low-risk groups classified by the four-lncRNA signature in the GSE53624 (a), RNA-seq (b) datasets
Association of the signature with clinicopathological characteristics in ESCC patients
| Variables | GSE53624 | RNA-seq | qPCR | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Low* | High* | Low* | Hig * | Low* | High* | ||||
| Age | 0.217 | 0.106 | 0.261 | ||||||
| ≤ 62 | 44 | 36 | 29 | 20 | 13 | 19 | |||
| > 62 | 16 | 23 | 20 | 29 | 29 | 23 | |||
| Sex | > 0.99 | 0.132 | 0.825 | ||||||
| Female | 11 | 10 | 20 | 12 | 17 | 19 | |||
| Male | 49 | 49 | 29 | 37 | 25 | 23 | |||
| TNM stage | 0.654 | 0.588 | 0.681 | ||||||
| Stage I | 2 | 4 | 1 | 2 | 3 | 1 | |||
| Stage II | 25 | 22 | 28 | 23 | 25 | 26 | |||
| Stage III | 33 | 33 | 19 | 21 | 12 | 14 | |||
| Stage IV | 1 | 3 | 2 | 1 | |||||
*Low risk ≤ Median of risk score, High risk > Median of risk score; The Chi-squared test P value < 0.05 was considered significant
Cox regression analysis of the signature for the ESCC patients from the three datasets
| Univariable analysis | Multivariable analysis | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Variables | HR | 95% CI of HR | HR | 95% CI of HR | |||||
| Lower | Upper | Lower | Upper | ||||||
| GSE53624 dataset(n = 119) | |||||||||
| Age | > 61 vs. ≤ 61 | 1.643 | 1.024 | 2.636 | 0.040 | 1.441 | 0.894 | 2.322 | 0.134 |
| Sex | Male vs. Female | 0.827 | 0.468 | 1.461 | 0.513 | 0.782 | 0.440 | 1.391 | 0.403 |
| TNM stage | III, IV vs. I, II | 1.901 | 1.226 | 2.948 | 0.004 | 2.017 | 1.288 | 3.159 | 0.002 |
| lncRNA signature | High risk vs. low risk | 3.264 | 1.998 | 5.331 | < 0.001 | 3.502 | 2.131 | 5.756 | < 0.001 |
| RNA seq set (n = 98) | |||||||||
| Age | > 61 vs. ≤ 61 | 1.287 | 0.744 | 2.225 | 0.367 | 1.314 | 0.711 | 2.430 | 0.383 |
| Sex | Male vs. Female | 1.534 | 0.818 | 2.875 | 0.182 | 1.116 | 0.561 | 2.221 | 0.754 |
| TNM stage | III, IV vs. I, II | 1.795 | 1.152 | 2.796 | 0.010 | 1.629 | 0.887 | 2.991 | 0.115 |
| lncRNA signature | High risk vs. low risk | 1.994 | 1.139 | 3.491 | 0.016 | 1.032 | 1.002 | 1.062 | 0.035 |
| Independent qPCR validation set (n = 84) | |||||||||
| Age | > 61 vs. ≤ 61 | 0.949 | 0.892 | 1.011 | 0.105 | 0.961 | 0.893 | 1.034 | 0.289 |
| Sex | Male vs. Female | 0.797 | 0.352 | 1.804 | 0.585 | 0.872 | 0.381 | 1.997 | 0.747 |
| TNM stage | III, IV vs. I, II | 2.699 | 1.607 | 4.534 | 0.000 | 4.910 | 2.133 | 11.302 | < 0.001 |
| lncRNA signature | High risk vs. low risk | 2.981 | 1.235 | 7.192 | 0.015 | 2.945 | 1.210 | 7.168 | 0.017 |
Fig. 4Stratification analysis of TNM stage using the five-lncRNA signature. a, b Stratification analysis of TNM low/high stage using the signature by Kaplan–Meier curves. c, d Comparing the survival prediction power of the signature with that of TNM stage by ROC in the entire datasets. e TimeROC analysis to explore the survival prediction power of combination of the signature and TNM stage in the entire dataset
Fig. 5Functional prediction of the five-lncRNAs signature. Visualization of the co-expressing genes with the risk score signature in GS53624 and RNA-Seq datasets (a). Functional enrichment analysis of those co-expressing genes by SubpathwayMiner (b)