| Literature DB >> 29069830 |
Xu Meng1, Guo Jin-Cheng2, Zhang Jue1, Ma Quan-Fu1, Yan Bin1, Wu Xu-Feng1.
Abstract
Ovarian cancer is prevalent in women which is usually diagnosed at an advanced stage with a high mortality rate. The aim of this study is to investigate protein-coding gene, long non-coding RNA, and microRNA associated with the prognosis of patients with ovarian serous carcinoma by mining data from TCGA (The Cancer Genome Atlas) public database. The clinical data of ovarian serous carcinoma patients was downloaded from TCGA database in September, 2016. The mean age and survival time of 407 patients with ovarian serous carcinoma were 59.71 ± 11.54 years and 32.98 ± 26.66 months. Cox's proportional hazards regression analysis was conducted to analyze genes that were significantly associated with the survival of ovarian serous carcinoma patients in the training group. Using the random survival forest algorithm, Kaplan-Meier and ROC analysis, we kept prognostic genes to construct the multi-dimensional transcriptome signature with max area under ROC curve (AUC) (0.69 in the training group and 0.62 in the test group). The selected signature composed by VAT1L, CALR, LINC01456, RP11-484L8.1, MIR196A1 and MIR148A, separated the training group patients into high-risk or low-risk subgroup with significantly different survival time (median survival: 35.3 months vs. 64.9 months, P < 0.001). The signature was validated in the test group showing similar prognostic values (median survival: 41.6 months in high-risk vs. 57.4 months in low-risk group, P=0.018). Chi-square test and multivariable Cox regression analysis showed that the signature was an independent prognostic factor for patients with ovarian serous carcinoma. Finally, we validated the expression of the genes experimentally.Entities:
Keywords: biomarker; long non-coding RNAs; microRNAs; ovarian cancer; protein-coding genes
Year: 2017 PMID: 29069830 PMCID: PMC5641173 DOI: 10.18632/oncotarget.20457
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Summary of patient demographics and clinical characteristics
| Characteristic | Training set | Testing set | Total |
|---|---|---|---|
| Median | 59 | 58 | 59 |
| Range | 38∼85 | 30∼87 | 30∼87 |
| Stage I | 0 | 0 | 0 |
| Stage II | 8 | 13 | 21 |
| Stage III | 163 | 158 | 321 |
| Stage IV | 31 | 31 | 62 |
| Living | 91 | 88 | 179 |
| Dead | 112 | 116 | 228 |
Figure 2Identification of the PCGs-lncRNAs-microRNAs signature in the training dataset
(A) Univariate Cox proportional hazards regression analysis of the PCGs, lncRNAs and microRNAs expression profiling data in the training dataset. (B) The procedure for identifying the final signature. The accuracies of all 511 signatures were calculated and the nine highest accuracies for k=1, 2...... 9 were shown in the plot.
Identities of PCGs, lncRNAs and microRNAs in the prognostic expression signature and their univariable cox association with prognosis
| Ensembl ID | Gene symbol | Gene name | Coefficienta | Gene expression level association with poor prognosis | Chromosome location | |
|---|---|---|---|---|---|---|
| ENSG00000171724b | VAT1L | Vesicle amine transport 1 like | 0.47 | 0.01 | high | chr16:77788530-77980107:[+] |
| ENSG00000179218b | CALR | Calreticulin | −0.52 | 0.00 | low | chr19:12938578-12944489:[+] |
| ENSG00000225882b | LINC01456 | −0.66 | 0.05 | low | chrX:17970197-18104644:[−] | |
| ENSG00000267764b | RP11-484L8.1 | 2.56 | 0.01 | high | chr18:48826051-48834770:[−] | |
| hsa-miR-196a-1c | MIR196A1 | 0.14 | 0.03 | high | chr17: 48632490-48632559 [−] | |
| hsa-miR-148ac | MIR148A | −0.19 | 0.02 | low | chr7: 25949919-25949986 [−] |
a: Derived from the univariable Cox regression analysis in the training set.
b: Ensembl database
c: miRBase database
Figure 3The PCGs-lncRNAs-microRNAs signature predicts overall survival of patients with OC and comparison the survival prediction power of the PCGs-lncRNAs-microRNAs signature and TNM stage
(A) Kaplan–Meier survival curves classified patients into high- and low-risk groups using the PCGs-lncRNAs-microRNAs signature in the training and test datasets. P Values were calculated by log-rank test. (B) ROC analysis was used to compare survival prediction power between the PCGs-lncRNAs-microRNAs signature and TNM stage.
Association of the PCG-lncRNA-microRNA signature with clinicopathological characteristics in OV patients (n=203)
| Variables | PCG-lncRNA signature | ||
|---|---|---|---|
| Low risk * | High risk * | ||
| 0.94 | |||
| ≤59 | 51 | 52 | |
| >59 | 51 | 49 | |
| unkown | 1 | 0 | 0.5 |
| II | 5 | 3 | |
| III | 83 | 80 | |
| IV | 13 | 18 | |
* Low risk ≤Median of risk score, High risk >Median of risk score; The Chi-squared test; P value <0.05 was considered significant.
Univariable and multivariable Cox regression analysis of the PCG-lncRNA-microRNA signature and survival of OV patients in the training, test and entire group
| The training set (n=203) | The Test set (n=204) | The entire dataset (n=407) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Variables | HR | 95% CI of HR | HR | 95% CI of HR | HR | 95% CI of HR | |||||||
| lower | upper | lower | upper | lower | upper | ||||||||
| Age | >59 vs.≤59 | 1.19 | 0.82 | 1.73 | 0.36 | 0.92 | 0.55 | 1.53 | 0.74 | 1.24 | 0.96 | 1.61 | 0.10 |
| pTNM stage | IV vs.II+III | 1.65 | 1.06 | 2.57 | 0.03 | 1.30 | 0.90 | 1.88 | 0.16 | 1.20 | 0.84 | 1.70 | 0.31 |
| PCG-lncRNA-microRNA signature | High risk vs. low risk | 2.80 | 1.87 | 4.18 | 0.00 | 1.59 | 1.07 | 2.33 | 0.02 | 2.05 | 1.55 | 2.71 | 0.00 |
| >59 vs.≤59 | 1.24 | 0.85 | 1.81 | 0.43 | 1.48 | 1.01 | 2.16 | 0.04 | 1.36 | 1.05 | 1.77 | 0.02 | |
| pTNM stage | IV vs.II+III | 1.49 | 0.95 | 2.33 | 0.21 | 0.97 | 0.67 | 1.38 | 0.86 | 1.16 | 0.87 | 1.55 | 0.30 |
| PCG-lncRNA-microRNA signature | High risk vs. low risk | 2.68 | 1.79 | 4.01 | 0.00 | 1.75 | 1.19 | 2.57 | 0.00 | 2.16 | 1.64 | 2.84 | 0.00 |
Figure 4Functional enrichment of the co-expressed protein-coding genes with prognostic the two PCGs, two lncRNAs and two microRNAs by GSEA in training and test group
Figure 5Validation of the PCGs, lncRNAs and microRNAs expression by experiment (A–B).
Figure 1Schedule of the study
The order of analyses to develop the risk score model and validate the efficiency of the signature to predict prognostic outcomes.