| Literature DB >> 28260099 |
Yanqin Gu1, Linfeng Lu1, Lingfeng Wu1, Hao Chen1, Wei Zhu1, Yi He1.
Abstract
The present study aimed to analyze RNA-seq data of kidney renal clear cell carcinoma (KIRC) to identify prognostic genes. RNA‑seq data were downloaded from The Cancer Genome Atlas. Feature genes with a coefficient of variation (CV) >0.5 were selected using the genefilter package in R. Gene co‑expression networks were constructed with the WGCNA package. Cox regression analysis was performed using the survive package. Furthermore, a functional enrichment analysis was conducted using Database for Annotation, Visualization and Integrated Discovery tools. A total of 533 KIRC samples were collected, from which 6,758 feature genes with a CV >0.5 were obtained for further analysis. The KIRC samples were divided into two sets: The training set (n=319 samples) and the validation set (n=214 samples). Subsequently, gene co‑expression networks were constructed for the two sets. A total of 12 modules were identified, and the green module was significantly associated with survival time. Genes from the green module were revealed to be implicated in the cell cycle and p53 signaling pathway. In addition, a total of 11 hub genes were revealed, and 10 of them (CCNA2, CDC20, CDCA8, GTSE1, KIF23, KIF2C, KIF4A, MELK, TOP2A and TPX2) were validated as possessing prognostic value, as determined by conducting a survival analysis on another gene expression dataset. In conclusion, a total of 10 prognostic genes were identified in KIRC. These findings may help to advance the understanding of this disease, and may also provide potential biomarkers for therapeutic development.Entities:
Mesh:
Year: 2017 PMID: 28260099 PMCID: PMC5364979 DOI: 10.3892/mmr.2017.6194
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Figure 1.(A) Scale-free fit R2 vs. various soft thresholds. The red line indicates an R2 of 0.85. (B) Mean Connectivity vs. different soft threshold β.
Figure 2.Distribution of genes in terms of degree (soft threshold, 5). (A) Training set; (B) Validation set; X-axis indicates degree k; Y-axis indicates percentage of genes with degree k. (C) Correlation between the training dataset and validation dataset co-expression networks. The x-axis indicates degree k in the training dataset; y-axis indicates degree k in the validation dataset.
Figure 3.Results of a cluster analysis, and 12 modules identified from the gene expression networks. (A) Training set; (B) validation set. Gray represents no module.
Figure 4.Enrichment of survival-associated genes in each module. (A) Training set; (B) validation set. X-axis indicates modules; Y-axis indicates significance of enrichment.
Top 10 GO biological process terms of genes from the green module.
| No. | Biological process | Count | P-value |
|---|---|---|---|
| GO:0007049 | Cell cycle | 98 | 1.85E-74 |
| GO:0022403 | Cell cycle phase | 79 | 3.66E-72 |
| GO:0000279 | M phase | 73 | 4.21E-71 |
| GO:0022402 | Cell cycle process | 83 | 1.43E-66 |
| GO:0000278 | Mitotic cell cycle | 68 | 7.52E-60 |
| GO:0000280 | Nuclear division | 56 | 8.53E-57 |
| GO:0007067 | Mitotic nuclear division | 56 | 8.53E-57 |
| GO:0000087 | M phase of mitotic cell cycle | 56 | 2.56E-56 |
| GO:0048285 | Organelle fission | 56 | 9.80E-56 |
| GO:0051301 | Cell division | 54 | 1.95E-46 |
GO, gene ontology.
Significantly over-represented Kyoto Encyclopedia of Genes and Genomes pathways of genes from the green module.
| No. | Pathway | Count | P-value |
|---|---|---|---|
| hsa04110 | Cell cycle | 25 | 2.63E-23 |
| hsa04114 | Oocyte meiosis | 13 | 7.35E-09 |
| hsa04914 | Progesterone-mediated oocyte maturation | 11 | 8.09E-08 |
| hsa04115 | p53 signaling pathway | 7 | 1.86E-04 |
| hsa03440 | Homologous recombination | 5 | 3.73E-04 |
Summary of the 11 hub genes.
| P-value | ||||||
|---|---|---|---|---|---|---|
| Gene | T set | V set | T set | V set | T set | V set |
| CCNA2 | 2.29E-06 | 8.15E-11 | 85.594 | 57.123 | 68.745 | 48.839 |
| CCNB2 | 9.08E-07 | 1.89E-08 | 94.399 | 68.515 | 72.728 | 55.378 |
| CDC20 | 6.17E-08 | 1.27E-08 | 93.507 | 60.032 | 74.198 | 50.181 |
| CDCA8 | 2.76E-05 | 5.21E-08 | 89.649 | 64.707 | 73.107 | 52.065 |
| GTSE1 | 1.88E-06 | 1.30E-08 | 93.828 | 63.922 | 73.780 | 53.611 |
| KIF23 | 3.21E-08 | 1.07E-08 | 91.183 | 60.441 | 69.097 | 48.626 |
| KIF2C | 3.00E-07 | 8.09E-08 | 88.153 | 64.374 | 70.517 | 54.608 |
| KIF4A | 1.14E-04 | 4.07E-08 | 92.184 | 63.336 | 69.749 | 51.397 |
| MELK | 9.74E-07 | 2.37E-07 | 85.264 | 60.536 | 69.125 | 52.317 |
| TOP2A | 3.88E-08 | 1.72E-08 | 88.265 | 61.531 | 72.680 | 53.977 |
| TPX2 | 7.24E-07 | 1.40E-08 | 88.309 | 68.001 | 71.906 | 57.164 |
T set, training set; V set, validation set; CCNA2, cyclin A2; CCNB2, cyclin B2; CDC20, cell division cycle 20; CDCA8, cell division cycle associated 8; GTSE1, G2 and S-phase expressed 1; KIF23, kinesin family member 23; KIF2C, kinesin family member 2C; KIF4A, kinesin family member 4A; MELK, maternal embryonic leucine zipper kinase; TOP2A, topoisomerase II alpha; TPX2, TPX2 microtubule-associated.
Figure 5.Kaplan-Meier survival curves of CCNA2. Based on gene expression data from (A) TCGA and the (B) E-GEOD-22541 dataset. CCNA2, cyclin A2; TCGA, the Cancer Genome Atlas.