Literature DB >> 26539246

A Five-Gene Signature Predicts Prognosis in Patients with Kidney Renal Clear Cell Carcinoma.

Yueping Zhan1, Wenna Guo1, Ying Zhang2, Qiang Wang3, Xin-jian Xu4, Liucun Zhu1.   

Abstract

Kidney renal clear cell carcinoma (KIRC) is one of the most common cancers with high mortality all over the world. Many studies have proposed that genes could be used to predict prognosis in KIRC. In this study, RNA expression data from next-generation sequencing and clinical information of 523 patients downloaded from The Cancer Genome Atlas (TCGA) dataset were analyzed in order to identify the relationship between gene expression level and the prognosis of KIRC patients. A set of five genes that significantly associated with overall survival time was identified and a model containing these five genes was constructed by Cox regression analysis. By Kaplan-Meier and Receiver Operating Characteristic (ROC) analysis, we confirmed that the model had good sensitivity and specificity. In summary, expression of the five-gene model is associated with the prognosis outcomes of KIRC patients, and it may have an important clinical significance.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26539246      PMCID: PMC4619904          DOI: 10.1155/2015/842784

Source DB:  PubMed          Journal:  Comput Math Methods Med        ISSN: 1748-670X            Impact factor:   2.238


1. Introduction

In recent years, the incidence and mortality of kidney cancer have been rising throughout the world [1]. In 2013, nearly 58,000 new cases occurred, and 130,001 patients died of kidney cancer in the United States [2]. Among them, kidney renal clear cell carcinoma (KIRC) is the most common histological subtype and accounts for 70%–80% of renal cancer cases [3]. KIRC tissue is resistant to traditional chemotherapeutic drugs [4], and patient outcomes varied a lot [5]. Although various researches have been done on KIRC, the clinical prognosis of KIRC patients still remains very poor; the survival time of 90% of patients with metastatic KIRC is less than 5 years [6]. Therefore, there is an urgent need to find potential molecular-based prognostic biomarkers in KIRC, and it is also one of the most important steps for prognostic prediction of patients. Messenger RNA is one of the most common molecular markers. Many studies have suggested that genes were involved in the biological processes of many cancers and related to prognostic survival time of patients. For instance, SIPL1 (Shank-Interacting Protein-Like 1) has reported to have overexpression during breast cancer tumorigenesis, and inhibiting the expression of SIPL1 may contribute to inhibition of breast cancer [7]. PLA2G16 has been proved as an important prognostic factor in primary osteosarcoma patients [8]. Dicerl has been found to be expressed at low level in nasopharyngeal carcinoma tissues no matter whether at the gene or at the protein levels, and it could also be a novel prognostic biomarker [9]. As for KIRC, several studies have been performed to detect gene expression signatures which may provide diagnostic and prognostic information [10-12]. Ge et al. have identified miRNA signature including 22 miRNAs as an independent novel predictor of patient outcomes [13]. Yu et al. have found that the expression of CIDE (cell death-inducing DFF45-like effector) is a novel predictor of prognosis [14]. However, detailed analyses of the associations between gene expression level and survival time of patients in KIRC remain limited. The goal of this paper is identifying genes that are related to overall survival time of KIRC patients by analyzing high-throughput RNA sequencing data downloaded from TCGA [15]. In brief, the main goals are as follows: (1) identify genes that could predict the survival time of KIRC patient, and construct a model; (2) evaluate the prognostic value, sensitivity, and specificity of the model; and (3) investigate the independence and universality of the gene marker in different KIRC stages.

2. Materials and Methods

2.1. KIRC Gene Expression Data from TCGA

Up to January 2015, TCGA database (https://tcga-data.nci.nih.gov/tcga/) contained 533 KIRC patient samples [15]. The gene expression profiling was performed by using the Illumina HiSeq platforms (Illumina Inc., San Diego, CA, USA). After excluding patients without survival status information, UNC RNASeqV2 level 3 expression data for 523 patients including 20,531 human genes and corresponding clinical data were downloaded. Then the 523 KIRC samples were randomly divided into training set (n = 262) and testing set (n = 261). Specimen IDs in the two sets were shown in Supplemental Table S1 (in Supplementary Material available online at http://dx.doi.org/10.1155/2015/842784). Training set was used to identify gene expression signature, and the testing set was used for validation.

2.2. Statistical Analysis

Firstly, log⁡2 transformed was used for normalizing the RNA-seq expression values [16]. Subsequently, as previous reports [17, 18], genes that were significantly (p < 0.001) related to patient survival were identified by Cox regression analysis and random survival forests-variable hunting (RSFVH) algorithm [19]. Considering that a model with a smaller number of genes is generally accompanied with a practically better value, we performed Cox proportional-hazard regression analysis with two genes, three genes, and five genes, respectively, expecting to dig out a better model for predicting survival. Then, based on Cox regression analysis, a risk score formula was built to calculate the risk score for each patient. As reported by Margolin et al. [20] and Meng et al. [18], the survival differences between the low-risk and high-risk groups were evaluated, and the sensitivity and specificity of the model in the survival prediction were also compared.

3. Results

3.1. Patient Characteristics

All 523 patients used in this study were clinically and pathologically diagnosed with KIRC. Clinical stages of the tumor were classified into stages I to IV based on the Fuhrman nuclear grading system [21]. Here, there are 260 patients from stage 1, 57 patients from stage 2, 125 patients from stage 3, and 81 patients from stage 4, respectively. Additionally, the average age and average prognostic survival time of these 523 patients were 61 years and 902 days, respectively. All the statistical information was summarized in Table 1.
Table 1

Summary of patient demographics and clinical characteristics.

CharacteristicPatients
Training set Testing set Total
Age
 Median616061
 Range26–9029–9026–90
Sex
 Male16417433864.63%
 Female988718535.37%
Vital status
 Living17318435768.26%
 Dead897716631.74%
Clinical stage
 Stage I13412626049.71%
 Stage II22355710.9%
 Stage III725312523.9%
 Stage IV34478115.49%

3.2. Detection of Genes Associated with Overall Survival Time of KIRC Patients in Training Set

To identify the gene which would be potentially associated with overall survival time of patients in KIRC, univariable Cox regression analysis (see Materials and Methods) for gene expression data was conducted in training set. With the significance level of 0.001, a total of 3,849 genes were identified (Table S2). Subsequently, 100 genes with the largest importance value in random survival forests analysis with default parameters [22, 23] were selected. Then, 1–5 genes were chosen from 100 genes as covariates by enumeration algorithm and 79,375,495 models were established in multivariate Cox regression analysis. After comparing with each other, the best model (indexed by AUROC) including 5 genes (CKAP4, ISPD, MAN2A2, OTOF, and SLC40A1) was determined, and the risk score formula for this model was (0.422 × expression value of CKAP4) + (−0.443 × expression value of ISPD) + (0.551 × expression value of MAN2A2) + (0.330 × expression value of OTOF) + (−0.369 × expression value of SLC40A1). The information of these five genes was shown in Table 2. And the functions of these genes were also summarized in Table 3. In addition, the error rate (27.27%) and variable importance values of these five genes were obtained with RSFVH (Figure 1). It can be seen from Figure 1 that the five genes have relatively large importance value; CKAP4 has more importance than other predictors. Taking the median risk score as the cut-off, the 262 KIRC patients were separated into low-risk group (n = 131) and high-risk group (n = 131). Survival analysis was performed by using the Kaplan-Meier method with a log-rank statistical test. As shown in Figure 2(a), Kaplan-Meier curves indicated that patients in high-risk group have significantly (p < 0.0001) worse prognosis comparing with the low-risk group (Figure 2(a)).
Table 2

Five genes significantly associated with the survival time of patients in the training set (n = 262).

Gene nameParametric p valueHazard ratioCoefficientVariable importanceRelative importance
CKAP41.80E − 091.5250.4220.03651
SLC40A19.30E − 080.691−0.3690.0360.9862
OTOF4.60E − 101.3910.330.280.7674
MAN2A20.000851.7340.5510.01920.5249
ISPD1.70E − 050.642−0.4430.01470.4012
Table 3

Five-gene functions' analysis.

Gene nameChromosomal positionStart siteEnd siteFunctionStudy
CKAP4chr12106237881106247935Sequence specific DNA binding transcriptional activator or repressorMcHugh et al. [29] Li et al. [30] Zhang et al. [31]

ISPDchr71591685116530558Mutations in ISPD cause Walker-Warburg syndromeWiller et al. [36] Roscioli et al. [37]

MAN2A2chr159090221890922585Catalyzes the committed step in the biosynthesis of complex N-glycansKroes et al. [38]

OTOFchr22645720326558698Triggers membrane fusion and exocytosisPadmanarayana et al. [33] Yildirim-Baylan et al. [22]

SLC40A1chr2189560590189580811Mediates cellular iron effluxMoreno-Carralero et al. [34]
Figure 1

Random survival forests-variable hunting analysis reveals the error rate for the data as a function of trees (a) and the importance values for predictors (b). Importance values show the impact of genes on the model.

Figure 2

Kaplan-Meier curves with two-sided log-rank test show relationship between the risk score resulting from five genes and patients survival. Using the median risk score as a cut-off, patients were divided into the high-risk score and low-risk score. (a) Kaplan-Meier curves for training set patients (n = 262); (b) Kaplan-Meier curves for testing set patients (n = 261). The two-sided log-rank tests were used to determine the survival differences between the high-risk score and low-risk score.

3.3. Verification of Survival-Associated Genes in Testing Set

To determine the prognostic potential of the five-gene signature, Kaplan-Meier survival analysis was performed in testing set. Just as it is in training set, based on the risk score of individual patient, patients in testing set were divided into low-risk and high-risk groups and Kaplan-Meier analysis was used to compare the patient survival differences. Statistically significant differences (p < 0.0001) between high-risk group and low-risk group were observed; in other words, higher risk score was related to shorter survival time (Figure 2(b)), which is in agreement with that in training set, revealing that five-gene signature may play an important role in predicting the survival of KIRC patients. To further confirm the clinical performance of the five-gene model as a biomarker for predicting prognosis, the Receiver Operating Characteristic (ROC) analysis was performed for estimating the effect of the gene signature on patient survival. And the corresponding AUROC were calculated by hiring three years as the cut-off point. The AUROC was 0.783 (Figure 3), showing that the five-gene model has high sensitivity and specificity and could be used as a biomarker to predict the prognostic survival of patients.
Figure 3

Receiver Operating Characteristic (ROC) analysis of the five-gene signature. The AUROC was 0.783 (p < 0.001), showing that the five-gene model has high sensitivity (true positive rate) and specificity (true negative rate) in predicting the survival time of KIRC patients.

3.4. The Independence and Universality of the Five-Gene Model

Studies have shown that age and clinical stage were also related to patient survival [5, 13, 21]. To examine whether the five-gene signature could distinguish the high-risk patients from low-risk patients when age of patients and stage were taken into account, multivariate Cox proportional hazard analyses were performed in both training and testing set. The results confirmed that risk score of five genes is independent of age and stage, as shown in Table 4. Besides, whether the five-gene signature was functional in different KIIRC stages was also investigated by using Kaplan-Meier and ROC analysis. Results showed that, in stage 3 and stage 4, the survival time of patients was dramatically different between high-risk group and low-risk group (p < 0.001, Figure S1). Moreover, the AUROC in stage 2, stage 3, and stage 4 were 0.761, 0.718, and 0.715, respectively (Figure S2), further revealing that the five-gene signature has predictive value in different clinical stages.
Table 4

Univariable and multivariable Cox regression analyses in training and testing set.

VariablesUnivariable modelMultivariable model
HR95% CI of HR p valueHR95% CI of HR p value
Training set (N = 262)
 Five-gene model 2.7172.180–3.387<0.0012.7522.193–3.454<0.001
 Age1.0311.014–1.0500.0011.0321.009–1.0480.003
Testing set (N = 261)
 Five-gene model 1.9361.620–2.315<0.0011.8751.560–2.253<0.001
 Age1.0221.004–1.0410.0191.0110.993–1.0310.234

Training set (N = 262)
 Five-gene model 2.7172.180–3.387<0.0012.1931.726–2.786<0.001
 Stage2.0971.734–2.537<0.0011.7171.394–2.111<0.001
Testing set (N = 261)
 Five-gene model 1.9361.620–2.315<0.0011.7001.390–2.078<0.001
 Stage1.9051.564–2.322<0.0011.6791.367–2.062<0.001

4. Discussion

KIRC is one of the most common primary renal malignancies with high morbidity and mortality [24]. However, the understanding of KIRC is not complete, and there are no clinical tools for predicting patient outcome apart from the traditional clinical parameters. Accurate data from the clinical examination of KIRC specimens could help doctors to decide appropriate treatment for patients [25]. Therefore, the identification and validation of novel biomarkers account for an important part of practical KIRC study [26]. In this study, we identified a five-gene signature that was significantly related to patient survival in KIRC based on genome-wide RNA profiling of 523 KIRC patients from TCGA database. In addition, we confirmed that the five-gene signature could be regarded as an independent predictor of prognostic survival after considering the various variables including age and stage, and it is also universal in different stages. Many previous studies on genes in KIRC have mainly considered some known cancer-associated genes. For instance, Wei et al. have found that high expression of pituitary tumor-transforming gene-1 (PTTG1) in KIRC patients was associated with poor prognosis by using qRT-PCR and immunohistochemistry [27]. Peters et al. have proved that low gene expression levels of GATA1 and GATA2 were related to tumor aggressiveness and short survival time in KIRC [28]. With respect to the five genes we identified in this study, all of them have also been reported to be associated with cancer. It turned out that CKAP4 could be used to distinguish primary salivary oncocytic lesions from metastatic RCC effectively in dubious cases with 100% accuracy [29] and related to lymphatic metastasis [30, 31]. Mutations in OTOF, which functionally triggers membrane fusion and exocytosis, may provide a link between calcium signaling and cancer [22, 32, 33]. SLC40A1 is a cell membrane protein that has been identified to mediate cellular iron efflux [23, 34] and contribute to the invasive phenotype [35]. Mutations in ISPD may cause Walker-Warburg syndrome [36, 37]. MAN2A2 was downregulated in hepatocellular carcinoma [38]. However, up to now, such predictive markers were not analyzed in KIRC patients and the molecular study concerning these genes has not been reported in KIRC. Nevertheless, our research showed that the expressions of these genes were related to survival time of patients. ROC curve showed that the AUROC is approximately 0.8, considering that the larger AUROC usually implies a better model for prediction [6, 39], which further demonstrated that the five-gene signature in our study is a novel prognostic marker with high accuracy and has important clinical significance. Furthermore, the five-gene signature was an independent predictor, which was pervasive in different stages. In different stages, ROC analysis shows high sensitivity and specificity (AUROC >0.7) except stage 1, which is possibly because stage 1 is slow-growing tumor, cancer cells are not invasive and metastatic, and the number of patients that died of KIRC was smaller than that in other stages [40]. We found here that the average age of patients who died in stage 1 was more than 67, which is higher than in other stages, revealing that the age at diagnosis may have some influence on KIRC prognosis, and part of deaths was attributed to increased risk of disease mortality with increasing age. Therefore, these results suggested that the five-gene signature is significantly important in clinic. The functional mechanisms of these genes remain unclear. Moreover, the five-gene signature has not yet been tested in a clinical trial. The experimental studies on these genes and further well-designed studies should be conducted to verify our findings, thereby providing a better understanding of their roles in predicting KIRC prognosis.

5. Conclusions

In summary, a five-gene signature strongly associated with patients' survival was identified by performing Cox regression analysis and Kaplan-Meier analysis in training set. Subsequently, Kaplan-Meier and ROC analysis in testing set further indicated that the five-gene signature could be used as a novel biomarker to predict the treatment outcome of KIRC patient. Additionally, multivariate Cox regression analysis revealed that the five-gene signature was an independent predictor. These results suggested that the five-gene signature could help to predict the survival with significant clinical implications. Table S1 The sample list and information of patients in training set and testing set. Table S2 Univariable Cox regression analysis with a significance level of 0.001 reveal significant relation between gene expression and survival time. Figure S1 Kaplan-Meier curves analysis of the different clinical stages in the testing set. The two-sided log-rank test were used to determine the survival differences. Figure S2 Receiver operating characteristic (ROC) analysis of sensitivity and specificity by five-gene model in predicting survival time of patient with different clinical stages in the testing set.
  40 in total

1.  The International Society of Urological Pathology (ISUP) Vancouver Classification of Renal Neoplasia.

Authors:  John R Srigley; Brett Delahunt; John N Eble; Lars Egevad; Jonathan I Epstein; David Grignon; Ondrej Hes; Holger Moch; Rodolfo Montironi; Satish K Tickoo; Ming Zhou; Pedram Argani
Journal:  Am J Surg Pathol       Date:  2013-10       Impact factor: 6.394

2.  Toward predicting metastatic progression of melanoma based on gene expression data.

Authors:  Yuanyuan Li; Juno M Krahn; Gordon P Flake; David M Umbach; Leping Li
Journal:  Pigment Cell Melanoma Res       Date:  2015-04-24       Impact factor: 4.693

3.  p63 immunohistochemistry differentiates salivary gland oncocytoma and oncocytic carcinoma from metastatic renal cell carcinoma.

Authors:  Jonathan B McHugh; Aaron P Hoschar; Mari Dvorakova; Anil V Parwani; E Leon Barnes; Raja R Seethala
Journal:  Head Neck Pathol       Date:  2007-10-26

4.  Consistency of Random Survival Forests.

Authors:  Hemant Ishwaran; Udaya B Kogalur
Journal:  Stat Probab Lett       Date:  2010-07-01       Impact factor: 0.870

Review 5.  The complex roles of microRNAs in the metastasis of renal cell carcinoma.

Authors:  Lei Zhang; Bin Xul; Shuqiu Chen; Kai Lu; Chunhui Liu; Yiduo Wang; Yuming Zhao; Xiaowen Zhang; Dachuang Liu; Ming Chen
Journal:  J Nanosci Nanotechnol       Date:  2013-05

6.  Identification of CKAP4/p63 as a major substrate of the palmitoyl acyltransferase DHHC2, a putative tumor suppressor, using a novel proteomics method.

Authors:  Jun Zhang; Sonia L Planey; Carolina Ceballos; Stanley M Stevens; Susan K Keay; David A Zacharias
Journal:  Mol Cell Proteomics       Date:  2008-02-22       Impact factor: 5.911

Review 7.  Mammalian iron transporters: families SLC11 and SLC40.

Authors:  Nicolas Montalbetti; Alexandre Simonin; Gergely Kovacs; Matthias A Hediger
Journal:  Mol Aspects Med       Date:  2013 Apr-Jun

8.  Expression of cytoskeleton-associated protein 4 is related to lymphatic metastasis and indicates prognosis of intrahepatic cholangiocarcinoma patients after surgery resection.

Authors:  Min-hong Li; Li-wei Dong; Shuang-xi Li; Gu-sheng Tang; Yu-fei Pan; Jian Zhang; Hui Wang; Hua-bang Zhou; Ye-xiong Tan; He-ping Hu; Hong-yang Wang
Journal:  Cancer Lett       Date:  2013-05-09       Impact factor: 8.679

9.  Systematic evaluation of the prognostic impact and intratumour heterogeneity of clear cell renal cell carcinoma biomarkers.

Authors:  Sakshi Gulati; Pierre Martinez; Tejal Joshi; Nicolai Juul Birkbak; Claudio R Santos; Andrew J Rowan; Lisa Pickering; Martin Gore; James Larkin; Zoltan Szallasi; Paul A Bates; Charles Swanton; Marco Gerlinger
Journal:  Eur Urol       Date:  2014-07-19       Impact factor: 20.096

10.  Prediction models and risk assessment for silicosis using a retrospective cohort study among workers exposed to silica in China.

Authors:  Lap Ah Tse; Juncheng Dai; Minghui Chen; Yuewei Liu; Hao Zhang; Tze Wai Wong; Chi Chiu Leung; Hans Kromhout; Evert Meijer; Su Liu; Feng Wang; Ignatius Tak-sun Yu; Hongbing Shen; Weihong Chen
Journal:  Sci Rep       Date:  2015-06-19       Impact factor: 4.379

View more
  13 in total

1.  The Expression Profile, Clinical Application and Potential Tumor Suppressing Mechanism of hsa_circ_0001675 in Head and Neck Carcinoma.

Authors:  Yujie Cao; Dong Ye; Zhisen Shen; Zan Li; Qun Li; Hao Rong
Journal:  Front Oncol       Date:  2022-05-06       Impact factor: 5.738

2.  Clinical performance of E2Fs 1-3 in kidney clear cell renal cancer, evidence from bioinformatics analysis.

Authors:  Bin Liang; Jianying Zhao; Xuan Wang
Journal:  Genes Cancer       Date:  2017-05

3.  A seven-gene signature model predicts overall survival in kidney renal clear cell carcinoma.

Authors:  Ling Chen; Zijin Xiang; Xueru Chen; Xiuting Zhu; Xiangdong Peng
Journal:  Hereditas       Date:  2020-09-03       Impact factor: 3.271

4.  Comprehensive assessment gene signatures for clear cell renal cell carcinoma prognosis.

Authors:  Peng Chang; Zhitong Bing; Jinhui Tian; Jingyun Zhang; Xiuxia Li; Long Ge; Juan Ling; Kehu Yang; Yumin Li
Journal:  Medicine (Baltimore)       Date:  2018-11       Impact factor: 1.817

5.  Fifteen-gene expression based model predicts the survival of clear cell renal cell carcinoma.

Authors:  Ping Li; He Ren; Yan Zhang; Zhaoli Zhou
Journal:  Medicine (Baltimore)       Date:  2018-08       Impact factor: 1.889

6.  Detection of dysregulated competing endogenous RNA modules associated with clear cell kidney carcinoma.

Authors:  Hong Wang; Dahua Xu; Huiying Huang; Ying Cui; Chunhua Li; Chunrui Zhang; Shengnan Guo; Lining Zhang; Xiaomu Xu; Jiankai Xu; Jianping Lu; Liqiang Wang; Kongning Li
Journal:  Mol Med Rep       Date:  2018-06-19       Impact factor: 2.952

7.  A Two-DNA Methylation Signature to Improve Prognosis Prediction of Clear Cell Renal Cell Carcinoma.

Authors:  Shanping Shi; Shazhou Ye; Xiaoyue Wu; Mingjun Xu; Renjie Zhuo; Qi Liao; Yang Xi
Journal:  Yonsei Med J       Date:  2019-11       Impact factor: 2.759

8.  Protein-coding genes, long non-coding RNAs combined with microRNAs as a novel clinical multi-dimension transcriptome signature to predict prognosis in ovarian cancer.

Authors:  Xu Meng; Guo Jin-Cheng; Zhang Jue; Ma Quan-Fu; Yan Bin; Wu Xu-Feng
Journal:  Oncotarget       Date:  2017-08-24

9.  Two protein-coding genes act as a novel clinical signature to predict prognosis in patients with ovarian serous cystadenocarcinoma.

Authors:  Jue Zhang; Meng Xu; Han Gao; Jin-Chen Guo; Yu-Lin Guo; Miao Zou; Xu-Feng Wu
Journal:  Oncol Lett       Date:  2018-01-12       Impact factor: 2.967

10.  A 16-gene signature predicting prognosis of patients with oral tongue squamous cell carcinoma.

Authors:  Zeting Qiu; Wei Sun; Shaowei Gao; Huaqiang Zhou; Wulin Tan; Minghui Cao; Wenqi Huang
Journal:  PeerJ       Date:  2017-11-17       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.