Literature DB >> 33976979

Prediction and analysis of novel key genes ITGAX, LAPTM5, SERPINE1 in clear cell renal cell carcinoma through bioinformatics analysis.

Yingli Sui1, Kun Lu1, Lin Fu1.   

Abstract

BACKGROUND: Clear Cell Renal Cell Carcinoma (CCRCC) is the most aggressive subtype of Renal Cell Carcinoma (RCC) with high metastasis and recurrence rates. This study aims to find new potential key genes of CCRCC.
METHODS: Four gene expression profiles (GSE12606, GSE53000, GSE68417, and GSE66272) were downloaded from the Gene Expression Omnibus (GEO) database. The TCGA KIRC data was downloaded from The Cancer Genome Atlas (TCGA). Using GEO2R, the differentially expressed genes (DEG) in CCRCC tissues and normal samples were analyzed. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses were performed in DAVID database. A protein-protein interaction (PPI) network was constructed and the hub gene was predicted by STRING and Cytoscape. GEPIA and Kaplan-Meier plotter databases were used for further screening of Key genes. Expression verification and survival analysis of key genes were performed using TCGA database, GEPIA database, and Kaplan-Meier plotter. Receiver operating characteristic (ROC) curve was used to analyze the diagnostic value of key genes in CCRCC, which is plotted by R software based on TCGA database. UALCAN database was used to analyze the relationship between key genes and clinical pathology in CCRCC and the methylation level of the promoter of key genes in CCRCC.
RESULTS: A total of 289 up-regulated and 449 down-regulated genes were identified based on GSE12606, GSE53000, GSE68417, and GSE66272 profiles in CCRCC. The upregulated DEGs were mainly enriched with protein binding and PI3K-Akt signaling pathway, whereas down-regulated genes were enriched with the integral component of the membrane and metabolic pathways. Next, the top 35 genes were screened out from the PPI network according to Degree, and three new key genes ITGAX, LAPTM5 and SERPINE1 were further screened out through survival and prognosis analysis. Further results showed that the ITGAX, LAPTM5, and SERPINE1 levels in CCRCC tumor tissues were significantly higher than those in normal tissues and were associated with poor prognosis. ROC curve shows that ITGAX, LAPTM5, and SERPINE1 have good diagnostic value with good specificity and sensitivity. The promoter methylation levels of ITGAX, LAPTM5 and SERPINE1 in CCRCC tumor tissues were significantly lower than those in normal tissues. We also found that key genes were associated with clinical pathology in CCRCC.
CONCLUSION: ITGAX, LAPTM5, and SERPINE1 were identified as novel key candidate genes that could be used as prognostic biomarkers and potential therapeutic targets for CCRCC. ©2021 Sui et al.

Entities:  

Keywords:  Bioinformatical analysis; CCRCC; Differentially expressed genes; Protein-protein interaction

Year:  2021        PMID: 33976979      PMCID: PMC8063882          DOI: 10.7717/peerj.11272

Source DB:  PubMed          Journal:  PeerJ        ISSN: 2167-8359            Impact factor:   2.984


Introduction

Kidney cancer is a complex disease composed of a variety of cancers, showing different histology, clinical course, genetic changes, and response to treatment (Linehan et al., 2019). Renal Cell Carcinoma is the most common tumor in the kidney, Whose morbidity and mortality are rising worldwide. Renal cell carcinoma is divided into different subtypes, including clear cell renal cell carcinoma (CCRCC), chromogenic cell renal carcinoma (chRCC), and papillary renal cell carcinoma (pRCC) (Linehan & Ricketts, 2019). CCRCC is a metabolic disease (Wettersten et al., 2017), accounting for more than 80% of all renal cell carcinomas (Makhov et al., 2018). It is the most aggressive subtype of renal cell carcinoma with a high rate of metastasis and recurrence (Jiang et al., 2020; Yuan et al., 2018). Although some progress has been made in the treatment of CCRCC, the current treatments of CCRCC still focus on surgical treatment and traditional chemotherapy (Loo et al., 2019; Bex et al., 2019). At the same time, there is a lack of effective early diagnosis methods in the clinic, and some patients still have a relapse and targeted drug tolerance, leading to poor prognosis of radiotherapy and chemotherapy. Therefore, finding new targeted biomarkers relevant to the diagnosis and treatment of CCRCC remains of paramount importance. In this study, bioinformatics methods were used to obtain CCRCC gene expression data from GEO database, and normal samples and CCRCC samples were selected for grouping processing. Next, 738 DEGs were screened, including 289 up-regulated genes and 449 down-regulated genes. And then, GO enrichment analysis and KEGG signal pathway analysis were performed by DAVID. The up-regulated DEGs are mainly concentrated on protein binding, plasma membrane, inflammation, signal transduction, and PI3K-Akt signaling pathways, while the down-regulated genes are mainly concentrated on the extracellular exosome, oxidation–reduction process, integral component of membrane, protein homodimerization activity, and metabolic pathways. Finally, the top 35 hub genes were screened by PPI network. Based on the novelty of ITGAX, LAPTM5 and SERPINE1 that have not been reported in CCRCC, the expression of these three genes is significantly associated with survival prognosis and all have high degrees. Therefore, ITGAX, LAPTM5, and SERPINE1 were finally selected as key genes. Further analysis showed that ITGAX, LAPTM5, and SERPINE1 are highly expressed in CCRCC, which are significantly related to the survival prognosis of CCRCC. The methylation level of ITGAX, LAPTM5, SERPINE1 in CCRCC is reduced. Moreover ITGAX, LAPTM5 and SERPINE1 are related to the Clinical pathology of CCRCC and have good diagnostic value for CCRCC. In conclusion, we provided a systematic and comprehensive analysis of CCRCC and is the first to suggest that ITGAX, LAPTM5, and SERPINE1 might be used as biomarkers for the new clinical diagnosis and treatment of CCRCC.

Materials & Methods

Data collection

Four gene expression profiles ; (GSE12606, GSE53000, GSE68417, and GSE66272) were downloaded from the GEO (Barrett et al., 2013) database (http://www.ncbi.nlm. nih.gov/geo/). The TCGA KIRC data was downloaded from TCGA (https://www.cancer.gov/about-nci /organization /ccg/research/structural-genomics/tcga). The GSE12606 (Stickel et al., 2009) data set was obtained by GPL570 platform, including three adjacent normal kidney specimens, and three CCRCC samples. The GSE53000 (Gerlinger et al., 2014) and GSE68417 (Thibodeau et al., 2016) data sets were based on GPL6244 Platforms. GSE53000 data set was comprised of 60 samples including 6 adjacent normal kidney specimens and 54 CCRCC samples and GSE68417 data set was comprised of 43 samples including 14 adjacent normal kidney specimens and 29 CCRCC samples. GSE66272 (Wotschofsky et al., 2016; Liep et al., 2016) data set was obtained by GPL5029 platform and contained 27 CCRCC tumor samples and 26 normal kidney samples. The TCGA KIRC data contained 539 CCRCC tumor samples and 72 normal kidney samples.

Study design and data processing

In order to clarify our study, we designed a flow chart to demonstrate data collection, processing, analysis and verification (Fig. 1). The online tool GEO2R (https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga) is an online analysis of GEO series based on the R programming language, which is used to screen DEGs between normal kidney and CCRCC samples from the GSE data sets. The data was standardized and filtered to select significant DEGs. P-value < 0.05, log Fold Change (—log FC—) ≥1 were chosen as the cutoff criteria. log Fold Change (— log FC —) ≥ 1 means that the multiple of change is greater than or equal to 2. It is generally considered that there is a difference between 2 times and more. Then, DEGs were further screened according to cutoff criteria: P-value < 0.05, log FC ≥1 as up-regulated genes, P-value < 0.05, log FC ≤1 as down-regulated genes. Finally, importing all the up-regulated genes or down-regulated genes in the 4 datasets into Funrich 3.1.1 software, and taking the intersection of the up-regulated genes or down-regulated genes in the 4 datasets respectively. TCGA RNA-seq simple converter was used to standardize and log2 conversion of TCGA KIRC data. TCGA KIRC data were used for expression verification and ROC curve analysis of Key genes.
Figure 1

Flow chart of data collection, processing, analysis and verification in this study.

Enrichment analysis of DEGs

DAVID 6.8 (Database for Annotation, Visualization and Integrated Discovery, https://david.ncifcrf.gov/)  (Huang da, Sherman & Lempicki, 2009b; Huang da, Sherman & Lempicki, 2009a) database was used to analyze Gene Ontology (GO) (The Gene Ontology Consortium, 2017) such as the biological process (BP) (Berchtold, Csaba & Zimmer, 2017), cellular component (CC) (Borg & Baudino, 2011) andmolecular function (MF), and Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa & Goto, 2000) pathway enrichment analysis of important DEGs, which promoted the visualization of gene and protein function (Dennis Jr et al., 2003). Among them, the count value represents the number of genes enriched in the pathway. The cutoff value was P < 0.05.

Protein-Protein Interaction (PPI) Network construction and hub gene screening

The online tool STRING (Von Mering et al., 2003) (http://string-db.org) was utilized for the analysis of protein-protein interactions . Importing all DEGs (including up-regulated genes and down-regulated genes) into the STRING database for analysis, and the confidence level ≥0.4 is considered to be significant for PPI. STRING and Cytoscape software (version 3.6.1) were used to construct the PPI network, which is regarded as a network software platform for visualizing protein molecular interactions. Using the degree algorithm in the cytohubba plug-in of Cytoscape, the degree value could be calculated and thus the hub genes could also be screened from PPI network. The importance of genes is directly related to the degree of protein. According to the degree value (degree ≥37), the first 35 genes were selected as hub genes. Three key genes ITGAX, LAPTM5 and SERPINE1with high degree were screened through survival and prognosis analysis as well as the novelty of genes.

Key genes analysis and verification

Three key genes were analyzed and verified comprehensively by using TCGA, GEPIA, Oncomine, Kaplan–Meier plotter and UALCAN databases. Similarly, TCGA database also applied to ROC curve analysis. ROC curve was plotted by R software, which could be used to analyze the diagnostic value of key genes in CCRCC. GEPIA (http://gepia.cancer-pku.cn/) analyzed the expression of key genes in CCRCC. Oncomine (Rhodes et al., 2007) (http://www.on comine.org) was a database consisting of microarray data of various tumors, which verified the expression of key genes in CCRCC. Kaplan-Meier Plotter (https://kmplot.com/analysis/) was used to analyze the survival of key genes in CCRCC. UALCAN (Chandrashekar et al., 2017) (http://ualcan.path. uab.edu/index.html) online database was used to analyze the relationship between key genes and Clinical pathology in CCRCC, and the promoter methylation level of key genes in CCRCC was analyzed.

Statistical analysis

Statistically significant differences between the normal tissues group and tumor tissues group were determined using Student’s t tests. Also, Kaplan–Meier analysis was used to assess OS. ROC curve is a graphical plot (Cao & López-de Ullibarri, 2019) that reflects the sensitivity and specificity of continuous variables. P < 0.05 was considered to be statistically significant for all the tests. All the statistical analyses applied GraphPad prism 6.0 or R software.

Results

Volcano plots of the differentially expressed genes in four datasets

The data sets of CCRCC, which were GSE12606, GSE53000, GSE68417 and GSE66272, were downloaded from the GEO database (Table 1) and analyzed by GEO2R separately. A total of 9,560 DEGs were screened from the GSE12606 data sets, among which 1568 genes were up-regulated and 5436 genes were down-regulated instead. A total of 1,286 DEGs were screened from the GSE53000 data sets, among which 583 were up-regulated and 708 were down-regulated. A total of 1,890 DEGs were screened from the GSE68417 data sets among them 726 up-regulated and 1,164 down-regulated genes were selected. There were 3,627 up-regulated genes and 5,170 down-regulated genes among the 8,797 DEGs screened from the GSE66272 data sets. The screening criteria were P-value 0.05, log Fold Change (— log FC —) ≥ 1. All of the DEGs from the four data sets were presented in Volcano plots (Figs. 2A–2D). Among them, red represents high-expressed genes, green represents low-expressed genes, and black represents genes whose expression levels are not significant in each data set. We gained the intersection of four independent data sets through Funrich 3.1.1 software and made a visualization analysis of Venn Diagram. After being overlapped, the common 738 genes (Fig. S1) were identified, including 289 up-regulated and 449 down-regulated genes (Figs. 2E–2F).
Table 1

Summary of TCGA and GEO Clear Cell Renal Cell Carcinoma datasets.

Author, yearSampleGEO no.PlatformNormalTumor(Refs.)
Stickel JS et al, 2009CCRCC GSE12606 GPL57033Stickel et al. (2009)
Gerlinger M et al, 2014CCRCC GSE53000 GPL6244654Gerlinger et al. (2014)
Thibodeau BJ et al, 2016CCRCC GSE68417 GPL62442914Thibodeau et al. (2016)
Wotschofsky Z et al, 2016CCRCC GSE66272 GPL50292726Wotschofsky et al. (2016)
TCGACCRCC72539

Notes.

The Cancer Genome Atlas

Gene Expression Omnibus

Figure 2

Screening of differentially expressed genes.

(A–D) The volcano plot of all DEGs respectively in GSE12606, GSE53000, GSE68417, and GSE66272 datasets. Red and green nodes represent up-regulated genes and down-regulated genes, respectively. (E–F) 738 DEGs were identified in four profile datasets (GSE12606, GSE53000, GSE68417, and GSE66272), 289 upregulated genes, 449 downregulated genes.

Notes. The Cancer Genome Atlas Gene Expression Omnibus

Screening of differentially expressed genes.

(A–D) The volcano plot of all DEGs respectively in GSE12606, GSE53000, GSE68417, and GSE66272 datasets. Red and green nodes represent up-regulated genes and down-regulated genes, respectively. (E–F) 738 DEGs were identified in four profile datasets (GSE12606, GSE53000, GSE68417, and GSE66272), 289 upregulated genes, 449 downregulated genes.

GO and KEGG enrichment analysis of DEGs

To further explore the function of DEGs, enrichment analysis of up-regulated genes and down-regulated genes were displayed respectively. DAVID 6.8 was used to perform GO and KEGG analysis of DEGs in CCRCC (Table 2). In biological processes, up-regulated DEGs are mostly involved in cell adhesion, signal transduction, immune response, especially the regulation of inflammatory response (Fig. 3A); while down-regulated DEGs are mostly involved in oxidation–reduction process proteolysis, ion transmembrane transport, response to drug, and ion transport (Fig. 3E). In terms of cellular components, up-regulated DEGs are mainly distributed in the plasma membrane, integral component of the membrane, extracellular exosome, and membrane (Fig. 3B); down-regulated DEGs are mainly distributed in integral component of membrane, plasma membrane, cellular exosomes, and integral component of plasma membrane (Fig. 3F). In terms of molecular function, up-regulated DEGs generally have protein binding, identical protein bind, ATP binding capacity, and protein homodimerization activity (Fig. 3C); down-regulated DEGs generally have identical protein homodimerization activity, calcium ion binding, sequence-specific DNA binding, and oxidoreductase activity (Fig. 3G). In the KEGG signal pathway, the up-regulation of DEGs mostly involved in the PI3K-Akt signaling pathway, pathways in cancer, focal adhesion, and HIF-1 signaling pathway (Fig. 3D); while the down-regulation of DEGs mainly involved in the metabolic pathway, Biosynthesis of antibiotics, Carbon metabolism, and Aldosterone-regulated sodium reabsorption (Fig. 3H). CCRCC is a kind of metabolic disease. Metabolic reprogramming covers different processes including aerobic glycolysis, fatty acid metabolism and the utilization of tryptophan, glutamine as well as arginine (Lucarelli et al., 2019), which has also been proved by the results of KEGG pathway enrichment analysis in the work. KEGG pathway results show that down-regulated DEGs are enriched in metabolic pathways, Glycolysis/Gluconeogenesis, Glycine, serine and threonine metabolism and so on.
Table 2

GO and KEGG analysis of differentially expressed DEG associated with CCRCC.

ExpressionCategoryTermCountP-value
UpregulatedGO_BPGO:0006954∼inflammatory response301.30E−12
GO_BPGO:0007165∼signal transduction291.31E−02
GO_BPGO:0007155∼cell adhesion265.33E−08
GO_BPGO:0006955∼immune response241.76E−07
GO_BPGO:0042493∼response to drug215.92E−08
GO_BPGO:0045087∼innate immune response191.36E−04
GO_BPGO:0030198∼extracellular matrix organization181.08E−08
GO_BPGO:0050900∼leukocyte migration176.55E−11
GO_BPGO:0001525∼angiogenesis174.06E−07
GO_BPGO:0008284∼positive regulation of cell proliferation172.57E−03
GO_CCGO:0005886∼plasma membrane1081.13E−09
GO_CCGO:0016021∼integral component of membrane1072.03E−04
GO_CCGO:0070062∼extracellular exosome662.68E−04
GO_CCGO:0016020∼membrane644.11E−07
GO_CCGO:0005887∼integral component of plasma membrane626.07E−14
GO_CCGO:0005576∼extracellular region501.98E−06
GO_CCGO:0005615∼extracellular space492.48E−08
GO_CCGO:0009986∼cell surface311.08E−09
GO_CCGO:0005783∼endoplasmic reticulum204.86E−02
GO_CCGO:0000139∼Golgi membrane171.94E−02
GO_MFGO:0005515∼protein binding1546.05E−03
GO_MFGO:0005524∼ATP binding366.15E−03
GO_MFGO:0042803∼protein homodimerization activity217.66E−03
GO_MFGO:0042802∼identical protein binding201.92E−02
GO_MFGO:0005102∼receptor binding197.41E−06
GO_MFGO:0004872∼receptor activity142.70E−05
GO_MFGO:0004672∼protein kinase activity122.09E−02
GO_MFGO:0005215∼transporter activity83.41E−02
GO_MFGO:0004888∼transmembrane signaling receptor activity84.43E−02
GO_MFGO:0005201∼extracellular matrix structural constituent75.17E−04
KEGGhsa04151: PI3K-Akt signaling pathway192.01E−04
KEGGhsa04510: Focal adhesion173.76E−06
KEGGhsa05200: Pathways in cancer175.96E−03
KEGGhsa04066: HIF-1 signaling pathway123.37E−06
KEGGhsa04015: Rap1 signaling pathway123.56E−03
KEGGhsa04145: Phagosome103.39E−03
KEGGhsa05205: Proteoglycans in cancer102.05E−02
KEGGhsa05150: Staphylococcus aureus infection91.14E−05
KEGGhsa04611: Platelet activation94.86E−03
KEGGhsa05133: Pertussis87.79E−04
DownregulatedGO_BPGO:0055114∼oxidation–reduction process333.33E−05
GO_BPGO:0034220∼ion transmembrane transport195.29E−06
GO_BPGO:0042493∼response to drug181.62E−03
GO_BPGO:0006811∼ion transport141.66E−05
GO_BPGO:0055085∼transmembrane transport148.16E−03
GO_BPGO:0007588∼excretion134.97E−11
GO_BPGO:0010628∼positive regulation of gene expression133.13E−02
GO_BPGO:0006814∼sodium ion transport124.94E−06
GO_BPGO:0001822∼kidney development128.92E−06
GO_BPGO:0008152∼metabolic process123.22E−03
GO_CCGO:0016021∼integral component of membrane1714.63E−07
GO_CCGO:0070062∼extracellular exosome1701.01E−33
GO_CCGO:0005886∼plasma membrane1622.99E−12
GO_CCGO:0005887∼integral component of plasma membrane763.16E−11
GO_CCGO:0005615∼extracellular space501.87E−03
GO_CCGO:0016324∼apical plasma membrane444.39E−22
GO_CCGO:0016323∼basolateral plasma membrane282.05E−14
GO_CCGO:0009986∼cell surface245.53E−03
GO_CCGO:0005759∼mitochondrial matrix198.74E−04
GO_CCGO:0043025∼neuronal cell body151.88E−02
GO_MFGO:0042803∼protein homodimerization activity337.43E−04
GO_MFGO:0005509∼calcium ion binding297.77E−03
GO_MFGO:0043565∼sequence-specific DNA binding204.43E−02
GO_MFGO:0016491∼oxidoreductase activity133.24E−03
GO_MFGO:0046983∼protein dimerization activity113.31E−03
GO_MFGO:0008201∼heparin binding115.22E−03
GO_MFGO:0005088∼Ras guanyl-nucleotide exchange factor activity101.78E−03
GO_MFGO:0003824∼catalytic activity103.74E−02
GO_MFGO:0016787∼hydrolase activity104.07E−02
GO_MFGO:0046934∼phosphatidylinositol-4,5-bisphosphate 3-kinase activity91.10E−04
KEGGhsa01100: Metabolic pathways688.56E−08
KEGGhsa01130: Biosynthesis of antibiotics195.17E−05
KEGGhsa01200: Carbon metabolism111.83E−03
KEGGhsa04960: Aldosterone-regulated sodium reabsorption91.45E−05
KEGGhsa04978: Mineral absorption93.69E−05
KEGGhsa00010: Glycolysis / Gluconeogenesis97.57E−04
KEGGhsa04610: Complement and coagulation cascades84.14E−03
KEGGhsa04966: Collecting duct acid secretion71.11E−04
KEGGhsa00260: Glycine, serine and threonine metabolism79.10E−04
KEGGhsa00280: Valine, leucine and isoleucine degradation72.46E−03
Figure 3

Enrichment analysis of GO and KEGG with up-regulated DEG.

(A) The biological process of GO analysis showed that the up-regulation of DEGs was mainly related to cell adhesion, inflammatory response, signal transduction, and immune response. (B) The enrichment analysis of up-regulated DEGs cell components is mainly related to the cellular exosomes, integral component of membrane, plasma membrane, and integral component of plasma membrane. (C) The molecular function of GO analysis showed that the up-regulation of DEGs was mainly related to protein binding, identical protein bind, ATP binding capacity, and protein homodimerization activity. (D) The KEGG pathways related to the up-regulation of DEGs expression mainly include the PI3K-Akt signaling pathway, focal adhesion, pathways in cancer, and HIF-1 signaling pathway. (E) The biological process of GO analysis showed that the downregulation of DEGs was mainly related to oxidation–reduction process proteolysis, ion transmembrane transport, response to the drug, and ion transport. (F) The enrichment analysis of down-regulated DEGs cell components is mainly related to integral components of membrane, plasma membrane, cellular exosomes, and integral component of plasma membrane. (G) The molecular function of GO analysis showed that the downregulation of DEGs was mainly related to protein homodimerization activity, calcium ion binding, oxidoreductase activity, and sequence-specific DNA binding. (H) The KEGG pathways related to the down-regulation of DEGs expression mainly include metabolic pathway, Biosynthesis of antibiotics, Carbon metabolism, and Aldosterone-regulated sodium reabsorption.

In addition, GO and KEGG enrichment analyses were performed for all DEG (up-regulated and down-regulated genes) (Fig. S2). Finally, 11 common enrichment results were found from the three analyses (including up-regulated DEG, down-regulated DEG and all DEG enrichment analyses). The common enrichment results are as follows: response to drug, plasma membrane, integral component of membrane, extracellular space, extracellular exosome, cell surface, cell surface, protein homodimerization activity, identical protein binding. The results of enrichment analysis were consistent with previous studies (Wang, Yu & Chai, 2019; Tian et al., 2019). It suggests that these 11 pathways may be important in CCRCC.

Enrichment analysis of GO and KEGG with up-regulated DEG.

(A) The biological process of GO analysis showed that the up-regulation of DEGs was mainly related to cell adhesion, inflammatory response, signal transduction, and immune response. (B) The enrichment analysis of up-regulated DEGs cell components is mainly related to the cellular exosomes, integral component of membrane, plasma membrane, and integral component of plasma membrane. (C) The molecular function of GO analysis showed that the up-regulation of DEGs was mainly related to protein binding, identical protein bind, ATP binding capacity, and protein homodimerization activity. (D) The KEGG pathways related to the up-regulation of DEGs expression mainly include the PI3K-Akt signaling pathway, focal adhesion, pathways in cancer, and HIF-1 signaling pathway. (E) The biological process of GO analysis showed that the downregulation of DEGs was mainly related to oxidation–reduction process proteolysis, ion transmembrane transport, response to the drug, and ion transport. (F) The enrichment analysis of down-regulated DEGs cell components is mainly related to integral components of membrane, plasma membrane, cellular exosomes, and integral component of plasma membrane. (G) The molecular function of GO analysis showed that the downregulation of DEGs was mainly related to protein homodimerization activity, calcium ion binding, oxidoreductase activity, and sequence-specific DNA binding. (H) The KEGG pathways related to the down-regulation of DEGs expression mainly include metabolic pathway, Biosynthesis of antibiotics, Carbon metabolism, and Aldosterone-regulated sodium reabsorption.

Construction of PPI network and screening of key genes

To identify the key genes, the STRING online database and Cytoscape software were used to analyze all DEGs (including up-regulated genes and down-regulated genes)and construct PPI network (Fig. 4A). Based on the main role of proteins in biological functions, their interaction determines the molecular and cellular mechanisms that control the health and disease state of the organism (Safari-Alighiarloo et al., 2014). Next, using the degree algorithm in the cytohubba plug-in of Cytoscape to screen the hub genes in PPI network. The gene whose degree ≥37 could be defined hub gene, therefore the first 35 genes (Table 3) in the PPI network were chosen to be hub genes (Fig. 4D). Next, GO enrichment analysis was conducted for all DEGs and the first 35 DEGs in the PPI network. The result showed that all DEGs mainly enriched in signal transduction, oxidation–reduction process, cell adhesion, inflammatory response, plasma membrane, integral component of membrane, extracellular exosome, integral component of plasma membrane (Figs. 4B–4C). The top 35 DEGs were enriched in extracellular space, extracellular exosome, inflammatory response, an integral component of the plasma membrane, and cell surface (Fig. 4E). By comparison, it turned out that the enrichment analysis results of all DEGs in PPI network contained the results of the top 35’s. In the end, three new key genes with high degree which were ITGAX, LAPTM5 and SERPINE1, could be screened by using GEPIA and Kaplan–Meier plotter database.
Figure 4

PPI network and GO enrichment analysis of key genes.

(A) The PPI network is constructed, including up-regulated genes and down-regulated genes, and considers that the confidence level ≥ 0.4 is significant for PPI. (B–C) GO enrichment analysis of all DEGs in PPI network. (D) PPI network of the top 35 DEGs according to the degree (degree ≥ 37). (E) GO enrichment analysis of the top 35 DEGs.

Table 3

The top 35 DEGs identified with the degree in the PPI network (Degree ≥ 37).

ExpressionGenesDegreeGenesDegree
UpregulatedEGFR110C3AR152
VEGFA100CYBB51
PTPRC93LCP250
FN190TLR749
ITGB270C345
TLR267CCL544
MMP966TIMP143
CD8664HCK43
CXCR462FCGR2A42
ICAM162CD5341
CCND161CTSS41
TYROBP59LAPTM540
CSF1R58LOX38
PLEK55SERPINE137
ITGAX53CAV137
DownregulatedALB134KNG152
EGF80KIT44
ERBB267

PPI network and GO enrichment analysis of key genes.

(A) The PPI network is constructed, including up-regulated genes and down-regulated genes, and considers that the confidence level ≥ 0.4 is significant for PPI. (B–C) GO enrichment analysis of all DEGs in PPI network. (D) PPI network of the top 35 DEGs according to the degree (degree ≥ 37). (E) GO enrichment analysis of the top 35 DEGs.

The expression of key genes

Among the 35 genes, we focused on ITGAX, LAPTM5, and SERPINE1, which have not been reported to be related to the occurrence and development of CCRCC. Firstly, the expression levels of ITGAX, LAPTM5, and SERPINE1 in CCRCC tumor tissues are significantly higher than those in normal tissues adjacent to cancer according to GEPIA database (Figs. 5A–5C). Furthermore, the data of TCGA KIRC showed that compared with adjacent normal tissues, the mRNA expressions of ITGAX, LAPTM5, and SERPINE1 in 72 pairs of CCRCC tissues were significantly increased (Figs. 5D–5F). In the Oncomine Gumz renal database, the mRNA levels of ITGAX, LAPTM5 and SERPINE1 were also upregulated in CCRCC tissues when compared with adjacent normal kidney tissues (Figs. 5H–5G). According to the UALCAN database, the promoter methylation levels of ITGAX, LAPTM5 and SERPINE were decreased in CCRCC (Figs. 5J–5L). To sum up, according to the GEPIA database, Oncomine Gumz renal database and TCGA database, the expression levels of these three genes in CCRCC tumor tissues are significantly higher than those in normal tissues adjacent to cancer. It could be further speculated that the high expression of ITGAX, LAPTM5, and SERPINE1 in CCRCC tumor tissue might be related to the decrease of promoter methylation.
Figure 5

Key gene expression between normal kidney and CCRCC tissues.

(A–C) Box plot showing the expression of ITGAX, LAPTM5, SERPINE1 in GEPIA database. These three genes are highly expressed in CCRCC. (D–F) In the TCGA database, compared with adjacent normal tissues, the expression of ITGAX, LAPTM5, SERPINE1 mRNA in 72 pairs of CCRCC tissues increased significantly. (G–I) The expression of ITGAX, LAPTM5, SERPINE1 are significantly increased in Oncomine Gumz renal database. * P < 0.05, ** P < 0.01, *** P < 0.001. (J–L) The box plot showing the promoter methylation levels of ITGAX, LAPTM5 and SERPINE1 in the UALCAN database. P < 0.01.

Key gene expression between normal kidney and CCRCC tissues.

(A–C) Box plot showing the expression of ITGAX, LAPTM5, SERPINE1 in GEPIA database. These three genes are highly expressed in CCRCC. (D–F) In the TCGA database, compared with adjacent normal tissues, the expression of ITGAX, LAPTM5, SERPINE1 mRNA in 72 pairs of CCRCC tissues increased significantly. (G–I) The expression of ITGAX, LAPTM5, SERPINE1 are significantly increased in Oncomine Gumz renal database. * P < 0.05, ** P < 0.01, *** P < 0.001. (J–L) The box plot showing the promoter methylation levels of ITGAX, LAPTM5 and SERPINE1 in the UALCAN database. P < 0.01.

The association of key genes expression with clinical pathology in CCRCC

Furthermore, the relationship between the mRNA expression of ITGAX, LAPTM5 and SERPINE1 and different clinical pathology grades were measured. The results showed that their mRNA expression was significantly related to pathological grades (Figs. 6A–6C). And the expression of ITGAX, LAPTM5, and SERPINE1 mRNA in CCRCC samples are also significantly correlated with severe clinical staging (Figs. 6D–6F). Among them, the expression levels of ITGAX, LAPTM5, and SERPINE1 were higher in stage 4 and grade 4. In conclusion, ITGAX, LAPTM5, and SERPINE1 are significantly associated with clinical pathology.
Figure 6

The relationship between ITGAX, LAPTM5, and SERPINE1 expression and clinical pathology of CCRCC.

(A–C) Boxplot showing that ITGAX, LAPTM5, and SERPINE1 mRNA expression were significantly related to pathological grades, and patients in grade 4 have the highest expression in CCRCC according to UALCAN databases. (D–F) Boxplot showing that the expression of ITGAX, LAPTM5, and SERPINE1 mRNA in CCRCC samples are significantly correlated with severe clinical staging and the mRNA expression of ITGAX, LAPTM5, and SERPINE1 were higher in patients with stage 4 according to UALCAN databases. * P < 0.05, ** P < 0.01, *** P < 0.001.

The relationship between ITGAX, LAPTM5, and SERPINE1 expression and clinical pathology of CCRCC.

(A–C) Boxplot showing that ITGAX, LAPTM5, and SERPINE1 mRNA expression were significantly related to pathological grades, and patients in grade 4 have the highest expression in CCRCC according to UALCAN databases. (D–F) Boxplot showing that the expression of ITGAX, LAPTM5, and SERPINE1 mRNA in CCRCC samples are significantly correlated with severe clinical staging and the mRNA expression of ITGAX, LAPTM5, and SERPINE1 were higher in patients with stage 4 according to UALCAN databases. * P < 0.05, ** P < 0.01, *** P < 0.001.

Survival and diagnostic value of ITGAX, LAPTM5, and SERPINE1 in CCRCC

According to the Kaplan–Meier plotter database, the overall survival of ITGAX, LAPTM5, and SERPINE1 genes was tested (Figs. 7A–7C). The results showed that the high expression of three key genes in CCRCC was negatively correlated with prognosis. Then, the ROC curve was used to evaluate the difference between CCRCC and the normal tissues in the TCGA KIRC data. The ROC curves (Figs. 7D–7E) of these three genes showed that their area under the curve (AUC) and the 95% confidence intervals (CI) are as follows: ITGAX (AUC = 96.315, CI = 0.9423–0.9840), LAPTM5, (AUC = 94.808 CI = 0.9243–0.9719), and SERPINE1(AUC = 76.272, CI [0.7068–0.8186]). Since ROC curves had good specificity and sensitivity, ITGAX, LAPTM5, and SERPINE1 had excellent diagnostic efficiency for distinguishing tumors and normal tissues.
Figure 7

Survival and diagnostic value of ITGAX, LAPTM5, and SERPINE1 in CCRCC.

(A–C) The overall survival of ITGAX, LAPTM5, and SERPINE1.The results showed that the high expression of three key genes in CCRCC was negatively correlated with prognosis. (D) ROC curve of ITGAX (AUC = 96.315, cutoff value = 8.836, Sensitivity = 92.579 Specificity = 91.667). (E) ROC curve of LAPTM5, (AUC = 94.808, cutoff value = 12.983 Sensitivity = 85.158 Specificity = 94.444). (F) ROC curve of SERPINE1, AUC = 76.272, cutoff value = 12,646, Sensitivity = 66.234 Specificity = 75).

Survival and diagnostic value of ITGAX, LAPTM5, and SERPINE1 in CCRCC.

(A–C) The overall survival of ITGAX, LAPTM5, and SERPINE1.The results showed that the high expression of three key genes in CCRCC was negatively correlated with prognosis. (D) ROC curve of ITGAX (AUC = 96.315, cutoff value = 8.836, Sensitivity = 92.579 Specificity = 91.667). (E) ROC curve of LAPTM5, (AUC = 94.808, cutoff value = 12.983 Sensitivity = 85.158 Specificity = 94.444). (F) ROC curve of SERPINE1, AUC = 76.272, cutoff value = 12,646, Sensitivity = 66.234 Specificity = 75).

Discussion

Clear Cell Renal Cell Carcinoma (CCRCC) is a metabolic disease whose morbidity is rising worldwide. The feature of kidney cancer is to participate in the target genes’ mutation of metabolic pathways (Lucarelli et al., 2019). Recently, with the application of bioinformatics, the molecular characteristics of CCRCC have been greatly improved and the development of targeted therapy has been promoted. These advances have significantly improved the median survival of patients with advanced disease. However, about 30% of CCRCC local patients will still relapse or metastasize after surgical removal of the tumor (Li et al., 2019). Around 1/3 of the metastatic patients had poor prognosis and rare high drug resistance rate. Under the circumstance of different treatment, the identification of biomarkers was needed urgently so as to predict the drug’s effects (Deleuze et al., 2020). Therefore, identification of CCRCC key genes and prognostic judgment is still very crucial. In our research, some reported genes related to CCRCC, such as VEGFA (Zeng et al., 2016), EGFR (Cossu-Rocca et al., 2016), were also screened out. Vascular Endothelial Growth Factor A (VEGFA) is a member of the PDGF/VEGF growth factor family. VEGFA has a potential role in the diagnosis and treatment of CCRCC. VEGFA can inhibit the proliferation of CCRCC 786-O cells, promote cell apoptosis, and inhibit cell migration and invasion (Zeng et al., 2016). Epidermal growth factor receptor (EGFR) is closely related to the progression of many epithelial malignancies and is an important therapeutic target (Cossu-Rocca et al., 2016). EGFR is a cell surface protein, belonging to the ERBB family. EGFR binds to epidermal growth factor to induce receptor dimerization and tyrosine self phosphorylation, which eventually leads to cell proliferation (Mitsudomi & Yatabe, 2010). EGFR can activate a variety of signaling pathways, mainly MAPK / ERK and PI3K / AKT pathways (Yarden & Sliwkowski, 2001). Compared with the former 4 studies, this study found three new key genes which were ITGAX, LAPTM5 and SERPINE1, and applied several methods to do functional analysis and systematic research to key genes such as methylation level analysis, survival analysis, ROC curve analysis and so on. In previous studies, GSE12606 mainly focused on the functional analysis of HLA ligand in CCRCC, while GSE5300 mainly focused on the structure and revolution of CCRCC genome, and GSE66272 mainly focused on the role of miRNA in CCRCC and GSE68417 mainly focused on the analysis to the gene expression profile of CCRCC. On the contrary, this study put emphasis on screening the biomarkers used to do early diagnosis in CCRCC and analyzing the intersection of DEGs in four datasets. Besides, three new potential marker genes ITGAX, LAPTM5, SERPINE1 were also screened out and proved. Integrin alpha x (ITGAX) is a member of the integrin family, commonly function as a receptor for extracellular matrix. It is reported that ITGAX is involved in the angiogenesis of dendritic cells and tumor angiogenesis (Wang et al., 2019). In addition, ITGAX is identified as a new type of aggressive prostate cancer susceptibility gene (Williams et al., 2014). Lysosomal protein transmembrane 5 (LAPTM5), known as E3 protein, may play a role in hematopoiesis and prevent excessive activation of lymphocytes (Cai et al., 2015). It is reported that LAPTM5 can regulate the proliferation and viability of bladder cancer cells, leading to cell cycle arrest in the G0/G1 phase (Chen et al., 2017). LAPTM5 is also associated with the spontaneous regression of neuroblastoma (Inoue et al., 2009). Studies have shown that Serpin family E member 1 (SERPINE1 is a regulator of Glioblastoma (GBM) cell proliferation, and is related to poor prognosis of patients and mesenchymal GBM. Down-regulation of SERPINE1 in primary GBM cells inhibited the growth and invasiveness of tumors in the brain, and SERPINE1 plays a key role in the spread of GBM (Seker et al., 2019).

Conclusions

To sum up, the expression levels of ITGAX, LAPTM5, SERPINE1 in CCRCC tumor tissues are significantly higher than those in normal tissues adjacent to cancer and are related to the tumor stage and tumor grade. ITGAX, LAPTM5, and SERPINE1 have high diagnostic efficiency for tumors and normal tissues, and their expressions are associated with poor prognosis of CCRCC. The decrease of promoter methylation of ITGAX, LAPTM5 and SERPINE1 in CCRCC tumor tissues indicates that the high expression of key genes in CCRCC might be relevant to the low methylation level. The limitation of this study lied in that the internal molecular mechanisms where key genes played a role remained unclear, which need a further research. Further studies are needed to explore the detailed mechanisms of these key genes in CCRCC. In conclusion, we identified ITGAX, LAPTM5, and SERPINE1 as potential marker genes of CCRCC by bioinformatics methods, providing insights for future therapeutic design. Meanwhile, we conducted a relatively systematic and comprehensive analysis on CCRCC data, thereby providing a theoretical basis for identifying therapeutic targets of CCRCC, promoting early detection, and monitoring tumor progression.

Volcano code

The code is used for visual analysis of volcanoes. Click here for additional data file. Click here for additional data file. Click here for additional data file.

Heat map of logFC values for all DEG in the four datasets

The color labor represents the different values of log FC. Click here for additional data file.

Enrichment analysis of GO and KEGG with all DEGs

(A) Biological processes. (B)Cellular components. (C) Molecular functions (D) KEGG pathway. Click here for additional data file.

Datasets GSE12606

A total of 9560 DEGs were screened from the GSE12606, of which, 2,230 and 7,330 genes were up- and downregulated. Click here for additional data file.

Dataset GSE53000

A total of 1,286 DEGs were screened from the GSE53000 among them 589 upregulated and 697 downregulated genes were selected. Click here for additional data file.

Dataset GSE68417

A total of 1,890 DEGs were screened from the GSE68417 among them 726 upregulated and 1,164 downregulated genes were selected. Click here for additional data file.

Dataset GSE66272

There were 3,627 upregulated genes and 5,170 downregulated genes among the 8797 DEGs screened from the GSE66272 dataset. Click here for additional data file. Click here for additional data file. Click here for additional data file.

PPI network

The raw data of the PPI network, including up-regulated genes and down-regulated genes, and considers that the confidence level ≥ 0.4 is significant for PPI. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  41 in total

1.  DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Authors:  Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2003-04-03       Impact factor: 13.583

2.  FOXM1-Activated LINC01094 Promotes Clear Cell Renal Cell Carcinoma Development via MicroRNA 224-5p/CHSY1.

Authors:  Yufeng Jiang; Haimin Zhang; Wei Li; Yang Yan; Xudong Yao; Wenyu Gu
Journal:  Mol Cell Biol       Date:  2020-01-16       Impact factor: 4.272

3.  Expression and Polymorphisms of Lysosome-Associated Protein Transmembrane 5 (LAPTM5) in Patients with Systemic Lupus Erythematosus in a Chinese Population.

Authors:  Xinze Cai; Ying Qiao; Yang Chen; Shuyan Du; Dong Chen; Shuang Yu; Nan Liu; Yi Jiang
Journal:  Biochem Genet       Date:  2015-05-22       Impact factor: 1.890

Review 4.  Untangling the ErbB signalling network.

Authors:  Y Yarden; M X Sliwkowski
Journal:  Nat Rev Mol Cell Biol       Date:  2001-02       Impact factor: 94.444

Review 5.  Challenges and opportunities in the proteomic characterization of clear cell renal cell carcinoma (ccRCC): A critical step towards the personalized care of renal cancers.

Authors:  Qing Kay Li; Christian P Pavlovich; Hui Zhang; Christopher R Kinsinger; Daniel W Chan
Journal:  Semin Cancer Biol       Date:  2018-07-25       Impact factor: 15.707

Review 6.  The Metabolic Basis of Kidney Cancer.

Authors:  W Marston Linehan; Laura S Schmidt; Daniel R Crooks; Darmood Wei; Ramaprasad Srinivasan; Martin Lang; Christopher J Ricketts
Journal:  Cancer Discov       Date:  2019-05-14       Impact factor: 39.397

7.  EGFR kinase-dependent and kinase-independent roles in clear cell renal cell carcinoma.

Authors:  Paolo Cossu-Rocca; Maria R Muroni; Francesca Sanges; Giovanni Sotgiu; Anna Asunis; Luciana Tanca; Daniela Onnis; Giovanna Pira; Alessandra Manca; Simone Dore; Maria G Uras; Sara Ena; Maria R De Miglio
Journal:  Am J Cancer Res       Date:  2015-12-15       Impact factor: 6.166

8.  Comparison of Immediate vs Deferred Cytoreductive Nephrectomy in Patients With Synchronous Metastatic Renal Cell Carcinoma Receiving Sunitinib: The SURTIME Randomized Clinical Trial.

Authors:  Axel Bex; Peter Mulders; Michael Jewett; John Wagstaff; Johannes V van Thienen; Christian U Blank; Roland van Velthoven; Maria Del Pilar Laguna; Lori Wood; Harm H E van Melick; Maureen J Aarts; J B Lattouf; Thomas Powles; Igle Jan de Jong Md PhD; Sylvie Rottey; Bertrand Tombal; Sandrine Marreaud; Sandra Collette; Laurence Collette; John Haanen
Journal:  JAMA Oncol       Date:  2019-02-01       Impact factor: 31.777

9.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

10.  Cooperative Effect of miR-141-3p and miR-145-5p in the Regulation of Targets in Clear Cell Renal Cell Carcinoma.

Authors:  Julia Liep; Ergin Kilic; Hellmuth A Meyer; Jonas Busch; Klaus Jung; Anja Rabien
Journal:  PLoS One       Date:  2016-06-23       Impact factor: 3.240

View more
  4 in total

1.  Transgenerational inheritance of promoter methylation changes in extrauterine growth restriction-induced pulmonary arterial pressure disorders.

Authors:  Lili Tang; Ping Chen; Liu Yang; Jiyuan Liu; Yuanfang Zheng; Jincai Lin; Senhua Chen; Yinzhu Luo; Yanyan Chen; Xiaoying Ma; Liyan Zhang
Journal:  Ann Transl Med       Date:  2021-10

Review 2.  Receiver operating characteristic curve: overview and practical use for clinicians.

Authors:  Francis Sahngun Nahm
Journal:  Korean J Anesthesiol       Date:  2022-01-18

Review 3.  Cancer-Associated Fibroblasts: Mechanisms of Tumor Progression and Novel Therapeutic Targets.

Authors:  Ralf-Peter Czekay; Dong-Joo Cheon; Rohan Samarakoon; Stacie M Kutz; Paul J Higgins
Journal:  Cancers (Basel)       Date:  2022-02-27       Impact factor: 6.639

4.  Key Prognostic Value of Lysosomal Protein Transmembrane 5 in Kidney Renal Clear Cell Carcinoma.

Authors:  Bin Li; Xue-Dong Shi
Journal:  Int J Gen Med       Date:  2022-03-05
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.