Literature DB >> 34266404

Bioinformatics analysis of C3 and CXCR4 demonstrates their potential as prognostic biomarkers in clear cell renal cell carcinoma (ccRCC).

Jing Quan1, Yuchen Bai1, Yunbei Yang1, Er Lei Han2, Hong Bai1, Qi Zhang1, Dahong Zhang3.   

Abstract

BACKGROUND: The molecular prognostic biomarkers of clear cell renal cell carcinoma (ccRCC) are still unknown. We aimed at researching the candidate biomarkers and potential therapeutic targets of ccRCC.
METHODS: Three ccRCC expression microarray datasets (include GSE14762, GSE66270 and GSE53757) were downloaded from the gene expression omnibus (GEO) database. The differentially expressed genes (DEGs) between ccRCC and normal tissues were explored. The potential functions of identified DEGs were analyzed by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). And then the protein - protein interaction network (PPI) was established to screen the hub genes. After that, the expressions of hub genes were identified by the oncomine database. The hub genes' prognostic values of patients with ccRCC were analyzed by GEPIA database.
RESULTS: A total of 137 DEGs were identified by utilizing the limma package and RRA method, including 63 upregulated genes and 74 downregulated genes. It is found that 137 DEGs were mainly enriched in 82 functional terms and 24 pathways in accordance with the research results. Thirteen highest-scoring genes were screened as hub genes (include 10 upregulated genes and 3 downregulated candidate genes) by utilizing the PPI network and module analysis. Through integrating the oncoming database and GEPIA database, the author found that C3 and CXCR4 are not only overexpressed in ccRCC, but also associated with the prognosis of ccRCC. Further results could reveal that patients with high C3 expression had a poor overall survival (OS), while patients with high CTSS and TLR3 expressions had a good OS; patients with high C3 and CXCR4 expressions had a poor disease-free survival (DFS), while ccRCC patients with high TLR3 expression had a good DFS.
CONCLUSION: These findings suggested that C3 and CXCR4 were the candidate biomarkers and potential therapeutic targets of ccRCC patients.
© 2021. The Author(s).

Entities:  

Keywords:  Bioinformatics analysis; Biomarker; Gene; ccRCC

Year:  2021        PMID: 34266404      PMCID: PMC8283915          DOI: 10.1186/s12885-021-08525-w

Source DB:  PubMed          Journal:  BMC Cancer        ISSN: 1471-2407            Impact factor:   4.430


Introduction

Renal cell carcinoma (RCC) is the most common kidney malignancies, which originates in the renal tubular epithelium [1]. Among of RCC, clear cell RCC (ccRCC) is the most important histological subtype, accounting for ∼80% of RCC [2]. The vast majority of RCC are discovered by accident. Less than 5% of RCC are detected by the classic triad (gross hematuria, flank pain and abdominal mass) and are often advanced. Due to resistant to radiotherapy and chemotherapy, surgical resection is still the optimal treatment for RCC [2]. Although the emergence of immunotherapy and targeted therapy has diversified the treatment of RCC, the prognosis of patients with RCC who have lost the opportunity of surgery remains dismal [3]. Therefore, it is particularly important to understand the pathogenesis of RCC and investigate biomarkers to support the treatment and prediction of prognosis. In recent years, bioinformatics analysis of gene expression microarrays could help identify the potential target genes of diseases and provide the molecular characteristics, regulatory pathways and cellular networks of diseases [4]. The gene expression omnibus (GEO, www.ncbi.nlm.nih.gov/geo/) database is an international public functional genomics database, which stores common array and sequence data. In the past decades, more and more scholars had indicated that tumor-related genes were discovered by using GEO databases in their researches. For instance, Guo et al. found that 31 mostly changed hub genes were significant enriched in several pathways through integrated bioinformatical analysis, which mainly associated with cell cycle process, chemokines and G-protein coupled receptor signaling pathways in colorectal cancer [5]. Besides, Liang’s research results indicated that BCL2, CCND1 and COL1A1 might be the key genes in thyroid papillary carcinoma through bioinformatics analysis [6]. What’s more, bioinformatics has been widely used in the diagnosis and prognosis of renal cancer. For example, li et al. found that MMP2, DCN, COL4A1, CASR, GPR4, UTS2, and LDLR can be regarded as potential immunotherapy biomarkers for RCC [3]. And Tao constructed a immune-related gene-based prognostic index, which can effectively predict the prognosis of patients with renal cancer and the associated immunoinfiltrating cells and provide a new method for predicting the prognosis and targeted therapy of renal cancer [7]. Based on the above researches and methods, the author analyzed the gene expression profile of ccRCC by using the GEO database, and then further analyzed the data to provide valuable hub genes for the following translational and clinical research.

Materials and methods

Access to public resources

Three expression profiling datasets (GSE14762 [8], GSE66270 [9] and GSE53757 [10]) were downloaded from the Gene Expression Omni - bus (GEO) database of the National Center for Biotechnology Information (NCBI). The GSE14762 dataset included 11 tumor tissue samples and matched normal tissue samples. The GSE66270 dataset included 14 normal tissue samples and 14 tumor tissue samples. And the GSE53757 dataset included 72 tumor tissue samples and adjacent tissue samples. Among of them, the microarray data of GSE14762 was running at the GPL4866 Plaforms, and the microarray data of GSE66270 and GSE53757 were analyzed at the GPL570 Plaforms. Platforms and series matrix files were downloaded as TXT files. Details for GEO ccRCC data were shown in Table 1.
Table 1

Details for GEO ccRCC data

ReferenceGEOPlatformSample
normaltumor
Furge K [8]1GSE14762GPL48661111
Jung K [9]2GSE66270GPL5701414
Von Roemeling CA [10]3GSE53757GPL5707272

1. Renal Cell Carcinoma: Hypoxia and Endocytosis

2. Genome-wide analysis of gene expression patterns in human kidney cancer [patients without metastasis

3. Gene array analysis of clear cell renal cell carcinoma tissue versus matched normal kidney tissue

Details for GEO ccRCC data 1. Renal Cell Carcinoma: Hypoxia and Endocytosis 2. Genome-wide analysis of gene expression patterns in human kidney cancer [patients without metastasis 3. Gene array analysis of clear cell renal cell carcinoma tissue versus matched normal kidney tissue

Detection of DEGs

The R language software (version 3.5.0; https://www.r-project.org/) and annotation package were used to handle the downloaded data files. Probe name in the downloaded data files was changed into international standard name. The package in the Bioconductor (http://www.bioconductor.org/) was used for gene distinguish expression analysis. Robust Multi-array Average (RMA) algorithm was used for the gene expression profile data preprocessing. And quantile normalization was performed to normalize the above data. P < 0.05 and [log2 Fold Change] ≥ 2 were regarded as the DEGs screening threshold. The Robust Rank Aggreg (RRA) analysis (http://cran.r-project.org/) was used to list the up-regulated and down-regulated genes. DEGs of three datasets were represented by volcano map and hierarchical clustering heat map.

Gene ontology (GO) and KEGG enrichment analyses

The biological processes (BP), molecular functions (MF) and cellular components (CC) of DEGs were explored by applying two online biological tools. The online website g:Profiler (https://biit.cs.ut.ee/gprofiler/gost) was used for Go analysis. And DAVID 6.8 (https://david.ncifcrf.gov/) was used for KEGG analysis. P < 0.05 was considered as the significant threshold for GO and KEGG pathway analysis.

PPI network construction

Online database STRING (http://string-db.org) and Cytoscape software (Version 3.6.1, http://www.cytoscape.org/) were applied to generate the PPI network of DEGs and identify the hub genes. Besides, the Molecular Complex Detection (MCODE) plug-in in Cytoscape software was used to analyze clustered sub-networks of highly intraconnected nodes from the above PPI network. The default parameters of MCODE plug-in were as follows: Degree cutoff ≥2, Node score cutoff ≥0.2, K-core ≥2, and Max depth = 100.

Expression and survival analysis of hub genes

The meta-analysis function of oncomine database (https://www.oncomine.org/) was used to better validate the expression level of hub genes. Besides, online database GEPIA (http://gepia.cancer-pku.cn/detail.php) was an interactive web server, which can analyze the expression of tumor and normal genes. The purpose of this study was to analyze the relationship between the hub genes expression and the survival analysis of [overall survival (OS) and disease free survival (DFS)].

Results

Microarray data information

The RCC expression microarray datasets (GSE14762, GSE66270 and GSE53757) were standardized by RMA algorithm, and the results were shown in Fig. 1. The author obtained 381 DREs from GSE14762 in accordance with the screening criteria (P < 0.05 and [log2 FC] ≥ 2). Moreover, the author obtained 870 DEGs and 1324 DEGs from GSE66270 and GSE53757. The DEGs from the two groups of sample data included in each of the three databases were shown by volcano plot (Fig. 2). The cluster heatmaps of the top 100 DEGs from the three microarrays were shown in Fig. 3.
Fig. 1

Standardization of gene expression by boxplot. The GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C) was standardized

Fig. 2

Volcano plot of differential data expressions between two sample sets. Three figures show the volcano plot of GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C). The red oints represent overexpressed genes (threshold: P < 0.05 and |[log2 FC]| ≥ 2). The green points represent under-expressed genes (threshold: P < 0.05 and |[log2 FC]| ≥ 2). The black points represent undifferentiated genes

Fig. 3

Clustering heatmap of DEGs. Three figures show the heatmap of GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C). Red grid shows that the genes expression is uoverexpressed, green grid shows that the genes expression is under-expressed, black grid shows that there are no significant difference and gray grid shows that genes are too weak to be detected

Standardization of gene expression by boxplot. The GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C) was standardized Volcano plot of differential data expressions between two sample sets. Three figures show the volcano plot of GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C). The red oints represent overexpressed genes (threshold: P < 0.05 and |[log2 FC]| ≥ 2). The green points represent under-expressed genes (threshold: P < 0.05 and |[log2 FC]| ≥ 2). The black points represent undifferentiated genes Clustering heatmap of DEGs. Three figures show the heatmap of GSE14762 data (A), GSE66270 data (B) and GSE55757 data (C). Red grid shows that the genes expression is uoverexpressed, green grid shows that the genes expression is under-expressed, black grid shows that there are no significant difference and gray grid shows that genes are too weak to be detected

DEGs identification in ccRCC

The three microarray databases of RCC were analyzed and sorted by the limma package (threshold: P < 0.05 and [log2 Fold Change] ≥ 2), and then further analyzed by the RRA method. As a result, 137 DEGs were identified, including 63 overexpressed genes and 74 under-expressed genes (Table 2). The heatmap of the top 20 overexpressed and under-expressed genes was revealed by R-heatmap software in Fig. 4.
Table 2

The genes differentially expressed both in GEO database were identified in ccRCC samples

Gene names
Upregulated DEGsEGLN3, CA9, ANGPTL4, IGFBP3, ENO2, NDUFA4L2, SPAG4, HK2, CXCR4, APOC1, NOL3, LAPTM5, LPCAT1, PSMB9, CTSS, TYROBP, NETO2, RRM2, TMEM45A, CAV2, LOC101928916 /// NNMT, TNFAIP6, PFKP, TLR3, LGALS1, MIR6787 /// SLC16A3, C3, COL23A1, C1QA, CSTA, CAV1, ITGB2, SEMA5B, PLOD2, C1QB, TRIB3, MS4A6A, PDK1, BIRC3, DDB2, ENTPD1, TREM2, EVI2A, P2RX7, HILPDA, LOC56901, FBXO16 /// ZNF395, ST8SIA4, CTHRC1, PRKCDBP, ENPP3, ISG20, MNDA, SLC16A3, ZNF395, FCER1G, PLK2, TNFSF13B, FCGR3A /// FCGR3B, RGS1, TLR2, TGFBI, CASP1
Downregulated DEGs

KCNJ1, KNG1, CLCNKB, FGF9, DMRT2, CALB1, RHCG, CLDN8, ATP6V0A4, SFRP1, ATP6V1G3, NPHS2, HS6ST2, ABAT, ATP6V1B1, AQP2, ALDH6A1, DIO1, SLC34A1, ATP6V0D2, RHBG, MAN1C1, FGF1, PVALB, UMOD, GPC3, DPEP1, SERPINA5, XPNPEP2, DCXR, TMEM52B, ACOX2, TMEM213, LPPR1, HEPACAM2, GPR110, TFCP2L1, FXYD4, HRG, GGT6, ERP27, SLC12A3, TYRP1, DUSP9, SH3GL2, SMIM5, SUCLG1, UPP2, SLC4A1, SLC22A8, SLC7A8, HSD11B2, ACAA1, SOST, ENPP6, RP11-999E24.3, ALDH4A1

TCF21, EFHD1, FBP1, HPD, TMEM30B, SLC13A3, SLC22A7, AFM, ACSF2, PCK2, PLG, FABP1, LOC155006, SUCNR1, LINC01187, CRYAA, CHL1

Fig. 4

RRA analyses. This figure shows the top 20 overexpressed and under-expressed genes obtained by RRA analyses. Red grid indicates that the genes expression is upregulated, blue grid indicates that the genes expression is downregulated, and white grid indicates that there is no detected gene expression

The genes differentially expressed both in GEO database were identified in ccRCC samples KCNJ1, KNG1, CLCNKB, FGF9, DMRT2, CALB1, RHCG, CLDN8, ATP6V0A4, SFRP1, ATP6V1G3, NPHS2, HS6ST2, ABAT, ATP6V1B1, AQP2, ALDH6A1, DIO1, SLC34A1, ATP6V0D2, RHBG, MAN1C1, FGF1, PVALB, UMOD, GPC3, DPEP1, SERPINA5, XPNPEP2, DCXR, TMEM52B, ACOX2, TMEM213, LPPR1, HEPACAM2, GPR110, TFCP2L1, FXYD4, HRG, GGT6, ERP27, SLC12A3, TYRP1, DUSP9, SH3GL2, SMIM5, SUCLG1, UPP2, SLC4A1, SLC22A8, SLC7A8, HSD11B2, ACAA1, SOST, ENPP6, RP11-999E24.3, ALDH4A1 TCF21, EFHD1, FBP1, HPD, TMEM30B, SLC13A3, SLC22A7, AFM, ACSF2, PCK2, PLG, FABP1, LOC155006, SUCNR1, LINC01187, CRYAA, CHL1 RRA analyses. This figure shows the top 20 overexpressed and under-expressed genes obtained by RRA analyses. Red grid indicates that the genes expression is upregulated, blue grid indicates that the genes expression is downregulated, and white grid indicates that there is no detected gene expression

GO and KEGG analysis of DEG

The author futher understood the function of hub genes include BP, CC and MF by using DAVID database. Significant results of the GO enrichment analysis of DEGs in ccRCC are shown in Table 3. As shown in Fig. 5a and b, GO analysis (threshold: P < 0.05 and count≥2) demonstrated that ccRCC hub genes were mainly enriched in 50 terms of BP group, such as response to hypoxia, oxidation-reduction process and proteolysis. In CC group, DEGs were enriched in 21 terms, such as extracellular exosome, plasma membrane and membrane integral component. Similarly in MF group, DEGs were enriched in 11 terms, such as identical protein binding, receptor binding and heparin binding. As shown in Fig. 5c, the result illustrated the relationship between the different functions of cytoscape software.
Table 3

Significant results of the GO enrichment analysis of DEGs

FunctionTermCountPValueGenes
biological processesGO:0007588 ~ excretion72.64E-07NPHS2, CLCNKB, UMOD, ATP6V1B1, ATP6V0A4, KCNJ1, AQP2
GO:0001666 ~ response to hypoxia115.42E-07CAV1, NOL3, CA9, PLOD2, CXCR4, EGLN3, TLR2, HSD11B2, ABAT, CASP1, ANGPTL4
GO:0090383 ~ phagosome acidification49.87E-04ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0033572 ~ transferrin transport40.002115ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0008286 ~ insulin receptor signaling pathway50.002575CAV2, ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0032755 ~ positive regulation of interleukin-6 production40.004349P2RX7, TLR2, FCER1G, TLR3
GO:0034220 ~ ion transmembrane transport70.004506FXYD4, CLCNKB, ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2, AQP2
GO:0015695 ~ organic cation transport30.00525RHCG, SLC7A8, RHBG
GO:0006885 ~ regulation of pH30.005972RHCG, ATP6V1B1, ATP6V0A4
GO:0090090 ~ negative regulation of canonical Wnt signaling pathway60.006989CTHRC1, CAV1, SOST, GPC3, SFRP1, PSMB9
GO:0042493 ~ response to drug80.007086P2RX7, CA9, SFRP1, LGALS1, SLC34A1, HSD11B2, ABAT, NNMT
GO:0008152 ~ metabolic process60.007913ENPP6, LPCAT1, SUCLG1, MAN1C1, ACSF2, ACAA1
GO:0032092 ~ positive regulation of protein binding40.010136CTHRC1, CAV1, PLK2, TRIB3
GO:0055074 ~ calcium ion homeostasis30.011171CAV1, ATP6V1B1, CALB1
GO:0006508 ~ proteolysis100.011293C1QA, C1QB, GGT6, SFRP1, C3, CTSS, CASP1, PLG, DPEP1, XPNPEP2
GO:0055114 ~ oxidation-reduction process110.011711ALDH6A1, TYRP1, PLOD2, NDUFA4L2, RRM2, EGLN3, HSD11B2, ALDH4A1, DIO1, DCXR, HPD
GO:0010951 ~ negative regulation of endopeptidase activity50.012067KNG1, C3, SERPINA5, HRG, CSTA
GO:0050900 ~ leukocyte migration50.012407SLC16A3, CAV1, SLC7A8, FCER1G, ITGB2
GO:0045880 ~ positive regulation of smoothened signaling pathway30.013221GPC3, SFRP1, FGF9
GO:0001798 ~ positive regulation of type IIa hypersensitivity20.014597C3, FCER1G
GO:2000054 ~ negative regulation of Wnt signaling pathway involved in dorsal/ventral axis specification20.014597SOST, SFRP1
GO:0061621 ~ canonical glycolysis30.01542ENO2, PFKP, HK2
GO:0001503 ~ ossification40.020939SOST, SLC34A1, ATP6V1B1, ATP6V0A4
GO:0006954 ~ inflammatory response80.021562KNG1, TNFAIP6, P2RX7, CXCR4, C3, TLR2, TLR3, ITGB2
GO:0050717 ~ positive regulation of interleukin-1 alpha secretion20.021815P2RX7, CASP1
GO:2000116 ~ regulation of cysteine-type endopeptidase activity20.021815BIRC3, PSMB9
GO:0070634 ~ transepithelial ammonium transport20.021815RHCG, RHBG
GO:0019065 ~ receptor-mediated endocytosis of virus by host cell20.021815CAV2, CAV1
GO:0015991 ~ ATP hydrolysis coupled proton transport30.022874ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0051480 ~ regulation of cytosolic calcium ion concentration30.022874CAV1, PVALB, CALB1
GO:0001525 ~ angiogenesis60.024169CAV1, FGF9, TGFBI, HRG, FGF1, ANGPTL4
GO:0002931 ~ response to ischemia30.024234CAV1, NOL3, HK2
GO:0072221 ~ metanephric distal convoluted tubule development20.028982UMOD, CALB1
GO:0002283 ~ neutrophil activation involved in immune response20.028982FCER1G, TYROBP
GO:0007162 ~ negative regulation of cell adhesion30.029996KNG1, TGFBI, HRG
GO:0006955 ~ immune response80.03533RGS1, TNFSF13B, C3, ENPP3, TLR2, CTSS, FCGR3A, FCGR3B
GO:0070836 ~ caveola assembly20.036096CAV2, CAV1
GO:0015696 ~ ammonium transport20.036096RHCG, RHBG
GO:0006873 ~ cellular ion homeostasis20.036096RHCG, SLC4A1
GO:0019740 ~ nitrogen utilization20.036096RHCG, RHBG
GO:0010543 ~ regulation of platelet activation20.036096FCER1G, HRG
GO:0051005 ~ negative regulation of lipoprotein lipase activity20.036096APOC1, ANGPTL4
GO:0034123 ~ positive regulation of toll-like receptor signaling pathway20.036096TLR2, TLR3
GO:0031623 ~ receptor internalization30.039536CAV1, FCER1G, ITGB2
GO:0006094 ~ gluconeogenesis30.041225ENO2, FBP1, PCK2
GO:0050776 ~ regulation of immune response50.041952C3, ITGB2, FCGR3A, TREM2, TYROBP
GO:0030514 ~ negative regulation of BMP signaling pathway30.04294CAV1, SOST, SFRP1
GO:0000187 ~ activation of MAPK activity40.043944P2RX7, CXCR4, FGF1, DUSP9
GO:0007596 ~ blood coagulation50.046407P2RX7, SERPINA5, FCER1G, ENTPD1, PLG
GO:0032760 ~ positive regulation of tumor necrosis factor production30.046448TLR2, FCER1G, TLR3
cell compositionGO:0070062 ~ extracellular exosome614.70E-17FGF9, SLC7A8, CALB1, AQP2, EFHD1, GPC3, PVALB, CXCR4, PLOD2, SERPINA5, TMEM52B, TGFBI, SLC4A1, FCGR3A, ATP6V0D2, FCGR3B, DPEP1, HPD, KNG1, ALDH6A1, CRYAA, SUCLG1, SLC22A8, PFKP, FBP1, C1QA, C1QB, RHCG, MNDA, ABAT, CSTA, CHL1, SH3GL2, ENPP6, C3, ENPP3, APOC1, ITGB2, UMOD, ATP6V1B1, ENO2, HS6ST2, HRG, SUCNR1, ENTPD1, SLC12A3, LGALS1, PCK2, MAN1C1, PLG, PSMB9, XPNPEP2, AFM, GGT6, SFRP1, NPHS2, SLC13A3, FABP1, ATP6V0A4, IGFBP3, DCXR
GO:0016323 ~ basolateral plasma membrane126.23E-08CLDN8, CAV1, RHCG, CA9, SLC22A7, SLC22A8, SLC7A8, RHBG, UMOD, SLC4A1, ATP6V1B1, AQP2
GO:0005886 ~ plasma membrane534.79E-06CLDN8, TLR2, SLC7A8, AQP2, GPC3, CXCR4, TGFBI, SLC4A1, FCGR3A, FCGR3B, DPEP1, KNG1, COL23A1, SLC22A7, SUCLG1, SLC22A8, SLC34A1, TNFSF13B, RHCG, CA9, ATP6V1G3, TREM2, SH3GL2, CHL1, ENPP6, CAV2, CAV1, FXYD4, C3, RHBG, TRIB3, CLCNKB, ITGB2, KCNJ1, ENO2, TMEM30B, FCER1G, HRG, SUCNR1, ENTPD1, TYROBP, SLC12A3, PLG, XPNPEP2, SLC16A3, P2RX7, RGS1, SFRP1, NPHS2, SLC13A3, DIO1, ATP6V0A4, DCXR
GO:0005887 ~ integral component of plasma membrane255.00E-05CAV2, CAV1, FXYD4, SLC12A3, SLC22A7, ENPP3, SLC22A8, TLR2, RHBG, SLC34A1, SLC7A8, TLR3, CLCNKB, AQP2, SLC16A3, P2RX7, LAPTM5, GPC3, RHCG, NPHS2, FCER1G, SLC13A3, SLC4A1, ENTPD1, TYROBP
GO:0016471 ~ vacuolar proton-transporting V-type ATPase complex47.28E-05ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0072562 ~ blood microparticle81.04E-04KNG1, C1QB, AFM, C3, HRG, SLC4A1, PLG, ANGPTL4
GO:0005578 ~ proteinaceous extracellular matrix101.24E-04CTHRC1, SOST, GPC3, SFRP1, LGALS1, TGFBI, UMOD, FGF1, CHL1, ANGPTL4
GO:0016324 ~ apical plasma membrane102.30E-04CAV1, RHCG, SLC12A3, SLC34A1, UMOD, ATP6V1B1, ATP6V0A4, ATP6V0D2, DPEP1, AQP2
GO:0005615 ~ extracellular space224.94E-04KNG1, CTHRC1, C3, FGF9, LGALS1, HILPDA, CTSS, PLG, TNFAIP6, AFM, GPC3, SOST, TNFSF13B, SFRP1, SERPINA5, TGFBI, ENO2, CSTA, FGF1, IGFBP3, DPEP1, ANGPTL4
GO:0031225 ~ anchored component of membrane60.001254ENPP6, GPC3, UMOD, FCGR3B, DPEP1, XPNPEP2
GO:0016021 ~ integral component of membrane530.002201CLDN8, TLR2, SLC7A8, TLR3, AQP2, CXCR4, EVI2A, TMEM52B, SMIM5, SLC4A1, FCGR3A, FCGR3B, COL23A1, SLC22A7, SLC22A8, SLC34A1, DMRT2, HEPACAM2, TNFSF13B, CA9, RHCG, SPAG4, HSD11B2, TREM2, CHL1, NETO2, CAV2, TYRP1, CAV1, TMEM213, ENPP3, RHBG, CLCNKB, UMOD, KCNJ1, SEMA5B, LPCAT1, TMEM30B, FCER1G, HS6ST2, SUCNR1, ENTPD1, MS4A6A, TYROBP, TMEM45A, SLC12A3, NDUFA4L2, HILPDA, GGT6, SFRP1, SLC13A3, DIO1, ATP6V0A4
GO:0043234 ~ protein complex100.002667CAV2, CAV1, SOST, PVALB, SERPINA5, NPHS2, DDB2, FABP1, PRKCDBP, BIRC3
GO:0005576 ~ extracellular region220.004463KNG1, ENPP6, FGF9, C3, APOC1, UMOD, CTSS, PLG, C1QA, C1QB, AFM, SOST, TNFSF13B, SFRP1, SERPINA5, TGFBI, HRG, FGF1, TREM2, CASP1, IGFBP3, ANGPTL4
GO:0009986 ~ cell surface110.005205SFRP1, CXCR4, LGALS1, TLR2, SLC34A1, FCER1G, TLR3, HILPDA, ITGB2, PLG, TYROBP
GO:0002080 ~ acrosomal membrane30.007858CAV2, CAV1, SERPINA5
GO:0005602 ~ complement component C1 complex20.014107C1QA, C1QB
GO:0045121 ~ membrane raft60.015662CAV2, CAV1, NPHS2, TLR2, SLC34A1, BIRC3
GO:0000139 ~ Golgi membrane100.024687CAV2, CAV1, LPCAT1, ST8SIA4, TLR3, HS6ST2, HEPACAM2, MAN1C1, SH3GL2, HPD
GO:0005581 ~ collagen trimer40.027574C1QA, CTHRC1, C1QB, COL23A1
GO:0005759 ~ mitochondrial matrix70.028942PDK1, ALDH6A1, SUCLG1, ALDH4A1, ABAT, PCK2, ACSF2
GO:0005782 ~ peroxisomal matrix30.042028ACOX2, FABP1, ACAA1
molecular functionGO:0019864 ~ IgG binding46.11E-05FCER1G, UMOD, FCGR3A, FCGR3B
GO:0008201 ~ heparin binding70.001175KNG1, SOST, SFRP1, FGF9, SERPINA5, HRG, FGF1
GO:0005102 ~ receptor binding100.001189KNG1, ACOX2, P2RX7, CAV1, TNFSF13B, C3, HRG, HILPDA, PLG, TYROBP
GO:0030506 ~ ankyrin binding30.009327RHCG, RHBG, SLC4A1
GO:0001530 ~ lipopolysaccharide binding30.011232P2RX7, TLR2, TREM2
GO:0043027 ~ cysteine-type endopeptidase inhibitor activity involved in apoptotic process30.012243NOL3, BIRC3, DPEP1
GO:0015301 ~ anion:anion antiporter activity30.012243SLC22A7, SLC22A8, SLC4A1
GO:0051117 ~ ATPase binding40.017177CAV1, FXYD4, ATP6V1G3, ATP6V0A4
GO:0042802 ~ identical protein binding120.022021CLDN8, CAV1, NOL3, SFRP1, CRYAA, FBP1, ALDH4A1, TLR3, SH3GL2, DCXR, TYROBP, ANGPTL4
GO:0015078 ~ hydrogen ion transmembrane transporter activity30.022995ATP6V1B1, ATP6V0A4, ATP6V0D2
GO:0004869 ~ cysteine-type endopeptidase inhibitor activity30.025762KNG1, HRG, CSTA
Fig. 5

Go and KEGG analysis of DREs in ccRCC. A DEGs were divided into three functional groups by GO analysis, including molecular function, biological processes and cell composition. B GO enrichment significance items of DEGs in three functional groups. C The relationship between the different functions. D Significant pathway enrichment of DEGs. Purple represents the signaling pathway, red represents the overexpressed genes and green represents the under-expressed genes

Significant results of the GO enrichment analysis of DEGs Go and KEGG analysis of DREs in ccRCC. A DEGs were divided into three functional groups by GO analysis, including molecular function, biological processes and cell composition. B GO enrichment significance items of DEGs in three functional groups. C The relationship between the different functions. D Significant pathway enrichment of DEGs. Purple represents the signaling pathway, red represents the overexpressed genes and green represents the under-expressed genes The significantly enriched pathways were submitted to KEGG analysis to further analyze the above DEGs. As shown in Table 4 and Fig. 5d, the significant pathway enrichment of DEGs was indicated by KEGG analysis. These DEGs were enriched in 24 pathways, which mainly related to metabolic pathways, phagosome and other pathways.
Table 4

KEGG pathway analysis of DEGs associated with ccRCC

TermCountP ValueGenes
hsa04966:Collecting duct acid secretion61.77E-05CLCNKB, SLC4A1, ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa05150:Staphylococcus aureus infection75.12E-05C1QA, C1QB, C3, ITGB2, FCGR3A, FCGR3B, PLG
hsa04145:Phagosome109.64E-05C3, TLR2, ITGB2, CTSS, ATP6V1G3, FCGR3A, ATP6V1B1, ATP6V0A4, FCGR3B, ATP6V0D2
hsa05323:Rheumatoid arthritis77.65E-04TNFSF13B, TLR2, ITGB2, ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa05152:Tuberculosis90.001573C3, TLR2, FCER1G, ITGB2, CTSS, FCGR3A, ATP6V0A4, FCGR3B, ATP6V0D2
hsa04610:Complement and coagulation cascades60.001629KNG1, C1QA, C1QB, C3, SERPINA5, PLG
hsa01100:Metabolic pathways260.006292ACOX2, TYRP1, ENPP3, HK2, UPP2, ATP6V1B1, LPCAT1, ENO2, ALDH4A1, ATP6V0D2, HPD, ALDH6A1, NDUFA4L2, SUCLG1, FBP1, PFKP, PCK2, MAN1C1, GGT6, RRM2, ABAT, ATP6V1G3, ATP6V0A4, DCXR, ACAA1, NNMT
hsa03320:PPAR signaling pathway50.009598ACOX2, FABP1, PCK2, ACAA1, ANGPTL4
hsa00010:Glycolysis / Gluconeogenesis50.009598ENO2, FBP1, PFKP, HK2, PCK2
hsa05140:Leishmaniasis50.011719C3, TLR2, ITGB2, FCGR3A, FCGR3B
hsa01200:Carbon metabolism60.013278ALDH6A1, SUCLG1, ENO2, FBP1, PFKP, HK2
hsa05133:Pertussis50.01412C1QA, C1QB, C3, ITGB2, CASP1
hsa05110:Vibrio cholerae infection40.026859ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa05134:Legionellosis40.029616C3, TLR2, ITGB2, CASP1
hsa04721:Synaptic vesicle cycle40.043794ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa05230:Central carbon metabolism in cancer40.045546SLC16A3, PDK1, PFKP, HK2
hsa00640:Propanoate metabolism30.047335ALDH6A1, SUCLG1, ABAT
hsa01130:Biosynthesis of antibiotics70.04889SUCLG1, ENO2, FBP1, PFKP, HK2, PCK2, ACAA1
hsa05120:Epithelial cell signaling in Helicobacter pylori infection40.051005ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa00051:Fructose and mannose metabolism30.060164FBP1, PFKP, HK2
hsa04650:Natural killer cell mediated cytotoxicity50.065949FCER1G, ITGB2, FCGR3A, FCGR3B, TYROBP
hsa00190:Oxidative phosphorylation50.084557NDUFA4L2, ATP6V1G3, ATP6V1B1, ATP6V0A4, ATP6V0D2
hsa04960:Aldosterone-regulated sodium reabsorption30.085035FXYD4, HSD11B2, KCNJ1
hsa05322:Systemic lupus erythematosus50.086365C1QA, C1QB, C3, FCGR3A, FCGR3B
KEGG pathway analysis of DEGs associated with ccRCC

PPI network and module analysis

String database was used to generate PPI networks of DEGs in RCC. Figure 6a showed the relationship between the 137 candidate hub genes. Besides, MCODE application was applied to screen out the highest-scoring nodes. And Fig. 6b displayed the module with the highest score (score = 10, node = 11, edges = 50). As a result, MCODE application selected 13 nodes with the highest score, including 10 upregulated candidate genes (C1QA, C1QB, C3, CTSS, CXCR4, FCER1G, ITGB2, TLR2, TLR3 and TYROBP) and 3 downregulated candidate genes (AQP2, KNG1, PLG).
Fig. 6

Protein-protein interaction network and MCODE application. A PPI network. B Top 13 degree genes by MCODE application

Protein-protein interaction network and MCODE application. A PPI network. B Top 13 degree genes by MCODE application The oncomine database and GEPIA database were applied to further explore the expression and prognosis of the above screened genes. Six analyses were obtained from the oncomine database (Fig. 7). The significant (P < 0.05) expression of 10 genes were suggested by the result of meta-analysis. Figure 8 indicated the OS and DFS of 10 genes. And the result demonstrated that ccRCC patients with high C3 expression had a poor OS, while ccRCC patients with high CTSS and TLR3 expressions had a good OS. Besides, in ccRCC patients, high C3 and CXCR4 expressions indicated a poor DFS, while high TLR3 expression indicated a good DFS. Finally, C3 and CXCR4 were selected to distinguish the prognosis of ccRCC patients.
Fig. 7

The expression level of 13 hub genes. Among 6 different analysis datasets by the ONCOMINE database

Fig. 8

The OS and DFS of 10 candidate genes in ccRCC patients by GEPIA database. (OS: A-J, DFS: K-T)

The expression level of 13 hub genes. Among 6 different analysis datasets by the ONCOMINE database The OS and DFS of 10 candidate genes in ccRCC patients by GEPIA database. (OS: A-J, DFS: K-T)

Discussion

Kidney cancer accounts for about 2 to 3% of adult malignant tumors, and 80 to 90% of adult renal malignancies. In 2012, about 338, 000 kidney cancer cases were newly discovered, accounting for 24% of all tumors; and there were 144,000 death cases, accounting for 17% of all tumors [11]. RCC was the most common kidney malignancies. The early symptoms of RCC were not obvious, and most patients are diagnosed with advanced stage or metastasis [12]. RCC was characteristic of easy recurrence and metastasis because of its complexity of the causes and pathogenesis. Moreover, it was insensitive to the traditional chemoradiotherapy. Under the influence of these reasons, RCC usually leaded to poor clinical outcomes. Hence, it could improve the diagnosis, treatment and prognosis of RCC via understanding more of the biological molecular mechanism. The sequencing technology and bioinformatics are developing gradually, the collection and analysis of previous data will support to explor the pathogenesis of RCC and discover possible biomarkers for diagnosis andtreatment [13]. Bioinformatics method is a highly efficient research pathway, which could promote the development of related gene or group of disease by analyzing the biological data. At the present, Bioinformatics have been widely used at all areas, including medical research, the design of the discover disease-related genes, clinical diagnosis of disease, individualized treatment of diseases and new molecular targets for drug discovery [14]. One hundred thirty-seven DEGs were identified in this study, including 63 overexpressed genes and 74 under-expressed genes. It was found that these DEGs were mainly enriched in 82 terms and 24 pathways through GO and KEGG analysis. Thirteen highest-scoring genes were screened as hub gene through PPI network. Further verification based on the oncomine platform indicated that 10 hub genes (C1QA, C1QB, C3, CTSS, CXCR4, FCER1G, ITGB2, TLR2, TLR3 and TYROBP) had significantly highly expressed. Finally, through the GEPIA platform, the author found that ccRCC patients with high C3 expression had a poor OS, while ccRCC patients with high CTSS and TLR3 expressions had a better OS. Meanwhile, high C3 and CXCR4 expressions were associated with a poor DFS, while patients with high TLR3 expression had a good DFS. As a protein coding gene, complement component 3 (C3) is involved in the occurrence and development of many diseases, including C3 deficiency, Autosomal Recessive and Hemolytic Uremic Syndrome, Atypical 5 [15]. And its related pathways are Immune response Lectin induced complement pathway and Signaling by GPCR. In previous reports, C3 was demonstrated as a potential prognostic marker for non-small cell lung cancer and may be a new immune marker to differentiate the prognosis of patients with non-small cell lung cancer [16, 17]. Besides, Yuan et al. demonstated that overexpressed C3 could activate the JAK2/STAT3 pathway, which affected the progression of gastric cancer [18]. In addition, it had been reported that tumor cell–derived C3 could regulated TAMs through C3a-C3aR-PI3Kγ pathway to suppress the antitumor immunity [19]. CTSS (Cathepsin S) is a protein coding gene. Previous articles in papillary thyroid carcinoma reported that CTSS was highly expressed and related to transformation. These results revealed that the highly expression of CTSS was associated with poor prognosis and lymph node metastasis [20]. Similarly, it had been reported that CTSS was over-expressed in triple-negative breast cancer, and the inhibition of CTSS could be conducted by inhibiting the growth and metastasis of triple-negative breast cancer [21]. Prof. Dheilly found that follicular lymphoma patients harbor a recurrent hotspot mutation targeting tyrosine 132 (Y132D) in cathepsin S (CTSS) that enhances protein activity. Futher study revealed that it could enhanced the anti-tumor immune responses in Non-Hodgkin Lymphoma by inhibiting CTSS [22]. In this study, the author analyzed the research data and found that CTSS was indeed highly expressed in RCC, but the high expression was associated with better prognosis. The prognosis of patients with high expression was even better, which is an opposite effect between expression and prognosis. The potential reasons for the inconsistent findings need further investigations. As a member of the Toll-like receptor (TLR) family, previous studies had reported that TLR3 was abnormally expressed in a variety of tumors, including breast, ovarian and prostate tumors. But TLR3 was associated with the clinical outcomes of various cancers [23, 24]. Francesca revealed that TLR3 could induce apoptpsis in Non-Small-Cell Lung Cancer via boosting the innate immune response [25]. Besides, Fan’s result demonstated that TLR3 suppressed the proliferation by downregulating the EGFR/PI3K/AKT pathway in brest cancer [26]. Similarly, TLR3 was also downregulated in hepatocellular carcinoma. And deep reseach showed that overexpression of TLR3 was associated with longer survival [27]. In this study, TLR3 was highly expressed in RCC but it was related to the better prognosis result. Chemokine receptor-4 (CXCR4) belongs to the super-family of the seven-transmembrane domain, heterotrimeric G-protein-coupled receptors and is associated with cell proliferation, migration, invasion and survival. In the previous reports, it had been demonstrated that CXCR4 was upregulated in sporadic Vestibular schwannomas (VS) as well as in neurofibromatosis type 2 (NF2) tumors [28-30]. Besides, SDF-1 (CXCL12)/CXCR4 signaling has been verified to play a vital role in oncobiology, especially in hypoxia adaptation, metastasis and migration [31]. What’s more, the CXCR4 antagonists (such as AMD3100, Mozobil®) were widely applied in hematopoietic stem cells, which could dramaticly increase the mobilization efficiency and yields of progenitor cells [32]. The results in this study showed that CXCR4 was over-expressed in RCC and associated with poor prognosis. However, the role of CXCR4 in RCC has been poorly studied. Therefore, the further exploration of the mechanism of CXCR4 in RCC will help people to find new therapeutic targets.

Conclusion

In summary, the author identified two ccRCC-associated candidate genes (C3 and CXCR4) with potential prognostic value via bioinformatics analysis of three expression profile datasets from the GEO database. Additionally, in this study, it have been found that CTSS and TLR3 were abnormally expressed in ccRCC and associated with ccRCC prognosis. However, their expression level is contrary to the prognosis. These novel biomarkers may have important clinical significance for the diagnosis and prognosis of RCC, but their detailed action mechanism in the development of renal carcinoma needs to be further explored. In the following studies, the author will further verify the expression of the above genes in renal cancer through RT-QPCT. In addition, its downstream target genes and signaling pathways need to be explored and verified by cell experiments in vitro and animal experiments in vivo, which will help the author to better understand its developmental mechanism in renal cancer. Additional file 1. Additional file 2.
  32 in total

1.  Intracellular Activation of Complement C3 Leads to PD-L1 Antibody Treatment Resistance by Modulating Tumor-Associated Macrophages.

Authors:  Haoran Zha; Xinxin Wang; Ying Zhu; Diangang Chen; Xiao Han; Fei Yang; Jianbao Gao; Chunyan Hu; Chi Shu; Yi Feng; Yulong Tan; Jinyu Zhang; Yongsheng Li; Yisong Y Wan; Bo Guo; Bo Zhu
Journal:  Cancer Immunol Res       Date:  2018-12-04       Impact factor: 11.151

2.  Toll-like receptor 3 acts as a suppressor gene in breast cancer initiation and progression: a two-stage association study and functional investigation.

Authors:  Lei Fan; Peng Zhou; Qi Hong; Ao-Xiang Chen; Guang-Yu Liu; Ke-Da Yu; Zhi-Ming Shao
Journal:  Oncoimmunology       Date:  2019-03-30       Impact factor: 8.110

3.  A bioinformatics approach for precision medicine off-label drug drug selection among triple negative breast cancer patients.

Authors:  Lijun Cheng; Bryan P Schneider; Lang Li
Journal:  J Am Med Inform Assoc       Date:  2016-04-23       Impact factor: 4.497

4.  Sorafenib reduces the percentage of tumour infiltrating regulatory T cells in renal cell carcinoma patients.

Authors:  Ingrid M E Desar; J Hans F M Jacobs; Christina A Hulsbergen-vandeKaa; Wim J G Oyen; Peter F A Mulders; Winette T A van der Graaf; Gosse J Adema; Carla M L van Herpen; I Jolanda J M de Vries
Journal:  Int J Cancer       Date:  2010-11-12       Impact factor: 7.396

5.  Using Machine Learning Modeling to Explore New Immune-Related Prognostic Markers in Non-Small Cell Lung Cancer.

Authors:  Jiasheng Xu; Han Nie; Jiarui He; Xinlu Wang; Kaili Liao; Luxia Tu; Zhenfang Xiong
Journal:  Front Oncol       Date:  2020-10-30       Impact factor: 6.244

6.  Neuronal pentraxin 2 supports clear cell renal cell carcinoma by activating the AMPA-selective glutamate receptor-4.

Authors:  Christina A von Roemeling; Derek C Radisky; Laura A Marlow; Simon J Cooper; Stefan K Grebe; Panagiotis Z Anastasiadis; Han W Tun; John A Copland
Journal:  Cancer Res       Date:  2014-06-24       Impact factor: 12.701

7.  Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012.

Authors:  Jacques Ferlay; Isabelle Soerjomataram; Rajesh Dikshit; Sultan Eser; Colin Mathers; Marise Rebelo; Donald Maxwell Parkin; David Forman; Freddie Bray
Journal:  Int J Cancer       Date:  2014-10-09       Impact factor: 7.396

8.  Regulation of endocytosis via the oxygen-sensing pathway.

Authors:  Yi Wang; Olga Roche; Mathew S Yan; Greg Finak; Andrew J Evans; Julie L Metcalf; Bridgid E Hast; Sara C Hanna; Bill Wondergem; Kyle A Furge; Meredith S Irwin; William Y Kim; Bin T Teh; Sergio Grinstein; Morag Park; Philip A Marsden; Michael Ohh
Journal:  Nat Med       Date:  2009-03-01       Impact factor: 53.440

9.  Identification of Key Candidate Genes and Pathways in Colorectal Cancer by Integrated Bioinformatical Analysis.

Authors:  Yongchen Guo; Yonghua Bao; Ming Ma; Wancai Yang
Journal:  Int J Mol Sci       Date:  2017-03-28       Impact factor: 5.923

10.  Integrated bioinformatics analysis reveals that the expression of cathepsin S is associated with lymph node metastasis and poor prognosis in papillary thyroid cancer.

Authors:  Juan Tan; Xiaoxiao Qian; Bin Song; Xiumin An; Tingting Cai; Zhihua Zuo; Dafa Ding; Yibing Lu; Hong Li
Journal:  Oncol Rep       Date:  2018-05-08       Impact factor: 3.906

View more
  2 in total

1.  DLK2 Acts as a Potential Prognostic Biomarker for Clear Cell Renal Cell Carcinoma Based on Bioinformatics Analysis.

Authors:  Man-Gang Lee; Yung-Kuo Lee; Shih-Chung Huang; Chen-Lin Chang; Chou-Yuan Ko; Wen-Chin Lee; Tung-Yuan Chen; Shiow-Jyu Tzou; Cheng-Yi Huang; Ming-Hong Tai; Yu-Wei Lin; Mei-Lang Kung; Ming-Chao Tsai; Yung-Lung Chen; Yi-Chen Chang; Zhi-Hong Wen; Chao-Cheng Huang; Tian-Huei Chu
Journal:  Genes (Basel)       Date:  2022-04-01       Impact factor: 4.141

2.  Crosstalk of necroptosis and pyroptosis defines tumor microenvironment characterization and predicts prognosis in clear cell renal carcinoma.

Authors:  Liangmin Fu; Jiahao Bao; Jinhui Li; Qiuyang Li; Hansen Lin; Yayun Zhou; Jiangbo Li; Yixuan Yan; Marvin E Langston; Tianhao Sun; Songliang Guo; Xinwei Zhou; Yuhang Chen; Yujun Liu; Yiqi Zhao; Jun Lu; Yong Huang; Wei Chen; Benjamin I Chung; Junhang Luo
Journal:  Front Immunol       Date:  2022-09-30       Impact factor: 8.786

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.