Literature DB >> 31746380

Screening and identification of key biomarkers in prostate cancer using bioinformatics.

Song Li1, Junqing Hou1, Weibo Xu1.   

Abstract

Prostate cancer (PCa) is the second most common cancer amongst males worldwide. In the current study, microarray datasets GSE3325 and GSE6919 from the Gene Expression Omnibus database were screened to identify candidate genes that are associated with the progression of PCa. A total of 273 differentially expressed genes (DEGs) were identified, which included 173 downregulated genes and 100 upregulated genes, and a protein‑protein interaction network was constructed using Search Tool for the Retired of Interacting Genes. The enriched functions and pathways of the identified DEGs included cell adhesion, the negative regulation of cell proliferation, protein binding and focal adhesion. A total of 8 hub genes were identified, of which PDZ binding kinase, Krüppel‑like factor 4, collagen type XII α‑1 chain, RAP1A and RAP39B were indicated to be associated with the progression and recurrence of PCa. In conclusion, the DEGs and hub genes identified in the present study may aid in determining the molecular mechanisms associated with PCa carcinogenesis and progression.

Entities:  

Year:  2019        PMID: 31746380      PMCID: PMC6896273          DOI: 10.3892/mmr.2019.10799

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Prostate cancer (PCa) is the second most common type of cancer among men globally, and constitutes ~15% of all cancer diagnoses worldwide (1). Digital rectal examination, measurement of the serum level of prostate specific antigen (PSA) and biopsy from a prostate transrectal ultrasonography are the most common diagnostic tools for PCa (2). Additionally, with advances in genetic analysis, alterations have been identified in a number of gene regions in patients with PCa, including prostate antigen 3, androgen-dependent transmembrane serine 2 and S-transferase P1 (3–5). However, genetic analysis exhibits a low specificity and can increase the number of unnecessary biopsies performed without reducing patient mortality (6). Previous studies have associated the tribbles pseudokinase 1 gene with the development of a number of tumors, including colorectal leukemia and hepatocellular cancers (7–9). It has been shown that transmembrane protease, serine 2:ETS-related gene (TMPRSS2:ERG) fusion is associated with diagnosing PCa in urine samples and DNA-based molecular templates (10). However, due to the lack of effective diagnostic methods during the early stages of the disease, the mortality rate of PCa remains high (10). Therefore, it is crucial to understand the molecular mechanisms associated with PCa carcinogenesis, proliferation and recurrence. Microarray technology and bioinformatics analysis led to the identification of 273 differentially expressed genes (DEGs) and functional pathways in the carcinogenesis and progression of PCa. In the current study, two mRNA microarray datasets from Gene Expression Omnibus (GEO) were analyzed to identify DEGs between PCa tissues and non-cancerous tissues. Subsequently, the molecular mechanisms of PCa carcinogenesis and progression were investigated using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis and protein-protein interaction (PPI) network analyses. In conclusion, a total of 273 DEGs and 8 hub genes were identified in the current study, and these genes may be candidate biomarkers for PCa.

Materials and methods

Database

GEO (http://www.ncbi.nlm.nih.gov/geo) (11) is a public functional genomics database. GSE3325 (12) and GSE6919 (13) were downloaded from GEO (Affymetrix Human Genome U133 Plus 2.0 Array). The GSE3325 dataset contained 12 PCa tissue samples and 12 non-cancerous samples. GSE6919 contained 8 PCa samples and 8 non-cancerous samples.

Identification of DEGs

The Affy package (version 1.52.0) (14) was used to preprocess the raw expression data in the R statistical software (R ×64 3.5.3; http://cran.r-project.org). DEGs were subsequently identified between PCa and normal samples using the limma (version 3.34.7) package of the R statistical software (https://bioconductor.org/packages/release/bioc/html/limma.html). DEGs with log2FC >1 and P<0.01 were selected in the microarray data.

Enrichment analysis of DEGs

The Database for Annotation, Visualization and Integrated Discovery (DAVID; version 6.7; http://david.ncifcrf.gov) (15), which provides functional annotation information of genes and proteins, was used to perform DEG analysis of KEGG pathway enrichment (16) and GO annotation (17). P<0.05 was set as the threshold value.

Module analysis and construction of the PPI network

The Search Tool for the Retrieval of Interacting Genes (version 10.0; http://string-db.org) (18), which offers comprehensive information on PPIs, was used to create the PPI network. The molecular interaction networks were visualized using Cytoscape (version 3.4.0; http://cytoscape.org/) (19). The Molecular Complex Detection (MCODE) (version 1.4.2) of Cytoscape was used to identify densely connected regions (20). The PPI networks were visualized using Cytoscape software. The hub genes and the most significant module in the PPI networks were identified using MCODE.

Hub genes selection and analysis

The hub genes were selected and their co-expression genes were analyzed using cBioPortal (http://www.cbioportal.org) (21). Hierarchical clustering of hub genes was constructed using the University of California, Santa Cruz Cancer Browser (http://genome-cancer.ucsc.edu) (22). The overall survival and disease-free survival analyses of hub genes (the cutoff was the median expression value) were performed using Kaplan-Meier analysis in cBioPortal. The hazard ratio (HR) with 95% confidence intervals and log-rank P-value was also computed. The expression of PDZ binding kinase (PBK) and Krüppel-like factor 4 (KLF4) in cancer tissues were analyzed and presented using the online database Serial Analysis of Gene Expression (SAGE; http://www.ncbi.nlm.nih.gov/SAGE) (23). The relationship between expression patterns and tumor-node-metastasis (TNM) stage, Gleason grade and recurrence status were analyzed using the online database Oncomine (http://www.oncomine.com) (24).

Results

Identification of DEGs in PCa

After standardization of the microarray results, a total of 1,024 DEGs in GSE6919 and 2,371 DEGs in GSE3325 were identified. The overlap between the 2 datasets contained 273 genes, as presented in Fig. 1A, consisting of 173 downregulated genes and 100 upregulated genes in PCa tissues.
Figure 1.

Venn diagram, PPI network and the most significant module of DEGs. (A) DEGs were selected with a fold change >2 and P-value <0.01. The 2 datasets showed an overlap of 273 genes. (B) PPI network of DEGs constructed using Cytoscape. (C) Most significant module with 20 nodes and 100 edges. Upregulated genes are marked in light red; downregulated genes are marked in light blue. DEGs, differentially expressed genes; PPI, protein-protein interaction.

PPI network and module analysis

The PPI network of DEGs (Fig. 1B) and the most significant module were identified using Cytoscape (Fig. 1C). The functional analyses of DEGs demonstrated that genes in this module were mainly enriched in nucleotide binding, small molecule binding, focal adhesion and the regulation of the actin cytoskeleton (Table I).
Table I.

GO and KEGG pathway enrichment analysis of DEGs in the most significant module.

Pathway IDPathway descriptionCount in gene setFDR
GO:0000166Nucleotide binding60.003
GO:1901265Nucleoside phosphate binding60.003
GO:0036094Small molecule binding60.004
GO:0008092Cytoskeletal protein binding5<0.001
hsa04510Focal adhesion80.007
hsa04810Regulation of actin cytoskeleton80.008

GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; DEG, differentially expressed genes; FDR, false discovery rate.

Functional enrichment analyses of DEGs

Functional and pathway enrichment analyses of DEGs were performed using DAVID. GO analysis revealed that the biological processes of DEGs were significantly enriched in cell adhesion, negative regulation of cell proliferation, cell division and extracellular matrix organization (Table II). Molecular functions of DEGs were enriched in protein binding, GTP binding, mannose-binding, chromatin binding and chromatin binding (Table II). Cell components enriched with DEGs included the nucleus, cytoplasm, perinuclear region of the cytoplasm and focal adhesion (Table II). KEGG pathway analysis revealed that DEGs were mainly enriched in focal adhesion, regulation of the actin cytoskeleton, tight junction, coagulation cascades and gap junctions.
Table II.

KEGG and GO enrichment analyses of DEGs.

TermDescriptionCount in gene setP-value
GO:0007155Cell adhesion120.049
GO:0008285Negative regulation of cell proliferation110.044
GO:0051301Cell division100.050
GO:0030308Negative regulation of cell growth70.006
GO:0030198Extracellular matrix organization70.051
GO:0005634Nucleus99<0.001
GO:0005737Cytoplasm93<0.001
GO:0048471Perinuclear region of cytoplasm170.010
GO:0005925Focal adhesion16<0.001
GO:0009986Cell surface140.033
GO:0005515Protein binding147<0.001
GO:0005525GTP binding130.007
GO:0003682Chromatin binding120.019
GO:0019901Protein kinase binding110.035
hsa04510Focal adhesion110.007
hsa04810Regulation of actin cytoskeleton90.008
hsa04530Tight junction60.003
hsa04540Gap junction60.005

GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; DEG, differentially expressed genes.

Analyses of the 8 hub genes

In the present study, a total of 8 hub genes were identified and these hub genes were presented in Table III. The criteria for selection were as follows: MCODE scores >5, degree cut-off=2, node score cut-off=0.2, Max depth=100 and k-score=2. Among the 8 genes, PBK, RAP1A, GNAS and RAB39B were upregulated, while COPZ2, KLF4, BACE1 and COL12A1 were downregulated A network of the hub genes and their co-expression genes were analyzed using the cBioPortal online platform (Fig. 2A). Hierarchical clustering demonstrated that the hub genes could differentiate PCa samples from noncancerous samples (Fig. 2B). Subsequently, the overall survival analysis of the hub genes was performed using a Kaplan-Meier curve analysis. Patients with PCa and PBK, RAP1A, GNAS, coatomer protein complex subunit ζ 2 (COPZ2), β-secretase 1 (BACE1) and collagen type XII α-1 Chain (COL12A1) upregulation demonstrated decreased overall survival (Fig. 3A). Patients with PCa and PBK, RAP1A, GNAS, COPZ2, BACE1 and COL12A1 upregulation exhibited decreased disease-free survival (Fig. 3B). Additionally, RAB39B and KLF4 upregulation was associated with increased overall survival and disease-free survival. Based on the above survival analysis, PBK and KLF4 were identified to serve important roles in the carcinogenesis or progression of PCa. Oncomine analysis of cancer and normal tissue revealed that PBK and KLF4 were significantly overexpressed in PCa in the different datasets (Fig. 4A and B). In the Taylor prostate of Oncomine dataset, the increased mRNA levels of PBK were associated with TNM stage, Gleason grade and recurrence status (Fig. 5A). In the Tatulippe prostate of Oncomine dataset, decreased KLF4 mRNA levels were associated with TNM stage, Gleason grade and recurrence status (Fig. 5A and B). PBK gene expression in metastatic tissue was higher compared with primary tumor and solid tissue normal via Oncomine (Fig. 6).
Table III.

Functional roles of 8 hub genes with degree ≥10.

No.Gene symbolFull nameFunction
1PBKPDZ binding kinaseActive lymphoid cells and support testicular functions; over expression of this gene has been implicated in tumorigenesis
2RAP1AMember of RAS oncogene familyAffect cell proliferation and adhesion, and may play a role in tumor malignancy
3GNASMember of RAS oncogene familyThe encoded protein regulates signaling pathways that affect cell proliferation and adhesion
4COPZ2Coatomer protein complex subunit ζ 2This gene encodes a member of the adaptor complexes small subunit family
5KLF4Krüppel-like factor 4Control the G1-to-S transition of the cell cycle
6BACE1β-secretase 1This gene encodes a member of the peptidase A1 family of aspartic proteases
7COL12A1Collagen type XII α 1 chainModify the interactions between collagen I fibrils and the surrounding matrix
8RAB39BMember RAS oncogene familyEncodes a member of the Rab family of proteins
Figure 2.

Interaction network and biological process analysis of the hub genes. (A) Hub genes and co-expression genes were analyzed using cBioPortal. Nodes with bold black outline represent hub genes. Nodes with thin black outline represent the co-expression genes. (B) Hierarchical clustering of hub genes. The samples under the red bar are non-cancerous samples and the samples under the blue bar are PCA samples.

Figure 3.

Correlation of expression of individual hubs genes survival. (A) Overall survival and (B) disease-free survival analyses of hub genes were performed using cBioPortal online platform. P<0.05 was considered statistically significant.

Figure 4.

Oncomine analysis of cancer vs. normal tissue of PBK and KLF4. Heat maps of (A) PBK and (B) KLF4 gene expression in clinical PCA carcinoma samples vs. normal tissues. PBK, PDZ binding kinase; KLF4, Krüppel-like factor 4.

Figure 5.

Relationship between the PBK, KLF4 and clinicopathological features. Association between the expression of (A) PBK and (B) KLF4 and TNM stage, Gleason grade and recurrence status. PBK, PDZ binding kinase; KLF4, Krüppel-like factor 4; TNM, tumor-node-mestastasis.

Figure 6.

mRNA expression of 8 hub genes between primary tumor, solid tissue normal and metastatic tissue.

Discussion

Previous studies have demonstrated that the TMPRSS2:ERG fusion is significantly associated with the diagnosis of PCa (25–27). Promoter hypermethylation and downregulated expression of glutathione peroxidase 3 have been observed in a variety of cancer types, including thyroid cancer, hepatocellular carcinoma and PCa (10,26,27). Yu et al (28) identified an association between Piwi-like protein 2 (PIWIL2) gene expression and metastatic PCa. Potential markers for use in the diagnosis and treatment of PCa, which exhibit high efficiency, are urgently required. To increase understanding of the molecular mechanisms of candidate genes, GO, KEGG and PPI analyses were performed. In the current study, the epigenetic and genetic mechanisms in PCa were assessed using microarray technology. A total of two mRNA microarray datasets were selected. A total of 273 DEGs were identified, including 173 downregulated genes and 100 upregulated genes. The interactions of DEGs were investigated using GO and KEGG analyses. DEGs were found to be enriched in focal adhesion, regulation of the actin cytoskeleton, tight junctions, coagulation cascades and gap junctions. However, other studies (29,30) have demonstrated that DEGs were enriched in a number of functional terms, including cellular response to bone morphogenetic protein (BMP) stimulus, response to BMP, extracellular region and pathways that are associated with transforming growth factor-β signaling. GO enrichment analysis revealed that changes in the most significant modules were enriched in nucleotide binding, nucleoside phosphate binding, small molecule binding and cytoskeletal protein binding, while changes in KEGG were mainly enriched in focal adhesion and regulation of the actin cytoskeleton. A total of 8 DEGs were selected as hub genes. The criteria for selection were as follows: MCODE scores >5, degree cut-off=2, node score cut-off=0.2, Max depth=100 and k-score=2. Among the 8 genes, PBK, RAP1A, GNAS and RAB39B were upregulated, while COPZ2, KLF4, BACE1 and COL12A1 were downregulated. PBK and KLF4 were identified to be important genes in the present study. PBK is highly homologous to mitogen-activated protein kinase (31,32). By virtue of target utilization, PBK has been revealed to influence growth and differentiation (33–36). PBK is expressed in the outer cell layer of seminiferous tubules in primary spermatocytes (37), and is often increased in a number of human cancer types from different tissue sources (38,39). However, the function of PBK has not yet been fully determined. In a previous study, the immunohistochemical expression of PBK/T-LAK cell-originated protein kinase (TOPK) was revealed to be significantly associated with human bladder cancer, and was identified as a novel diagnostic biomarker for this disease (40). In the present study, the PPI network revealed that PBK directly interacted with cyclin-dependent kinase 1, Rac GTPase-activating protein 1, baculoviral IAP repeat containing 5 and protein regulator of cytokinesis 1, indicating a key role for PBK in PCa. The expression of PBK was subsequently assessed. Gene upregulation in PBK were associated with reductions in overall and disease free survival. Additionally, the KLF4 upregulation was significantly associated with increased overall survival and disease-free survival. High levels of PBK were associated with advanced stages, Gleason score ≥8 and recurrence (41). Additionally, PBK was significantly increased in PCa (P=0.001), and the expression was higher in the Gleason high-scoring group compared with the low-scoring group (P=0.001) (41). PBK, Gleason score and pathological stage are independent predictors of PCa recurrence, and PBK has been indicated to be significantly associated with survival of no biochemical recurrence (42). As an important mitotic kinase, PBK has been reported to exhibit a close association with patient clinical characteristics (43). The current study indicated that higher mRNA levels of PBK were associated with TNM stage, Gleason grade and recurrence status, demonstrating the vital roles of PBK in the carcinogenesis and progression of PCa. PBK gene expression in metastatic tissue was higher compared with primary tumor and solid tissue normal tissue, and PBK has been indicated to serve an important role in mitosis (43). PBK expression and phosphorylation are significantly increased during cell mitosis (44). Previously, a knock-out study of TOPK revealed that PBK can affect spindle formation (36). When PBK is inhibited during mitosis, the spindle (especially the central part) in mitosis and the subsequent cells become blurred (44). The pulp division cannot be completed smoothly and the cells will subsequently split out of the multinucleated cells. Therefore, PBK has been identified to be associated with the regulation of proliferation and cell cycle changes in malignant tumor cells, and has also been revealed to promote tumor cell transformation (33). KLF4 is a member of the Krüppel-like zinc finger transcription factor family, which serves a role in regulating important processes, including cell proliferation, differentiation and embryo development (36). They are also associated with numerous human cancer types, including gastrointestinal, bladder and lung cancers (33,36,45). A number of KLF4 targeting genes are also biomarker transcription factors in the endothelial-mesenchymal transition (EMT) process (45). Additionally, a previous study indicated that the expression of E-cadherin and α-catenin in the KLF4 overexpression treatment group was significantly higher compared with the control group, while the mesenchymal cell marker vimentin and the expression of vascular endothelial growth was significantly lower compared with the control group (46). It has been shown that KLF4 protein is negatively associated with clinical stages in patients with meningioma, and it promotes or inhibits the EMT process by acting on transcription factors (46). The transcription factor KLF4 in PCa cells promotes the migration and invasion of EMT and tumor cells in vitro (47). These results are consistent with the results of the current study, which indicated that lower mRNA levels of PBK were associated with TNM stage, Gleason grade and recurrence status. KLF4 was also indicated to be downregulated in PCa tissue with metastases. Furthermore, the stable knockdown of KLF4 expression in PCa cells has been identified to upregulate the expression of epithelial-related gene E-cadherin and downregulate the expression of a variety of mesenchymal-associated genes in vitro, and has been revealed to serve a role in the inhibition of tumor cell migration and invasion (48). Katz et al (49) demonstrated that the expression of KLF4 in tumor tissues was significantly decreased in patients with PCa in the USA, and that the upregulation of KLF4 inhibited tumor migration and invasion. Ghaleb et al (50) identified a positive feedback loop control between KLF4 and the androgen receptor, and revealed that the inhibition of KLF4 expression in prostatic adenocarcinoma cells can inhibit the occurrence of EMT in vitro and serve a role in inhibiting tumor cell migration and invasion. It has also been indicated that KLF4 can serve the role as an oncogene or tumor suppressor gene in a number of cellular environments (50). In conclusion, a total of 273 DEGs and 8 hub genes were identified as potential novel diagnostic biomarkers for PCa. The current study identified 2 genes associated with PCa progression, including PBK and KLF4. However, the current study is performed based on bioinformatics methods and no experiments were performed to confirm these conclusions. Therefore, further experimental study is required to support the results gained from the current analysis.
  48 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Interrogating the kinome.

Authors:  Chao Zhang; Gaston Habets; Gideon Bollag
Journal:  Nat Biotechnol       Date:  2011-11-08       Impact factor: 54.908

3.  Haploinsufficiency of Krüppel-like factor 4 promotes adenomatous polyposis coli dependent intestinal tumorigenesis.

Authors:  Amr M Ghaleb; Beth B McConnell; Mandayam O Nandan; Jonathan P Katz; Klaus H Kaestner; Vincent W Yang
Journal:  Cancer Res       Date:  2007-08-01       Impact factor: 12.701

4.  Identification of prostate cancer hub genes and therapeutic agents using bioinformatics approach.

Authors:  Enhao Fang; Xiuqing Zhang; Qi Wang; Daoming Wang
Journal:  Cancer Biomark       Date:  2017-12-06       Impact factor: 4.388

5.  Genome-wide molecular profiles of HCV-induced dysplasia and hepatocellular carcinoma.

Authors:  Elisa Wurmbach; Ying-bei Chen; Greg Khitrov; Weijia Zhang; Sasan Roayaie; Myron Schwartz; Isabel Fiel; Swan Thung; Vincenzo Mazzaferro; Jordi Bruix; Erwin Bottinger; Scott Friedman; Samuel Waxman; Josep M Llovet
Journal:  Hepatology       Date:  2007-04       Impact factor: 17.425

6.  Gene expression patterns in human liver cancers.

Authors:  Xin Chen; Siu Tim Cheung; Samuel So; Sheung Tat Fan; Christopher Barry; John Higgins; Kin-Man Lai; Jiafu Ji; Sandrine Dudoit; Irene O L Ng; Matt Van De Rijn; David Botstein; Patrick O Brown
Journal:  Mol Biol Cell       Date:  2002-06       Impact factor: 4.138

7.  The zinc-finger transcription factor Klf4 is required for terminal differentiation of goblet cells in the colon.

Authors:  Jonathan P Katz; Nathalie Perreault; Bree G Goldstein; Catherine S Lee; Patricia A Labosky; Vincent W Yang; Klaus H Kaestner
Journal:  Development       Date:  2002-06       Impact factor: 6.868

8.  TOPK is highly expressed in circulating tumor cells, enabling metastasis of prostate cancer.

Authors:  Huimin Sun; Lei Zhang; Changhong Shi; Peizhen Hu; Wei Yan; Zhe Wang; Qiuhong Duan; Fan Lu; Lipeng Qin; Tao Lu; Juanjuan Xiao; Yingmei Wang; Feng Zhu; Chen Shao
Journal:  Oncotarget       Date:  2015-05-20

9.  The Oncogenic Role of Tribbles 1 in Hepatocellular Carcinoma Is Mediated by a Feedback Loop Involving microRNA-23a and p53.

Authors:  Ying Ye; Guangdong Wang; Guoyu Wang; Juhua Zhuang; Saifei He; Yanan Song; Jing Ni; Wei Xia; Jiening Wang
Journal:  Front Physiol       Date:  2017-11-10       Impact factor: 4.566

10.  The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists.

Authors:  Da Wei Huang; Brad T Sherman; Qina Tan; Jack R Collins; W Gregory Alvord; Jean Roayaei; Robert Stephens; Michael W Baseler; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

View more
  2 in total

1.  Bioinformatic analysis of dysregulated proteins in prostate cancer patients reveals putative urinary biomarkers and key biological pathways.

Authors:  Rui Vitorino; Margarida Fardilha; Tânia Lima; Rui Henrique
Journal:  Med Oncol       Date:  2021-01-16       Impact factor: 3.064

2.  Screening a novel six critical gene-based system of diagnostic and prognostic biomarkers in prostate adenocarcinoma patients with different clinical variables.

Authors:  Hadia Munir; Fawad Ahmad; Sajid Ullah; Saeedah Musaed Almutairi; Samra Asghar; Tehmina Siddique; Mostafa A Abdel-Maksoud; Rabab Ahmed Rasheed; Fatma Alzahraa A Elkhamisy; Mohammed Aufy; Hamid Yaz
Journal:  Am J Transl Res       Date:  2022-06-15       Impact factor: 3.940

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.