Literature DB >> 29805577

Functional analysis of gene expression profiling-based prediction in bladder cancer.

Ji-Ping Wang1, Ji-Yan Leng2, Rong-Kui Zhang1, Li Zhang1, Bei Zhang1, Wen-Yan Jiang1, Lan Tong1.   

Abstract

The present study aimed to analyze the modification of gene expression in bladder cancer (BC) by identifying significant differentially expressed genes (DEGs) and functionally assess them using bioinformatics analysis. To achieve this, two microarray datasets, GSE24152 (which included 10 fresh tumor tissue samples from urothelial bladder carcinoma patients and 7 benign mucosa samples from the bladder), and GSE42089 (which included 10 tissues samples from urothelial cell carcinoma patients and 8 tissues samples from the normal bladder), were downloaded from the Gene Expression Omnibus database for further analysis. Differentially expressed genes (DEGs) were screened between benign the mucosa and control groups in GSE24152 and GSE42089 datasets. Gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) analysis were performed on overlapping DEGs identified in GSE24152 and GSE42089. Protein-protein interaction (PPI) networks and sub-networks were then constructed to identify key genes and main pathways. GO terms analysis was also performed for the selected clusters. In total, 1,325 DEGs in GSE24152 and 647 DEGs in GSE42089 were screened, in which 619 common DEGs were identified. The DEGs were mainly enriched in pathways and GO terms associated with mitotic and chromosome assembly, including nucleosome assembly, spindle checkpoint and DNA replication. In the interaction network, progesterone receptor (PGR), MAF bZIP transcription factor G (MAFG), cell division cycle 6 (CDC6) and members of the minichromosome maintenance family (MCMs) were identified as key genes. Histones were also considered to be significant factors in BC. Nucleosome assembly and sequence-specific DNA binding were the most significant clustered GO terms. In conclusion, the DEGs, including PGR, MAFG, CDC6 and MCMs, and those encoding the core histone family were closely associated with the development of BC via pathways associated with mitotic and chromosome assembly.

Entities:  

Keywords:  bladder cancer; clustering analysis; differentially expressed genes; interaction network

Year:  2018        PMID: 29805577      PMCID: PMC5950606          DOI: 10.3892/ol.2018.8370

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Bladder cancer (BC) is a heterogeneous disease with a variable disease history and is one of the most common genitourinary malignancies worldwide (1). A total of 72,570 new cases of urinary BC were diagnosed in the United States and 15,210 patients succumbed to BC in 2013 (2). The most common pathological type of BC is urothelial cell carcinoma, and the 5-year survival rate of BC is ~77% in the United States (3). BC is a disease with multifactorial etiology; radical cystectomy with pelvic lymphadenectomy is considered to be standard therapy (4), although the use of radiotherapy is considered as an alternative, particularly in more frail patients (5). However, these therapies are limited in their effectiveness, as BC has high recurrence and mortality rates, meaning that greater understanding of the course of BC development is urgently required. A previous study suggested that different mechanisms have evolved to respond to specific phenotypic alterations in genes and cellular pathways involved in BC, and certain genetic variations in major tumor-associated pathways were proven to induce BC (6). Shen et al (6) assessed the differentially expressed genes (DEGs) and interacting pathways in BC using bioinformatics analysis, given that the genes encoding activator protein 1 (AP-1) and nuclear factor of activated T cells were key in BC. Zhou et al (7) analyzed the gene expression in human BC samples using the GSE42089 microarray dataset, identifying a set of genes associated with mitotic spindle checkpoint dysfunction as being key in BC. However, the fact that only one microarray dataset was used by each of the above studies may prove to be a limitation to the analysis described. In the present study, the gene expression profiles examined had to use the same platform as GSE42089 (Affymetrix GeneChip Human Genome U133 Plus 2.0 Array) and had to be samples composed of bladder cancer specimens and normal bladder tissue. The only other dataset that fit these criteria was GSE24152. Therefore, two microarray profiles, GSE24152 and GSE42089, were used for integrated analysis of gene expression modification in BC in the present study; the use of the two gene expression profiles enabled a more reliable conclusion to be drawn. DEGs were identified, and gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed. Protein-protein interaction (PPI) networks and sub-networks were also constructed to identify the key genes and main pathways involved in BC. Using the aforementioned bioinformatics methods, the modification of gene expression in BC was analyzed by identifying significant DEGs and pathways, which may provide novel insight for the etiology and treatment research of BC.

Materials and methods

Microarray data

Two gene expression profiles, GSE24152 (8) and GSE42089 (7), were downloaded from the Gene Expression Omnibus database in National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/geo/), based on the GPL6791 and GPL9828 platforms in the Affymetrix GeneChip Human Genome U133 Plus 2.0 Array, respectively. The GSE24152 dataset included 17 samples, of which 10 were fresh tumor tissue samples collected from patients with urothelial carcinoma of the bladder and 7 were benign mucosa samples from the bladder. The microarray GSE42089 dataset consisted of 18 samples, of which 10 samples were tissues from urothelial cell carcinoma and 8 were normal bladder tissue.

Data preprocessing and DEGs analysis

The robust multiple average algorithm in the affy package (9) was used to normalize the microarray data and box plots were then generated. The microarray data from GSE24152 and GSE42089 were divided into two groups, a bladder carcinoma group and a normal group. Using the Limma package (10), the probe-level data of two sets were converted into expression measures and DEGs were identified in the bladder carcinoma group compared with the control group in GSE24152 and GSE42089. A Venn diagram was generated using VennDiagram package (11) to screen common DEGs in GSE24152 and GSE42089 for further analysis. A false discovery rate <0.01 and a |log2FC (fold-change) |>0.5 was used as the threshold. Heat maps were generated using the heatmap.2 function in ggplot 2 (12) to display the relative expression differences of DEGs. In addition, the cor.test function was used to evaluate the correlation coefficient of two groups of DEGs.

GO and pathway enrichment analysis

GO terms analysis is a widely used approach for studying large-scale genomic or transcriptomic data in function that consists of three terms: Biological process, cellular component and molecular function categories (13). KEGG is a widely used collection of online databases that deals with genomes, enzymatic pathways, and biological chemicals (14). In the present study, GO analysis and KEGG pathway enrichment were performed on the three categories of the screened DEGs using the DAVID (15) analysis tool, with a statistically significant cut-off of P<0.01.

PPI network and sub-network construction

Cytoscape (16) is an open-source bioinformatics software platform used for the visualization of molecular interaction networks and integrating with gene expression profiles and other state data. In the present study, Biosgenet in Cytoscape (17) was used to predict and visualize the interactions between selected DEGs and proteins using the BIND database (18) with a statistically significant cut-off of P<0.05. Significant sub-networks (clusters) were extracted from the PPI network using ClusterOne in Cytoscape (19) with a statistically significant cut-off of P<0.01. GO terms analysis was also performed for the selected clusters.

Results

DEGs selection

The variations in raw expression data were normalized following preprocessing (Fig. 1). A total of 1,325 DEGs were screened from the GSE24152 dataset and 637 DEGs were screened from the GSE42089 dataset (Fig. 2A and B depicts volcano plots of the two microarrays). A total of 619 common DEGs were identified by a Venn diagram, including 313 upregulated genes and 306 downregulated genes. From the heat maps of the 619 common DEGs generated (Fig. 2C and D), the DEGs could significantly distinguish the bladder carcinoma dataset from normal dataset.
Figure 1.

Box plots of data normalization. (A) Box plot of GSE24152 data normalization. (B) Box plot of GSE42089 data normalization. The x-axis represents samples; the y-axis represents gene expression values. The whiskers of the box plots represent the sample median.

Figure 2.

Volcano plots and heat maps of DEGs of two microarray profiles. Volcano plots of DEGs in (A) GSE24152 and (B) GSE42089. Heat map of DEGs in (C) GSE24152 and (D) GSE42089. DEG, differently expressed gene.

GO and KEGG enrichment analysis of DEGs

GO and KEGG enrichment analysis were performed with a cut-off of P<0.01, obtaining 74 GO terms and 4 KEGG pathways were obtained. The main GO terms and KEGG pathways of 619 common DEGs are listed in Table I. According to the results, DEGs were mainly enriched in chromosomal assembly and cell cycle pathways, including DNA replication, and the main GO terms were also associated with mitosis, including mitotic sister chromatid segregation and spindle checkpoint.
Table I.

Functional enrichment of differentially expressed genes.

CategoryTermP-valueFold enrichment
BPMitotic sister chromatid segregation1.90×10−49.8
BPNegative regulation of cell cycle process1.90×10−311.0
BPSpindle checkpoint9.70×10−316.3
CCCondensed chromosome, centromeric region1.70×10−99.8
CCCondensed chromosome kinetochore2.90×10−910.5
CCMidbody1.20×10−313.5
MFChromatin binding4.50×10−33.8
MFMicrotubule motor activity5.30×10−35.2
MFDamaged DNA binding1.70×10−37.3
KEGGProgesterone-mediated oocyte maturation1.80×10−45.5
KEGGCell cycle1.10×10−86.1
KEGGDNA replication1.70×10−49.1

BP, biological process; CC, cellular component; MF, molecular function; KEGG, Kyoto Encyclopedia of Genes and Genomes.

A PPI network was constructed for all DEGs (Fig. 3) and clustering analysis was performed based on this PPI network. In total, 6 sub-networks (clusters) were obtained (Table II and Fig. 4). Progesterone receptor (PGR) was a key gene in cluster 1 and the proteins enriched in cluster 1 were all histone proteins. In cluster 2, MAF bZIP transcription factor G (MAFG) was identified as a key gene and it was also detected in cluster 5. Cell division cycle 6 (CDC6) was a key gene in clusters 3 and 4. Minichromosome maintenance complex component (MCM) family members, including MCM2, MCM4, MCM7 and MCM10, were also key genes in clusters 3 and 4. Nucleosome assembly and sequence-specific DNA binding were significant GO terms in cluster 1 and cluster 2, respectively. No GO terms were obtained for clusters 3–6.
Figure 3.

Protein-protein interaction network of DEGs. Pink boxes depict DEGs; blue boxes depict target proteins; pink arrows depict interactions between genes and target proteins; blue arrows depict interactions between proteins. DEG, differently expressed gene.

Table II.

Sub-networks obtained by clustering analysis.

ClusterNodesDensityQualityP-valueDEGsSignificant GO term
1290.6080.946<0.001PGR, CHAF1B, ASF1B,Nucleosome assembly
2160.6530.8429.18×10−5HLF, MAFGSequence-specific DNA binding
3150.5140.6354.41×10−5CDC6, MCM2, MCM7, MCM10None found
4130.5130.4942.00×10−3CDC6, MCM2, MCM10, MCM7, MCM4None found
5120.5150.5763.00×10−3MAFGNone found
6  40.5001.0001.00×10−2EFNA4None found

DEGs, differentially expressed genes; GO, gene ontology.

Figure 4.

Sub-networks of 6 clusters. Pink boxes depict differently expressed genes; blue boxes depict target proteins; pink arrows depict interactions between genes and target proteins; blue arrows depict interactions between proteins.

Discussion

BC is a common malignancy that requires a high degree of surveillance, owing to the frequency of recurrence and the poor clinical outcome of invasive disease (20). Bioinformatic analysis of BC cells at the gene level can provide novel insights into this disease. In the present study, using the microarray datasets GSE24152 and GSE42089, significant DEGs in BC were identified. In the significant extracted sub-networks, PGR, MAFG, CDC6 and members of the MCM family were identified as key genes in BC; histones were also considered to have major functions in BC. The main GO terms and pathways were those associated with the cell cycle and chromosome assembly, including nucleosome assembly, spindle checkpoint and DNA replication. PGR encodes a member of the steroid receptor superfamily, which mediates the physiological effects of progesterone (21). In the present study, PGR was an important gene in a cluster that was regulated by a set of transcription factors, revealing the potential significance of PGR in BC. Men are more frequently affected by BC than women, indicating that hormones and their receptors may function as regulatory factors (6). Miyamoto et al (22) clarified that the androgen receptor (AR) was involved in BC. As AR and PGR are determinant modulators of gonadal sex hormones, PGR may perform similar functions in BC with AR, via the progesterone-mediated oocyte maturation pathway, which was enriched in this study (23). Histones are the main structural proteins associated with DNA in eukaryotic cells. Histones are divided into two groups, core histones and the nucleosomal histones (24). Core histones are some of the most highly conserved proteins in eukaryotes and have key roles in the organization of DNA folding (25). The altered patterns of histone modifications in various human cancer types have been studied extensively in recent years (26). Schneider et al (27) demonstrated that global histone modification levels were lower in BC than in normal urinary tissue. The conserved histone H2A has been reported to be overexpressed in BC cells and contributes to the activation of cancer-associated transcription pathways (28). In the present study, a set of core histones was clustered in cluster 1, the main GO term of which was associated with nucleosome assembly. Considering the function of histones in mitosis, it was concluded that the nucleosome and chromatin assembly may be modified in BC. MAF encodes a nuclear transcription-regulating protein characterized by a basic region and leucine zipper domain; it has crucial roles in a variety of cellular processes (29). MAFG is a small member of the MAF protein family that consists of little more than the DNA binding and dimerization motif (30), which is able to partially co-localize with FBJ murine osteosarcoma viral oncogene homolog (FOS) in the nucleus and heterodimize with it (31). Members of the AP-1 family of transcription factors are dimeric complexes involved in cellular proliferation, transformation and death (32), and contains the FOS, jun proto-oncogene (JUN) and activating transcription factor (ATF) protein families, of which FOS is a major member. AP-1 family members are immediate early genes induced by a variety of stress signals and control the stress response including cell proliferation, apoptosis and tumorigenesis (33). By forming heterodimers, FOS proteins aid the binding of AP-1 to DNA and exert oncogene activity, leading to tumorigenesis (34). Previous reports demonstrated that the genes encoding AP-1 participate in the cancer-associated immune and inflammation pathways in BC (6). The data from the present study revealed that the expression of MAFG in BC was upregulated, which may increase tumorigenesis by promoting the formation of FOS dimers and the AP-1 complex. CDC6 is an essential regulator of DNA replication in eukaryotic cells that assembles pre-replicative complexes at origins of replication during the G1 phase of the cell division cycle (35). MCM family members encode highly conserved proteins that act as enzymatically active helicases (36). MCMs drive the formation of pre-replicative complexes (PRCs), the formation of which is the first key event during the G1 phase of the cell cycle (37). MCMs and CDC6 are key proteins in the mechanism of DNA replication and are functionally associated with each other during the cell cycle (38). CDC6 is responsible for the loading of MCM proteins onto origins of replication and, with the presence of CDC6, MCMs bind to the chromatin specifically during the G1 phase of the cell cycle (35). The increased expression of CDC6 and MCM has been observed in dysplastic cells and CDC6 and MCM are consequently considered to be specific biomarkers of proliferating cells (38). Recent studies have revealed the proto-oncogenic activity of CDC6, with its overexpression interfering with the expression of certain tumor suppressor genes and potentially promoting DNA hyper-replication, inducing a senescence response similar to that caused by oncogene activation (39,40). Some members of the MCM family, such as MCM7, were also overexpressed and amplified in a variety of human malignancies (41). In the present study, the expression of CDC6 was found to be upregulated, revealing that in BC cells, DNA replication is aberrant. In conclusion, the present study reveals that the genes PGR, MAFG, CDC6 and MCMs, and a set of histones are important factors in BC and have key roles in mitotic processes, including nucleosome assembly, spindle checkpoint and DNA replication. However, the small size of the microarray sample and the lack of experimental variation are limitations. Thus, further studies with larger sample size and experimental verification should be performed to confirm the conclusions of the current study.
  38 in total

1.  BIND: the Biomolecular Interaction Network Database.

Authors:  Gary D Bader; Doron Betel; Christopher W V Hogue
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Detecting overlapping protein complexes in protein-protein interaction networks.

Authors:  Tamás Nepusz; Haiyuan Yu; Alberto Paccanaro
Journal:  Nat Methods       Date:  2012-03-18       Impact factor: 28.547

Review 3.  AP-1 as a regulator of cell life and death.

Authors:  Eitan Shaulian; Michael Karin
Journal:  Nat Cell Biol       Date:  2002-05       Impact factor: 28.824

4.  Global histone H4K20 trimethylation predicts cancer-specific survival in patients with muscle-invasive bladder cancer.

Authors:  Ann-Christin Schneider; Lukas C Heukamp; Sebastian Rogenhofer; Guido Fechner; Patrick J Bastian; Alexander von Ruecker; Stefan C Müller; Jörg Ellinger
Journal:  BJU Int       Date:  2011-05-31       Impact factor: 5.588

5.  Radiotherapy with or without chemotherapy in muscle-invasive bladder cancer.

Authors:  Nicholas D James; Syed A Hussain; Emma Hall; Peter Jenkins; Jean Tremlett; Christine Rawlings; Malcolm Crundwell; Bruce Sizer; Thiagarajan Sreenivasan; Carey Hendron; Rebecca Lewis; Rachel Waters; Robert A Huddart
Journal:  N Engl J Med       Date:  2012-04-19       Impact factor: 91.245

6.  BisoGenet: a new tool for gene network building, visualization and analysis.

Authors:  Alexander Martin; Maria E Ochagavia; Laya C Rabasa; Jamilet Miranda; Jorge Fernandez-de-Cossio; Ricardo Bringas
Journal:  BMC Bioinformatics       Date:  2010-02-17       Impact factor: 3.169

Review 7.  Cancer stem cells and hepatocellular carcinoma.

Authors:  Zhixing Yao; Lopa Mishra
Journal:  Cancer Biol Ther       Date:  2009-09       Impact factor: 4.742

8.  Comparative gene expression profiling analysis of urothelial carcinoma of the renal pelvis and bladder.

Authors:  Zhongfa Zhang; Kyle A Furge; Ximing J Yang; Bin T Teh; Donna E Hansel
Journal:  BMC Med Genomics       Date:  2010-12-15       Impact factor: 3.063

9.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

10.  The present and future burden of urinary bladder cancer in the world.

Authors:  Martine Ploeg; Katja K H Aben; Lambertus A Kiemeney
Journal:  World J Urol       Date:  2009-02-15       Impact factor: 4.226

View more
  1 in total

1.  A Herpes Simplex Virus Thymidine Kinase-Induced Mouse Model of Hepatocellular Carcinoma Associated with Up-Regulated Immune-Inflammatory-Related Signals.

Authors:  Zhijuan Gong; Qingwen Ma; Xujun Wang; Qin Cai; Xiuli Gong; Georgi Z Genchev; Hui Lu; Fanyi Zeng
Journal:  Genes (Basel)       Date:  2018-07-27       Impact factor: 4.096

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.