Literature DB >> 26893748

Identification of genes and pathways associated with pancreatic ductal adenocarcinoma by bioinformatics analyses.

Jin Long1, Zhongbo Zhang1, Zhe Liu1, Yuanhong Xu1, Chunlin Ge1.   

Abstract

This study aimed to explore the underlying genes and pathways associated with pancreatic ductal adenocarcinoma (PDAC) by bioinformatics analyses. Gene expression profile GSE43795 was downloaded from the Gene Expression Omnibus database. The differentially expressed genes (DEGs) between six PDAC and five non-neoplastic pancreatic tissue samples were analyzed using the limma package. Gene ontology (GO) and pathway enrichment analyses of DEGs were performed, followed by functional annotation and protein-protein interaction (PPI) network construction. Finally, the sub-network was identified and pathway enrichment analysis was performed on the contained DEGs. A total of 374 downregulated and 559 upregulated DEGs were identified. The downregulated DEGs were enriched in GO terms associated with digestion and transport and pathways related to metabolism, while the upregulated DEGs were enriched in GO terms associated with the cell cycle and mitosis and pathways associated with the occurrence of cancer including the cell cycle pathway. Following functional annotation, the oncogene pituitary tumor-transforming 1 (PTTG1) was upregulated. In the PPI network and sub-network, cell division cycle 20 (CDC20) and BUB1 mitotic checkpoint serine/threonine kinase B (BUB1B) were hub genes with high connectivity degrees. Additionally, DEGs in the sub-network including cyclin B1 (CCNB1) were mainly enriched in the cell cycle and p53 signaling pathways. In conclusion, the cell cycle and p53 signaling pathways may play significant roles in PDAC, and DEGs including CDC20, BUB1B, CCNB1 and PTTG1 may be potential targets for PDAC diagnosis and treatment.

Entities:  

Keywords:  differentially expressed genes; pancreatic ductal adenocarcinoma; pathway enrichment analysis; protein-protein interaction network

Year:  2015        PMID: 26893748      PMCID: PMC4734321          DOI: 10.3892/ol.2015.4042

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Pancreatic ductal adenocarcinoma (PDAC) ranks among the most malignant of human cancers (1). Currently, the annual number of associated mortalities is similar to the disease's annual incidence (2). The poor prognosis of PDAC is correlated with the nonspecificity of symptoms, advanced disease at presentation and lack of effective adjuvant and systemic therapy (3). Currently, surgical extirpation for localized disease offers the only chance of long-term survival (3). Therefore, understanding the pathological mechanisms to detect PDAC as early as possible is an urgent requirement to enable further advances in therapeutic modalities and agents. Previous studies have identified that the development of PDAC may involve certain genetic factors including the overexpression of oncogenes, inactivation of tumor suppressor genes or the deregulation of various signaling pathways (4). For instance, kirsten rat sarcoma viral oncogene homolog mutations have been observed to occur with increasing frequency in progressively later stages of pancreatic adenocarcinoma (5). Iacobuzio-Donahue et al demonstrated that tumor antigen p97, cathepsin L2 and kallikrein 10 were differentially expressed among PDACs (6). Additionally, the phosphoinositide 3-kinase signaling pathway is known to be activated in pancreatic cancer, which is due to the aberrant expression of phosphatase and tensin homolog. Progress achieved in understanding the mechanism of PDAC is likely to contribute to the treatment of this disease. However, no breakthrough treatments have been identified, so the present knowledge would appear to be insufficient. In the present study, we downloaded microarray data of GSE43795 and identified the differentially expressed genes (DEGs) between PDAC and non-neoplastic pancreatic tissue (NN) samples to explore the molecular mechanisms of PDAC. Park et al (7) used the dataset GSE43795 to study the characterization of gene expression and activated signaling pathways in solid pseudopapillary neoplasms of the pancreas. However, the functional annotation and protein-protein interaction (PPI) of DEGs are still far from being clear. In the present study, we performed functional enrichment analyses and functional annotation. Finally, PPI networks and sub-networks were constructed and analyzed to study and identify target genes for the diagnosis and treatment of PDAC. We aimed to explore the underlying genes and pathways associated with PDAC. The findings from this study are likely to play a significant role in PDAC genesis and may potentially serve as biomarkers in the diagnosis and treatment of PDAC.

Materials and methods

Affymetrix microarray data

The microarray data of GSE43795 were downloaded from the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) database based on the platform of GPL10558 Illumina HumanHT-12 V4.0 expression beadchip. A total of six PDAC and five NN samples were used in this study to develop the Affymetrix microarray data (Affymetrix, Inc., Santa Clara, CA, USA).

Data pre-processing and differential expression analysis

Background correction, quartile data normalization and probe summarization were performed for the original array data, then they were converted into expression measures by the robust multiarray average (8) algorithm in the R affy package (9) (http://www.bioconductor.org). For the GSE43795 dataset, the limma eBayes (10) method in Bioconductor (http://www.bioconductor.org) was used to identify genes which were differentially expressed between PDAC and NN samples. The log2-fold change (log2FC) was calculated. |log2FC| ≥3 and false discovery rate (FDR) <0.01 were considered as the cutoff values for DEG screening.

Gene ontology and pathway enrichment analyses

Gene ontology (GO) (11) is a tool used for collecting a large number of gene annotation terms. The Kyoto Encyclopedia of Genes and Genomes (KEGG) knowledge database (12) is a collection of online databases dealing with genomes, enzymatic pathways and biological chemicals. The Database for Annotation, Visualization and Integrated Discovery (DAVID) (13), as a comprehensive set of functional annotation tools, has been developed for relating the functional terms with gene lists using a clustering algorithm. In order to analyze the DEGs at the functional level, we performed GO and KEGG pathway enrichment analyses using the DAVID online tool to obtain the enriched biological processes (BPs) and pathways. P<0.01 was set as the threshold value.

Functional annotation of DEGs

Based on the data information of transcription factors (TFs), the DEGs were selected and annotated to determine whether these genes had the function of transcriptional regulation. The tumor suppressor gene database (TSGene) (14) integrates TSGs with large-scale experimental evidence to provide a comprehensive resource for the further investigation of TSGs and their molecular mechanisms in cancer. The tumor-associated gene (TAG) database (15) is used to save new genes that play a role in carcinogenesis. In this study, we extracted all known oncogenes and TSGs from the TAG and TSG databases.

PPI network construction and sub-network identification

The Search Tool for the Retrieval of Interacting Genes (STRING) database (16) is a precomputed global resource which was designed to evaluate PPI information. In the PPI network, each node stands for a gene and the edges represent the interactions between nodes. The degree indicates the number of edges linked to a given node and the nodes with a high degree are defined as the hub genes that possess essential biological functions. In this study, the STRING online tool was applied to analyze the PPI network of DEGs and only those experimentally validated interactions with a combined score >0.9 were selected as significant. The BioNet package (http://www.bioconductor.org/packages/release/bioc/html/BioNet.html) (17) provides a comprehensive set of methods for the integrated analysis of gene expression data and biological networks. In the current study, we used BioNet to identify the sub-network in the PPI network with FDR <1.0E-06. Based on the DEGs in the sub-network, we performed KEGG pathway enrichment analysis with P<0.01.

Results

Identification of DEGs

For the dataset GSE43795, a total of 979 transcripts were obtained following data pre-processing. Among them, 393 were downregulated transcripts corresponding to 374 DEGs and 586 were upregulated transcripts which corresponded to 559 DEGs.

GO and pathway enrichment analyses

Following GO and pathway analyses for down- and upregulated DEGs, the top five GO BP terms were collected and are shown in Table I. The GO terms enriched by downregulated DEGs were mainly related to digestion, transport and signaling while the GO terms enriched by upregulated DEGs were mainly associated with the cell cycle and mitosis.
Table I.

Gene ontology functional enrichment analysis for down- and upregulated differentially expressed genes (top 5).

TermPathwayCountP-value
Downregulated DEGs
  GO: 0007586Digestion  181.16E-10
  GO: 0006811Ion transport  531.73E-08
  GO: 0007267Cell-cell signaling  532.21E-08
  GO: 0044765Single-organism transport1043.34E-08
  GO: 0006810Transport1147.52E-07
Upregulated DEGs
  GO: 0000280Nuclear division  446.22E-15
  GO: 0007067Mitosis  446.22E-15
  GO: 0000278Mitotic cell cycle  711.64E-14
  GO: 0048285Organelle fission  449.46E-14
  GO: 0051301Cell division  489.29E-12

Count signifies the number of enriched DEGs. DEG, differentially expressed gene.

The pathways enriched by downregulated DEGs were mainly related to digestion, absorption and metabolism, and included protein digestion and absorption and metabolism of xenobiotics by cytochrome P450. The pathways enriched by upregulated DEGs were mainly related to the occurrence and spread of cancer, including the cell cycle and p53 signaling pathways (Table II).
Table II.

Pathway functional enrichment analysis for down- and upregulated differentially expressed genes.

TermPathwayCountP-value
Downregulated DEGs
  04972Pancreatic secretion194.79E-12
  04974Protein digestion and absorption139.90E-08
  00260Glycine, serine and threonine metabolism  89.43E-07
  04964Proximal tubule bicarbonate reclamation  61.78E-05
  04950Maturity onset diabetes of the young  62.99E-05
  00980Metabolism of xenobiotics by cytochrome P450  84.11E-04
  00982Drug metabolism-cytochrome P450  84.97E-04
  00480Glutathione metabolism  61.58 E-03
  04971Gastric acid secretion  72.65 E-03
  04973Carbohydrate digestion and absorption  54.96 E-03
  04020Calcium signaling pathway115.38 E-03
  04610Complement and coagulation cascades  68.03 E-03
  00250Alanine, aspartate and glutamate metabolism  48.44 E-03
  00750Vitamin B6 metabolism  29.09 E-03
  04976Bile secretion  69.21 E-03
Upregulated DEGs
  04110Cell cycle159.27E-06
  04115p53 signaling pathway119.35E-06
  04512ECM-receptor interaction103.77E-04
  05200Pathways in cancer204.11E-03
  05146Amoebiasis  97.08E-03
  05412Arrhythmogenic right ventricular cardiomyopathy  79.61E-03

Count signifies the number of enriched DEGs. DEG, differentially expressed gene.

After researching the expression of TFs and TAGs, 15 TFs were downregulated, including prospero homeobox 1 and PBX/knotted 1 homeobox 2, and 19 TFs were upregulated, including vitamin D receptor and upstream transcription factor 1. In addition, among the downregulated DEGs, 22 genes were TAGs. Of these, 3 were oncogenes, 16 were TSGs and the effect of other 3 genes was uncertain. In the upregulated DEGs, 12 were oncogenes, including pituitary tumor-transforming 1 (PTTG1), 32 were TSGs and the effect of other 11 genes was uncertain (Table III).
Table III.

Results of functional annotation of differentially expressed genes.

TF countTF nameTAG countTAG name
Downregulated DEGs
15PROX1, PKNOX2, PBX1, PAX6, ONECUT1, NR5A2, NR4A2, NKX2.5, NKX2.2, LMO3, KLF15, INSM1, GATA4, FOSB, ESRRG22TAG oncogenes: PBX1, GATA4, FGFR1;
TSGs: ZBTB16, WNK2, SFRP5, SFRP1, SERPINI2, PROX1, PLCE1, PAX6, ONECUT1, NRCAM, GNMT, DIRAS3, C2orf40, BTG2, BEX2, ARID3B;
Others: SLC43A1, NR4A2, CHRM3
Upregulated DEGs
19VDR, USF1, SPI1, RUNX2, RUNX1, PITX1, MYCBP, LEF1, HOXC4, HOXB8, HOXB7, HOXA13, HOXA10, FOXM1, FOXF2, FOXD2, FOXD1, FOXA1, E2F755TAG oncogenes: WISP1, TNFRSF6B, SPI1, RUNX2, PTTG1, NRAS, LCN2, LAMC2, HOXA10, HMMR, CEP55, CCNA2;
TSGs: TES, SFN, SERPINB5, SEC14L2, RASAL1, RARRES3, RARRES1, PYCARD, PRODH, MMP11, MFHAS1, JUP, ISG15, INPP4B, IGFBP3, HTRA1, HPGD, HOPX, GPRC5A, GLIPR1, GJB2, FANCD2, EGLN3, CHEK1, CEACAM1, CDH11, CASP8, CAPG, BUB1B, BLM, BIK, ABLIM3;
Others: TFAP2A, TACC3, RUNX1, PTK6, OAS1, ITGB4, FHL2, DHDH, CCNE2, BUB1, BIRC5;

TF, transcription factor; TAG, tumor-associated gene; DEG, differentially expressed gene; TSG, tumor suppressor gene.

PPI network construction and sub-network pathway enrichment analysis

Based on data from the STRING database, the PPI network was constructed (Fig. 1). Ten nodes were selected as hub genes (degree ≥12), including cell division cycle 20 (CDC20, degree=18) and BUB1 mitotic checkpoint serine/threonine kinase B (BUB1B, degree=16).
Figure 1.

Protein-protein interaction network of differentially expressed genes (DEGs). Green nodes represent downregulated DEGs; red nodes represent upregulated DEGs. The size of the node indicates the connectivity degree and larger circles indicate a higher degree.

Using the BioNet package, sub-neworks with 34 nodes were detected. Fig. 2 shows that CDC20 was a hub gene with degree=16.
Figure 2.

Sub-network of differentially expressed genes (DEGs). Color indicates log2-fold change (from low to high: green, pink, red). Green nodes represent downregulated DEGs; red nodes represent upregulated DEGs. Circles indicate that the node is significant in the network while squares indicate less significance.

Following KEGG pathway enrichment analyses of the DEGs in the sub-network, we observed that these DEGs including cyclin B1 (CCNB1) were mainly enriched in the cell cycle, p53 signaling and oocyte meiosis pathways.

Discussion

The identification of DEGs in PDAC is critical to the development of novel strategies to detect and treat this highly malignant cancer. In the present study, a total of 933 DEGs were identified between PDAC and NN samples through gene expression profiling of GSE43795. The downregulated DEGs were mainly enriched in the BP terms associated with digestion, transport and signaling, and pathways associated with digestion, absorption and metabolism. The upregulated DEGs were mainly enriched in the BP terms associated with cell cycle and mitosis, and in the cell cycle and p53 signaling pathways. The oncogene PTTG1 was upregulated following functional annotation. In the PPI network, the hub genes CDC20 and BUB1B had higher connectivity degrees. Additionally, CCNB1, CDC20 and BUB1B were enriched in several pathways in the sub-networks. This result suggested that these genes and pathways may play significant roles in the progression of PDAC. Cancer is characterized by uncontrolled cell proliferation, and tumor cells have typically acquired damage to genes that directly regulate their cell cycles (18). Research has revealed that the loss of appropriate cell cycle regulation leads to genomic instability (19). The cell cycle is believed to play a role in the etiology of spontaneous cancers (20). In the present study, the cell cycle pathway was observed to be enriched by several upregulated DEGs, including CDC20 and BUB1B, which were also hub genes in the PPI network and sub-network. CDC20 is one of the regulators of spindle checkpoints, and appears to act as a regulatory protein interacting with several other proteins at multiple points in the cell cycle (21). In mammals, CDC20 is involved in anaphase onset and late mitotic events (22) and its expression is essential for cell division (23). At present, CDC20 is frequently reported to be upregulated in numerous types of malignancies including pancreatic cancer (23). Chang et al suggested that CDC20 expression may play a role in facilitating PDAC cell mitosis. For the other DEG, BUB1B, its encoded protein is also a key protein in the mitotic spindle checkpoint (24). It has been reported that overexpression of spindle assembly checkpoint molecules may result in DNA aneuploidy and carcinogenesis in mice (25). The high expression of BUB1B has often been reported to be associated with chromosomal instability in several malignancies, including kidney carcinomas, breast cancer and bladder cancer (24,26,27). Our results further confirm that aberrant CDC20 and BUB1B expression and cell cycle pathways in which the two DEGs participated may play key roles in PDAC tumorigenesis and progression and may thus be useful as therapeutic targets. In this study, the p53 signaling pathway was also observed to be enriched by upregulated DEGs including CCNB1. The p53 protein inhibits malignant transformation through direct and indirect regulation of transcription of the genes associated with the cell cycle and apoptosis (21). Presently, TP53 is the most frequently mutated gene in human cancer and its mutation frequency is up to 96% in pancreatic adenocarcinoma (28,29). CCNB1 is a regulatory protein involved in mitosis, and plays an essential role in cell proliferation (30). In normal tissues, the expression level of CCNB1 is low; however, it was noted to be overexpressed in tumors with TP53 mutation, including colorectal, cervical and pancreatic cancer (31–33). In addition, TP53 has been demonstrated to regulate the promoter of CCNB1 in opposing ways (34). Briefly, the p53 signaling pathway and its enriched DEGs including CCNB1 were closely associated with PDAC; thus, this pathway and these genes may be used as potential targets for PDAC treatment. The results of functional annotation of DEGs revealed that the PTTG1 oncogene was upregulated; in addition, it was noted to participate in the PPI network and sub-network. Our findings also revealed that PTTG1 was enriched in the BP terms associated with mitosis. PTTG1 is a regulatory protein, and plays a central role in chromosome stability, cell transformation and gene regulation (35,36). In particular, PTTG1 is a critical mitotic checkpoint protein that helps hold sister chromatids together before entering anaphase (37). Research has identified that PTTG1 expression is highly activated in rapidly proliferating cells (38). To date, the overexpression of PTTG1 has been identified in numerous cancers tissues as well as in cell lines, including colon, ovarian, breast and various other solid tumors (39–41). In our study, the overexpression of PTTG1 is consistent with the observations above. Taken together, these data support the hypothesis that PTTG1 may be a candidate molecular marker associated with PDAC progression and prognosis. Although bioinformatics technologies have the potential to identify and validate candidate agents for critical diseases, certain limitations remain in this study. Firstly, the sample size for microarray analysis was small, which may have caused a high rate of false positive results. Secondly, this study lacked experimental verification. Further genetic and experimental studies with a larger sample size are still required in the future to confirm the results. However, our data provide a comprehensive bioinformatics analysis of the DEGs and pathways which may be involved in PDAC. The findings of the present study may contribute to our understanding of the underlying molecular mechanisms of PDAC. DEGs including CDC20, BUB1B, CCNB1 and PTTG1 as well as the cell cycle pathway and p53 signaling pathway have the potential to be used as targets for PDAC diagnosis and treatment.
  39 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Expression of pituitary-tumour transforming gene in colorectal tumours.

Authors:  A P Heaney; R Singson; C J McCabe; V Nelson; M Nakashima; S Melmed
Journal:  Lancet       Date:  2000-02-26       Impact factor: 79.321

Review 3.  Pancreatic cancer: pathogenesis, prevention and treatment.

Authors:  Fazlul H Sarkar; Sanjeev Banerjee; Yiwei Li
Journal:  Toxicol Appl Pharmacol       Date:  2006-11-11       Impact factor: 4.219

4.  Structure, expression, and function of human pituitary tumor-transforming gene (PTTG).

Authors:  X Zhang; G A Horwitz; T R Prezant; A Valentini; M Nakashima; M D Bronstein; S Melmed
Journal:  Mol Endocrinol       Date:  1999-01

5.  Overexpression of cyclin B1 in human colorectal cancers.

Authors:  A Wang; N Yoshimi; N Ino; T Tanaka; H Mori
Journal:  J Cancer Res Clin Oncol       Date:  1997       Impact factor: 4.553

Review 6.  Cell cycle control and cancer.

Authors:  L H Hartwell; M B Kastan
Journal:  Science       Date:  1994-12-16       Impact factor: 47.728

7.  Activated Kras and Ink4a/Arf deficiency cooperate to produce metastatic pancreatic ductal adenocarcinoma.

Authors:  Andrew J Aguirre; Nabeel Bardeesy; Manisha Sinha; Lyle Lopez; David A Tuveson; James Horner; Mark S Redston; Ronald A DePinho
Journal:  Genes Dev       Date:  2003-12-17       Impact factor: 11.361

8.  Identification of K-ras mutations in pancreatic juice in the early diagnosis of pancreatic cancer.

Authors:  P Berthélemy; M Bouisson; J Escourrou; N Vaysse; J L Rumeau; L Pradayrol
Journal:  Ann Intern Med       Date:  1995-08-01       Impact factor: 25.391

9.  Genomic instability and increased expression of BUB1B and MAD2L1 genes in ductal breast carcinoma.

Authors:  Marina Scintu; Rita Vitale; Maria Prencipe; Antonietta Pia Gallo; Loriana Bonghi; Vanna Maria Valori; Evaristo Maiello; Monica Rinaldi; Emanuela Signori; Carla Rabitti; Massimo Carella; Bruno Dallapiccola; Vittorio Altomare; Vito M Fazio; Paola Parrella
Journal:  Cancer Lett       Date:  2007-05-10       Impact factor: 8.679

10.  Overexpression of the mitotic checkpoint genes BUB1 and BUBR1 is associated with genomic complexity in clear cell kidney carcinomas.

Authors:  Mafalda Pinto; Joana Vieira; Franclim R Ribeiro; Maria J Soares; Rui Henrique; Jorge Oliveira; Carmen Jerónimo; Manuel R Teixeira
Journal:  Cell Oncol       Date:  2008       Impact factor: 6.730

View more
  6 in total

1.  NEAT1 regulates pancreatic cancer cell growth, invasion and migration though mircroRNA-335-5p/c-met axis.

Authors:  Jia Cao; Yi Zhang; Jiachun Yang; Sijia He; Mingming Li; Shiyan Yan; Ying Chen; Chunying Qu; Leiming Xu
Journal:  Am J Cancer Res       Date:  2016-10-01       Impact factor: 6.166

2.  Identification of key regulators of pancreatic ductal adenocarcinoma using bioinformatics analysis of microarray data.

Authors:  Nan Li; Xin Zhao; Shengyi You
Journal:  Medicine (Baltimore)       Date:  2019-01       Impact factor: 1.889

3.  Overexpression of BUB1B, CCNA2, CDC20, and CDK1 in tumor tissues predicts poor survival in pancreatic ductal adenocarcinoma.

Authors:  Shu Dong; Fei Huang; Hao Zhang; Qiwen Chen
Journal:  Biosci Rep       Date:  2019-02-26       Impact factor: 3.840

4.  Evaluation of Oncogene NUP37 as a Potential Novel Biomarker in Breast Cancer.

Authors:  Kangdi Li; Ting Liu
Journal:  Front Oncol       Date:  2021-07-27       Impact factor: 6.244

5.  Comprehensive Multiomics Analysis Reveals Potential Diagnostic and Prognostic Biomarkers in Adrenal Cortical Carcinoma.

Authors:  Xiunan Li; Jiayi Li; Leizuo Zhao; Zicheng Wang; Peizhi Zhang; Yingkun Xu; Guangzhen Wu
Journal:  Comput Math Methods Med       Date:  2022-08-09       Impact factor: 2.809

6.  Identification of the lncRNA-miRNA-mRNA network associated with gastric cancer via integrated bioinformatics analysis.

Authors:  Xiao-Yu Ma; Yu Ma; Huan Zhou; Hui-Jing Zhang; Ming-Jun Sun
Journal:  Oncol Lett       Date:  2019-09-25       Impact factor: 2.967

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.