Literature DB >> 29387216

Bioinformatics analysis of RNA sequencing data reveals multiple key genes in uterine corpus endometrial carcinoma.

Liang Shen1, Ming Liu1, Wei Liu1, Jing Cui2, Changzhong Li1.   

Abstract

In the present study, the RNA sequencing (RNA-seq) data of uterine corpus endometrial carcinoma (UCEC) samples were collected and analyzed using bioinformatics tools to identify potential genes associated with the development of UCEC. UCEC RNA-seq data were downloaded from The Cancer Genome Atlas database. Differential analysis was performed using edgeR software. A false discovery rate <0.01 and |log2(fold change)|>1 were set as the cut-off criteria to screen for differentially expressed genes (DEGs). Differential gene co-expression analysis was performed using R/EBcoexpress package in R. DEGs in the gene co-expression network were subjected to Gene Ontology analysis using the Database for Annotation, Visualization and Integration Discovery. Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis was also performed on the DEGs using KOBAS 2.0 software. The ConnectivityMap database was used to identify novel drug candidates. A total of 3,742 DEGs were identified among the 552 UCEC samples and 35 normal controls, and comprised 2,580 upregulated and 1,162 downregulated genes. A gene co-expression network consisting of 129 DEGs and 368 edges was constructed. Genes were associated with the cell cycle and the tumor protein p53 signaling pathway. Three modules were identified, in which genes were associated with the mitotic cell cycle, nuclear division and the M phase of the mitotic cell cycle. Multiple key hub genes were identified, including cell division cycle 20, cyclin B2, non-SMC condensin I complex subunit H, BUB1 mitotic checkpoint serine/threonine kinase, cell division cycle associated 8, maternal embryonic leucine zipper kinase, MYB proto-oncogene like 2, TPX2, microtubule nucleation factor and non-SMC condensin I complex subunit G. In addition, the small molecule drug esculetin was implicated in the suppression of UCEC progression. Overall, the present study identified multiple key genes in UCEC and clinically relevant small molecule agents, thereby improving our understanding of UCEC and expanding perspectives on targeted therapy for this type of cancer.

Entities:  

Keywords:  differentially expressed genes; gene co-expression network; small molecule drugs; uterine corpus endometrial carcinoma

Year:  2017        PMID: 29387216      PMCID: PMC5769370          DOI: 10.3892/ol.2017.7346

Source DB:  PubMed          Journal:  Oncol Lett        ISSN: 1792-1074            Impact factor:   2.967


Introduction

Endometrial cancer is the most commonly diagnosed female genital cancer and, in 2012, was ranked the fourth most common cancer in women worldwide, after breast, lung and colorectal cancer (1). In China, an estimated 63,400 new cases and 21,800 mortalities were reported for endometrial cancer in 2015 (2). Uterine corpus endometrial carcinoma (UCEC) is a common type of endometrial cancer. The incidence of UCEC increases with age and it is most frequently diagnosed in women aged between 45 and 65 years (3). Previous studies have sought to identify tumor biomarkers and have identified multiple UCEC biomarkers, including activated leukocyte cell adhesion molecule (4), sperm-associated antigen 9 (5), L1 cell adhesion molecule (6), progestogen-associated endometrial protein (7), heat shock protein family A (Hsp70) member 5 (8) and CD151 molecule (Raph blood group) (9). To enhance our understanding of UCEC and explore more effective and targeted therapies, the present study aimed to identify gene co-expression networks, hub genes and small molecule drugs associated with the development of UCEC. The Cancer Genome Atlas (TCGA) is a comprehensive genomic database that holds data for >20 types of cancer obtained from thousands of patients. Data in TCGA includes whole-genome measurements of multiple genomic features, including DNA copy numbers, DNA methylation, and gene and microRNA expression, thereby assisting researchers in assessing cancer mechanisms at multiple molecular and regulatory levels (10). Furthermore, TGGA data is open access and available to all researchers in individual work settings. In the present study, UCEC RNA sequencing data were collected and analyzed using bioinformatics tools. Differentially expressed genes (DEGs) were identified and a gene co-expression network was constructed, enabling the hub genes to be identified. Relevant small molecule drugs were also assessed. The results of the present study may provide novel insights into the pathogenesis of UCEC and thereby expand perspectives on the treatment of this type of tumor.

Materials and methods

Gene expression data

The RNA expression data (level 3) of UCEC patients and normal controls without UCEC were downloaded from the TCGA data portal up until December 2016 (http://cancergenome.nih.gov/) (11). A total of 552 UCEC samples and 35 normal control samples were included in the dataset. The exclusion criteria were as follows: A histological diagnosis not of UCEC; and samples without complete data for analysis.

DEG screening

The DEGs between UCEC and normal control tissues were screened using edgeR software (v..5; http://www.bioconductor.org/packages/release/bioc/html/edgeR.html). A false discovery rate <0.01 and |log2(fold change)|>1 were set as the cut-off values to identify the DEGs.

Cluster analysis

Bidirectional hierarchical clustering according to the expression of the DEGs was performed using the pheatmap package (v.1.0.8; https://CRAN.R-project.org/package=pheatmap) in R, as previously described (12). A heat map was used to represent the data (Fig. 1).
Figure 1.

Bidirectional hierarchical clustering of the 3,742 differentially expressed genes within the 587 samples (552 uterine corpus endometrial carcinoma and 35 control) in the dataset.

Gene co-expression network construction

Correlations among the DEGs were calculated using the R/EBcoexpress package in R (version 3.5; http://www.bioconductor.org/packages/release/bioc/html/EBcoexpress.html), as previously described (13). Genes with a correlation coefficient >0.9 were included in the gene co-expression network, which was visualized using Cytoscape software (version 3.5.1; http://www.cytoscape.org/).

Functional enrichment analysis

A Gene Ontology (GO) analysis (14) was performed on the DEGs in the gene co-expression network using the Database for Annotation, Visualization and Integration Discovery (http://david.abcc.ncifcrf.gov/) (15). P<0.05 was set as the cut-off criteria. Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis (16) was performed using KOBAS 2.0 software (http://kobas.cbi.pku.edu.cn/) (17), with P<0.05 set as the threshold.

Module analysis

Modules were identified using the MCODE plug-in in Cytoscape with the cut-off criteria of degree ≥2 and k-core ≥3. Each module was functionally annotated using the Cytoscape plug-in BiNGO based on hypergeometric distribution (adjusted P-value <0.01).

Screening of relevant small molecule drugs

ConnectivityMap (cMap) (18) is a public database containing >7,000 expression profiles representing 1,309 compounds (www.broad.mit.edu/cmap/). Relevant small molecule drugs were predicted using the cMap tool and those with a |score|>0.6 were included.

Results

DEGs among UCEC and control samples

The differential expression analysis identified 3,742 DEGs, comprising 2,580 upregulated and 1,162 downregulated genes. The results of bidirectional hierarchical clustering of the 3,742 DEGs in the 587 samples (552 UCEC and 35 normal control samples) that were screened were provided (Fig. 1). The UCEC and control samples differed in gene expression pattern, suggesting that DEGs could distinguish between the two types of sample.

GO annotations of DEGs

The GO annotations of the DEGs were provided (Fig. 2). Cancer-associated processes, including cell adhesion, ion transport and biological adhesion, were among the significantly associated terms.
Figure 2.

Gene Ontology annotations of the differentially expressed genes.

Gene co-expression network

Correlated genes with a correlation coefficient >0.9 were included in the gene co-expression network (Fig. 3). A total of 129 DEGs (nodes) and 368 edges (lines between nodes) were included, divided into 110 upregulated and 19 downregulated genes. The sizes of the nodes were proportional to the number of genes in the gene set, and a number of hub genes (nodes with degrees ≥8) were highlighted, including cell division cycle 20 (CDC20), cyclin B2, non-SMC condensin I complex subunit H, BUB1 mitotic checkpoint serine/threonine kinase, cell division cycle-associated 8, maternal embryonic leucine zipper kinase, MYB proto-oncogene like 2, TPX2, microtubule nucleation factor and non-SMC condensin I complex subunit G.
Figure 3.

Gene co-expression network of differentially expressed genes. Red, upregulated genes; green, downregulated genes.

Co-expression network functional enrichment analysis

GO enrichment analysis identified 20 significantly overrepresented terms for gene function in the co-expression network (Table I), including M phase, nuclear division, mitosis and the M phase of the mitotic cell cycle. KEGG pathway enrichment analysis indicated that pathways such as those of the cell cycle, oocyte meiosis, progesterone-mediated oocyte maturation and tumor protein p53 signaling were significantly enriched among genes in the co-expression network (Table II).
Table I.

GO terms significantly overrepresented in the genes from the gene co-expression network.

GO categoryGO termCountP-valueFDR
GO:0000279M phase451.27×10−461.93×10−43
GO:0000280Nuclear division405.98×10−469.07×10−43
GO:0007067Mitosis405.98×10−469.07×10−43
GO:0000087M phase of the mitotic cell cycle401.27×10−451.93×10−42
GO:0048285Organelle fission403.20×10−454.85×10−42
GO:0022403Cell cycle phase461.33×10−432.02×10−40
GO:0000278Mitotic cell cycle433.22×10−414.89×10−38
GO:0007049Cell cycle532.40×10−403.63×10−37
GO:0022402Cell cycle process483.05×10−404.62×10−37
GO:0051301Cell division354.17×10−336.33×10−30
GO:0005819Spindle248.32×10−279.68×10−24
GO:0015630Microtubule cytoskeleton352.86×10−263.33×10−23
GO:0007059Chromosome segregation183.26×10−214.94×10−18
GO:0044430Cytoskeletal part371.25×10−201.45×10−17
GO:0005856Cytoskeleton415.11×10−195.94×10−16
GO:0043228Non-membrane-bounded organelle525.37×10−186.24×10−15
GO:0043232Intracellular non-membrane-bounded organelle525.37×10−186.24×10−15
GO:0000775Chromosome, centromeric region172.29×10−172.67×10−14
GO:0000793Condensed chromosome174.39×10−175.10×10−14
GO:0000779Condensed chromosome, centromeric region148.86×10−171.33×10−13

GO, Gene Ontology; FDR, false discovery rate.

Table II.

Kyoto Encyclopedia of Genes and Genomes pathways significantly overrepresented in the genes from the gene co-expression network.

TermCountP-value
hsa04110: Cell cycle133.65×10−16
hsa04114: Oocyte meiosis113.06×10−13
hsa04914: Progesterone-mediated oocyte maturation  73.60×10−8
hsa04115: p53 signaling pathway  48.02×10−5
hsa05166: HTLV-I infection  51.41×10−3
hsa04270: Vascular smooth muscle contraction  37.17×10−3
hsa04068: FoxO signaling pathway  39.19×10−3

HTLV-I, human T-cell lymphotropic virus type 1; FoxO, forkhead box O; p53, tumor protein 53.

Modules and functions

Three modules (designated A-C) were identified from the gene co-expression network (Fig. 4). Module A included 11 DEGs implicated in the mitotic cell cycle and cell cycle phases. Module B contained 6 DEGs associated with nuclear division. Module C comprised 7 DEGs associated with the M phase of the mitotic cell cycle (Table III).
Figure 4.

Modules identified from the gene co-expression network.

Table III.

Functional terms of the three modules.

GO-IDP-valueFDRNo. of genes
Module A
  GO:0000279~M phase9.60×10−71.12×10−36
  GO:0000278~mitotic cell cycle1.71×10−62.01×10−36
  GO:0022403~cell cycle phase2.98×10−63.50×10−36
  GO:0007067~mitosis8.04×10−69.43×10−35
  GO:0000280~nuclear division8.04×10−69.43×10−35
  GO:0000087~M phase of mitotic cell cycle8.64×10−61.01×10−25
  GO:0048285~organelle fission9.43×10−61.11×10−25
  GO:0022402~cell cycle process1.37×10−51.60×10−26
  GO:0007059~chromosome segregation1.69×10−51.98×10−24
  GO:0000793~condensed chromosome3.41×10−53.10×10−24
  GO:0005819~spindle5.04×10−54.59×10−24
  GO:0005694~chromosome5.32×10−54.83×10−25
Module B
  GO:0000280~nuclear division3.36×10−73.63×10−45
  GO:0007067~mitosis3.36×10−73.63×10−45
  GO:0000087~M phase of mitotic cell cycle3.61×10−73.90×10−45
  GO:0048285~organelle fission3.95×10−74.27×10−45
  GO:0000279~M phase1.69×10−61.82×10−35
  GO:0000278~mitotic cell cycle2.69×10−62.93×10−35
  GO:0015630~microtubule cytoskeleton3.37×10−62.94×10−35
  GO:0022403~cell cycle phase4.22×10−64.56×10−35
  GO:0022402~cell cycle process1.46×10−51.57×10−25
Module C
  GO:0000280~nuclear division3.36×10−73.63×10−45
  GO:0007067~mitosis3.36×10−73.63×10−45
  GO:0000087~M phase of mitotic cell cycle3.61×10−73.90×10−45
  GO:0048285~organelle fission3.95×10−74.27×10−45
  GO:0000279~M phase1.69×10−61.82×10−35
  GO:0000278~mitotic cell cycle2.69×10−62.91×10−35
  GO:0015630~microtubule cytoskeleton3.37×10−62.94×10−35
  GO:0022403~cell cycle phase4.22×10−64.56×10−35
  GO:0022402~cell cycle process1.46×10−51.57×10−25

GO-ID, Gene Ontology identification; FDR, false discovery rate.

Relevant small molecule drugs

A total of 7 small molecule drugs were identified, including esculetin, antazoline and isometheptene. Esculetin had the highest negative connectivity score compared with the 6 other small molecule drugs (−0.844; Table IV).
Table IV.

Relevant small molecule drugs.

cMap nameCorrelationP-value
Esculetin−0.8447.55×10−3
Antazoline−0.7291.07×10−2
Isometheptene−0.7221.20×10−2
Oxamniquine0.7171.31×10−2
Pyrimethamine−0.6421.49×10−2
Carmustine−0.7951.75×10−2
Cefapirin0.6364.28×10−2

cMap, connectivity map.

Discussion

The results of the present study indicated that multiple DEGs are associated with UCEC pathophysiology. Matrix metalloproteinases (MMPs) serve an important function in degrading the extracellular matrix and basement membrane components (19). Karahan et al identified that MMP-9 was expressed in a high percentage of primary endometrial carcinoma tissues and that MMP-9 expression was potentially associated with parameters of tumor aggressiveness (20). Furthermore, MMP-9 has been revealed to be overexpressed in UCEC and associated with UCEC progression (21). Topoisomerase 2α, a key enzyme in DNA replication, has been implicated as a predictive marker in treating endometrial cancer with taxane-containing adjuvant chemotherapy (22). Enhancer of zeste homolog 2 (EZH2), a critical component of the polycomb repressive complex 2, is associated with cell proliferation, invasion, adhesion and metastasis in several cancer types (23), and is overexpressed in numerous types of cancer, including breast cancer (24), colorectal cancer (25), ovarian cancer (26) and nasopharyngeal cancer (27). A previous study indicated that EZH2 may predict a more aggressive endometrial carcinoma and may therefore potentially represent a therapeutic target for this type of cancer (28). Baculoviral inhibitor of apoptosis (IAP) repeat containing 5 gene (BIRC5), a member of the IAP gene family, may inhibit the activation of caspase to negatively regulate apoptosis (29). In endometrial cancer, increased BIRC5 expression has been associated with poor progression-free survival and may serve as an independent prognostic factor (30). It is well established that CDC20 is an essential developmental gene; disrupting CDC20 function in mice induces mortality in embryos (31). Previous studies have also suggested that CDC20 functions oncogenically in human tumorigenesis. CDC20 serves important functions in the cell cycle (32), cell apoptosis (33) and in targeting downstream substrates for ubiquitination and subsequent degradation (34). Furthermore, CDC20 has been revealed to be upregulated and may represent a useful prognostic biomarker in endometrial cancer (35). In the present study, esculetin was identified as a significant small molecule drug in the development of UCEC. Esculetin (6,7-dihydroxycoumarin), a coumarin derived from natural plants, inhibits cancer cells from proliferating and induces multiple types of human cancer cells to apoptose, including breast (36), colon (37,38) and gastric cancer (39), malignant melanoma (40), hepatocellular carcinoma (41,42) and oral squamous cancer (43,44). Esculetin has also been reported to assist in inducing cancer cells to apoptose (45,46). In the present study, esculetin had a higher negative connectivity score compared with the 6 other small molecule drugs, suggesting that esculetin could inhibit UCEC from progressing. To the best of our knowledge, no study has yet reported on the association between esculetin and endometrial cancer. Verifying whether esculetin represents a novel therapeutic agent in treating UCEC requires further studies. Overall, the present study identified multiple key genes in UCEC and clinically relevant small molecule agents, thereby improving our understanding of UCEC and expanding perspectives on targeted therapy for this type of cancer.
  46 in total

1.  Activated leukocyte cell adhesion molecule (ALCAM) is a marker of recurrence and promotes cell migration, invasion, and metastasis in early-stage endometrioid endometrial cancer.

Authors:  Laura Devis; Cristian P Moiola; Nuria Masia; Elena Martinez-Garcia; Maria Santacana; Tomita Vasilica Stirbat; Françoise Brochard-Wyart; Ángel García; Francesc Alameda; Silvia Cabrera; Jose Palacios; Gema Moreno-Bueno; Miguel Abal; William Thomas; Sylvie Dufour; Xavier Matias-Guiu; Anna Santamaria; Jaume Reventos; Antonio Gil-Moreno; Eva Colas
Journal:  J Pathol       Date:  2017-01-23       Impact factor: 7.996

2.  Cell cycle control of Wnt/β-catenin signalling by conductin/axin2 through CDC20.

Authors:  Michel V Hadjihannas; Dominic B Bernkopf; Martina Brückner; Jürgen Behrens
Journal:  EMBO Rep       Date:  2012-04       Impact factor: 8.807

3.  The endoplasmic reticulum stress marker, glucose-regulated protein-78 (GRP78) in visceral adipocytes predicts endometrial cancer progression and patient survival.

Authors:  Koji Matsuo; Michael J Gray; Dong Yun Yang; Sucheta A Srivastava; Prem B Tripathi; Laura A Sonoda; Eun-Jeong Yoo; Louis Dubeau; Amy S Lee; Yvonne G Lin
Journal:  Gynecol Oncol       Date:  2012-11-28       Impact factor: 5.482

4.  Expression of gelatinase (MMP-2 and MMP-9) and cyclooxygenase-2 (COX-2) in endometrial carcinoma.

Authors:  N Karahan; M Güney; S Baspinar; B Oral; N Kapucuoglu; T Mungan
Journal:  Eur J Gynaecol Oncol       Date:  2007       Impact factor: 0.196

Review 5.  Update on pathology, staging and molecular pathology of endometrial (uterine corpus) adenocarcinoma.

Authors:  Joanne K L Rutgers
Journal:  Future Oncol       Date:  2015-11-09       Impact factor: 3.404

6.  Functional polymorphism -31C/G in the promoter of BIRC5 gene and risk of nasopharyngeal carcinoma among chinese.

Authors:  Fuchao Ma; Hongxing Zhang; Yun Zhai; Wenfeng Huang; Chang Zhao; Shengqiu Ou; Hong Zhou; Wenzhao Yuan; Zhifu Wang; Hongxue Wang; Wei Yue; Lixia Yu; Peiyao Li; Xia Xia; Mi Cai; Yang Zhang; Ying Cui; Fuchu He; Yilong Ma; Gangqiao Zhou
Journal:  PLoS One       Date:  2011-02-03       Impact factor: 3.240

7.  PathwayVoyager: pathway mapping using the Kyoto Encyclopedia of Genes and Genomes (KEGG) database.

Authors:  Eric Altermann; Todd R Klaenhammer
Journal:  BMC Genomics       Date:  2005-05-03       Impact factor: 3.969

8.  Esculetin, a coumarin derivative, exerts in vitro and in vivo antiproliferative activity against hepatocellular carcinoma by initiating a mitochondrial-dependent apoptosis pathway.

Authors:  J Wang; M L Lu; H L Dai; S P Zhang; H X Wang; N Wei
Journal:  Braz J Med Biol Res       Date:  2014-12-12       Impact factor: 2.590

9.  EZH2 contributes to the response to PARP inhibitors through its PARP-mediated poly-ADP ribosylation in breast cancer.

Authors:  H Yamaguchi; Y Du; K Nakai; M Ding; S-S Chang; J L Hsu; J Yao; Y Wei; L Nie; S Jiao; W-C Chang; C-H Chen; Y Yu; G N Hortobagyi; M-C Hung
Journal:  Oncogene       Date:  2017-09-18       Impact factor: 9.867

10.  miR-26a inhibits invasion and metastasis of nasopharyngeal cancer by targeting EZH2.

Authors:  Li Yu; Juan Lu; Bao Zhang; Xiong Liu; Lu Wang; Si-Yang Li; Xiao-Hong Peng; Xia Xu; Wen-Dong Tian; Xiang-Ping Li
Journal:  Oncol Lett       Date:  2013-02-04       Impact factor: 2.967

View more
  10 in total

1.  Identification of early stage recurrence endometrial cancer biomarkers using bioinformatics tools.

Authors:  María José Besso; Luciana Montivero; Ezequiel Lacunza; María Cecilia Argibay; Martín Abba; Laura Inés Furlong; Eva Colas; Antonio Gil-Moreno; Jaume Reventos; Ricardo Bello; Mónica Hebe Vazquez-Levin
Journal:  Oncol Rep       Date:  2020-06-16       Impact factor: 3.906

2.  Differential gene expression induced by Verteporfin in endometrial cancer cells.

Authors:  Lisa Gahyun Bang; Venkata Ramesh Dasari; Dokyoon Kim; Radhika P Gogoi
Journal:  Sci Rep       Date:  2019-03-07       Impact factor: 4.379

3.  Identifying an Eight-Gene Signature to Optimize Overall Survival Prediction of Esophageal Adenocarcinoma Using Bioinformatics Analysis of ceRNA Network.

Authors:  Yuanyong Wang; Naixin Liang; Zhiqiang Xue; Xinying Xue
Journal:  Onco Targets Ther       Date:  2020-12-22       Impact factor: 4.147

4.  Identification of Tumor Tissue of Origin with RNA-Seq Data and Using Gradient Boosting Strategy.

Authors:  Ruixi Li; Bo Liao; Bo Wang; Chan Dai; Xin Liang; Geng Tian; Fangxiang Wu
Journal:  Biomed Res Int       Date:  2021-02-17       Impact factor: 3.411

5.  COVID-19-associated lncRNAs as predictors of survival in uterine corpus endometrial carcinoma: A prognostic model.

Authors:  Yang Ding; Xia Li; Jiena Li
Journal:  Front Genet       Date:  2022-09-06       Impact factor: 4.772

6.  Novel DNA Damage-Related Subtypes Characterization Identifies Uterine Corpus Endometrial Carcinoma (UCEC) Based on Machine Learning.

Authors:  Qi Yu; Xinqi Ge; Zheng Wang; Shu Ding; Yunfeng Jin; Liping Chen
Journal:  J Oncol       Date:  2022-08-28       Impact factor: 4.501

7.  Identification of a six-gene signature with prognostic value for patients with endometrial carcinoma.

Authors:  Yizi Wang; Fang Ren; Peng Chen; Shuang Liu; Zixuan Song; Xiaoxin Ma
Journal:  Cancer Med       Date:  2018-10-10       Impact factor: 4.452

8.  Prognostic Genes of Breast Cancer Identified by Gene Co-expression Network Analysis.

Authors:  Jianing Tang; Deguang Kong; Qiuxia Cui; Kun Wang; Dan Zhang; Yan Gong; Gaosong Wu
Journal:  Front Oncol       Date:  2018-09-11       Impact factor: 6.244

9.  Functional genomics of AP-2α and AP-2γ in cancers: in silico study.

Authors:  Damian Kołat; Żaneta Kałuzińska; Magdalena Orzechowska; Andrzej K Bednarek; Elżbieta Płuciennik
Journal:  BMC Med Genomics       Date:  2020-11-19       Impact factor: 3.063

10.  Stemness Related Genes Revealed by Network Analysis Associated With Tumor Immune Microenvironment and the Clinical Outcome in Lung Adenocarcinoma.

Authors:  Hao Zeng; Jianrui Ji; Xindi Song; Yeqian Huang; Hui Li; Juan Huang; Xuelei Ma
Journal:  Front Genet       Date:  2020-09-16       Impact factor: 4.599

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.