Literature DB >> 26366417

Systematic Analysis of Endometrial Cancer-Associated Hub Proteins Based on Text Mining.

Huiqiao Gao1, Zhenyu Zhang1.   

Abstract

OBJECTIVE: The aim of this study was to systematically characterize the expression of endometrial cancer- (EC-) associated genes and to analysis the functions, pathways, and networks of EC-associated hub proteins.
METHODS: Gene data for EC were extracted from the PubMed (MEDLINE) database using text mining based on NLP. PPI networks and pathways were integrated and obtained from the KEGG and other databases. Proteins that interacted with at least 10 other proteins were identified as the hub proteins of the EC-related genes network.
RESULTS: A total of 489 genes were identified as EC-related with P < 0.05, and 32 pathways were identified as significant (P < 0.05, FDR < 0.05). A network of EC-related proteins that included 271 interactions was constructed. The 17 proteins that interact with 10 or more other proteins (P < 0.05, FDR < 0.05) were identified as the hub proteins of this PPI network of EC-related genes. These 17 proteins are EGFR, MET, PDGFRB, CCND1, JUN, FGFR2, MYC, PIK3CA, PIK3R1, PIK3R2, KRAS, MAPK3, CTNNB1, RELA, JAK2, AKT1, and AKT2.
CONCLUSION: Our data may help to reveal the molecular mechanisms of EC development and provide implications for targeted therapy for EC. However, corrections between certain proteins and EC continue to require additional exploration.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26366417      PMCID: PMC4561104          DOI: 10.1155/2015/615825

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Endometrial cancer is one of the most common gynecologic malignancies, and the incidence of this cancer continues to increase [1]. During the prior several decades, progress in molecular biology has improved our understanding of the occurrence and development of EC. It has been established that the biological behavior of tumors is controlled by functional proteins within cells and the signaling pathways in which these proteins are involved. Therefore, studies on the structure and function of hub proteins in signaling pathways may be valuable for diagnosing EC and for determining targeted therapies for this disease. To date, research has examined a large number of EC-related genes and proteins that could potentially be used as biomarkers or targets for diagnosis or treatment [2, 3]. However, most published papers regarding EC have focused on only a handful of genes and proteins. Although the research objectives of molecular biology are shifting from single genes or proteins to genomics or proteomics, there are a limited number of systematic studies of whole-genome expression in the context of EC. At present, text mining (TM) technology is widely used in biomedical research to extract information from large quantities of biomedical literature and construct databases of disease-related genes, proteins, and molecular interactions [4, 5]. In this study, we systematically characterized the expression of EC-associated genes by mining data from the PubMed document retrieval system. In addition, we used bioinformatics methods to analyze the functions, pathways, and networks of relevant hub proteins.

2. Materials and Methods

The extraction of data by TM was based on natural language processing (NLP). Using “Endometrial Cancer” and “Endometrium Carcinoma” as search terms, we searched the PubMed database for article abstracts published before March 2014 and formatted the documents that were obtained. Genes and proteins that appeared in the abstracts of these documents were located and tagged using ABNER (A Biomedical Named Entity Recognizer; an open source tool for automatically tagging genes, proteins, and other entity names in text) [6]. Gene names were normalized based on the Entrez Gene database (the National Center for Biotechnology Information's database for gene-specific information) [7]. The frequency at which each gene occurred was then counted. A hypergeometric distribution was used to calculate the probabilities that genes would be cocited with EC at frequencies higher than theoretical expectations; genes of which P < 0.05 were considered relevant. Gene ontology (GO) analysis was performed using GSEABase software package from the R statistical platform (http://www.r-project.org/). Genes were classified by biological process, cellular component, and molecular function. The EC-related protein-protein interaction (PPI) network was integrated from the KEGG (Kyoto Encyclopedia of Genes and Genomes), MIPS (Munich Information Center for Protein Sequences), and PubMed databases. GenMAPP v2.1 was used to map EC-related genes to the KEGG database to determine the pathways in which these genes were involved. A threshold of 0.05 was established for P values and false discovery rate (FDR).

3. Results

3.1. EC-Related Genes and GO Analysis

After the retrieval of documents from PubMed, 15157 abstracts were examined, and 832 genes were obtained. Eventually, a total of 489 genes were identified as EC-related with P < 0.05; among these genes, PGR, TP53, and MLH1 were mentioned most frequently. Table 1 lists the 20 most significant EC-related genes.
Table 1

The 20 most significant EC-related genes based on text mining.

GeneDescriptionCount P value
PGRProgesterone receptor3230
TP53Tumor protein p532960
MLH1mutL homolog 11500
PTENPhosphatase and tensin homolog1300
MSH2mutS homolog 21120
VEGFAVascular endothelial growth factor A820
ERBB2erb-b2 receptor tyrosine kinase 2 (HER2)770
MSH6mutS homolog 6 750
EGFREpidermal growth factor receptor680
MKI67Antigen identified by monoclonal antibody Ki-67664.80E − 09
BCL2B-cell CLL/lymphoma 2540
CCND1Cyclin D1531.02E − 08
ESR1Estrogen receptor 1480
TCEAL1Transcription elongation factor A (SII)-like 1470
CDKN2ACyclin-dependent kinase inhibitor 2A (p16)390
CYP19A1Cytochrome P450, family 19, subfamily A, polypeptide 1390
INSinsulin360
PTGS2Prostaglandin-endoperoxide synthase 2 (COX2)340
PMS2Postmeiotic segregation increased 2 330
PCNAProliferating cell nuclear antigen320
Classification results for biological processes, cellular components, and molecular functions by GO analysis are presented in Table 2. Developmental processes, protein metabolism, and signal transduction were the major biological processes associated with EC-related genes; with respect to molecular function, the primary activities of these genes included signal transduction, nucleic acid binding, and transcriptional regulation. These genes were related to various cellular components, including the nucleus, plasma membrane, and nonstructural extracellular matrix.
Table 2

Classification results for biological processes, cellular components, and molecular functions by GO analysis.

Term Count P value
Biological process
 Cell cycle and proliferation2244.05E − 11
 Stress response1605.51E − 11
 Developmental processes3368.54E − 11
 RNA metabolism1880.00031
 DNA metabolism670
 Protein metabolism2541.07E − 10
 Other metabolic processes2292.58E − 10
 Cell organization and biogenesis1787.72E − 11
 Cell-cell signaling448.21E − 08
 Signal transduction2450.00089
 Cell adhesion510.00284
 Death1412.77E − 11
 Other biological processes4365.94E − 06

Molecular function
 Transcription regulatory activity1078.46E − 10
 Signal transduction activity2403.60E − 05
 Enzyme regulator activity480.01638
 Nucleic acid binding activity1942.53E − 07
 Kinase activity841.32E − 08
 Other molecular function7442.86E − 07

Cellular component
 Extracellular matrix341.42E − 05
 Nonstructural extracellular1801.09E − 10
 Cytosol535.50E − 11
 Nucleus3061.77E − 09
 Plasma membrane1860.00014
 Translational apparatus220.00148
 Other cellular component4469.02E − 08

3.2. Pathway and PPI Analysis

Following pathway analysis, 32 pathways were identified as significant (P < 0.05, FDR < 0.05); among these pathways, the cytokine-cytokine receptor interaction, MAPK, and focal adhesion signaling pathways involved the largest number of genes. Table 3 lists the 20 most significant EC-related pathways.
Table 3

The 20 most significant pathways in which EC-related genes were involved.

TermCount P value
Cytokine-cytokine receptor interaction641.95E − 09
MAPK signaling pathway622.91E − 08
Focal adhesion521.12E − 08
Cell cycle488.71E − 15
Regulation of actin cytoskeleton462.43E − 05
Jak-STAT signaling pathway392.25E − 06
Toll-like receptor signaling pathway363.72E − 10
Chemokine signaling pathway360.00170
p53 signaling pathway341.63E − 14
Apoptosis333.75E − 10
T cell receptor signaling pathway331.55E − 07
Insulin signaling pathway333.00E − 05
ErbB signaling pathway321.77E − 09
Wnt signaling pathway326.43E − 04
Neurotrophin signaling pathway313.51E − 05
Natural killer cell-mediated cytotoxicity280.00168
Steroid hormone biosynthesis269.99E − 13
Adherens junction247.63E − 06
Fc epsilon RI signaling pathway249.69E − 06
NOD-like receptor signaling pathway234.62E − 07
We constructed a network of EC-related proteins that included 271 interactions (Figure 1). The 17 proteins that interact with at least 10 other proteins (P < 0.05,  FDR < 0.05) were identified as the hub proteins of the EC-related PPI network. These proteins are EGFR, MET, PDGFRB, CCND1, JUN, FGFR2, MYC, PIK3CA, PIK3R1, PIK3R2, KRAS, MAPK3, CTNNB1, RELA, JAK2, AKT1, and AKT2 (Figure 2). EGFR, which interacts with 33 other proteins, was the EC-related protein that exhibited the greatest number of interactions.
Figure 1

Network analysis of EC-related genes.

Figure 2

Hub proteins for EC.

4. Discussion

In the present study, by extracting information from biomedical literature, we obtained a dataset of EC-related proteins and identified 17 hub proteins. Most relationships between EC and certain hub proteins, such as EGFR, IGF1R, and MET, have been extensively studied, and all of the aforementioned proteins are known to be closely related to the occurrence and development of EC. However, relative to these proteins, PDGFRB, FGFR2, MAPK3, and JAK2 have been reported less frequently in the context of EC.

4.1. PI3K and AKT

PI3K is a heterodimeric enzyme that consists of a regulatory subunit (p85) encoded by PIK3R1, PIK3R2, and PIK3R3 and a catalytic subunit (p110) encoded by PIK3CA, PIK3CB, and PIK3CD [8]. Mutations in PIK3CA, PIK3R1, and PIK3R2 occur at high rates in EC [9, 10]. AKT is the downstream target gene of PI3K, and AKT1 and AKT2 are two subtypes of AKT. Based on data mining, we found that PI3K and AKT are involved in many pathways, including the focal adhesion pathway, the toll-like receptor signaling pathway, and, most notably, the PI3K/AKT signaling pathway. PI3K phosphorylates PIP2 to PIP3, which can activate AKT. Subsequently, activated AKT stimulates the regulation of cellular metabolism, growth and survival by CCND1, Myc, NF-κB, and a variety of downstream factors [11]. AKT plays a key role in this pathway. The PI3K/AKT signaling pathway can inhibit cell apoptosis and promote cell proliferation [12]. In EC, molecular alterations lead to increased PI3K/AKT signaling; in particular, the dominant activation event is the loss of the PTEN protein, which is a tumor suppressor that negatively affects the PI3K signaling pathway [11, 13]. Many recent studies have demonstrated that the PI3K/AKT pathway is activated in all types of EC and that this activation is associated with the aggressiveness of this disease [14, 15]. Recently, certain PI3K/AKT pathway inhibitors have been evaluated in preclinical or early clinical trials [16].

4.2. RAS and MAPK

RAS is an oncogene that serves as a central focus for many signal transduction pathways associated with a high percentage of human tumors. Activating mutations in KRAS can be observed in EC [17]. A recent analysis of EC signal transduction indicated that KRAS mutation is associated with elevated phosphorylation of MEK1/2, ERK1/2, and p38MAPK [9]. In fact, many studies have indicated that the RAS/MAPK pathway is frequently upregulated in EC [18, 19]. Moreover, KRAS also interacts with the PI3K pathway. Notably, KRAS-induced carcinogenesis can be inhibited when the interaction between RAS and the PI3K catalytic subunit P110α is blocked in vitro [20]. In this study, we found that KRAS and MAPK were involved in many signaling pathways, such as the MAPK signaling pathway, pathways involved in regulating the actin cytoskeleton, and the ErbB signaling pathway. As a hub of various pathways, MAPK regulates a cascade of downstream genes that participate in cell proliferation and differentiation, including Bcl-2, c-Myc, rock, and RSK2, among others.

4.3. FGFR2

FGFR2 is one type of fibroblast growth factor receptor and a member of the RTK family. RTKs are well known for their role in tumorigenesis [21]. In addition, it has been demonstrated that activating mutations in FGFR2 are associated with multiple types of tumors, including EC. By utilizing immunohistochemistry and PCR to examine FGFR2 expression and the presence of FGFR2 mutations in endometrial carcinoma, Gatius et al. determined that FGFR2 acted as an oncogene in EC and that FGFR2 expression was positively correlated with tumor stage and grade [22]. In our study, FGFR2 was mainly found to be involved in the MAPK signaling pathway and the regulation of the actin cytoskeleton. In fact, FGF signaling can activate several downstream pathways, including both the RAS-MAPK pathway and the PI3K-AKT pathway [23]. There has long been interest in FGFR inhibitors, and many studies have demonstrated that FGFR inhibition can block the progression of FGFR2-mutated EC [24, 25]. The targeting of FGFR2 is a possible treatment strategy for endometrial carcinoma.

4.4. PDGFRB

PDGF is a major mitogen that mediates the growth of fibroblasts, smooth muscle cells, and other cell. This protein also has significant effects on the angiogenesis of endothelial cells. PDGF exerts its biological effects by binding to its two receptors, α-receptor (PDGFRA) and β-receptor (PDGFRB), which are located on the cell membrane. These PDGF receptors are also members of the RTK family. In vivo and in vitro research have indicated that the excessive expression of PDGF and PDGFR can be detected in breast, pancreas, colorectal, and other tumors [26, 27]. Liegl et al. demonstrated that PDFGRB can be detected in the endothelial cells of endometrial stromal sarcomas [28]. PDGFR-mediated signaling contributes to tumor angiogenesis, and PDGF can upregulate the expression of VEGF, which also has angiogenic effects. Our TM indicated that PDGFRB participated as an upstream factor in cytokine-cytokine receptor interaction, the MAPK signaling pathway, focal adhesion, and the regulation of actin cytoskeleton. Moreover, the targeting of PDGFR to inhibit tumor cell signal transduction may play a crucial antitumor role [29, 30].

4.5. JAK2

JAK2, a member of the JAK family, is widely distributed in the cytoplasm. This protein is involved in signal transduction during hematopoiesis and in the immune system; in particular, JAK2 plays important roles in the production of red blood cells and the activation of immune cells. Research has demonstrated that JAK2 is associated with multiple tumors. The constitutive activation of JAK2 has been detected in many malignant solid tumors, such as colon cancer, head and neck cancer, leukemia, multiple myeloma, and other blood diseases [31-33]. Several JAK2 inhibitors are currently being evaluated in clinical trials in patients [34, 35]. JAK2 forms several signal transduction pathways in combination with multiple members of the STAT family; among these pathways, the JAK2-STAT3 pathway is particularly prominent. The JAK2-STAT3 signaling pathway, which mediates cell proliferation, differentiation, and apoptosis, is a focal point of the cellular signaling network and is closely associated with tumorigenesis [36]. However, there exists little research addressing the correlation between EC and JAK2-STAT3. The research of Liu et al. and Gao et al. indicated that the leptin can promote EC growth via activating the JAK2-STAT3 signal pathway in obese patient [37, 38]. In our study, JAK2 not only participates in the JAK-STAT pathway but also can activate the downstream PI3K-AKT pathway. In summary, in this investigation, we systematically analyzed EC-related genes and identified certain hub proteins and their pathways and networks. This systematic study may help to reveal the molecular mechanisms of EC development. However, the study results were obtained based on TM, which only considered previously published literatures; thus, the correlations between certain proteins and EC require additional explorations. Moreover, our data also provide implications for targeted therapy for EC. After obtaining deeper insight into the EC-related signaling network, additional hub protein inhibitors with stronger specificities will be developed. Anyhow, multiple hub proteins-targeted drugs will have broad potential for tumor treatment.
  38 in total

1.  JAK/STAT signalling pathway in colorectal cancer: a new biological target with therapeutic implications.

Authors:  Jean-Philippe Spano; Gerard Milano; Clivier Rixe; Remi Fagard
Journal:  Eur J Cancer       Date:  2006-09-11       Impact factor: 9.162

Review 2.  Defining the role of the JAK-STAT pathway in head and neck and thoracic malignancies: implications for future therapeutic approaches.

Authors:  Stephen Y Lai; Faye M Johnson
Journal:  Drug Resist Updat       Date:  2010-05-14       Impact factor: 18.500

3.  FGFR2 alterations in endometrial carcinoma.

Authors:  Sonia Gatius; Ana Velasco; Ainara Azueta; Maria Santacana; Judit Pallares; Joan Valls; Xavier Dolcet; Jaime Prat; Xavier Matias-Guiu
Journal:  Mod Pathol       Date:  2011-07-01       Impact factor: 7.842

4.  Regulation of the phosphatidylinositol 3-kinase-Akt and the mitogen-activated protein kinase pathways by ursolic acid in human endometrial cancer cells.

Authors:  Yumiko Achiwa; Kiyoshi Hasegawa; Yasuhiro Udagawa
Journal:  Biosci Biotechnol Biochem       Date:  2007-01-07       Impact factor: 2.043

5.  A unique spectrum of somatic PIK3CA (p110alpha) mutations within primary endometrial carcinomas.

Authors:  Meghan L Rudd; Jessica C Price; Sarah Fogoros; Andrew K Godwin; Dennis C Sgroi; Maria J Merino; Daphne W Bell
Journal:  Clin Cancer Res       Date:  2011-01-25       Impact factor: 12.531

6.  Lipocalin-2 and matrix metalloproteinase-9 expression in high-grade endometrial cancer and their prognostic value.

Authors:  Sanja Srdelić Mihalj; Ivana Kuzmić-Prusac; Sandra Zekić-Tomaš; Ivana Šamija-Projić; Vesna Čapkun
Journal:  Histopathology       Date:  2015-02-16       Impact factor: 5.087

7.  Another surprise from Metformin: novel mechanism of action via K-Ras influences endometrial cancer response to therapy.

Authors:  David A Iglesias; Melinda S Yates; Dharini van der Hoeven; Travis L Rodkey; Qian Zhang; Ngai Na Co; Jennifer Burzawa; Sravanthi Chigurupati; Joseph Celestino; Jessica Bowser; Russell Broaddus; John F Hancock; Rosemarie Schmandt; Karen H Lu
Journal:  Mol Cancer Ther       Date:  2013-09-27       Impact factor: 6.261

8.  PDGFRalpha/beta expression correlates with the metastatic behavior of human colorectal cancer: a possible rationale for a molecular targeting strategy.

Authors:  Thomas C Wehler; Kirsten Frerichs; Claudine Graf; Daniel Drescher; Katrin Schimanski; Stefan Biesterfeld; Martin R Berger; Stephan Kanzler; Theodor Junginger; Peter R Galle; Markus Moehler; Ines Gockel; Carl C Schimanski
Journal:  Oncol Rep       Date:  2008-03       Impact factor: 3.906

9.  Malignant stroma increases luminal breast cancer cell proliferation and angiogenesis through platelet-derived growth factor signaling.

Authors:  Mauricio P Pinto; Wendy W Dye; Britta M Jacobsen; Kathryn B Horwitz
Journal:  BMC Cancer       Date:  2014-10-01       Impact factor: 4.430

Review 10.  Molecular alterations of PI3K/Akt/mTOR pathway: a therapeutic target in endometrial cancer.

Authors:  Athanasia Pavlidou; Nikos F Vlahos
Journal:  ScientificWorldJournal       Date:  2014-01-12
View more
  4 in total

1.  A novel method for crosstalk analysis of biological networks: improving accuracy of pathway annotation.

Authors:  Christoph Ogris; Dimitri Guala; Thomas Helleday; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2016-09-22       Impact factor: 16.971

2.  Systematic analysis of molecular mechanisms for HCC metastasis via text mining approach.

Authors:  Cheng Zhen; Caizhong Zhu; Haoyang Chen; Yiru Xiong; Junyuan Tan; Dong Chen; Jin Li
Journal:  Oncotarget       Date:  2017-02-21

3.  Bioinformatic analysis of the molecular mechanism underlying bronchial pulmonary dysplasia using a text mining approach.

Authors:  Weitao Zhou; Fei Shao; Jing Li
Journal:  Medicine (Baltimore)       Date:  2019-12       Impact factor: 1.817

4.  Comprehensive Analysis of Prognostic Alternative Splicing Signatures in Endometrial Cancer.

Authors:  Peigen Chen; Junxian He; Huixia Ye; Senwei Jiang; Yunhui Li; Xiaomao Li; Jing Wan
Journal:  Front Genet       Date:  2020-05-29       Impact factor: 4.599

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.