Literature DB >> 32756534

Integrated Analysis of Hub Genes and Pathways In Esophageal Carcinoma Based on NCBI's Gene Expression Omnibus (GEO) Database: A Bioinformatics Analysis.

Tan Yu-Jing1, Tang Wen-Jing1, Tang Biao1.   

Abstract

BACKGROUND Esophageal carcinoma (ESCA) is a health challenge with poor prognosis and limited treatment options. Our aim is to screen for hub genes and pathways associated with ESCA pathology as diagnostic or therapeutic targets. MATERIAL AND METHODS We downloaded 2 ESCA-related datasets from the Gene Expression Omnibus (GEO) database. Subsequently, differentially expressed genes (DEGs) of ESCA were determined by statistical analysis. Both Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis of DEGs were performed using online analytic tools. Network analysis was employed to construct a protein-protein interaction (PPI) network and to filter hub genes. We evaluated the expression level and impact of hub genes on survival of ESCA patients using the OncoLoc webserver. RESULTS A total of 210 DEGs were identified. The GO analysis showed that the DEGs were enriched in cell division. The KEGG pathway analysis showed DEGs that were enriched in cell cycle regulation, known cancer pathways, the PI3K-Akt signaling pathway, and the cGMP-PKG signaling pathway. The top 10 hub genes were markedly upregulated in ESCA tissue compared with normal esophageal tissue. Moreover, the expression level of the hub genes was different at different pathological stages of ESCA. Further prognostic analysis identified that the top 10 hub genes were related to late survival of ESCA patients, while exhibiting few associations with early survival time. CONCLUSIONS The signaling pathways involving the DEGs probably represent the pathological mechanism underlying ESCA. The hub genes were associated with survival of ESCA patients, and as such have the potential to serve as diagnostic indicators and therapeutic targets.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32756534      PMCID: PMC7431388          DOI: 10.12659/MSM.923934

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

Esophageal carcinoma (ESCA) is the eighth most common cancer and the sixth most common cancer leading to death, with more than 456 000 sufferers worldwide [1,2]. Previous studies have documented that the overall 5-year survival of ESCA patients is approximately 15% to 25% [2]. ESCA includes two major subtypes: esophageal squamous cell carcinoma (ESCC), the predominant form, and esophageal adenocarcinoma (EAC), which affects a growing percentile of patients [3]. There are distinct differences between the pathological risks underlying these histological subtypes that affect their incidence. The mechanism underlying ESCC is complicated and embraces a broad spectrum of hazards contributing to its rapidly increasing incidence. There are two primary types of risk factors: inheritable and environmental factors. There are several environmental risks, including alcohol and tobacco consumption, low vegetable and fruit intake, and low socioeconomic status [4]. As for EAC, almost all cases are complicated by precancerous lesions called Barrett’s oesophagus, which mainly derive from gastroesophageal reflux disease, a common condition throughout the human population [5]. Despite this, the risk of progression from this kind of chronic premalignant condition into EAC is low; approximately 0.12%, which makes it difficult to perform predictions of EAC from Barrett’s oesophagus. Fortunately, there are some mechanistic connections between heritable elements and EAC. A meta-analysis demonstrated that single nucleotide polymorphisms (SNPs) located in the LPA, TPPP, and CEP72 genes were associated with high risk of both EAC and its premalignant precursor. SNPs near HTR3C and ABCC5 implicated this region as a specific locus for EAC [6]. Given its extremely invasive nature and poor prognosis among gastrointestinal malignancies, a majority of people die from ESCA, placing a heavy burden on the international economy [7]. When it comes to treatment, ESCA demands multidisciplinary knowledge, covering endoscopic procedures, neoadjuvant therapy with chemotherapy, chemoradiotherapy, surgery, and other procedures [8]. If early-stage, mild symptoms are neglected by patients and clinical workers, treatment will be even more difficult. Studies have demonstrated that early diagnosis of dysplasia allows for decisive therapy and ultimately lengthens survival time [9]. In addition, immuno-oncology therapies with molecular targets have emerged, promising early results among ESCA patients [10]. Pursuit of available biomarkers therefore has the potential to exert positive effects via early diagnosis and treatment. At present, exploring pathological mechanisms of diseases based on bioinformatics theories has becoming an increasingly important and effective method [11]. With bioinformatics analysis, researchers can gain comprehensive knowledge regarding the studied diseases from molecular data. More crucially, it can provide novel insight leading to early diagnosis, definitive treatment, and survival prediction [12]. Previous research in the bioinformatics field has yielded some achievements in ESCA knowledge, such as identification of associated genes and pathways and altered methylation associated with ESCA pathology [13,14]. However, analytical ability for ESCA has been limited due to insufficient sample size and a lack of in-depth evaluation of key genes which play a dominant role in the malignant development of ESCA. Learning from our predecessors, we added the expression data from two microarrays to our research to enlarge the sample size, in order to reduce errors. We aimed to seek out hub genes and pathways involved in ESCA using integrated analysis of the chosen microarray data. Moreover, we aimed to conduct further expression and survival assessments of hub genes, which may provide reference and direction for early diagnosis and practical therapeutic strategies for ESCA. To our knowledge, the present study represents the first assessment of key hub genes associated with ESCA using bioinformatics methods to obtain information on altered expression of these genes at different stages, as well as their prognostic impact for diagnosed patients.

Material and Methods

Acquisition of datasets

The appropriate datasets were obtained from the Gene Expression Omnibus (GEO) database (). Inclusion criteria were as follows: (1) sample from human esophageal tissue (cell-line and animal experiments excluded); (2) normalized expression data; (3) coverage of diverse types of ESCA, including ESCC, EAC, and rare types of ESCA; (4) control group from non-carcinoma oesophageal tissue; (5) inclusion of mRNA expression level. After an overall scrutiny, the datasets of GSE100942 [15] and GSE111044 [16] were selected as they suited the inclusion standards we set up. Moreover, both of them had been created recently, in the past 5 years, which met the need for timeliness. GSE100942 consisted of ten samples, including five samples from ESCC and five samples from normal esophageal tissue. GSE111044 was made up of six samples, including three samples from small cell esophageal carcinoma (SCEC) and three samples from corresponding normal tissue.

Identification of DEGs

GEO2R (), an online web tool affiliated with GEO, already provided access to the data set and collection. Further analysis was applied to the prepared datasets using GEO2R. The selected datasets were divided into two groups: the normal tissue group and the cancer group. Subsequently, direct statistical analysis of each dataset was conducted with cutoff values of: p<0.01 and 2 |log FC| ≥3. The intersection of the two filtered datasets, representing DEGs, was identified by a web-based tool named Venny v2.1. ().

Enrichment analysis

Gene ontology (GO) enrichment analysis was conducted using the online platform Metascape [17] (). A set of functional classifications and annotations are included in Metascape, such as biological processes (BP), cellular components (CC), and molecular functions (MF). The norm for statistical significance of p<0.05 was determined as the threshold value for our statistical analysis, to deeply screen enriched GO terms. The Kyoto Encyclopedia of Genes and Genomes (KEGG) Orthology Based Annotation System, third version (KOBAS v3.0) [18] () was used to perform the KEGG pathway enrichment analysis. A corrected p value <0.05 was considered to represent statistical significance. Moreover, we used the remaining pathways to draw a bubble chart of the top 20 pathways as a visualization, using OmicShare tools ().

Construction of the PPI network

The screened DEGs were imported into the Search Tool for the Retrieval of Interacting Genes (STRING) [19] () to begin network analysis. STRING possesses a potent ability to process multiple proteins or amino acid sequences input by users, which are then used to construct a protein-protein interaction (PPI) network. Conveniently, STRING can provide various types of exported files, including tsv, PNG, and tables. In our research, we downloaded the requisite files (tsv and PNG) to prepare for validation of hub genes and further research.

Expression analysis of hub genes

In light of degree level, hub genes were identified by cytohubba, an analytic tool attached to Cytoscape software [20] (). Cytohubba can rank nodes in a network by their network features in order to find a requested objective from imported elements of a PPI network, at levels of degree, closeness, and so on. We successfully used Gene Expression Profiling Interactive Analysis (GEPIA2) [21] () to compare mRNA expression levels between ESCA tissue and paired normal tissue. Additionally, differential expression levels in patients at each stage of ESCA were compared in GEPIA.

Survival analysis of hub genes

OncoLnc [22] () was used to analyze predictive hub gene data. OncoLnc is an efficient online analytic box, which can supply survival data correlated with mRNA expression level based on The Cancer Genome Atlas (TCGA) data. We set 50 as the cutoff to divide paired data into high- and low-expression groups, for a Kaplan plot of input genes related to ESCA.

Results

Confirmation of DEGs

After GEO2R analysis of the two screened datasets, the criterion p<0.01 and 2 |log FC| ≥3 was utilized for statistical analysis. A total of 3170 and 318 gene fragments were selected from the GSE111044 and GSE100942 datasets. Genes found to be present in both datasets were selected as DEGs. Consequently, a total of 210 DEGs were screened from the two genetic datasets (Table 1, Figure 1).
Table 1

Details of DEGs filtered from GSE111044 and GSE100942.

SourceTotalElements
Intersection of GSE111044 and GSE100942210SLMAP, SAMD4A, KCNMA1, LDB3, LINC01279, TPX2, EPB41L4B, WISP2, IGF2BP3, CCNB1, MSRB3, ANP32E, NFIA, CNN1, COL1A1, ANLN, BIRC5, GHR, BCHE, DNMT3B, SMAD9, LRRK2, HLF, ADAMTS2, AURKA, KIF14, FHL1, SORBS2, ADRB1, CSRP1, TCF21, KIF4A, KIT, RAD54B, C1QTNF7, NEGR1, FAM72A, KCNAB1, MELK, CDH11, NDC80, CCNA2, GTSE1, NUF2, KRT78, PDK4, MMP1, MYOCD, EPCAM, ANGPTL1, DAAM2, P2RY14, ECT2, KIF23, DEPDC1, EMCN, LPAR3, OGN, POSTN, NBEA, MYLK, CYSLTR1, CCNB2, PRC1, IGFBP6, TNXB, CEP55, MYOC, CCNE2, MCM2, MCM4, CGNL1, ADIRF, HEY1, SRPX, IL1RN, DLGAP5, RAD51AP1, SMTN, TMEM108, SCN7A, CCDC69, CDCA2, MYL9, LRRN4CL, RHPN2, CDH19, PGM5, SOBP, C2orf40, AOX1, COL1A2, LAPTM4B, GULP1, CFD, RNASE4, NEK2, ABLIM1, KIF26B, LPP, FGL2, CHRDL1, HSPB8, RAP1A, DEPDC1B, CILP, ITIH5, HMGB3, TMEM110, NETO2, SYNPO2, SOX4, PLN, MRVI1, EZH2, TNFSF10, CD36, CAPN14, CFH, MYOT, MT1M, RASEF, EMP1, PGM5-AS1, FAM149A, STEAP4, AOC3, LMOD1, ASPA, TMEM35A, CCL14, MYH11, KIF2C, EBF1, EREG, AGTR1, CDCA7, TNS1, CDC20, RBPMS2, C7, BUB1, CCL2, PRR11, IL6, CPED1, RGS2, COL3A1, DPT, SCARA5, PTPRN2, ATP1A2, CDCA3, IFI6, ATAD2, KNL1, EXOSC7, ADAMTS1, ANOS1, SORBS1, FGF7, STIL, UBE2C, STXBP6, MTURN, CCDC80, CLU, CXCL12, IL36A, SLC35F6, TOP2A, AVPR1A, COL14A1, HELLS, FANCI, IRX2, NEXN, SPC25, KIF18A, BUB1B, DKK1, HJURP, AQP1, LMO3, SOCS2, DTL, ACTG2, ADH1B, MEST, ARHGAP6, HMMR, CAB39L, FAM107A, GPX3, KIF20A, SHISA2, ENC1, PTGS1, CASQ2, CTHRC1, UHRF1, SGO2, MAMDC2, TTK, CDKN3, NCAPG, FXYD3, CENPF, NUSAP1, CRCT1
Figure 1

Venn diagram of filtered differentially expressed genes (DEGs) shared by the GSE111044 and GSE100942 datasets. The overlapping area, which contained 210 items, was considered to contain DEGs for ESCA.

GO enrichment analysis

In total, 114 GO terms mediated by the DEGs were identified as enriched, spanning the categories of molecular function (MF), cellular component (CC), and biological process (BP) (Table 2). The MF results implicated DEGs that were profoundly enriched in actin binding, extracellular matrix structural constituents, and structural constituents of muscle (Figure 2). The CC results indicated that DEGs were enriched in chromosome, centromere, spindle, extracellular matrix, and stress fibers (Figure 3). With respect to BP, DEGs were noticeably enriched in processes related to cell division, mitotic cell cycle phase transition, muscle system process, and muscle structural development (Figure 4). We then obtained the top 20 terms of the GO functional enrichment analysis (Figure 5). The results indicated that ESCA-related DEGs were mainly involved in cell division, muscle system process, development and attachment of spindle microtubules to the kinetochore at the midbody, and extracellular matrix.
Table 2

GO enrichment analysis of ESCA-related DEGs.

CategoryTermAnnotationCountP ValueFDR
MFGO: 0042393Histone binding50.04095643.57063
GO: 0004672Protein kinase activity90.03785841.02474
GO: 0017048Rho GTPase binding30.03528738.83208
GO: 0008083Growth factor activity60.02921733.3507
GO: 0003678DNA helicase activity30.02648430.73641
GO: 0042826Histone deacetylase binding50.02326527.53669
GO: 0019901Protein kinase binding100.01906923.15887
GO: 0016887ATPase activity70.01345316.91675
GO: 0048407Platelet-derived growth factor binding30.0057757.618473
GO: 0005201Extracellular matrix structural constituent50.0055747.362384
GO: 0005524ATP binding280.0043155.745381
GO: 0008574ATP-dependent microtubule motor activity, plus-end-directed47.15E-040.97331
GO: 0008017Microtubule binding103.85E-040.525754
GO: 0008201Heparin binding93.10E-040.423032
GO: 0003777Microtubule motor activity72.05E-040.280292
GO: 0005515Protein binding1181.98E-040.271172
GO: 0008307Structural constituent of muscle67.85E-050.107306
GO: 0003779Actin binding151.63E-060.002223
CCGO: 0005814Centriole50.03431136.49999
GO: 0097149Centralspindlin complex20.03208234.56688
GO: 0005694Chromosome50.027230.14128
GO: 0070062Extracellular exosome420.025728.72761
GO: 0005680Anaphase-promoting complex30.02534228.38631
GO: 0005584Collagen type i trimer20.02150424.62943
GO: 0016363Nuclear matrix50.02105624.17972
GO: 0051233Spindle midzone30.01761820.64245
GO: 0005925Focal adhesion110.01024712.53825
GO: 0015629Actin cytoskeleton80.00968211.8868
GO: 0005654Nucleoplasm440.00879410.8543
GO: 0005576Extracellular region290.00817410.12583
GO: 0048471Perinuclear region of cytoplasm150.0077289.598831
GO: 0005813Centrosome120.0068228.519486
GO: 0000922Spindle pole60.0066218.277421
GO: 0015630Microtubule cytoskeleton70.0037714.795012
GO: 0001725Stress fiber50.0027473.514485
GO: 0045120Pronucleus30.0023563.021411
GO: 0005737Cytoplasm770.0016172.083603
GO: 0005829Cytosol540.001541.984823
GO: 0031012Extracellular matrix110.0014771.904168
GO: 0005876Spindle microtubule50.0012821.655013
GO: 0031262Ndc80 complex36.88E-040.890813
GO: 0000942Condensed nuclear chromosome Outer kinetochore36.88E-040.890813
GO: 0005874Microtubule125.90E-040.765086
GO: 0030018Z disc82.97E-040.385939
GO: 0005581Collagen trimer86.25E-050.081295
GO: 0000776Kinetochore82.74E-050.035668
GO: 0005871Kinesin complex72.22E-050.028929
GO: 0005615Extracellular space332.02E-050.026247
GO: 0005578Proteinaceous extracellular matrix147.32E-060.009524
GO: 0005819Spindle106.68E-060.008694
GO: 0000775Chromosome, centromeric region82.59E-060.003375
GO: 0030496Midbody111.40E-060.001819
GO: 0000777Condensed chromosome kinetochore113.37E-084.38E-05
BPGO: 0007015Actin filament organization40.04818855.68998
GO: 0051436Negative regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle40.04655454.41983
GO: 0001666Response to hypoxia60.04602654.00186
GO: 0046777Protein autophosphorylation60.04602654.00186
GO: 0044344Cellular response to fibroblast growth factor stimulus30.04503353.20648
GO: 2000660Negative regulation of interleukin-1-mediated signaling pathway20.04450152.77484
GO: 0010574Regulation of vascular endothelial growth factor production20.04450152.77484
GO: 0043154Negative regulation of cysteine-type endopeptidase activity involved in apoptotic process40.04337251.8463
GO: 0097421Liver regeneration30.04233950.98273
GO: 0051384Response to glucocorticoid40.03734846.59797
GO: 0060326Cell chemotaxis40.03734846.59797
GO: 0007049Cell cycle70.03714746.41426
GO: 0045840Positive regulation of mitotic nuclear division30.03464544.07219
GO: 0071173Spindle assembly checkpoint20.03356443.03115
GO: 0060414Aorta smooth muscle tissue morphogenesis20.03356443.03115
GO: 0006977DNA damage response, signal transduction by p53 class mediator resulting in cell cycle arrest40.03313442.61195
GO: 0000070Mitotic sister chromatid segregation30.03221541.70664
GO: 0006306DNA methylation30.03221541.70664
GO: 0042787Protein ubiquitination involved in ubiquitin-dependent protein catabolic process60.03010439.57493
GO: 0051966Regulation of synaptic transmission, glutamatergic30.0275736.92013
GO: 0051439Regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle30.0275736.92013
GO: 0043547Positive regulation of GTPase activity130.02734936.68291
GO: 0070301Cellular response to hydrogen peroxide40.02669835.9813
GO: 1903779Regulation of cardiac conduction40.025534.66984
GO: 0008283Cell proliferation100.02360132.53933
GO: 0048146Positive regulation of fibroblast proliferation40.02319332.07322
GO: 0000022Mitotic spindle elongation20.02250231.27789
GO: 0001501Skeletal system development60.01982728.11186
GO: 0010881Regulation of cardiac muscle contraction by regulation of the release of sequestered calcium ions30.01919227.33961
GO: 0006939Smooth muscle contraction30.01729924.99359
GO: 0051726Cell cycle regulation60.01342219.96503
GO: 0031145Anaphase-promoting complex-dependent catabolic process50.01234518.5125
GO: 0006957Complement activation, alternative pathway30.00915214.06012
GO: 0051310Metaphase plate congression30.00780112.10953
GO: 0055119Relaxation of cardiac muscle30.00780112.10953
GO: 0048565Digestive tract development40.00715211.1566
GO: 0008284Positive regulation of cell proliferation130.00688810.76681
GO: 0030574Collagen catabolic process50.0059479.363414
GO: 0034501Protein localization to kinetochore30.0053998.535467
GO: 0032060Bleb assembly30.0053998.535467
GO: 0007019Microtubule depolymerization30.0053998.535467
GO: 0006268DNA unwinding involved in DNA replication30.0053998.535467
GO: 0009612Response to mechanical stimulus50.004457.087078
GO: 0071549Cellular response to dexamethasone stimulus40.0041966.694943
GO: 0019221Cytokine-mediated signaling pathway70.0037696.034147
GO: 0032355Response to estradiol60.003735.973802
GO: 0001558Regulation of cell growth60.002133.453634
GO: 0000910Cytokinesis50.0020953.397158
GO: 0007094Mitotic spindle assembly checkpoint40.001412.299434
GO: 0000086G2/M transition of mitotic cell cycle89.49E-041.552707
GO: 0007080Mitotic metaphase plate congression57.83E-041.283031
GO: 0007155Cell adhesion157.52E-041.232171
GO: 0007018Microtubule-based movement73.11E-040.511889
GO: 0030199Collagen fibril organization67.41E-050.122099
GO: 0006936Muscle contraction92.91E-050.047888
GO: 0007052Mitotic spindle organization61.99E-050.032833
GO: 0000281Mitotic cytokinesis61.68E-050.027614
GO: 0007059Chromosome segregation81.15E-050.018962
GO: 0007062Sister chromatid cohesion122.35E-083.87E-05
GO: 0007067Mitotic nuclear division205.66E-119.33E-08
GO: 0051301Cell division251.55E-122.55E-09
Figure 2

Top 16 enriched molecular function GO terms. The horizontal axis is a logarithmic calculation of p value, while GO terms are listed on the y axis. The length of each bar represents lower p value (higher significance). Actin binding was the term with the highest significance level.

Figure 3

Top 16 enriched cellular component GO terms. The horizontal axis is a logarithmic calculation of p value, while GO terms are listed on the y axis. The length of each bar represents lower p value (higher significance). Chromosome and centromeric region was the term with the highest significance level.

Figure 4

Top 20 enriched biological process GO terms. The horizontal axis is a logarithmic calculation of p value, while GO terms are listed on the y axis. The length of each bar represents lower p value (higher significance). Cell division was the term with the highest significance level.

Figure 5

Top 20 results from GO functional enrichment analysis. The size of the filled circles indicates statistical significance. Larger circle size indicates lower p value. Circle color indicates type of GO term, as listed in the legend (lower left corner).

Enrichment analysis of the KEGG pathway

In total, 107 pathways connected with DEGs were procured through KOBAS v3.0. The corrected p value <0.05 was set as the statistical cutoff, and 60 pathways without statistical difference were rejected. The remaining 47 pathways with statistical significance are presented in Table 3. Among these remaining pathways, we chose the top 20 enriched pathways using OmicShare tools (Figure 6). These pathways revealed that a broad spectrum of DEGs participated in the cell cycle, pathways in cancer, the phosphatidylinositol-3-kinase (PI3K)/protein kinase B (AKT) signaling pathway, the cGMP-protein kinase G (PKG) signaling pathway, cytokine-cytokine receptor interaction, and vascular smooth muscle contraction.
Table 3

KEGG pathway enrichment analysis of ESCA-related DEGs.

TermNumberCorrected p valueGenes
Cell cycle101.19E-08CCNB2, CCNB1, CCNA2, BUB1B, CCNE2, CDC20, TTK, MCM4, MCM2, BUB1
cGMP-PKG signaling pathway91.51E-06MRVI1, MYLK, KCNMA1, ADRB1, ATP1A2, RGS2, PLN, AGTR1, MYL9
Vascular smooth muscle contraction82.68E-06MYH11, MRVI1, AVPR1A, ACTG2, MYLK, KCNMA1, AGTR1, MYL9
AGE-RAGE signaling pathway in diabetic complications60.000105714CCL2, IL6, COL1A2, COL1A1, COL3A1, AGTR1
Pathways in cancer110.00016471CXCL12, CCNE2, BIRC5, KIT, IL6, HLF, FGF7, LPAR3, AGTR1, HEY1, MMP1
PI3K-Akt signaling pathway90.000181898GHR, CCNE2, KIT, IL6, COL1A2, EREG, COL1A1, FGF7, LPAR3
Platelet activation60.000181898PTGS1, MYLK, RAP1A, COL1A2, COL1A1, COL3A1
Calcium signaling pathway70.000181898AVPR1A, CASQ2, MYLK, ADRB1, PLN, AGTR1, CYSLTR1
Oocyte meiosis60.000181898CCNB2, CCNB1, BUB1, CCNE2, CDC20, AURKA
Cytokine-cytokine receptor interaction80.000256568CCL2, IL1RN, TNFSF10, GHR, DTL, IL6, IL36A, CXCL12
Protein digestion and absorption50.000372719COL3A1, COL14A1, COL1A2, ATP1A2, COL1A1
Rheumatoid arthritis50.000372719IL6, CCL2, CXCL12, DTL, MMP1
Progesterone-mediated oocyte maturation50.000505745CCNB2, CCNB1, CCNA2, AURKA, BUB1
p53 signaling pathway40.001955985CCNE2, CCNB2, CCNB1, GTSE1
Human T-cell leukemia virus 1 infection60.001955985CCNB2, CCNA2, BUB1B, CCNE2, CDC20, IL6
Adrenergic signaling in cardiomyocytes50.002613273ADRB1, AGTR1, SCN7A, ATP1A2, PLN
Neuroactive ligand-receptor interaction70.002678945AVPR1A, GHR, P2RY14, ADRB1, LPAR3, AGTR1, CYSLTR1
ECM-receptor interaction40.003021888HMMR, COL1A2, CD36, COL1A1
Cellular senescence50.003023275IL6, CCNB2, CCNB1, CCNE2, CCNA2
Jak-STAT signaling pathway50.003030745IL6, AOX1, FHL1, GHR, SOCS2
Tyrosine metabolism30.003030745AOX1, AOC3, ADH1B
Amoebiasis40.003552075IL6, COL3A1, COL1A2, COL1A1
Viral protein interaction with cytokine and cytokine receptor40.004092655IL6, CCL2, CXCL12, TNFSF10
Malaria30.005771037IL6, CCL2, CD36
Intestinal immune network for IgA production30.005771037IL6, CXCL12, DTL
Focal adhesion50.005771037MYL9, COL1A2, MYLK, RAP1A, COL1A1
cAMP signaling pathway50.007342941ADRB1, MYL9, ATP1A2, RAP1A, PLN
Regulation of actin cytoskeleton50.007342941FGF7, CFD, CXCL12, MYLK, MYL9
Relaxin signaling pathway40.008337936COL3A1, COL1A1, COL1A2, MMP1
FoxO signaling pathway40.008510363IL6, CCNB2, CCNB1, TNFSF10
Renin secretion30.011964411ADRB1, AGTR1, KCNMA1
Phospholipase D signaling pathway40.011964411LPAR3, AGTR1, AVPR1A, KIT
PPAR signaling pathway30.015020846SORBS1, CD36, MMP1
Hepatitis B40.015736213IL6, CCNE2, BIRC5, CCNA2
Complement and coagulation cascades30.015736213CLU, C7, CFH
Tight junction40.017269759MYH11, EPB41L4B, RAP1A, MYL9
Salivary secretion30.021192296ADRB1, KCNMA1, ATP1A2
MicroRNAs in cancer50.021964309DNMT3B, CCNE2, EZH2, KIF23, SOX4
IL-17 signaling pathway30.021964309IL6, CCL2, MMP1
Hematopoietic cell lineage30.023983878IL6, CD36, KIT
Pancreatic secretion30.024052267KCNMA1, RAP1A, ATP1A2
Human papillomavirus infection50.029816413CCNE2, CCNA2, COL1A1, COL1A2, HEY1
Rap1 signaling pathway40.029816413FGF7, LPAR3, RAP1A, KIT
Prion diseases20.030030738IL6, C7
DNA replication20.03062247MCM4, MCM2
Leukocyte transendothelial migration30.03062247CXCL12, RAP1A, MYL9
AMPK signaling pathway30.035976653CAB39L, CCNA2, CD36
Figure 6

The top 20 KEGG pathways involving the differentially expressed genes (DEGs). Rich factor is represented on the x axis. KEGG pathway annotation is represented on the y axis. The size of the filled circles symbolizes the number of genes involved in the pathways. The color of the filled circles denotes significance, with gradation from red to blue symbolizing increasing p value.

Construction of the PPI network and identification of hub genes

The PPI network exported by STRING comprised 202 nodes and 1571 edges (Figure 7). The modular interaction network was viewed as statistically enriched, due to the enrichment p value <1.0e-16 (<0.05). The top 10 genes in the PPI network, in terms of degree ranking, were regarded as hub genes (Figure 8); these genes probably play crucial roles in the incidence and development of ESCA. TOP2A was in the highest rank among all the DEGs, followed by UBE2C, BUB1, KIF20A, TTK, CCNB2, KIF2C, TPX2, CCNA2, and CCNB1.
Figure 7

PPI network of ESCA-related differentially expressed genes (DEGs). The number of nodes was 202 (with 8 untouched genes). The number of edges was 1571; much higher than the expected 298 edges. The average node degree was 15.6. The average local clustering coefficient was 0.582. The PPI enrichment p value was less than 1.0e-16.

Figure 8

Topological diagram of the top 10 hub genes. In accordance with degree ranking, the top 10 differentially expressed genes (DEGs) were accepted as hub genes. Among all the DEGs, TOP2A was in the highest degree rank, followed by UBE2C, BUB1, KIF20A, TTK, CCNB2, KIF2C, TPX2, CCNA2, and CCNB1.

The expression boxplots of the hub genes, conveyed on the basis of TCGA and GTEx data, demonstrated that these hub genes were upregulated in ESCA tissue compared with normal esophageal tissue (Figure 9). Further, we evaluated the expression level of the hub genes at different pathological stages among ESCA patients (stage I, stage II, stage III, and stage IV patients). The stage-plots were scored using GEPIA (Figure 10). As is shown below, the expression level of the hub genes differed at different stages.
Figure 9

Differences in expression level of the top 10 hub genes between ESCA tissue and matched normal tissue. The plots are based on TCGA and GTEx data, and reveal that these hub genes are significantly upregulated in ESCA tissue compared with normal tissue (p<0.01). The red * denotes statistically significant difference.

Figure 10

GEPIA stage-plots for the top 10 hub genes in patients at different stages of ESCA. The stage-plot is considered statistically significant when Pr >F. Greater F values represent increasing significance.

Survival analysis for the hub genes

The survival curve output by OncoLnc showed that all the hub genes with high expression, in comparison with those with low expression, improved the percentage of late survival, after the 2000-day timepoint (>2000 on the x axis) (Figure 11). Moreover, when the surviving percentile of ESCA patients was stationary, high expression of the hub genes prolonged survival time. However, the hub genes exhibited few associations with early survival.
Figure 11

Effects of the top 10 hub genes on prognosis of ESCA patients, as calculated by OncoLnc. All of the hub genes improved late survival time of ESCA patients (x>2000), while few exhibited any association with early survival time.

Discussion

ESCA is a major global health challenge with multifactorial etiology, including both genetic and environmental components. Efforts to detect inchoate changes have attenuated the development of malignant cancer, and have even had some success in prevention [9]. Therefore, it is of vital importance to seek out predictive indicators and therapeutic markers for ESCA. In our research, 210 DEGs were screened from two genetic expression datasets, via the GEO bioinformatics portal. Then, GO function enrichment analysis and KEGG pathway enrichment analysis were applied to filter the DEGs. A PPI network was constructed using STRING to obtain hub genes that are likely to be central to the ESCA pathological process. Most previous studies in this field stopped at this point, resulting in limited clinical usefulness. Therefore, we implemented further analysis targeting the crucial hub genes identified in the first steps of our study. We focused on the assessment of screened hub genes with greater possibility as new therapeutic targets for treating ESCA. We conducted expression analysis of these hub genes in patients at different stages of ESCA, as well as prognostic analysis. These analyses showed the potential of these hub genes as biomarkers for appraisal in the process of therapy and post-therapy. The results of GO functional enrichment analysis identified several DEGs exhibiting a distinction from genes expressed in healthy esophageal tissue. The DEGs were principally targeted to the midbody, with functions related to cell division and attachment of spindle microtubules to the kinetochore. The results suggested that multitudes of DEGs were closely associated with nuclear activities, especially cell division. Cell division as a functional category includes mechanisms to properly orient and position the mitotic spindle, which is important because incorrect activity related to the spindle contributes to disease, even carcinogenesis [23,24]. Echoing the KEGG pathway enrichment analysis, the DEGs were strongly related to the cell cycle. Dysregulation of the cell cycle leading to endless proliferation of cells has been implicated in tumorigenesis [25]. Jing Wen et al. [26] found that, in vitro and in vivo, transcriptional activation of miR-424 elevated proliferation of ESCC cell lines and led to poor survival in ESCC patients; this activation facilitates both G1/S and G2/M cell cycle transitions. However, inhibition of cell cycle progression alleviates the development of malignancy. Chinese researchers have demonstrated that a novel Notch inhibitor called FLI-06 exerts antitumor activity in a dose-dependent manner via cell-cycle arrest and proliferation suppression in ESCC cells [27]. Above all, therapeutic interventions targeting cell division may generate antitumor activity. Additionally, the KEGG pathway enrichment analysis showed that the DEGs were enriched in the PI3K-Akt signaling pathway, the cGMP-PKG signaling pathway, cytokine-cytokine receptor interaction, and vascular smooth muscle contraction. Intriguingly, those findings differed from those of a similar study using bioinformatics methods conducted by He et al. [25]. Their findings indicated that DEGs related to ECA were mainly enriched in complement and coagulation cascades, mature- onset diabetes in the young, and retinol metabolism. Given that their sample was extracted from ECA patients and ours from patients diagnosed with ESCC and SCEC, it was inevitable for the two studies to find differences. Still, the evidence supporting our scientific results are as follows: The PI3K-Akt signaling pathway is implicated in several cancers, including breast cancer, colorectal carcinoma, hepatocellular cancer, and others [28-30]. Invariably, this pathway plays an important role in metastasis and prognosis among ESCA patients [31,32]. Ni Shi et al. [33] showed that MK2206/BEZ235 enhances apoptosis and inhibits tumor growth targeted at AKT phosphorylation in a mouse xenograft model of ESCC and in vitro. Moreover, the combination of MK2206 and BEZ235 boost antitumor effects because of dual PI3K and AKT inhibition. Another study demonstrated that Osthole, extracted from the herb Cnidium monnieri (L.) Cuss, reduced expression of PI3K and phosphorylated AKT (p-AKT), and thereby decreased ESCC proliferation [34]. Hence, treatments targeting the PI3K-Akt signaling pathway provide a promising approach to prevent ESCA and/or to slow its progression. The cGMP-PKG signaling pathway modulates massive physiological and pathological parameters [35]. Correlation between inflammation and the cGMP-PKG signaling pathway has been identified, involving mechanisms of neutrophil migration, mitochondrial permeability transition, and oscillations of calcium ions [36,37]. Interestingly, the mechanism responsible for dysplasia-induced (especially by Barrett’s esophagus) early EAC included a substantial promotion in the levels of inflammatory cytokines [38]. This implies that the cGMP-PKG signaling pathway is possibly connected with ESCA pathology in the early stages. Although studies testing the potential role of pharmacological modulation of the cGMP-PKG signaling pathway as an anticancer therapy are scant, we still hold a positive belief in their potential. Tumor events are elicited by the collective expression of multiple genes, while single genes function as carcinogens particularly associated with severity [39]. As revealed in the PPI network, coordinated DEGs mediated the occurrence and development of ESCA. To select the critical genes for the interlaced construction, we input information for all DEGs to Cytoscape to screen the top 10 nodes in terms of degree. As a consequence, the degree of TOP2A ranked the highest, followed by UBE2C, BUB1, KIF20A, TTK, CCNB2, KIF2C, TPX2, CCNA2, and CCNB1. These nodes were regarded as hub genes for ESCA. In contrast, He et al. [25] identified IL8, IVL, TIMP1, FN1, SERPINE1, SERPINA1, CFTR, SPP1, COL1A1, and AGT as hub genes for ESCA. They focused on EAC while we focused on ESCC and SCEC. It has been established that there are substantial differences between EAC and ESCC, especially in terms of pathology [3], so it is not surprising to obtain distinct results. Taken together, all 10 hub genes we identified were linked to cell proliferation, involving the cell cycle or chromosome activity. Details about key hub genes are demonstrated as follows: CCNA2 controls both the G1/S and the G2/M transition phases of the cell cycle, via the formation of specific serine/threonine protein kinase holoenzyme complexes with the cyclin-dependent protein kinases CDK1 or CDK2 [40]. Emerging evidence indicates that miR-219-5p mitigates ESCC cell proliferation by decreasing expression levels of CCNA2. Knockdown of CCNA2 enhances the impacts of miR-219-5p in terms of cell proliferation and cell cycle process [41]. CCNB2 and CCNB1 are elementary parts of the G2/M (mitosis) transition. One of the reasons why the overexpression of erythrocyte membrane protein band 4.1 like 3 (EPB41L3) noticeably improved overall survival rates among ESCA patients was its activation of CCNB1 signaling to induce G2/M cell cycle arrest [42]. A number of studies have found the high expression of CCNB2 to be related to tumor growth and poor prognosis in several cancers, including non-small cell lung carcinomas (NSCLC), hepatocellular carcinoma, and gastric cancer [43-45]. Without laboratory data on ESCA, the role of CCNB2 is still controversial. Given that CCNB2, CCNA2, and CCNB1 are essential for cell cycle regulation, and are subsidiary to the cyclin family, they have potential as therapeutic targets for ESCA. KIF20A and KIF2C are implicated in chromosome activity. They belong to the kinesin superfamily, and probably possess the potential to ameliorate malignancy. The kinesin family plays a significant role in microtubule motor activity of several cellular and extracellular structures, including Golgi membranes, chromosomes, associated vesicles along microtubules, and the central spindle [46]. KIF20A is a member of the kinesin-6 subfamily, and is crucial for cytokinesis and spindle assembly. Its expression pattern is linked to tumor development and prognosis [47]. KIF2C is responsible for the majority of microtubule plus-end depolymerizing activity in mitotic cells. It is required for chromosome segregation and congression during mitosis, and regulates the turnover and conversion of microtubules [48]. An immunohistochemical analysis conducted in ESCA tissue and adjacent non-cancerous tissues revealed that higher KIF2C expression led to higher pathologic tumor status and poorer tumor differentiation, but only for male patients [49]. TOP2A controls topological states of DNA, a crucial role for proper segregation of daughter chromosomes during mitosis and meiosis [50]. One clinical study suggested that high TOP2A could serve as a biomarker driving medical therapy [51]. Another study showed that the knockdown of the long noncoding RNA (lncRNA) DDX11-AS1 reversed paclitaxel (PTX) resistance by means of inhibiting the expression level of TOP2A [52]. Both of these results imply that targeting the expression of TOP2A may have promise as a pharmacological treatment. BUB1 is identified as a mitotic checkpoint that is important for spindle-assembly checkpoint signaling and correct chromosome alignment [53]. The transcriptional level of BUB1 in Barrett’s patients, leading to early chromosomal instability (CI), can be helpful in EAC [54]. For the time being, we can hypothesize that treatment modifying BUB1 expression may benefit ESCA patients. Meanwhile, the expression of UBE2C, TTK, and TPX2 is involved in ESCA pathology and prognosis. As a ubiquitin-conjugating enzyme, UBE2C promotes the degradation of several cell cycle-regulated proteins that control progression through mitosis [55]. Knockdown of UBE2C significantly inhibits cell proliferation via cell-cycle regulation, in in-vitro ESCA cell models, particularly in TE-1 cell lines [56]. TTK is associated with cell proliferation and chromosome alignment [57]. Previous clinical trials found that immunotherapies using epitope peptides derived from TTK potentiated cell response and immunity against ESCA [58,59]. TPX2 is required for normal assembly of microtubules as well as mitotic spindles during apoptosis. As detected by RT-qPCR, the relative expression of TPX2 is upregulated in ESCA tumor tissue, and in lymph node metastasis [60]. Moreover, low expression of miR-491 aimed at TPX2 promotes cell invasion in ESCA clinical samples, affecting carcinogenesis and cancer development [61]. Collectively, we identified 210 DEGs and implemented GO analysis, KEGG pathway enrichment analysis, and PPI network construction to understand their possible roles in ESCA carcinogenesis. The enriched KEGG pathway implicated pathological mechanisms in ESCA, including the cell cycle and the PI3K-Akt and cGMP-PKG signaling pathways. Furthermore, we identified 10 hub genes and assessed their prognostic value. The 10 hub genes were TOP2A (highest ranking in terms of degree), UBE2C, BUB1, KIF20A, TTK, CCNB2, KIF2C, TPX2, CCNA2, and CCNB1. These genes showed marked connections with late survival of ESCA patients, indicating their value in diagnosis and survival improvement. Our research provides new insights for ESCA early diagnosis and survival evaluation, at the level of genes, pathways, and molecular functions. Even though we analyzed two microarray datasets containing data from patients diagnosed with ESCA, the chosen samples in our research are still inadequate for definitive conclusions, especially in light of the scarcity of experimental evidence related to our findings. It will be necessary to validate our research findings through practical experiments, such as immunohistochemistry on tumor tissue and paired healthy control tissue.

Conclusion

In this research, a total of 210 ESCA-related DEGs were found, and their enriched functions and pathways were analyzed using bioinformatics methods. Taking the mechanistic link between enriched pathways and ESCA into consideration, our results may provide helpful information for understanding ESCA pathology and ESCA treatment. The PPI network we constructed from the DEGs revealed that these genes mediate the occurrence and development of ESCA through multi-gene interactions. From these, we selected the top 10 hub genes: TOP2A (highest ranking of degree), UBE2C, BUB1, KIF20A, TTK, CCNB2, KIF2C, TPX2, CCNA2, and CCNB1. These genes are closely associated with the etiology and prognosis of ESCA, and have potential as therapeutic or early-diagnostic biomarkers. Nevertheless, there are some limitations in our research. The absence of laboratory evidence imposes restrictions on the conclusions that can be drawn from our results. In addition, there were no more than 10 ESCA patients in each sample; this sample size may be insufficient. Therefore, in order to verify our findings, further experimental research, such as immunohistochemistry studies and large clinical studies, is needed.
  58 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 2.  The cGMP/PKG pathway as a common mediator of cardioprotection: translatability and mechanism.

Authors:  Javier Inserte; David Garcia-Dorado
Journal:  Br J Pharmacol       Date:  2015-03-16       Impact factor: 8.739

Review 3.  The stem cell division theory of cancer.

Authors:  Miguel López-Lázaro
Journal:  Crit Rev Oncol Hematol       Date:  2018-01-31       Impact factor: 6.312

Review 4.  Barrett's Esophagus - State of the Art.

Authors:  Marcel Tanţău; Mihaela Laszlo; Alina Tanţău
Journal:  Chirurgia (Bucur)       Date:  2018 Jan-Feb

5.  FLI-06 suppresses proliferation, induces apoptosis and cell cycle arrest by targeting LSD1 and Notch pathway in esophageal squamous cell carcinoma cells.

Authors:  Zhaoming Lu; Yandan Ren; Mengying Zhang; Tianli Fan; Yang Wang; Qi Zhao; Hong-Min Liu; Wen Zhao; Guiqin Hou
Journal:  Biomed Pharmacother       Date:  2018-08-31       Impact factor: 6.529

6.  CCNB2 overexpression is a poor prognostic biomarker in Chinese NSCLC patients.

Authors:  Xiaotao Qian; Xuekun Song; Yuan He; Zhiyong Yang; Tao Sun; Jing Wang; Guiqi Zhu; Weihai Xing; Changxuan You
Journal:  Biomed Pharmacother       Date:  2015-08-28       Impact factor: 6.529

7.  Pan-cancer analysis of somatic mutations and transcriptomes reveals common functional gene clusters shared by multiple cancer types.

Authors:  Hyeongmin Kim; Yong-Min Kim
Journal:  Sci Rep       Date:  2018-04-16       Impact factor: 4.379

8.  UBE2C Is a Transcriptional Target of the Cell Cycle Regulator FOXM1.

Authors:  Pedro Nicolau-Neto; Antonio Palumbo; Marco De Martino; Francesco Esposito; Tatiana de Almeida Simão; Alfredo Fusco; Luiz Eurico Nasciutti; Nathalia Meireles Da Costa; Luis Felipe Ribeiro Pinto
Journal:  Genes (Basel)       Date:  2018-03-29       Impact factor: 4.096

9.  Inhibition of esophageal cancer growth through the suppression of PI3K/AKT/mTOR signaling pathway.

Authors:  Ni Shi; Hao Yu; Tong Chen
Journal:  Onco Targets Ther       Date:  2019-09-17       Impact factor: 4.147

10.  The PI3K/AKT signaling pathway: Associations of miRNAs with dysregulated gene expression in colorectal cancer.

Authors:  Martha L Slattery; Lila E Mullany; Lori C Sakoda; Roger K Wolff; John R Stevens; Wade S Samowitz; Jennifer S Herrick
Journal:  Mol Carcinog       Date:  2017-11-19       Impact factor: 4.784

View more
  4 in total

1.  The Bioinformatical Identification of Potential Biomarkers in Heart Failure Diagnosis and Treatment.

Authors:  Xiaodong Sheng; Xiaoqi Jin; Yanqi Liu; Tao Fan; Zongcheng Zhu; Jing Jin; Guanqun Zheng; Zhixian Chen; Min Lu; Zhiqiang Wang
Journal:  Genet Res (Camb)       Date:  2022-05-16       Impact factor: 1.375

2.  Expression of the CLCA4 Gene in Esophageal Carcinoma and Its Impact on the Biologic Function of Esophageal Carcinoma Cells.

Authors:  Xin Song; Shuai Zhang; Shouchuan Li; Ye Wang; Xinming Zhang; Feng Xue
Journal:  J Oncol       Date:  2021-06-04       Impact factor: 4.375

3.  m5C RNA Methylation Primarily Affects the ErbB and PI3K-Akt Signaling Pathways in Gastrointestinal Cancer.

Authors:  Shixin Xiang; Yongshun Ma; Jing Shen; Yueshui Zhao; Xu Wu; Mingxing Li; Xiao Yang; Parham Jabbarzadeh Kaboli; Fukuan Du; Huijiao Ji; Yuan Zheng; Xiang Li; Jing Li; Qinglian Wen; Zhangang Xiao
Journal:  Front Mol Biosci       Date:  2020-12-07

4.  Drug repositioning for esophageal squamous cell carcinoma.

Authors:  Adam N Bennett; Rui Xuan Huang; Qian He; Nikki P Lee; Wing-Kin Sung; Kei Hang Katie Chan
Journal:  Front Genet       Date:  2022-09-28       Impact factor: 4.772

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.