Literature DB >> 34987580

Weighted Gene Correlation Network Analysis Identifies Specific Functional Modules and Genes in Esophageal Cancer.

Wei Xu1, Jian Xu1, Zhiqiang Wang1, Yuequan Jiang1.   

Abstract

OBJECTIVE: Esophageal cancer (ESCA) is one of the most aggressive malignancies globally with an undesirable five-year survival rate. Here, this study was conducted for determining specific functional genes linked with ESCA initiation and progression.
METHODS: Gene expression profiling of ESCA was curated from TCGA (containing 160 ESCA and 11 nontumor specimens) and GSE38129 (30 paired ESCA and nontumor tissues) datasets. Differential expression analysis was conducted between ESCA and nontumor tissues with adjusted p value <0.05 and |log2fold-change|>1. Weighted gene coexpression network analysis (WGCNA) was conducted for determining the ESCA-specific coexpression modules and genes. Thereafter, ESCA-specific differentially expressed genes (DEGs) were intersected. Functional enrichment analysis was then presented with clusterProfiler package. Protein-protein interaction was conducted, and hub genes were determined. Association of hub genes with pathological staging was evaluated, and survival analysis was presented among ESCA patients.
RESULTS: This study determined 91 ESCA-specific DEGs following intersection of DEGs and ESCA-specific genes in TCGA and GSE38129 datasets. They were remarkably linked to cell cycle progression and carcinogenic pathways like the p53 signaling pathway, cellular senescence, and apoptosis. Ten ESCA-specific hub genes were determined, containing ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20 A, TOP2A, and TPX2. They were prominently associated with pathological staging. Among them, KIF11 upregulation was in relation to undesirable prognosis of ESCA patients.
CONCLUSION: Collectively, we determined ESCA-specific coexpression modules and hub genes, which offered the foundation for future research concerning the mechanistic basis of ESCA.
Copyright © 2021 Wei Xu et al.

Entities:  

Year:  2021        PMID: 34987580      PMCID: PMC8723838          DOI: 10.1155/2021/8223263

Source DB:  PubMed          Journal:  J Oncol        ISSN: 1687-8450            Impact factor:   4.375


1. Introduction

Esophageal cancer (ESCA) ranks the eighth major cancer type as well as the sixth major cause of cancer-relevant deaths across the globe [1]. Tobacco and alcohol consumption are the main environmental risk factors of ESCA. The five-year survival rate is nearly 15% [2]. It mainly contains two histological subtypes: esophageal squamous cell carcinoma (approximately 90%) and esophageal adenocarcinoma (around 10%) [3]. Patients' advanced clinical presentation is linked to locally late and distant metastasis, which contributes to undesirable survival outcome. Additionally, because of tumor heterogeneity and acquired drug resistance, inherent resistance to radiotherapy and chemotherapy triggers therapeutic failure and unfavorable survival rate [2]. ESCA therapy depends upon patients' and tumors' features, especially the tumor, node, metastasis (TNM) staging system [4]. In the early stage, patients are suitable for endoscopic resection, while those in the advanced stage receive surgical resection, chemotherapy, chemoradiotherapy, or their combination [4]. For patients with unresectable ESCA, systemic chemotherapy is applied. Additionally, immunotherapy has emerged as a therapeutic option for advanced or metastatic patients [5]. Although the therapeutic options have been steadily increasing, the molecular mechanisms underlying ESCA remain indistinct. The pathogenesis of ESCA is a multistep process, involving distinct stages until eventually cancers [6]. Hence, to focus on the molecular mechanisms underlying the initiation and progression of ESCA may assist uncover underlying diagnostic markers or treatment targets. Weighted gene coexpression network analysis (WGCNA) is a reliable systematic biological algorithm, which may emphasize coexpression genomic modules and effectively evaluate the interactions between coexpression modules and clinical phenotypes [7]. This algorithm has been widely utilized for discovering cancer-specific modules and hub genes like bladder cancer [7], hepatocellular carcinoma [8], and lung cancer [9]. Limited studies have applied the WGCNA method to uncover the pathogenesis of ESCA. For instance, Nangraj et al. identified hub genes shared between Barrett's esophagus and esophageal adenocarcinoma through integrated analysis of protein-protein interaction (PPI) and WGCNA [10]. Through WGCNA, miR-92b-3p was determined as a pathogenic gene in ESCA [11]. Integrated analysis of WGCNA and network pharmacology deciphered the molecular mechanisms of compound Kushen injection in ESCA treatment [12]. Here, this study adopted the WGCNA algorithm for determining specific functional modules and genes in ESCA, offering the foundation for future research concerning the mechanistic basis of ESCA.

2. Materials and Methods

2.1. Data Collection and Preprocessing

The RNA-seq data of ESCA were retrieved from the Cancer Genome Atlas (TCGA) GDC Application Programming Interface. Gene expression profiling data (read counts) were processed and transformed into gene ID Ensembl (version 90). In total, 160 ESCA and 11 normal tissues were included. Microarray expression profiling of 30 ESCC tumors and adjacent normal tissues was curated from the GSE38129 dataset [13] in the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/gds/) repository. This dataset was in accordance with the GPL571 platform ((HG-U133 A_2) Affymetrix Human Genome U133A 2.0 Array). The raw expression profiling was background-corrected and normalized by quantile utilizing the robust multiarray average (RMA) method.

2.2. Differential Expression Analysis

Differentially expressed genes (DEGs) were selected utilizing the linear models for microarray data (limma; version 3.50.0) package through comparison of the expression profiling between ESCA and normal tissues [14]. The matched p values of gene symbols following the t-test were calculated, and adjusted p value <0.05 and |log2fold-change|>1 were set as the selection criteria. The volcano and heatmap of the DEGs were drawn.

2.3. WGCNA

Coexpression networks were separately established in TCGA and GSE38129 datasets utilizing WGCNA package (version 1.69) [15]. The genes with the first 25% standard deviation were chosen as the input genes. For constructing a scale-free network, the optimal soft threshold power value (β; ranging from 1 to 20) was determined with the “pickSoftThreshold” function through calculation of the scale-free fit index. Pearson's correlation matrix was conducted for evaluating the similarity among the pairwise genes utilizing the “cor” function. Thereafter, the adjacency was determined in accordance with β and Pearson's correlation matrix utilizing the “TOMsimilarity” function. Meanwhile, the corresponding dissimilarity (dissTOM) was determined. The modules were segmented with a dynamic cut tree algorithm, and similar modules were merged into one. Module eigengenes (MEs) that were the first principal component of gene expression patterns within a specific module were identified for each module.

2.4. Identification of ESCA-Relevant Coexpression Models

In this study, the most crucial critical feature was tissue type that was designated as ESCA tumor and normal specimens. Pearson correlation between MEs and clinical feature was analyzed. Modules that possessed the strongest correlation coefficient were determined as the ESCA-relevant coexpression models. Module membership indicates the intramodule connectivity of any gene within a given module. The higher the absolute value of module membership, the higher the negative or positive correlation between the gene with the module eigengenes. Gene significance was utilized for incorporating external information to the coexpression network. The higher the absolute value of gene significance, the higher the biological significance of a gene for tissue type. ESCA-relevant genes within the ESCA-relevant coexpression models were determined in accordance with module membership >0.8 and gene significance >0.5.

2.5. Identification of ESCA-Specific DEGs

For achieving the intersection of DEGs and coexpressed genes, an online web tool (http://bioinfogp.cnb.csic.es/tools/venny/index.html) was adopted for plotting Venn diagram.

2.6. Function Enrichment Analysis

Functional annotation of ESCA-specific DEGs was presented with the clusterProfiler package (version 4.2.0), containing Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis [16]. GO terms comprised of the biological process (BP), cellular component (CC), and molecular function (MF).

2.7. Protein-Protein Interaction (PPI) Analysis

The PPI network of ESCA-specific DEGs was conducted on the basis of the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING; version 11.0; https://string-db.org) online tool [17]. The CytoHubba plugin [18] of Cytoscape software (version 3.7.2) [19] was adopted for selecting the hub genes within the PPI network [18]. Herein, the first 10 genes were determined as hub genes.

2.8. Survival Analysis

In accordance with the optimal cutoff value determined by survival package, ESCA patients were stratified into high and low expression groups of the 10 ESCA-specific hub genes. Kaplan–Meier curves of overall survival were conducted between groups, and log-rank tests were utilized for comparing the survival differences.

2.9. Statistical Analysis

All the analyses in this study were implemented utilizing R software (version 3.5.1). Student's t test or Wilcoxon test was adopted for comparisons between groups. Spearman correlation analysis was carried out to evaluate the interactions of the 10 ESCA-specific hub genes with pathological staging of ESCA patients. P value<0.05 indicated the statistical significance.

3. Results

3.1. Exploration of DEGs in ESCA

For investigating the genetic alterations during the progression from normal to ESCA, we conducted differential expression analysis between ESCA tumors and normal tissues both in TCGA and GSE38129 datasets. In TCGA cohort, compared with 11 normal tissues, 1221 genes presented remarkable downregulation while 1169 genes displayed prominent upregulation in 160 ESCA tumors in accordance with adjusted p value <0.05 and |log2fold-change|>1 (Figures 1(a) and 1(b); Supplementary Table 1). With the same selection criteria, in the GSE38129 dataset, we determined 360 upregulated and 376 downregulated genes in 30 ESCA tumors in comparison to 30 nontumor tissues (Figures 1(c) and 1(d); Supplementary Table 2).
Figure 1

Analysis of DEGs of ESCA both in TCGA and GSE38129 datasets. (a) Volcano plots depict the results of differential expression analysis between 160 ESCA tumors and 11 normal tissues in TCGA cohort. Red bubble indicates upregulated gene in ESCA; green bubble represents downregulated gene in ESCA; black bubble is indicative of nonsignificant gene. (b) Heatmap visualizes the expression patterns of DEGs with adjusted p value <0.05 and |log2fold-change|>1 in 160 ESCA tumors (T) and 11 normal tissues (N) in TCGA cohort. Red represents upregulation, while green indicates downregulation. (c) Volcano plots present the results of differential expression analysis between 30 paired ESCA tumors and nontumor tissues in the GSE38129 dataset. Red bubble expresses upregulated gene in ESCA; green bubble is indicative of downregulated gene in ESCA; black bubble represents nonsignificant gene. (d) Heatmap displays the expression patterns of DEGs with adjusted p value <0.05 and |log2fold-change|>1 in 30 paired ESCA tumors (T) and nontumor tissues (N) in the GSE38129 dataset. Red is indicative of upregulation while green is indicative of downregulation.

3.2. Establishment of a Coexpression Network and Discovery of ESCA-Specific Coexpression Module in the TCGA Dataset

We first curated gene expression matrix of ESCA patients from TCGA cohort and chose the genes with the top 25% variances for subsequent analysis. No outlier sample was found, and we conducted a sample clustering tree, as shown in Figure 2(a). Thereafter, the soft threshold power value was set as 10 (scale-free topology R2 = 0.90) for constructing a scale-free network (Figure 2(b)). The adjacency matrix and the topological overlap matrix were separately developed. In total, 9 coexpression modules were clustered in accordance with the average hierarchical clustering and dynamic cutting tree (Figure 2(c)). The association of coexpression modules with clinical trait was analyzed. In Figure 2(d), the yellow module displayed the strongest correlation to tissue type, indicating that this module was strongly linked to ESCA progression. In line with module membership >0.8 and gene significance >0.5, we determined ESCA-specific genes (Figures 2(e) and 2(f)).
Figure 2

Establishment of a coexpression network and discovery of ESCA-specific coexpression module in the TCGA dataset. (a) Sample cluster analysis. (b) The scale-free network topology (left) as well as mean connectivity (right) under distinct soft threshold power values. (c) Gene dendrogram clustered in accordance with a dissimilarity measure. The upper panel indicates gene tree, and the bottom panel represents gene modules identified by diverse colors. (d) Heatmap visualizes the interaction between coexpression modules and clinical trait-tissue type. The upper number in each cell presents Pearson correlation coefficient between each module and tissue type. Meanwhile, the lower number indicates the p value. (e) Scatter plots depict the interaction between module membership and gene significance for normal tissue type for the yellow module. (f) Scatter plots present the interaction between module membership and gene significance for ESCA tumor tissue type for the yellow module.

3.3. Development of a Coexpression Network and Discovery of ESCA-Specific Coexpression Module in the GSE38129 Cohort

The coexpression network was also developed in the GSE38129 dataset. In accordance with the mRNA expression matrix, we selected the genes with the top 25% variances. As shown in Figure 3(a), there was no outlier sample among 30 paired ESCA tumors and nontumors. Afterwards, we established a scale-free network in line with the soft threshold power value = 20 (scale-free topology R2 = 0.90; Figure 3(b)). Following construction of the adjacency matrix and the topological overlap matrix, we determined 7 coexpression modules on the basis of the average hierarchical clustering and dynamic cutting tree (Figure 3(c)). In Figure 3(d), the turquoise module presented the strongest association with tissue type, demonstrating that this module was strongly linked to ESCA progression. Following module membership >0.8 and gene significance >0.5, ESCA-specific genes were determined (Figures 3(e) and 3(f)).
Figure 3

Development of a coexpression network and discovery of ESCA-specific coexpression module in the GSE38129 cohort. (a) Sample cluster analysis of 30 paired ESCA tumors and nontumors. (b) The scale-free network topology (left) and mean connectivity (right) following diverse soft threshold power values. (c) Gene dendrogram clustered in line with a dissimilarity measure. The upper panel presents gene tree and the bottom panel is indicative of gene modules signed by diverse colors. (d) Heatmap displays the relationship between coexpression modules and clinical trait-tissue type. The upper number in each cell presents Pearson correlation coefficient between each module and tissue type. Additionally, the lower number represents the p value. (e) Scatter plots showing the association between module membership and gene significance for normal tissue type for the turquoise module. (f) Scatter plots present the correlation between module membership and gene significance for ESCA tumor tissue type for the turquoise module.

3.4. Identification of ESCA-Specific DEGs and Their Biological Significance

For determining ESCA-specific DEGs, we intersected the DEGs and the ESCA-specific genes in TCGA and GSE38129 cohorts. As a result, 91 ESCA-specific DEGs were finally identified (Figure 4(a) and Table 1). Their biological significance was further evaluated through GO and KEGG enrichment analysis. In Figure 4(b) and Table 2, we noted that the ESCA-specific DEGs were remarkably linked to cell cycle progression like chromosome segregation, nuclear division, mitotic nuclear division, and sister chromatid segregation. Additionally, the ESCA-specific DEGs were in relation to ESCA progression-relevant KEGG pathways like cell cycle, DNA replication, cellular senescence, base excision repair, mismatch repair, p53 signaling pathway, homologous recombination, nucleotide excision repair, and apoptosis (Figure 4(c) and Table 3).
Figure 4

Identification of ESCA-specific DEGs and their biological significance. (a) Venn diagram depicts the intersection of the DEGs and the ESCA-specific genes in TCGA and GSE38129 cohorts. (b) GO enrichment results of the ESCA-specific DEGs. The first 10 enrichment results of BP, CC, and MF categories are separately displayed. (c) KEGG pathway enrichment results of the ESCA-specific DEGs.

Table 1

The list of ESCA-specific DEGs.

ESCA-specific DEGs
CBX3FOXM1DBF4LMNB2BLM
KAT2BDLGAP5MCM10ASPMC1orf112
KIF4APCNANUSAP1UBE2SAURKB
AURKAKIF18BCDT1POLE2FBXO5
CKS1BCENPEBUB1BOIP5MYBL2
ECT2CDC6BIRC5CCNA2CHEK1
HOXB7CEP55NCAPHCCNB1TFRC
KIF14MCM6DTLMKI67UBE2C
TRIP13PRC1NCAPGDEPDC1CDKN3
CITED2CDCA3FAM189A2LMNB1KIF20 A
MCM2FEN1HJURPNDC1GINS2
WDHD1NUDT1ORC6KIF11CENPF
RAD51AP1RNASEH2AHMMRGINS1STMN1
MAD2L1PBKECHDC2RUVBL1EXO1
NDC80CKS2FYCO1CENPMDNMT1
NEK2KIF18ADDX39AKNTC1CDK1
KIF2CSECISBP2LMELKCDC20TPX2
KIF23RAD54LSHCBP1TK1TOP2A
SPC25
Table 2

The detailed information of GO enrichment results of ESCA-specific genes.

IDDescriptionGene ratioBgRatio P valueAdjusted p Q valueCount
GO:0000280Nuclear division39/90436/188624.56 E − 406.17 E − 374.56 E − 3739
GO:0140014Mitotic nuclear division34/90296/188622.13 E − 381.44 E − 351.07 E − 3534
GO:0048285Organelle fission39/90486/188623.31 E − 381.49 E − 351.10 E − 3539
GO:0007059Chromosome segregation35/90337/188625.24 E − 381.77 E − 351.31 E − 3535
GO:0000819Sister chromatid segregation28/90199/188625.58 E − 341.51 E − 311.12 E − 3128
GO:0098813Nuclear chromosome segregation30/90273/188624.07 E − 339.16 E − 316.77 E − 3130
GO:0000070Mitotic sister chromatid segregation26/90164/188626.09 E − 331.18 E − 308.70 E − 3126
GO:0051983Regulation of chromosome segregation17/9089/188624.02 E − 236.80 E − 215.02 E − 2117
GO:0030071Regulation of mitotic metaphase/anaphase transition15/9059/188621.50 E − 222.25 E − 201.66 E − 2015
GO:0007091Metaphase/anaphase transition of mitotic cell cycle15/9061/188622.63 E − 223.55 E − 202.63 E − 2015
GO:0005819Spindle30/90381/195203.83 E − 294.34 E − 272.77 E − 2730
GO:0098687Chromosomal region29/90345/195205.72 E − 294.34 E − 272.77 E − 2729
GO:0000775Chromosome, centromeric region21/90196/195203.40 E − 231.72 E − 211.10 E − 2121
GO:0000793Condensed chromosome21/90217/195203.00 E − 221.14 E − 207.26 E − 2121
GO:0000779Condensed chromosome, centromeric region17/90117/195203.24 E − 219.84 E − 206.27 E − 2017
GO:0000776Kinetochore17/90137/195205.31 E − 201.35 E − 188.57 E − 1917
GO:0000777Condensed chromosome kinetochore14/90106/195205.13 E − 171.11 E − 157.10 E − 1614
GO:0000228Nuclear chromosome18/90250/195207.75 E − 171.47 E − 159.38 E − 1618
GO:0072686Mitotic spindle15/90157/195205.26 E − 168.89 E − 155.67 E − 1515
GO:0030496Midbody15/90193/195201.17 E − 141.77 E − 131.13 E − 1315
GO:0008017Microtubule binding15/89269/183373.07 E − 125.56 E − 103.91 E − 1015
GO:0015631Tubulin binding16/89368/183372.33 E − 112.11 E − 091.49 E − 0916
GO:0003777Microtubule motor activity9/8969/183374.41 E − 112.66 E − 091.87 E − 099
GO:0016887ATPase activity16/89478/183371.09 E − 094.92 E − 083.46 E − 0816
GO:0140097Catalytic activity, acting on DNA11/89204/183374.22 E − 091.53 E − 071.08 E − 0711
GO:0003774Motor activity9/89129/183371.26 E − 083.67 E − 072.59 E − 079
GO:0008574ATP-dependent microtubule motor activity, plus-end-directed5/8917/183371.42 E − 083.67 E − 072.59 E − 075
GO:1990939ATP-dependent microtubule motor activity5/8935/183376.96 E − 071.57 E − 051.11 E − 055
GO:0035173Histone kinase activity4/8916/183379.02 E − 071.81 E − 051.28 E − 054
GO:0003688DNA replication origin binding4/8923/183374.28 E − 067.06 E − 054.97 E − 054
Table 3

The detailed information of KEGG pathways enriched by ESCA-specific genes.

IDDescriptionGene ratioBgRatio P valueAdjusted p Q valueCount
hsa04110Cell cycle13/43126/81044.08 E − 142.08 E − 121.59 E − 1213
hsa03030DNA replication6/4336/81042.69 E − 086.85 E − 075.23 E − 076
hsa04114Oocyte meiosis6/43131/81045.94 E − 050.0010090.0007716
hsa04218Cellular senescence6/43156/81040.0001560.0019230.0014686
hsa04914Progesterone-mediated oocyte maturation5/43102/81040.0001890.0019230.0014685
hsa03410Base excision repair3/4333/81040.000680.0057760.0044113
hsa05166Human T cell leukemia virus 1 infection6/43222/81040.0010310.0075130.0057376
hsa05203Viral carcinogenesis5/43204/81040.0042570.027140.0207265
hsa03430Mismatch repair2/4323/81040.0064830.0340970.0260392
hsa04115p53 signaling pathway3/4373/81040.0066860.0340970.0260393
hsa03440Homologous recombination2/4341/81040.0197840.0917260.0700492
hsa03420Nucleotide excision repair2/4347/81040.0255650.1086510.0829742
hsa04210Apoptosis3/43136/81040.0350270.1374120.1049383
hsa04120Ubiquitin mediated proteolysis3/43142/81040.0390490.142250.1086333

3.5. Establishment of a PPI Network and Discovery of ESCA-Specific Hub Genes

For uncovering the interactions of the ESCA-specific DEGs, we conducted a PPI network in accordance with the STRING online tool. As shown in Figure 5(a), there were close interactions of proteins derived from the ESCA-specific DEGs. Utilizing CytoHubba plugin, we further determined the 10 ESCA-specific hub genes among them, containing TOP2A (score = 4.45 E + 23), ASPM (score = 4.45 E + 23), CDK1 (score = 4.45 E + 23), CDC20 (score = 4.45 E + 23), CCNA2 (score = 4.45 E + 23), KIF20 A (score = 4.45 E + 23), KIF11 (score = 4.45 E + 23), DLGAP5 (score = 4.45 E + 23), TPX2 (score = 4.45 E + 23), and BUB1B (score = 4.45 E + 23; Figure 5(b)). These ESCA-specific hub genes might exert crucial roles in ESCA progression.
Figure 5

Establishment of a PPI network and discovery of ESCA-specific hub genes. (a) The PPI network of ESCA-specific DEGs through the STRING online tool. (b) Discovery of the ESCA-specific hub genes utilizing CytoHubba plugin. The ten hub genes are marked in orange.

3.6. Association of the ESCA-Specific Hub Genes with Pathological Staging of ESCA

Further analysis was carried out for evaluating the associations of the ten ESCA-specific hub genes with diverse pathological staging of ESCA patients in TCGA cohort. Our results demonstrated that ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20 A, TOP2A, and TPX2 presented the different expression in diverse pathological stages across ESCA patients (Figures 6(a)–6(j)). This indicated that the 10 ESCA-specific hub genes were remarkably linked to pathological staging of ESCA.
Figure 6

Association of the ESCA-specific hub genes with pathological staging of ESCA patients. (a–j) Box plots depict the difference in (a) ASPM, (b) BUB1B, (c) CCNA2, (d) CDC20, (e) CDK1, (f) DLGAP5, (g) KIF11, (h) KIF20A, (i) TOP2A, and (j) TPX2 among diverse pathological staging of ESCA patients in TCGA cohort.

3.7. Association of the ESCA-Specific Hub Genes with ESCA Patients' Prognosis

In accordance with the optimal cutoff value of the expression of the ESCA-specific hub genes, we stratified ESCA patients in TCGA cohort into high and expression groups of ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20A, TOP2A, and TPX2 (Figures 7(a)–7(j)). Among them, we noted that ESCA patients in the high expression of the KIF11 group presented more undesirable overall survival outcome in comparison to those in the low expression of the KIF11 group.
Figure 7

Association of the ESCA-specific hub genes with ESCA patients' prognosis in TCGA cohort. (a–j) Kaplan–Meier curves display the difference in overall survival between high and low expression of (a) ASPM, (b) BUB1B, (c) CCNA2, (d) CDC20, (e) CDK1, (f) DLGAP5, (g) KIF11, (h) KIF20A, (i) TOP2A, and (j) TPX2 groups. Survival difference between groups is determined with the log-rank test.

4. Discussion

High-throughput sequencing technologies have improved our understanding about the heterogeneity and molecular basis underlying ESCA. At present, available biomarkers for prediction of ESCA patients' survival outcome remain nonsufficiently sensitive and specific. Hence, this study was conducted for discovering novel biomarkers for efficiently predicting ESCA patients' prognosis through the WGCNA algorithm, eventually lowering patients' morbidity and mortality. Combining the DEGs and ESCA-specific genes in TCGA and GSE38129 cohorts, we determined 91 ESCA-specific DEGs. Our functional enrichment analyses uncovered that the ESCA-specific DEGs were remarkably linked to cell cycle progression and carcinogenic pathways like the p53 signaling pathway, cellular senescence, and apoptosis. This indicated that the ESCA-specific DEGs exerted crucial roles in ESCA progression. Additionally, there were prominent interactions between proteins derived from the ESCA-specific DEGs in accordance with the PPI network. Among them, the 10 ESCA-specific hub genes were finally determined, containing ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20A, TOP2A, and TPX2. The tumorigenic roles of ASPM have been proposed in diverse cancer types. For instance, ASPM triggers prostate carcinoma stemness and progression through enhancing the Wnt-Dvl-3-beta-catenin pathway [20]. It is predictive of undesirable prognosis and modulates cellular proliferation in bladder carcinoma [21]. Its upregulation accelerates glioblastoma growth through modulating G1 restriction point progression as well as the Wnt-beta-catenin pathway [22]. Aberrantly expressed ASPM regulated by transcriptional factor FoxM1 triggers the malignant progression of gliomas [23]. Additionally, it is linked to poor survival outcome as well as induces carcinogenesis in diffuse large B cell lymphoma [24]. Abnormally expressed ASPM induces the progression of lung squamous cell carcinoma through modulating CDK4 [25]. Increasing evidences demonstrate the crucial role of ASPM in cancer progression. For example, BUB1B accelerates prostate carcinoma proliferation through transcriptionally modulating MELK [26]. It triggers hepatocellular carcinoma development through activating mTORC1 signaling [27]. It can facilitate extrahepatic cholangiocarcinoma development through JNK/c-Jun signaling [28]. Moreover, it participates in the tumorigenicity and radioresistance of glioblastoma [29]. For CCNA2, it can be suppressed by miR-219-5p, thereby affecting cellular proliferation and cell cycle progression in ESCA [30]. A previous study has proposed that CDC20 modulates E2F1 degradation and thymidylate synthase expression, thereby triggering ESCA chemoresistance [31]. Furthermore, CDK1 has been considered as an underlying diagnostic and cancer progression biomarker as well as a drug target for ESCA [32]. Previous bioinformatics and experimental evidences have demonstrated the tumorigenic role of DLGAP5 in ESCA [33]. KIF11 is essential for spheroid formation of ESCA cells [34]. ScRNA-seq and qPCR analysis uncovered that KIF20A possesses the potential to diagnose and predict ESCA patients' prognosis [35]. For TOP2A, experimental data demonstrate that it can affect the resistance of ESCA cells to paclitaxel [36]. Targeting TPX2 relieves ESCA progression through weakening tumor growth and invasion [37, 38]. Additionally, its upregulation is mediated by LINC00337 and triggers autophagy and resistance to cisplatin in ESCA cells [39]. On the basis of previously published literature and our findings, ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20A, TOP2A, and TPX2 play crucial roles in ESCA progression. Currently, approach of predicting ESCA patients' prognosis primarily depends on the conventional TNM staging system. Although conventional TNM staging is crucial for diagnosis and treatment interventions, it cannot roundly uncover the intrinsic biological processes and pathological development due to the high heterogeneity in tumor microenvironment and individual discrepancy. Our results demonstrated that the 10 ESCA-specific hub genes (ASPM, BUB1B, CCNA2, CDC20, CDK1, DLGAP5, KIF11, KIF20A, TOP2A, and TPX2) presented the remarkable associations with pathological staging, indicating that their roles in ESCA progression. Among them, KIF11 upregulation was indicative of an unfavorable survival outcome of ESCA patients, indicative of the potential of KIF11 as a prognostic indicator of ESCA. However, there are certain drawbacks in our study. First, the influence of expression alteration of the ESCA-specific hub genes upon patients' prognosis remains to be explored. Hence, in our future, the interactions of the ESCA-specific hub genes with patients' prognosis will be monitored and verified in the large-scale clinical data. Additionally, it is of importance to consider statistical bias because the sample size is relatively small. Moreover, in-depth investigation will be presented for validating the biological significance of the ESCA-specific hub genes through in vitro and in vivo experiments.

5. Conclusion

Overall, this study determined the 10 ESCA-specific hub genes as novel markers for ESCA with the WGCNA algorithm based on distinct datasets, which offered promising targets for ESCA precision medicine. Nevertheless, in-depth exploration is required for validating the biological function of the specific hub genes in large‐scale clinical cohorts.
  39 in total

1.  clusterProfiler: an R package for comparing biological themes among gene clusters.

Authors:  Guangchuang Yu; Li-Gen Wang; Yanyan Han; Qing-Yu He
Journal:  OMICS       Date:  2012-03-28

2.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

3.  The resistance of esophageal cancer cells to paclitaxel can be reduced by the knockdown of long noncoding RNA DDX11-AS1 through TAF1/TOP2A inhibition.

Authors:  Shuyao Zhang; Hong Jiang; Zhe Xu; Yi Jiang; Yuqi She; Xiaoting Huang; Shanna Feng; Wanying Chen; Shuang Chen; Yun Chen; Guodong Qiu; Shilong Zhong
Journal:  Am J Cancer Res       Date:  2019-10-01       Impact factor: 6.166

4.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

5.  The FOXM1/BUB1B signaling pathway is essential for the tumorigenicity and radioresistance of glioblastoma.

Authors:  Qing Ma; Yanmei Liu; Liang Shang; Jiao Yu; Qiumin Qu
Journal:  Oncol Rep       Date:  2017-10-12       Impact factor: 3.906

6.  Identification of cancer stem cell characteristics in liver hepatocellular carcinoma by WGCNA analysis of transcriptome stemness index.

Authors:  Kun-Hao Bai; Si-Yuan He; Ling-Ling Shu; Wei-Da Wang; Shi-Yong Lin; Qian-Yi Zhang; Liang Li; Lei Cheng; Yu-Jun Dai
Journal:  Cancer Med       Date:  2020-04-20       Impact factor: 4.452

7.  MiR-219-5p suppresses cell proliferation and cell cycle progression in esophageal squamous cell carcinoma by targeting CCNA2.

Authors:  Qiang Ma
Journal:  Cell Mol Biol Lett       Date:  2019-02-05       Impact factor: 5.787

8.  miR-92b-3p Functions As A Key Gene In Esophageal Squamous Cell Cancer As Determined By Co-Expression Analysis.

Authors:  Wanpeng Wang; Sengwang Fu; Xiaolu Lin; Jinhui Zheng; Juan Pu; Yun Gu; Weijun Deng; Yanyan Liu; Zhongxiang He; Wei Liang; Chengshi Wang
Journal:  Onco Targets Ther       Date:  2019-10-14       Impact factor: 4.147

9.  BUB1B promotes extrahepatic cholangiocarcinoma progression via JNK/c-Jun pathways.

Authors:  Chen Yu Jiao; Qin Chao Feng; Chang Xian Li; Dong Wang; Sheng Han; Yao Dong Zhang; Wang Jie Jiang; Jiang Chang; Xuehao Wang; Xiang Cheng Li
Journal:  Cell Death Dis       Date:  2021-01-11       Impact factor: 8.469

10.  ASPM Predicts Poor Clinical Outcome and Promotes Tumorigenesis for Diffuse Large B-cell Lymphoma.

Authors:  Jingjing Wu; Zhengmei He; Yaning Zhu; Chao Jiang; Yuan Deng; Bin Wei
Journal:  Curr Cancer Drug Targets       Date:  2021       Impact factor: 3.428

View more
  1 in total

1.  Esophageal squamous cell carcinoma: Integrated bioinformatics analysis for differential gene expression with identification of hub genes and lncRNA.

Authors:  F M Yasir Hasib
Journal:  Biochem Biophys Rep       Date:  2022-04-16
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.