| Literature DB >> 35071597 |
Md Rakibul Islam1, Mohammad Khursheed Alam2,3,4, Bikash Kumar Paul1,5, Deepika Koundal6, Atef Zaguia7, Kawsar Ahmed5,8.
Abstract
Esophageal carcinoma (EsC) is a member of the cancer group that occurs in the esophagus; globally, it is known as one of the fatal malignancies. In this study, we used gene expression analysis to identify molecular biomarkers to propose therapeutic targets for the development of novel drugs. We consider EsC associated four different microarray datasets from the gene expression omnibus database. Statistical analysis is performed using R language and identified a total of 1083 differentially expressed genes (DEGs) in which 380 are overexpressed and 703 are underexpressed. The functional study is performed with the identified DEGs to screen significant Gene Ontology (GO) terms and associated pathways using the Database for Annotation, Visualization, and Integrated Discovery repository (DAVID). The analysis revealed that the overexpressed DEGs are principally connected with the protein export, axon guidance pathway, and the downexpressed DEGs are principally connected with the L13a-mediated translational silencing of ceruloplasmin expression, formation of a pool of free 40S subunits pathway. The STRING database used to collect protein-protein interaction (PPI) network information and visualize it with the Cytoscape software. We found 10 hub genes from the PPI network considering three methods in which the interleukin 6 (IL6) gene is the top in all methods. From the PPI, we found that identified clusters are associated with the complex I biogenesis, ubiquitination and proteasome degradation, signaling by interleukins, and Notch-HLH transcription pathway. The identified biomarkers and pathways may play an important role in the future for developing drugs for the EsC.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35071597 PMCID: PMC8769846 DOI: 10.1155/2022/5908402
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Flow diagram of this study. This diagram explains we start our first step of this study from the GEO database; from the database, we select 4 datasets for statistical analysis and identify DEGs maintaining our cut-off filtration. After that, we categorize the identified DEGs according to their expression (upregulated and downregulated). After categorization, we implement function analysis and protein-protein interaction analysis, which are the two most key analyses of this study.
Dataset analysis details (a) before filtration and (b) after filtration.
| Accession number | Amount of sample | Upregulated DEGs | Downregulated DEGs | Total DEGs |
|---|---|---|---|---|
| (A) Before logFC filtration | ||||
| GSE93756 | 4 samples | 6197 | 13905 | 20102 |
| GSE94012 | 6 samples | 4464 | 15621 | 20085 |
| GSE104958 | 46 samples | 22238 | 11524 | 33762 |
| GSE143822 | 8 samples | 14552 | 17660 | 32212 |
| Overlapped | 1003 | 3818 | 4821 | |
| (B) After logFC filtration | ||||
| GSE93756 | 4 samples | 1094 | 4708 | 5802 |
| GSE94012 | 6 samples | 1520 | 3873 | 5393 |
| GSE104958 | 46 samples | 1128 | 4617 | 5945 |
| GSE143822 | 8 samples | 841 | 6183 | 7024 |
| Overlapped | 380 | 703 | 1083 | |
Top 10 (a) upregulated and (b) downregulated DEG name and LogFC value.
| DEG symbol | LogFC |
|---|---|
| (A) Upregulated DEGs | |
| AFG3L2 | 8.456189 |
| CAMKK2 | 8.268455 |
| EIF4H | 6.039133 |
| SLC6A19 | 5.983653 |
| OR2L3 | 5.71944 |
| FUNDC2P2 | 5.401373 |
| KRT6B | 5.195899 |
| WRAP53 | 5.106136 |
| OR56A3 | 5.054809 |
| LINC01465 | 5.031665 |
| (B) Downregulated DEGs | |
| COPS5 | -11.2045 |
| C3orf59 | -9.20752 |
| NOX6 | -8.2488 |
| RAB3B | -7.23927 |
| LINC01279 | -6.98733 |
| NEDD5 | -6.46107 |
| USP26 | -6.41274 |
| TTLL9 | -6.22158 |
| NKD1 | -6.09627 |
| FCAR | -6.01488 |
Gene Ontology analysis of upregulated DEGs using DAVID functional tools.
| Category | GO ID | GO term | Count | % |
|
|---|---|---|---|---|---|
| BP | GO:0045892 | Negative regulation of transcription, DNA-templated | 22 | 0.035989 | 0.001478 |
| BP | GO:0046513 | Ceramide biosynthetic process | 5 | 0.008179 | 0.001541 |
| BP | GO:0016567 | Protein ubiquitination | 17 | 0.02781 | 0.00305 |
| BP | GO:0051726 | Regulation of cell cycle | 9 | 0.014723 | 0.003954 |
| BP | GO:0048013 | Ephrin receptor signaling pathway | 7 | 0.011451 | 0.008317 |
| BP | GO:0016477 | Cell migration | 10 | 0.016359 | 0.008953 |
| BP | GO:0000045 | Autophagosome assembly | 5 | 0.008179 | 0.009555 |
| BP | GO:0045893 | Positive regulation of transcription, DNA-templated | 20 | 0.032717 | 0.009702 |
| BP | GO:0055007 | Cardiac muscle cell differentiation | 4 | 0.006543 | 0.017191 |
| BP | GO:0051865 | Protein autoubiquitination | 5 | 0.008179 | 0.018845 |
| CC | GO:0005789 | Endoplasmic reticulum membrane | 37 | 0.060527 | 1.84E-05 |
| CC | GO:0005654 | Nucleoplasm | 83 | 0.135776 | 8.75E-05 |
| CC | GO:0005634 | Nucleus | 136 | 0.222477 | 7.26E-04 |
| CC | GO:0005802 | Trans-Golgi network | 10 | 0.016359 | 0.001509 |
| CC | GO:0031965 | Nuclear membrane | 13 | 0.021266 | 0.002026 |
| CC | GO:0005622 | Intracellular | 39 | 0.063798 | 0.013664 |
| CC | GO:0005737 | Cytoplasm | 123 | 0.201211 | 0.015049 |
| CC | GO:0005829 | Cytosol | 80 | 0.130869 | 0.036397 |
| CC | GO:0005671 | Ada2/Gcn5/Ada3 transcription activator complex | 3 | 0.004908 | 0.038709 |
| CC | GO:0000139 | Golgi membrane | 19 | 0.031081 | 0.046953 |
| MF | GO:0005515 | Protein binding | 237 | 0.387698 | 1.25E-08 |
| MF | GO:0003677 | DNA binding | 56 | 0.091608 | 5.22E-04 |
| MF | GO:0003684 | Damaged DNA binding | 7 | 0.011451 | 0.002026 |
| MF | GO:0004842 | Ubiquitin-protein transferase activity | 16 | 0.026174 | 0.004151 |
| MF | GO:0061630 | Ubiquitin protein ligase activity | 11 | 0.017994 | 0.00625 |
| MF | GO:0005070 | SH3/SH2 adaptor activity | 6 | 0.009815 | 0.006276 |
| MF | GO:0008565 | Protein transporter activity | 6 | 0.009815 | 0.017551 |
| MF | GO:0017137 | Rab GTPase binding | 8 | 0.013087 | 0.022939 |
| MF | GO:0003676 | Nucleic acid binding | 31 | 0.050712 | 0.02606 |
| MF | GO:0032794 | GTPase activating protein binding | 3 | 0.004908 | 0.03379 |
∗GO: Gene Ontology; ∗BP: biological process; ∗CC: cellular component; ∗MF: molecular function.
Figure 2Gene Ontology analysis of upregulated DEGs using DAVID functional tools. Different colors of dots mean different categories of GO terms. The green-colored dot indicates biological process, the blue-colored dot indicates cellular component, and the red-colored dot defines molecular functions. The x-axis indicates the |Log (P value)| of associated GO terms. y-axis indicates the GO term name. The size of a dot represents gene count.
Gene Ontology analysis of downregulated DEGs using DAVID functional tools.
| Category | GO ID | Term | Count | % |
|
|---|---|---|---|---|---|
| BP | GO:0006413 | Translational initiation | 37 | 0.014347 | 4.26E-09 |
| BP | GO:0006614 | SRP-dependent cotranslational protein targeting to membrane | 27 | 0.010469 | 1.96E-07 |
| BP | GO:0006412 | Translation | 50 | 0.019388 | 3.60E-07 |
| BP | GO:0019083 | Viral transcription | 29 | 0.011245 | 6.66E-07 |
| BP | GO:0000184 | Nuclear-transcribed mRNA catabolic process, nonsense-mediated decay | 30 | 0.011633 | 7.51E-07 |
| BP | GO:0048245 | Eosinophil chemotaxis | 8 | 0.003102 | 1.64E-06 |
| BP | GO:0007155 | Cell adhesion | 69 | 0.026755 | 4.88E-05 |
| BP | GO:0006364 | rRNA processing | 38 | 0.014735 | 1.20E-04 |
| BP | GO:0002548 | Monocyte chemotaxis | 13 | 0.005041 | 2.75E-04 |
| BP | GO:0007156 | Homophilic cell adhesion via plasma membrane adhesion molecules | 29 | 0.011245 | 5.11E-04 |
| Category | Term | Count | % |
| |
| CC | GO:0031012 | Extracellular matrix | 55 | 0.021326 | 3.51E-07 |
| CC | GO:0005840 | Ribosome | 37 | 0.014347 | 4.78E-07 |
| CC | GO:0022625 | Cytosolic large ribosomal subunit | 20 | 0.007755 | 5.24E-06 |
| CC | GO:0030424 | Axon | 40 | 0.01551 | 3.48E-05 |
| CC | GO:0005578 | Proteinaceous extracellular matrix | 45 | 0.017449 | 6.17E-05 |
| CC | GO:0005788 | Endoplasmic reticulum lumen | 35 | 0.013571 | 9.14E-05 |
| CC | GO:0005747 | Mitochondrial respiratory chain complex I | 14 | 0.005429 | 2.80E-04 |
| CC | GO:0022627 | Cytosolic small ribosomal subunit | 13 | 0.005041 | 8.53E-04 |
| CC | GO:0098793 | Presynapse | 15 | 0.005816 | 0.001358 |
| CC | GO:0015935 | Small ribosomal subunit | 9 | 0.00349 | 0.00193 |
| Category | Term | Count | % |
| |
| MF | GO:0003735 | Structural constituent of ribosome | 49 | 0.019 | 1.85E-08 |
| MF | GO:0008137 | NADH dehydrogenase (ubiquinone) activity | 13 | 0.005041 | 0.001124 |
| MF | GO:0044822 | Poly(A) RNA binding | 134 | 0.051959 | 0.001979 |
| MF | GO:0008237 | Metallopeptidase activity | 17 | 0.006592 | 0.002751 |
| MF | GO:0005201 | Extracellular matrix structural constituent | 15 | 0.005816 | 0.002881 |
| MF | GO:0003723 | RNA binding | 70 | 0.027143 | 0.004922 |
| MF | GO:0047555 | 3′,5′-cyclic-GMP phosphodiesterase activity | 6 | 0.002327 | 0.006619 |
| MF | GO:0042056 | Chemoattractant activity | 8 | 0.003102 | 0.009717 |
| MF | GO:0001077 | Transcriptional activator activity, RNA polymerase II core promoter proximal region sequence-specific binding | 34 | 0.013184 | 0.01087 |
| MF | GO:0008009 | Chemokine activity | 11 | 0.004265 | 0.012935 |
Figure 3Gene Ontology analysis of downregulated DEGs using DAVID functional tools. Different colors of dots mean different categories of GO terms. The green-colored dot indicates biological process, the blue-colored dot indicates cellular component, and the red-colored dot defines molecular functions. The x-axis indicates the |Log (P value)| of associated GO terms. y-axis indicates the GO term name. The size of a dot represents gene count.
Pathway enrichment analysis of (a) upregulated and (b) downregulated DEGs using DAVID functional tools.
| Pathway term | Benjamini |
| Source |
|---|---|---|---|
| (A) Upregulated | |||
| Protein export | 0.169923 | 9.40E-04 | KEGG |
| Axon guidance | 0.284819 | 0.00338 | KEGG |
| RHO GTPases Activate Formins | 0.959243 | 0.007813 | REACTOME |
| HATs acetylate histones | 0.882682 | 0.010449 | REACTOME |
| Sphingolipid metabolism | 0.583473 | 0.013182 | KEGG |
| Pathogenic Escherichia coli infection | 0.580496 | 0.017396 | KEGG |
| Golgi associated vesicle biogenesis | 0.975818 | 0.026998 | REACTOME |
| ErbB signaling pathway | 0.671562 | 0.027725 | KEGG |
| Sphingolipid signaling pathway | 0.637004 | 0.030241 | KEGG |
| XBP1(S) activates chaperone genes | 0.956916 | 0.030359 | REACTOME |
| Lysosome | 0.593513 | 0.031324 | KEGG |
| The information-processing pathway at the IFN-beta enhancer | 0.978374 | 0.034876 | BIOCARTA |
| Activation of RAC1 | 0.961151 | 0.039023 | REACTOME |
| Signaling of hepatocyte growth factor receptor | 0.890962 | 0.040208 | BIOCARTA |
| Epithelial cell signaling in helicobacter pylori infection | 0.654812 | 0.042066 | KEGG |
| (B) Downregulated | |||
| L13a-mediated translational silencing of ceruloplasmin expression | 3.71E-07 | 4.17E-10 | REACTOME |
| Formation of a pool of free 40S subunits | 4.51E-07 | 5.06E-10 | REACTOME |
| GTP hydrolysis and joining of the 60S ribosomal subunit | 4.84E-07 | 5.43E-10 | REACTOME |
| Ribosome | 3.48E-07 | 1.26E-09 | KEGG |
| Peptide chain elongation | 4.55E-06 | 5.11E-09 | REACTOME |
| Selenocysteine synthesis | 1.04E-05 | 1.17E-08 | REACTOME |
| Eukaryotic translation termination | 1.37E-05 | 1.53E-08 | REACTOME |
| Viral mRNA translation | 1.99E-05 | 2.24E-08 | REACTOME |
| Nonsense mediated decay (NMD) independent of the exon junction complex (EJC) | 2.30E-05 | 2.58E-08 | REACTOME |
| SRP-dependent cotranslational protein targeting to membrane | 3.08E-04 | 3.45E-07 | REACTOME |
| Nonsense mediated decay (NMD) enhanced by the exon junction complex (EJC) | 3.78E-04 | 4.24E-07 | REACTOME |
| Formation of the ternary complex, and subsequently, the 43S complex | 0.008265 | 9.31E-06 | REACTOME |
| Ribosomal scanning and start codon recognition | 0.012429 | 1.40E-05 | REACTOME |
| Translation initiation complex formation | 0.012429 | 1.40E-05 | REACTOME |
| Chemokine | 0.001886 | 3.85E-05 | BBID |
Figure 4Bar plot diagram to demonstrate pathway analysis outcomes of upregulated DEGs. Different color of bars indicates the different database name. The x-axis indicates the value of |log10 (P value)|, and y-axis indicates the pathway term name.
Figure 5Bar plot diagram to demonstrate pathway analysis outcomes of downregulated DEGs. Different color of bars indicates the different database name. The x-axis indicates the value of |log10 (P value)|, and y-axis indicates the pathway term name.
Figure 6PPI network using identified DEGs. Nodes represent DEGs, and edge represents the connection between DEGs. The network has 646 nodes and 2055 connections. Green nodes represent upregulated DEGs, and red nodes represent downregulated DEGs. Eclipse-shaped nodes indicate the hub genes of the network. Hub genes are explored using 3 combined methods.
Figure 7Bar plot diagram to represent the values of degree, MNC, and bottleneck for specific hub genes. The red bar indicates degree value, the blue bar indicates MNC value, and the black bar indicates bottleneck value. The x-axis represents the gene name, and the y-axis represents numerical values of the corresponding method.
Rank of 10 hub genes based on degree, MNC, and bottleneck methods.
| Gene name | Rank degree | Rank MNC | Rank bottleneck |
|---|---|---|---|
| IL6 | 1 | 1 | 1 |
| CDH1 | 2 | 3 | 2 |
| NOTCH1 | 3 | 2 | 3 |
| ATP5C1 | 4 | 4 | 5 |
| BPTF | 5 | 10 | 4 |
| MRPS11 | 6 | 5 | 6 |
| MRPS15 | 7 | 6 | 8 |
| MRPL1 | 8 | 7 | 9 |
| NDUFB7 | 9 | 8 | 7 |
| NDUFS5 | 10 | 9 | 10 |
Associated pathways of significant 4 clusters.
| Pathway terms | Count | % |
|
|---|---|---|---|
| (A) Cluster 1 | |||
| Complex I biogenesis | 15 | 45.45455 | 6.48E-24 |
| Mitochondrial translation termination | 15 | 45.45455 | 6.35E-21 |
| Mitochondrial translation initiation | 15 | 45.45455 | 6.35E-21 |
| Mitochondrial translation elongation | 15 | 45.45455 | 6.35E-21 |
| Respiratory electron transport | 13 | 39.39394 | 3.91E-17 |
| (B) Cluster 2 | |||
| Ubiquitination and proteasome degradation | 12 | 0.593178 | 5.78E-17 |
| (C) Cluster 3 | |||
| Interferon alpha/beta signaling | 9 | 0.3159 | 1.32E-13 |
| Signaling by interleukins | 3 | 0.1053 | 0.003542 |
| ISG15 antiviral mechanism | 3 | 0.1053 | 0.008028 |
| (D) Cluster 4 | |||
| B-WICH complex positively regulates rRNA expression | 3 | 0.194805 | 9.95E-05 |
| Notch-HLH transcription pathway | 2 | 0.12987 | 0.002863 |
Figure 8Top 4 clusters and their associated pathways for (a) cluster 1, (b) cluster 2, (c) cluster 3, and (d) cluster 4. Hexagonal-shaped nodes present pathway name, and eclipse-shaped presents the gene name.