Literature DB >> 32058995

Identification of Hub Genes and Pathways in Gastric Adenocarcinoma Based on Bioinformatics Analysis.

Jieping Qiu1, Mengyu Sun1, Yaoqun Wang1, Bo Chen2.   

Abstract

BACKGROUND Gastric adenocarcinoma accounts for 95% of all gastric malignant tumors. The purpose of this research was to identify differentially expressed genes (DEGs) of gastric adenocarcinoma by use of bioinformatics methods. MATERIAL AND METHODS The gene microarray datasets of GSE103236, GSE79973, and GSE29998 were imported from the GEO database, containing 70 gastric adenocarcinoma samples and 68 matched normal samples. Gene ontology (GO) and KEGG analysis were applied to screened DEGs; Cytoscape software was used for constructing protein-protein interaction (PPI) networks and to perform module analysis of the DEGs. UALCAN was used for prognostic analysis. RESULTS We identified 2909 upregulated DEGs (uDEGs) and 7106 downregulated DEGs (dDEGs) of gastric adenocarcinoma. The GO analysis showed uDEGs were enriched in skeletal system development, cell adhesion, and biological adhesion. KEGG pathway analysis showed uDEGs were enriched in ECM-receptor interaction, focal adhesion, and Cytokine-cytokine receptor interaction. The top 10 hub genes - COL1A1, COL3A1, COL1A2, BGN, COL5A2, THBS2, TIMP1, SPP1, PDGFRB, and COL4A1 - were distinguished from the PPI network. These 10 hub genes were shown to be significantly upregulated in gastric adenocarcinoma tissues in GEPIA. Prognostic analysis of the 10 hub genes via UALCAN showed that the upregulated expression of COL3A1, COL1A2, BGN, and THBS2 significantly reduced the survival time of gastric adenocarcinoma patients. Module analysis revealed that gastric adenocarcinoma was related to 2 pathways: including focal adhesion signaling and ECM-receptor interaction. CONCLUSIONS This research distinguished hub genes and relevant signal pathways, which contributes to our understanding of the molecular mechanisms, and could be used as diagnostic indicators and therapeutic biomarkers for gastric adenocarcinoma.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32058995      PMCID: PMC7034404          DOI: 10.12659/MSM.920261

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

Gastric cancer (GC) is a common malignant disease with a mortality rate of about 10% [1], which does a great harm to global health. Gastric adenocarcinoma (GAC) is the most common pathological type of gastric cancer, accounting for 95% of gastric malignant tumors [2], and it is characterized by easy invasion and metastasis [3]. Most GC patients are diagnosed in advanced stages, which is the major reason for its poor prognosis [4]. Although multimodal therapy, including surgery, chemotherapy, radiotherapy, and targeted therapy, has recently improved, the 5-year overall survival rate of patients with terminal GC is still less than 20% [5], and it can be as high as 90% if GC is detected in the early stage [6]. Accordingly, the early diagnosis and treatment of GAC is crucial. Studies have shown that many biochemical molecular markers are involved in the occurrence and development of tumors and can be used for early screening of tumors. However, many markers are highly expressed in various types of tumors and do not have good specificity [7]. Therefore, it is necessary to further explore new and specific diagnostic markers of gastric adenocarcinoma as an auxiliary detection project for early diagnosis. Recently, bioinformatics has become a promising and effective tool for screening significant genetic or epigenetic variations that occur in carcinogenesis and determine the diagnosis and prognosis of cancer [8]. Various bioinformatics databases, such as the GEO database, provide opportunities for data mining for gene expression profiles of cancer. In this study, we imported 3 gastric adenocarcinoma datasets from the GEO database. We screened differentially expressed genes (DEGs) by comparing the gene expression between gastric adenocarcinoma samples and paired normal mucosa samples. Then, function annotations and signal pathway analysis of DEGs were performed using Gene ontology (GO) and KEGG signal pathway enrichment analysis in the DAVID database. Subsequently, to study the mechanism of occurrence and development of GAC at the molecular level, we used UALCAN for prognosis analysis and GEPIA for verification of the mRNA expression level, which may provide valuable insights for diagnosis, targeted drug research, and prognosis evaluation of GAC.

Material and Methods

Datasets

The Gene Expression Omnibus database (GEO, ) is a communal functional genic database including array-based and sequence-based data, and is available to users free of charge. The gene expression datasets of GSE103236 [9], GSE79973 [10], and GSE29998 [11] were acquired from the GEO database. The 3 datasets selected in this experiment all met 3 criteria: (1) samples from human gastric tissue; (2) with case-control group; and (3) sample number ≥18, and only for the pathological type of GAC. GSE103236 was based on the GPL4133 platform (Agilent-014850 Whole Human Genome Microarray 4x44K G4112F). GSE79973 was based on the GPL570 platform ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array). GSE29998 was based on the GPL6947 platform (Illumina HumanHT-12 V3.0 expression BeadChip). GSE103236 contains 19 samples, including 10 gastric adenocarcinoma samples and 9 matched normal mucosa samples. GSE79973 contains 20 samples, including 10 gastric adenocarcinoma samples and 10 matched normal mucosa samples. GSE29998 contains 99 samples, including 50 gastric adenocarcinoma samples and 49 matched normal mucosa samples.

Data processing

GEO2R () is an online tool with which different groups of samples from the GEO database can be compared to identify DEGs [12]. The data were divided into a gastric adenocarcinoma group and a normal group for further analysis by GEO2R. The benchmark adj. p<0.05 and |log2FC|>1 were determined as the cutoff values for statistical analysis of each dataset, and the intersecting parts of the 3 datasets were determined by use of the online tool Draw Venn diagram (bioinformatics. psb. ugent. be/webtools/Venn/).

Gene ontology (GO) and KEGG signal pathway analysis of DEGs

The GO () database [13] can provide functional classification for genomic data, including biological processes (BP), cellular component (CC), and molecular function (MF). GO analysis is a widely used annotating tool of genes and genic productions. The Kyoto Encyclopedia of Genes and Genomes (KEGG, ) database [14] is a networked website designed for genic function analysis, exegesis, and visualizing. The Database for Annotation, Visualization and Integrated Discovery (DAVID, ) [15] is an online tool for genic functional classification, which can be applied for gene analysis to assess the biological function of genes. In this research, GO enrichment analysis and KEGG pathway analysis were applied using the DAVID website to study the functions of DEGs. p<0.05 was set as the cutoff point for statistical significance.

Integration of protein-protein interaction (PPI) network and module analysis

The Search Tool for the Retrieval of Interacting Genes (STRING, ) [16] is a biological database designed for predicting PPI networks. The DEGs were imported to STRING to assess the interactive relationships, and a confidence score >0.9 was considered as significant. Then, we used Cytoscape [17], which biological graph visualization software that can construct comprehensive models of biologic molecular interaction. The Molecular Complex Detection (MCODE), a pluggable unit of Cytoscape, was applied for screening the modules of the PPI network. The benchmarks were determined as: degree cutoff=2, node score cutoff=0.2, k-core=4, and maximum depth=100. The KEGG signal pathway enrichment analysis was reapplied to DEGs located in the modules to study their major functions.

Expression levels and prognostic analysis of hub genes

GEPIA (Gene Expression Profiling Interactive Analysis) [18] is a well-known platform that can be used to analyze differences in the mRNA expression levels of a specific gene in specific cancers between cancerous tissues and paired normal tissues. We used GEPIA to study mRNA expression levels of hub genes in GAC and paired normal tissues. UALCAN () [19] was used to assess the prognosis of hub genes. For each gene, cancer patients were automatically separated into high-expression and low-expression groups in accordance with the expression value of RNA, and the difference p<0.05 was regarded as significant.

Results

Screening of DEGs

In the aggregate, 70 gastric adenocarcinoma samples and 68 matched normal mucosa samples from 3 datasets were analyzed. In view of the GEO2R analysis, using the adj. p<0.05 and |log2FC|>1 criteria, 2909 upregulated DEGs (uDEGs) and 7106 downregulated DEGs (dDEGs) were screened in GAC tissues compared with normal tissues (Figure 1). A total of 250 genes were collected from all 3 datasets, including 92 uDEGs (Figure 2, Table 1) and 158 dDEGs (Figure 2, Table 1).
Figure 1

Volcano plot of all significant DEGs. Volcano plot of all significant DEGs, including a total of 2909 uDEGs and 7106 dDEGs. Red color means uDEGs, green color represents dDEGs, and blue color represents genes that are not significantly different in expression. The criterion: |foldchange|>1, p<0.05 is determined as the cutoff value.

Figure 2

(A, B) Venn diagram of all screened DEGs. Venn diagram shows: uDEGs shared by GSE103236, GSE79973, and GSE29998 microarrays. A total of 92 uDEGs and 158 dDEGs were identified in the intersections.

Table 1

DEGs in gastric adenocarcinoma shared in 3 microarrays.

DEGSTotalElements
uDEGs92IGF2BP3, CDC25B, C5AR1, ZMYND15, COL1A1, GDPD5, ANLN, CHRNA5, FNDC1, COL18A1, PRRX1, PDGFRB, COL5A2, KIF4A, THY1, ASCL2, ANTXR1, SPP1, WNT5A, OLR1, MSR1, MELK, CDH11, TIMP1, BGN, COL8A1, TEAD4, ECT2, MMP11, KRT80, DDX31, FSCN1, SRPX2, WNT2, LRP8, CEMIP, BMP1, DIO2, ARPC1B, MFAP2, WISP1, VMO1, COL4A1, SLC1A3, SULF1, CLDN1, COL11A1, TREM1, COL1A2, APOC1, COL12A1, ESM1, ARHGAP11A, PLAU, RFC3, TGM2, OSMR, FOXC1, CHEK1, TNFRSF11B, VSNL1, IGFBP7, CST1, RCC2, LEF1, IL13RA2, LZTS1, SPHK1, KIF2C, AGPAT4, BUB1, TNFRSF12A, TROAP, ANGPT2, COL3A1, TMEM158, SERPINH1, FAP, INHBA, CDCA3, SLC5A6, CKAP2, THBS2, OLFML2B, S100A10, COL6A3, CSF2RA, HSD11B1, PMEPA1, CTHRC1, GAD1, NCAPG
dDEGs158EPB41L4B, COL4A5, ZNF385B, NRG4, IRX3, KLF4, NDRG2, IGFBP2, SLC9A2, SCNN1B, ESRRB, HTR4, F13A1, CNTD1, ADHFE1, CELA3B, OSBPL7, ADGRG2, MYOC, GIF, GPER1, HBB, GNG7, COL4A6, ERO1B, SOSTDC1, SBSPON, TMED6, ARHGAP24, CKMT2, KCNJ16, SLC26A7, ADAMTSL1, SYT4, PTGER3, ATP4A, DNASE1L3, CKB, MAL, AQP4, ESRRG, STOX2, CPA2, OXCT1, ABCA8, TTLL7, AXDND1, FXYD4, PER3, SHMT1, ETNPPL, DPT, PACRG, CAPN13, SLC5A5, FGA, NTN1, LGI1, ATP4B, EPM2A, SLC2A4, ADH1A, KLF15, SCUBE2, GPX3, KCNMB2, MAMDC2, SELENBP1, GPAT3, CLIC6, HAPLN1, TRIM50, B3GAT1, MS4A2, BHMT, GHRL, PNOC, GPRC5C, KCNJ13, DGKD, BMP6, GC, ALDH6A1, PDGFD, HTR1E, KIT, ADH1C, TOX, GCNT2, SST, PLCXD3, XYLT2, SLC6A16, CWH43, PDK4, GFRA2, GPR155, GREM2, NR3C2, PLA2G1B, MUC6, LINGO2, RNASE1, MYZAP, GUCA1C, AKR1C1, ACACB, PNPLA7, FXYD1, BAALC, PPP2R3A, BMP5, CKM, FBXL13, COBLL1, RGMB, FBP2, FAM150B, HIF3A, GSTA4, FGG, CHGA, RAB26, CAPN9, MT1M, SGSM1, ASPA, SCGN, SULT2A1, GAMT, CDHR3, CCKBR, ADRB2, GRIA3, SCGB2A1, SLC7A8, DUOX1, SORCS1, ARHGEF37, SCARA5, PACSIN1, LIFR, FNDC5, GLUL, CHIA, METTL7A, FAM189A2, SSTR1, PAIP2B, ACER2, ADH1B, MYRIP, KCNE2, PPP1R3C, TCEA3, PDE8B, SIGLEC11, KBTBD12

GO term enrichment analysis

GO analysis outcomes showed that for biological process (BP), uDEGs were markedly enriched in skeletal system development, cell adhesion, and biological adhesion (Figure 3, Table 2); the dDEGs are mainly in ion transport, homeostatic process, and chemical homeostatic (Figure 3, Table 2). For molecular function (MF), the uDEGs are enriched in structural molecule activity, extracellular matrix (ECM) structural constituent, and growth factor binding (Figure 3, Table 2); and the dDEGs are enriched in channel activity, passive transmembrane transporter activity, and substrate specific channel activity (Figure 3, Table 2). Cellular component (CC) analysis revealed that uDEGs are concentrated in extracellular region, extracellular region part, and proteinaceous extract (Figure 3, Table 2); and dDEGs are concentrated in extracellular region, plasma membrane part, and extracellular region part (Figure 3, Table 2).
Figure 3

Gene ontology analysis of DEGs related to gastric adenocarcinoma. The x-axis stands for the number of DEGs, and the vertical axis stands for GO terms, (A) The top 10 enriched biological processes (BP), cellular component (CC), and molecular function (MF) of 92 uDEGs. (B) The top 10 enriched BP, CC, and MF for 158 dDEGs.

Table 2

Gene ontology analysis of DEGs related to gastric adenocarcinoma.

ExpressionCategoryTermCount%P valueFDR
UpregulatedGOTERM_BP_FATGO: 0001501~skeletal system development193.1404958686.25E-171.67E-13
GOTERM_BP_FATGO: 0007155~cell adhesion203.3057851244.39E-126.55E-09
GOTERM_BP_FATGO: 0022610~biological adhesion203.3057851244.50E-126.72E-09
GOTERM_BP_FATGO: 0030199~collagen fibril organization81.322314051.19E-111.78E-08
GOTERM_BP_FATGO: 0030198~extracellular matrix organization81.322314051.30E-071.95E-04
GOTERM_BP_FATGO: 0001503~ossification81.322314052.60E-073.89E-04
GOTERM_BP_FATGO: 0060348~bone development81.322314054.12E-076.16E-04
GOTERM_BP_FATGO: 0043062~extracellular structure organization81.322314052.75E-060.0041046
GOTERM_BP_FATGO: 0032963~collagen metabolic process50.8264462813.71E-060.005542186
GOTERM_BP_FATGO: 0043588~skin development50.8264462814.29E-060.006410681
GOTERM_CC_FATGO: 0005578~proteinaceous extracellular matrix254.1322314051.86E-242.10E-21
GOTERM_CC_FATGO: 0031012~extracellular matrix254.1322314051.14E-231.28E-20
GOTERM_CC_FATGO: 0005576~extracellular region406.6115702486.86E-207.75E-17
GOTERM_CC_FATGO: 0044421~extracellular region part304.9586776868.27E-199.34E-16
GOTERM_CC_FATGO: 0044420~extracellular matrix part142.3140495871.64E-151.88E-12
GOTERM_CC_FATGO: 0005581~collagen91.4876033061.49E-121.68E-09
GOTERM_CC_FATGO: 0005583~fibrillar collagen50.8264462811.48E-071.67E-04
GOTERM_CC_FATGO: 0005604~basement membrane60.9917355372.04E-050.023013251
GOTERM_CC_FATGO: 0031093~platelet alpha granule lumen50.8264462812.76E-050.031185256
GOTERM_CC_FATGO: 0060205~cytoplasmic membrane-bounded vesicle lumen50.8264462813.67E-050.041403378
GOTERM_MF_FATGO: 0005201~extracellular matrix structural constituent111.8181818185.06E-135.90E-10
GOTERM_MF_FATGO: 0005198~structural molecule activity142.3140495873.15E-073.67E-04
GOTERM_MF_FATGO: 0005518~collagen binding50.8264462818.88E-060.010355681
GOTERM_MF_FATGO: 0005539~glycosaminoglycan binding71.1570247931.19E-050.013923194
GOTERM_MF_FATGO: 0001871~pattern binding71.1570247932.05E-050.023961179
GOTERM_MF_FATGO: 0030247~polysaccharide binding71.1570247932.05E-050.023961179
GOTERM_MF_FATGO: 0008201~heparin binding60.9917355373.72E-050.043401787
GOTERM_MF_FATGO: 0050840~extracellular matrix binding40.6611570251.30E-040.151895387
GOTERM_MF_FATGO: 0005509~calcium ion binding121.9834710744.20E-040.488103889
GOTERM_MF_FATGO: 0008237~metallopeptidase activity60.9917355375.52E-040.641829759
DownregulatedGOTERM_BP_FATGO: 0007586~digestion121.0380622847.45E-121.09E-08
GOTERM_BP_FATGO: 0055114~oxidation reduction151.2975778553.07E-050.045029161
GOTERM_BP_FATGO: 0006081~cellular aldehyde metabolic process40.3460207616.53E-040.952399247
GOTERM_BP_FATGO: 0030001~metal ion transport100.8650519030.002138183.08666015
GOTERM_BP_FATGO: 0006812~cation transport100.8650519030.0066666169.333089916
GOTERM_BP_FATGO: 0006811~ion transport121.0380622840.0070657849.865306976
GOTERM_BP_FATGO: 0046903~secretion70.6055363320.01024922114.00683938
GOTERM_BP_FATGO: 0015672~monovalent inorganic cation transport70.6055363320.0133718917.89646714
GOTERM_BP_FATGO: 0006813~potassium ion transport50.4325259520.01682180722.00279704
GOTERM_BP_FATGO: 0022600~digestive system process30.2595155710.01837448523.78773935
GOTERM_CC_FATGO: 0005576~extracellular region312.68166091.26E-050.014253173
GOTERM_CC_FATGO: 0045177~apical part of cell70.6055363320.0013710311.545706714
GOTERM_CC_FATGO: 0016324~apical plasma membrane60.5190311420.0021130342.373127171
GOTERM_CC_FATGO: 0005624~membrane fraction110.9515570930.04764480342.55171216
GOTERM_MF_FATGO: 0031420~alkali metal ion binding70.6055363320.0040138755.198242704
GOTERM_MF_FATGO: 0004033~aldo-keto reductase activity30.2595155710.0043057875.566366984
GOTERM_MF_FATGO: 0004198~calcium-dependent cysteine-type endopeptidase activity30.2595155710.0048998066.31139254
GOTERM_MF_FATGO: 0008289~lipid binding90.7785467130.00979699812.2496158
GOTERM_MF_FATGO: 0016620~oxidoreductase activity, acting on the aldehyde or oxo group of donors, NAD or NADP as acceptor30.2595155710.01002500112.51741656
GOTERM_MF_FATGO: 0030955~potassium ion binding50.4325259520.01027704112.81257072
GOTERM_MF_FATGO: 0008233~peptidase activity100.8650519030.01351707616.52574449
GOTERM_MF_FATGO: 0015267~channel activity80.6920415220.01931684522.80964632
GOTERM_MF_FATGO: 0022803~passive transmembrane transporter activity80.6920415220.0195469523.04969236
GOTERM_MF_FATGO: 0008900~hydrogen: potassium-exchanging ATPase activity20.1730103810.01974230823.25294859

KEGG signal pathway analysis

The most remarkably enriched pathways of uDEGs and dDEGs identified by KEGG analysis are shown in Table 3. The uDEGs are enriched in focal adhesion, ECM-receptor interaction, and cytokine-cytokine receptor interaction, while the dDEGs are enriched in pathways in arginine and proline metabolism, as well as glycine, serine, and threonine metabolism.
Table 3

KEGG pathway analysis of DEGs associated with gastric adenocarcinoma.

ExpressionTermCount%P valueFDR
Upregulatedhsa04512: ECM-receptor interaction111.8181818182.66E-132.14E-10
hsa04510: Focal adhesion111.8181818181.80E-091.44E-06
hsa04350: TGF-beta signaling pathway40.6611570250.0051491744.056689024
Downregulatedhsa00982: Drug metabolism80.6920415221.46E-071.41E-04
hsa00830: Retinol metabolism70.6055363321.38E-060.001338048
hsa00980: Metabolism of xenobiotics by cytochrome P45070.6055363322.60E-060.002518197
hsa00010: Glycolysis/Gluconeogenesis40.3460207610.0078180337.311759835
hsa00591: Linoleic acid metabolism30.2595155710.01555289314.070282
hsa00350: Tyrosine metabolism30.2595155710.03634833430.10544025

PPI network construction, module analysis and hub genes determination

The interaction between DEGs was calculated using the STRING database, and 250 DEGs differently expressed in all 3 data sets were imported into Cytoscape software for visualization. PPI network involves 143 nodes and 578 edges (Figure 4). The top 10 genes in connectivity ranking in the PPI network were selected as hub genes. The results showed that COL1A1 ranked highest among all DEGs, with 34 degree, followed by COL3A1, COL1A2, BGN, COL5A2, THBS2, TIMP1, SPP1, PDGFRB, and COL4A1 (Table 4).
Figure 4

PPI network construction, module analysis, and pathway enrichment analysis. Protein-protein interaction network for products of DEGs. A total of 143 nodes and 419 interaction associations were identified. (A) The nodes mean proteins; the edges mean the interactions of proteins; green circles meant dDEGs and orange circles meant uDEGs. (B) Module analysis based on Cytoscape software. (C) KEGG pathway enrichment analysis of DEGs in the module.

Table 4

Connectivity and regulation of the top 10 hub genes.

Gene symbolGene titleConnectivityRegulation
COL1A1Collagen type I alpha 1 chain34Up
COL3A1Collagen type III alpha 1 chain30Up
COL1A2Collagen type I alpha 2 chain29Up
BGNBiglycan29Up
COL5A2Collagen type V alpha 2 chain23Up
THBS2Thrombospondin 223Up
TIMP1TIMP metallopeptidase inhibitor 121Up
SPP1Secreted phosphoprotein 120Up
PDGFRBPlatelet-derived growth factor receptor beta20Up
COL4A1Collagen type IV alpha 1 chain19Up
The module analysis of 143 nodes showed that the most important module with higher score involves 15 nodes and 143 edges (Figure 4). All 15 nodes are all upregulated genes, which suggests the vital role of uDEGs in GAC. KEGG signal pathway analysis of the 15 genes showed that they mainly participated in 2 pathways: ECM-receptor interaction and focal adhesion. It is noteworthy that 8 of the 15 genes in the module (COL4A1, COL6A3, COL3A1, COL1A2, COL1A1, COL11A1, COL4A6, and THBS2) are involved in both pathways. GEPIA database showed that all 10 hub genes are upregulated in GAC (Figure 5). To assess the prognostic value of 10 hub genes, we used UALCAN for prognostic analysis. The results of the prognostic analysis showed the upregulated expression of COL3A1, COL1A2, BGN, and THBS2 significantly reduce the survival time of GAC patients (Figure 6).
Figure 5

Validation of the gene expression levels of COL1A1, COL3A1, COL1A2, BGN, COL5A2, THBS2, TIMP1, SPP1, PDGFRB, and COL4A1 between GAC and normal gastric tissues in the GEPIA database. They are significantly upregulated in GAC compared with normal tissues (P<0.01). The red * represents P<0.01.

Figure 6

UALCAN overall survival analysis plot of the top 10 hub genes expressed in gastric adenocarcinoma patient samples and 4 DEGs among the top 10 hub genes that are significantly related to the survival of gastric adenocarcinoma patients (P<0.05). (A) COL3A1; (B) COL1A2; (C) BGN; (D) THBS2.

Discussion

Gastric cancer is a leading cause of death. Early diagnosis and treatment are essential to prolong the survival time of GC patients. GAC is the most common type of gastric cancer. Therefore, it is crucial to further explore the predictive indicators and therapeutic targets of GAC. Recently, with the rapid development of bioinformatics, DNA microarray is increasingly applied to explore the early diagnosis, treatment, and prognosis of cancer [20]. Therefore, the present study explored the potential target genes and pathways of GAC by use of bioinformatics methods. In this study, 2909 uDEGs and 7106 dDEGs were identified from the GSE103236, GSE79973, and GSE29998 datasets downloaded from the GEO database, among which, 92 uDEGs and 158 dDEGs were significantly expressed in all 3 datasets. To further define the role of these DEGs in gastric adenocarcinoma, we performed a series of bioinformatics and prognostic analysis of these DEGs. GO analysis revealed uDEGs are highly involved in cell adhesion, biological adhesion, and skeletal system development, while the dDEGs are mainly in ion transport, homeostatic process, and chemical homeostasis. Studies [21] have shown that the decrease of cell adhesion is a key step in the metastasis of cancer, which agrees with our GO analysis results. For MF, the uDEGs are markedly enriched in ECM structural constituent, structural molecule activity, and growth factor binding, while the dDEGs were enriched in channel activity, passive transmembrane transporter activity, and substrate specific channel activity. GO CC analysis revealed that uDEGs were concentrated in extracellular region part, proteinaceous extract, and extracellular region, while dDEGs were concentrated in extracellular region, plasma membrane part, and extracellular region part. The role of ECM and collagen binding in development and progression of tumors has been confirmed in some previous studies [22,23], which agrees with results of the present study. To better understanding the relationships and interactions between these DEGs, we used Cytoscape software to construct a PPI network of DEGs-encoded proteins, and screened out 10 hub genes with high degrees. The order of degree from high to low was COL1A1, COL3A1, COL1A2, BGN, COL5A2, THBS2, TIMP1, SPP1, PDGFRB, and COL4A1. COL1A1, COL1A2, COL3A1, COL5A2, and COL4A1, which belong to the collagen (COL) family, are the top 10 hub genes, which suggests that the collagen gene is likely to be a potential target for gastric adenocarcinoma. Collagen is the main protein in bone and teeth, and is involved in the adhesion of tumor cells, gap junction, and formation of extracellular matrix (ECM) [24]. COL1A1 is the major component of type I collagen. Some studies have shown that mir-129-5p stops the invasion and proliferation of gastric cancer cells by inhibiting COL1A1[25]. Ma et al. [26] found silencing the collagen gene inhibits tumor proliferation and metastasis. Our study also found the uDEGs are enriched in cell adhesion and biological adhesion at the BP level, which suggests that DEGs belonging to the COL family play a vital role in invasion and metastasis of tumor cells. Studies [27,28] showed that COL1A2 is highly expressed in colorectal cancer and medulloblastoma. Research has revealed the high expression of COL3A1 is independently associated with the low survival rate of colorectal carcinoma [29]. However, the relationship between COL3A1 and gastric adenocarcinoma has not been studied. Zhang et al. [30] found that the high expression of COL4A1 is closely related to the depth of invasion, TNM stage, and lymph node metastasis. Makito et al. [32] demonstrated that COL4A1 can promote invasive ability and invasive growth pattern by activating the AKT pathway and upregulating epithelial-mesenchymal transition. Zhao et al. [32] also used bioinformatics methods show that COL5A2 is a key factor in gastric cancer, but there is no laboratory evidence to prove that COL5A2 is involved in gastric adenocarcinoma. ECM is a protein compound that plays an indispensable role in cell migration and cancer development [33]. BGN, as an integral part of ECM, is considered to be a pathway for malignant tumor cells to acquire migration and invasiveness [34]. Studies have shown that the expression of BGN in GC is notably upregulated, and correlated with depth of tumor invasion and TNF staging [35]. Thromboreactive protein (THBS) is an extracellular glycoprotein that plays roles in cell matrix and intercellular interactions [36]. Studies have shown that high THBS2 expression is correlated with low proliferation rate of gastric cancer cells [37]. Tissue inhibitor matrix metalloproteinase-1 (TIMP-1) is classified into the family of tissue inhibitors of metalloproteinases, and the proteins encoded by TIMP-1 are considered to be the key biofactors in the invasion and metastasis of tumors [38]. Wang et al. [39] showed that the expression level of TIMP1 in peripheral blood was associated with the stage of cancer, and the upregulation of TIMP1 may be an adverse prognostic factor for recurrence of gastric cancer. SPP1 is an ECM-related protein that has carcinogenic and anti-tumor effects [40]. Li et al. [41] also identified SPP1 as a prognostic pivotal gene in gastric cancer by bioinformatics. Sharvesh et al. [42] found that SPP1 is highly expressed in gastric cancer tissues compared with normal adjacent tissues, and its expression increased with the depth of tumor invasion. Platelet-derived growth factor receptors (PDGFRs) can induce activation of intracellular signal transduction pathways, which can promote cell proliferation, metastasis, and invasion [43]. Chen et al. [44] identified PDGFRB as a candidate gene for gastric cancer by constructing a gene co-expression network, which is consistent with the results of our study. It has also been affirmed that PDGFRB is upregulated in gastric cancer tissues, and its high expression is positively correlated with poor prognosis of gastric cancer patients [45]. These results suggest that BGN, THBS2, TIMP1, SPP1, and PDGFRB are key factors in GAC. Module analysis from the PPI network showed that gastric adenocarcinoma is closely related to focal adhesion and ECM-receptor interaction. Focal adhesion is a complex, dynamic process involving the driving activity of actin cytoskeleton and the participation of specific receptors and signal transduction [46]. Studies have found that focal adhesions are intensely involved in multiple key pathways of tumor migration and metastasis [47]. Research by Lu et al. showed that abnormal ECM can promote the growth and metastasis of tumors by directly promoting cell metastasis on the one hand, and indirectly by promoting the formation of tumor microvessels on the other hand [48]. It is noteworthy that 8 of the 15 genes in the module (COL4A1, COL6A3, COL3A1, COL1A2, COL1A1, COL11A1, COL4A6, and THBS2) are involved in both pathways, and most of them belong to the COL family, which strengthens the findings of the role of the COL family in gastric adenocarcinoma. To study the expression levels and prognostic value of 10 hub genes, we used GEPIA database and UALCAN for expression validation and prognostic analysis. The GEPIA database showed all the 10 hub genes are upregulated in GAC compared to normal gastric tissue. The results of the prognostic analysis showed that the upregulated expression of COL3A1, COL1A2, BGN, and THBS2 significantly reduced the survival time of GAC patients. Therefore, COL3A1, COL1A2, BGN, and THBS2 appear to be ideal prognostic indicators for gastric adenocarcinoma. In sum, we identified DEGs and performed GO analysis, pathway enrichment analysis, and PPI network construction to understand their roles in gastric adenocarcinoma. In addition, we identified COL3A1, COL1A2, BGN, and THBS2 as hub genes and evaluated their prognostic value. This study provided evidence for early diagnosis and prognostic evaluation of gastric adenocarcinoma at the molecular level, but these findings need to be confirmed by subsequent laboratory studies.

Conclusions

In this study, we used bioinformatics to predict the DEGs of gastric adenocarcinoma and its enriched pathways and screened and evaluated some hub genes to provide some ideas and references for the early diagnosis and treatment of gastric adenocarcinoma at the molecular level. However, the limitation of our research lies in the lack of laboratory evidence. Therefore, further laboratory studies are needed to validate these findings.
  47 in total

Review 1.  Practical Points in Gastric Pathology.

Authors:  Sangjeong Ahn; Do Youn Park
Journal:  Arch Pathol Lab Med       Date:  2016-05       Impact factor: 5.534

2.  MiR-129-5p suppresses gastric cancer cell invasion and proliferation by inhibiting COL1A1.

Authors:  Quan Wang; Jinhai Yu
Journal:  Biochem Cell Biol       Date:  2017-05-08       Impact factor: 3.626

3.  NCBI GEO: archive for functional genomics data sets--update.

Authors:  Tanya Barrett; Stephen E Wilhite; Pierre Ledoux; Carlos Evangelista; Irene F Kim; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Michelle Holko; Andrey Yefanov; Hyeseung Lee; Naigong Zhang; Cynthia L Robertson; Nadezhda Serova; Sean Davis; Alexandra Soboleva
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

4.  TIMP1 is a prognostic marker for the progression and metastasis of colon cancer through FAK-PI3K/AKT and MAPK pathway.

Authors:  Guohe Song; Shifeng Xu; Hong Zhang; Yupeng Wang; Chao Xiao; Tao Jiang; Leilei Wu; Tao Zhang; Xing Sun; Lin Zhong; Chongzhi Zhou; Zhaowen Wang; Zhihai Peng; Jian Chen; Xiaoliang Wang
Journal:  J Exp Clin Cancer Res       Date:  2016-09-20

5.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

6.  A panel of collagen genes are associated with prognosis of patients with gastric cancer and regulated by microRNA-29c-3p: an integrated bioinformatics analysis and experimental validation.

Authors:  Qiang-Nu Zhang; Hui-Li Zhu; Meng-Ting Xia; Juan Liao; Xiao-Tao Huang; Jiang-Wei Xiao; Cong Yuan
Journal:  Cancer Manag Res       Date:  2019-05-24       Impact factor: 3.989

7.  Gibbon genome and the fast karyotype evolution of small apes.

Authors:  Lucia Carbone; R Alan Harris; Sante Gnerre; Krishna R Veeramah; Belen Lorente-Galdos; John Huddleston; Thomas J Meyer; Javier Herrero; Christian Roos; Bronwen Aken; Fabio Anaclerio; Nicoletta Archidiacono; Carl Baker; Daniel Barrell; Mark A Batzer; Kathryn Beal; Antoine Blancher; Craig L Bohrson; Markus Brameier; Michael S Campbell; Oronzo Capozzi; Claudio Casola; Giorgia Chiatante; Andrew Cree; Annette Damert; Pieter J de Jong; Laura Dumas; Marcos Fernandez-Callejo; Paul Flicek; Nina V Fuchs; Ivo Gut; Marta Gut; Matthew W Hahn; Jessica Hernandez-Rodriguez; LaDeana W Hillier; Robert Hubley; Bianca Ianc; Zsuzsanna Izsvák; Nina G Jablonski; Laurel M Johnstone; Anis Karimpour-Fard; Miriam K Konkel; Dennis Kostka; Nathan H Lazar; Sandra L Lee; Lora R Lewis; Yue Liu; Devin P Locke; Swapan Mallick; Fernando L Mendez; Matthieu Muffato; Lynne V Nazareth; Kimberly A Nevonen; Majesta O'Bleness; Cornelia Ochis; Duncan T Odom; Katherine S Pollard; Javier Quilez; David Reich; Mariano Rocchi; Gerald G Schumann; Stephen Searle; James M Sikela; Gabriella Skollar; Arian Smit; Kemal Sonmez; Boudewijn ten Hallers; Elizabeth Terhune; Gregg W C Thomas; Brygg Ullmer; Mario Ventura; Jerilyn A Walker; Jeffrey D Wall; Lutz Walter; Michelle C Ward; Sarah J Wheelan; Christopher W Whelan; Simon White; Larry J Wilhelm; August E Woerner; Mark Yandell; Baoli Zhu; Michael F Hammer; Tomas Marques-Bonet; Evan E Eichler; Lucinda Fulton; Catrina Fronick; Donna M Muzny; Wesley C Warren; Kim C Worley; Jeffrey Rogers; Richard K Wilson; Richard A Gibbs
Journal:  Nature       Date:  2014-09-11       Impact factor: 49.962

8.  Epithelial but not stromal expression of collagen alpha-1(III) is a diagnostic and prognostic indicator of colorectal carcinoma.

Authors:  Xiao-Qing Wang; Zu-Xiong Tang; Dong Yu; Shu-Jian Cui; Ying-Hua Jiang; Qian Zhang; Jie Wang; Peng-Yuan Yang; Feng Liu
Journal:  Oncotarget       Date:  2016-02-23

9.  Identification of Key Genes and Pathways in Tongue Squamous Cell Carcinoma Using Bioinformatics Analysis.

Authors:  Huayong Zhang; Jianmin Liu; Xiaoyan Fu; Ankui Yang
Journal:  Med Sci Monit       Date:  2017-12-14

10.  Candidate genes in gastric cancer identified by constructing a weighted gene co-expression network.

Authors:  Jian Chen; Xiuwen Wang; Bing Hu; Yifu He; Xiaojun Qian; Wei Wang
Journal:  PeerJ       Date:  2018-05-04       Impact factor: 2.984

View more
  5 in total

1.  A tumour-resident Lgr5+ stem-cell-like pool drives the establishment and progression of advanced gastric cancers.

Authors:  A Fatehullah; Y Terakado; S Sagiraju; T L Tan; T Sheng; S H Tan; K Murakami; Y Swathi; N Ang; R Rajarethinam; T Ming; P Tan; B Lee; N Barker
Journal:  Nat Cell Biol       Date:  2021-12-02       Impact factor: 28.824

2.  Identification of Potential Biomarkers Associated with Prognosis in Gastric Cancer via Bioinformatics Analysis.

Authors:  Dong Li; Yi Yin; Muqun He; Jianfeng Wang
Journal:  Med Sci Monit       Date:  2021-02-14

3.  A Deep Neural Network for Gastric Cancer Prognosis Prediction Based on Biological Information Pathways.

Authors:  Jili Hu; Weiqiang Yu; Yuting Dai; Can Liu; Yongkang Wang; Qingfa Wu
Journal:  J Oncol       Date:  2022-09-09       Impact factor: 4.501

4.  Association of FLG mutation with tumor mutation load and clinical outcomes in patients with gastric cancer.

Authors:  Fu Yicheng; Liu Xin; Yu Tian; Liu Huilin
Journal:  Front Genet       Date:  2022-08-15       Impact factor: 4.772

5.  Silencing of SPP1 Suppresses Progression of Tongue Cancer by Mediating the PI3K/Akt Signaling Pathway.

Authors:  Qiaoli Zhang; Lifeng Li; Yueli Lai; Tong Zhao
Journal:  Technol Cancer Res Treat       Date:  2020 Jan-Dec
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.