Literature DB >> 35726219

Integrated Bioinformatics Analysis for Identifying the Significant Genes as Poor Prognostic Markers in Gastric Adenocarcinoma.

Yamei Li1, Yan Luo2, Qiang Tian2, Yu Lai3, Lei Xu2, Hailong Yun2, Yang Liang2, Dandan Liao2, Rui Gu4, Liye Liu5, Mu Yuan6, Yijiao Li7, Yufan Li4, Mingze Lu8, Xin Yong9, Hua Zhang2,10.   

Abstract

Gastric adenocarcinoma (GAC) is the most common histological type of gastric cancer and imposes a considerable health burden globally. The purpose of this study was to identify significant genes and key pathways participated in the initiation and progression of GAC. Four datasets (GSE13911, GSE19826, GSE54129, and GSE79973) including 171 GAC and 77 normal tissues from Gene Expression Omnibus (GEO) database were collected and analyzed. Through integrated bioinformatics analysis, we obtained 69 commonly differentially expressed genes (DEGs) among the four datasets, including 20 upregulated and 49 downregulated genes. The prime module in protein-protein interaction network of DEGs, including ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, BGN, and SPP1, was enriched in protein digestion and absorption, ECM-receptor interaction, focal adhesion, PI3K-Akt signaling pathway, and amoebiasis. Furthermore, expression and survival analysis found that all seven hub genes were highly expressed in GAC tissues and 6 of them (except for SPP1) were able to predict poor prognosis of GAC. Finally, we verified the 6 high-expressed hub genes in GAC tissues via immunohistochemistry, Western blot, and RNA quantification analysis. Altogether, we identified six significantly upregulated DEGs as poor prognostic markers in GAC based on integrated bioinformatical methods, which could be potential molecular markers and therapeutic targets for GAC patients.
Copyright © 2022 Yamei Li et al.

Entities:  

Year:  2022        PMID: 35726219      PMCID: PMC9206555          DOI: 10.1155/2022/9080460

Source DB:  PubMed          Journal:  J Oncol        ISSN: 1687-8450            Impact factor:   4.501


1. Introduction

Gastric adenocarcinoma (GAC), the predominant histological type of gastric cancer, is the fifth most common cancer ranked after lung, breast, colorectal, and prostate cancers [1, 2]. GAC, also known as stomach adenocarcinoma (STAD), has increased more than 1,000,000 new cases and led to deaths of more than 768,000 people worldwide in 2020 [3]. Although improvements in endoscopic, surgical, and systemic treatments have been made for decades, the mortality rate of GAC is still high and the global 5-year survival rates remain unsatisfactory [1, 4]. Thus, GAC still imposes a considerable health burden globally. Although the global 5-year survival rates are relatively low, the rates in Japan and South Korea are far more optimistic [5, 6], owing to early detection and screening efforts in these Asian countries [7]. Furthermore, it is reported that the 5-year survival rate of early-stage T1 GAC (according to the TNM classification of malignant tumors) is ∼95%, while advanced-stage GAC (which cannot be surgically treated) has a median survival of ∼9-10 months [8, 9], which further emphasizes the critical importance of early detection and diagnosis. Molecular markers are vital for early detection of cancer [10-12]. To date, several biomarkers have been used for the diagnosis and determination of the clinical stage of GAC. Among them, carcinoembryonic antigen (CEA), carbohydrate antigen 19-9 (CA19-9), and erb-b2 receptor tyrosine kinase 2 (HER2) are the most frequently used biomarkers for GAC in clinical setting [13, 14]. However, due to the insufficient specificity and sensitivity of the current markers, novel specific and sensitive molecular markers are still on urgent demand, especially in the field of early diagnosis and prognosis [13-15]. Bioinformatics analysis is a powerful and comprehensive tool for analyzing gene expression data from multiple datasets, which is perfect for excavating the potential molecular markers laid in Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) datasets. Therefore, in the current study, we mainly focused on exploring the commonly differential expressed genes among different GEO datasets. Gene ontology (GO) and KEGG enrichment analysis were further conducted to identify the hub genes and key pathways enriched in the commonly DEGs. Protein-protein interaction (PPI) network of the DEGs was constructed, and core genes were determined via the Cytoscape Molecular Complex Detection (MCODE). In addition, DAVID, GEPIA, and Kaplan–Meier plotter were applied to re-analyze the expression and survival information of the core genes, respectively [16, 17]. Finally, immunohistochemistry, Western blot, and RNA quantification analysis were performed to validate the expressions of the identified genes in GAC tissue samples.

2. Materials and Methods

2.1. Microarray Data Information

NCBI-GEO is a free public database and provides us with gene expression profile of numerous cancers. The following criteria were used to screen the datasets and ensure relevant data were recorded: (I) the sample includes gastric adenocarcinoma and normal tissues; (II) the study type is expression profiling by array; (III) the species is limited to Homo sapiens; (IV) access to raw data is allowed. We obtained the gene expression profiles of GSE13911, GSE19826, GSE54129, and GSE79973 in gastric adenocarcinoma and paired normal tissues. Microarray data of GSE13911, GSE19826, GSE54129, and GSE79973 were all on account of GPL570 platforms ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array) which included a total of 171 GAC tissues and 77 normal tissues.

2.2. DEGs Identification

Background correction and normalization were conducted through robust multi-array average (RMA) and Microarray Suite (MAS) approach. The GEO2R online tools [18] were used to identify DEGs between the GAC specimen and normal specimen with |log2FC| > 2 and an adjusted P value <0.05 [16-18]. Then, the raw data were analyzed in Venn software online to identify the commonly DEGs among the original four datasets. The DEGs with log2FC > 0 were considered as upregulated genes, while the DEGs with log2FC < 0 were considered as downregulated genes [16, 17].

2.3. GO and Pathway Enrichment Analysis

The functions and pathways enrichment of candidate DEGs were analyzed using DAVID (the Database for Annotation, Visualization and Integrated Discovery, https://david.ncifcrf.gov/) [19], which is an online bioinformatic tool with the function of integrating the GO and pathway enrichment analysis [20, 21]. Through DAVID, we identified the unique biological properties of the commonly DEGs and visualized the DEGs enrichment of molecular function (MF), cellular component (CC), biological process (BP), and Kyoto Encyclopedia of Gene and Genome (KEGG) pathways (P < 0.05) [16, 17].

2.4. Integration of PPI Network and Modular Analysis

STRING (Search Tool for the Retrieval of Interacting Genes, https://cn.string-db.org/) online tool [22] was used to evaluate the PPI information of DEGs. Then, the STRING app in Cytoscape [23] was employed to examine the potential correlation between these DEGs (maximum number of interactors = 0 and confidence score ≥0.4). In addition, the MCODE app in Cytoscape was used to identify the modules and hub genes of the PPI network (degree cutoff = 2, max. depth = 100, k-core = 2, and node score cutoff = 0.2) [16, 17]. PPI network properties, such as node degree and betweenness centrality, were visualized by shape size and label font size, respectively.

2.5. Survival and RNA Sequencing Expression Analysis

Kaplan–Meier plotter is a common website tool (https://kmplot.com/), which contains considerable information among several cancers, including breast and gastric cancer [24]. The survival analysis was conducted by Kaplan–Meier plotter, and the log rank P value and hazard ratio (HR) with 95% confidence intervals were computed and shown on the plot. To validate these DEGs with significant expression pattern, we applied the GEPIA (Gene Expression Profiling Interactive Analysis, https://gepia.cancer-pku.cn/) website to analyze the data of RNA sequencing expression based on the GTEx projects and TCGA datasets [25].

2.6. Immunohistochemical (IHC) Staining

IHC staining was performed to detect the protein level of certain genes in GAC and normal human tissue samples and performed according to the standard protocols using following antibodies: anti-ADAMTS2 (bs-5858R, 1 : 500), anti-COL10A1 (BA2023, 1 : 400), anti-COL1A1 (BA0325, 1 : 400), anti-COL1A2 (BM4017, 1 : 100), anti-COL8A1 (bs-7529R, 1 : 500), and anti-BGN (bs-7552R, 1 : 500).

2.7. Western Blot

GAC and adjacent normal tissue samples were grinded and lysed with RIPA buffer supplemented with protease inhibitor cocktail. Protein concentrations of the extracts were measured with BCA assay. The Western blot analysis was performed according to the standard protocols using the above antibodies.

2.8. RNA Quantification

Total RNA was extracted from GAC and adjacent normal tissues with TRIzol reagent (Invitrogen) and reverse-transcribed using PrimeScript™ RT reagent kit (Takara). Quantitative real-time PCR analysis was performed on LightCycler (Roche) with TB Green® Premix Ex Taq™ II (Takara). Data were normalized to GAPDH expression. The primers used for real-time PCR were as follows: GAPDH (forward: 5′-GGA GCG AGA TCC CTC CAA AAT-3′, reverse: 5′-GGC TGT TGT CAT ACT TCT CAT GG-3′), ADAMTS2 (forward: 5′-GTG CAT GTG GTG TAT CGC C-3′, reverse: 5′-AGG ACC TCG ATG TTG TAG TCA-3′), COL10A1 (forward: 5′-CAT AAA AGG CCC ACT ACC CAA C-3′, reverse: 5′-ACC TTG CTC TCC TCT TAC TGC-3′), COL1A1 (forward: 5′-GAG GGC CAA GAC GAA GAC ATC-3′, reverse: 5′-CAG ATC ACG TCA TCG CAC AAC-3′), COL1A2 (forward: 5′-GGC CCT CAA GGT TTC CAA GG-3′, reverse: 5′-CAC CCT GTG GTC CAA CAA CTC-3′), COL8A1 (forward: 5′-GCT GCC ACC TCA AAT TCC TC-3′, reverse: 5′-CTT TCT TGG GTA CGG CTT CCT-3′), and BGN (forward: 5′-GAG ACC CTG AAT GAA CTC CAC C-3′, reverse: 5′-CTC CCG TTC TCG ATC ATC CTG-3′).

3. Results

3.1. Identification of DEGs in Gastric Adenocarcinoma

GEO2R online tool was used to determine the DEGs from GSE13911, GSE19826, GSE54129, and GSE79973, which resulted in 484, 388, 971, and 524 DEGs, respectively (Figure 1(a)) (|log2FC| > 2 and adjust P value <0.05). Then, the commonly DEGs among the above four datasets were identified by Venn diagram software. Results showed that a total of 69 commonly DEGs were identified, including 20 upregulated genes (log2FC > 2) and 49 downregulated genes (log2FC < −2) in GAC tissues (Figures 1(b) and 1(c) and Table 1).
Figure 1

Identification of 69 commonly differentially expressed genes (DEGs) among the four datasets (GSE13911, GSE19826, GSE54129, and GSE79973) through Venn diagram software (available online: https://bioinformatics.psb.ugent.be/webtools/Venn/). (a) Volcano plot of DEGs from the four datasets. (b) Twenty DEGs were upregulated among the four datasets (log2FC > 0). (c) Forty-nine DEGs were downregulated among the four datasets (log2FC < 0). Different colors represented different datasets in B and C.

Table 1

All 69 commonly differentially expressed genes (DEGs) were identified from four profile datasets, including 20 upregulated genes and 49 downregulated genes in the gastric adenocarcinoma tissues compared to normal tissues.

DEGsGenes name
UpregulatedIGF2BP3 SULF1 FAP RARRES1 INHBA SPP1 COL1A1 COL10A1 FNDC1 COL11A1 CEMIP CTHRC1 THBS2 BGN COL1A2 CST1 MFAP2 ADAMTS2 WISP1 COL8A1

DownregulatedCNTN3 LIPF TRIM74///TRIM73 CAPN13 FBP2 AKR1B10 B4GALNT3 CYP2C18 ALDH3A1 ATP4A UGT2B15 KIAA1324 GKN1 ADGRG2 RDH12 GIF CA2 GATA5 ATP4B MAL CAPN9 SLC26A9 ESRRG ADTRP VSTM2A SSTR1 ACER2 MFSD4A DPCR1 ADH7 VSIG1 PGC KCNE2 SOSTDC1 TPCN2 CA9 MUC5AC VSIG2 SPINK7 TMED6 SCNN1B LINC00982 HPGD PSAPL1 CWH43 KCNJ16 KCNJ15 GKN2 CXCL17

3.2. GO and KEGG Analysis of DEGs in Gastric Adenocarcinoma

In order to examine the biological properties of the 69 DEGs, DAVID software was applied to conduct GO and KEGG analysis. Results of GO analysis indicated that (1) for biological processes (BP), upregulated DEGs were particularly enriched in endodermal cell differentiation, collagen fibril organization, protein heterotrimerization, skin morphogenesis, and cell adhesion, and downregulated DEGs in digestion, potassium ion import, oxidation-reduction process, and bicarbonate transport (Figure 2(a)); (2) for cell component (CC), upregulated DEGs were significantly enriched in extracellular space, collagen trimer, collagen type I trimer, and proteinaceous extracellular matrix, and downregulated DEGs in extracellular space, lysosome, apical plasma membrane, integral component of plasma membrane, and integral component of membrane (Figure 2(b)); (3) for molecular function (MF), upregulated DEGs were enriched in extracellular matrix structural constituent and heparin binding, and downregulated DEGs in inward rectifier potassium channel activity, hydrogen:potassium-exchanging ATPase activity, and G-protein activated inward rectifier potassium channel activity (Figure 2(c) and Table 2, P < 0.05).
Figure 2

Gene ontology analysis of commonly DEGs in gastric adenocarcinoma. DAVID software was applied to conduct gene ontology. Biological processes (BP) (a), cell component (CC) (b), and molecular function (MF) (c) of the up/downregulated DEGs were shown (P < 0.05).

Table 2

Gene ontology analysis of differentially expressed genes in gastric adenocarcinoma.

ExpressionCategoryTermCount% P-valueFDR
UpregulatedGOTERM_BP_DIRECTGO:0035987∼endodermal cell differentiation315.02.41E − 040.277424
GOTERM_BP_DIRECTGO:0030199∼collagen fibril organization315.03.59E − 040.413041
GOTERM_BP_DIRECTGO:0070208∼protein heterotrimerization210.00.0071327.921921
GOTERM_BP_DIRECTGO:0043589∼skin morphogenesis210.00.0071327.921921
GOTERM_BP_DIRECTGO:0007155∼cell adhesion315.00.0085429.418433
GOTERM_CC_DIRECTGO:0005615∼extracellular space840.04.61E − 050.038561
GOTERM_CC_DIRECTGO:0005581∼collagen trimer315.00.0012851.069804
GOTERM_CC_DIRECTGO:0005584∼collagen type I trimer210.00.0024912.064654
GOTERM_CC_DIRECTGO:0005578∼proteinaceous extracellular matrix315.00.01432711.37209
GOTERM_MF_DIRECTGO:0005201∼extracellular matrix structural constituent315.05.25E − 040.415248
GOTERM_MF_DIRECTGO:0008201∼heparin binding210.00.08739851.56318

DownregulatedGOTERM_BP_DIRECTGO:0007586∼digestion510.21.81E − 050.023045
GOTERM_BP_DIRECTGO:0010107∼potassium ion import48.24.56E − 050.058116
GOTERM_BP_DIRECTGO:0055114∼oxidation-reduction process714.30.0033334.16404
GOTERM_BP_DIRECTGO:0015701∼bicarbonate transport36.10.0054066.671731
GOTERM_CC_DIRECTGO:0005615∼extracellular space1122.40.0016461.61346
GOTERM_CC_DIRECTGO:0005764∼lysosome510.20.0024992.44067
GOTERM_CC_DIRECTGO:0016324∼apical plasma membrane510.20.0061375.897225
GOTERM_CC_DIRECTGO:0005887∼integral component of plasma membrane1020.40.0080077.631398
GOTERM_CC_DIRECTGO:0016021∼integral component of membrane2244.90.0091038.634041
GOTERM_MF_DIRECTGO:0005242∼inward rectifier potassium channel activity36.17.31E − 040.820368
GOTERM_MF_DIRECTGO:0008900∼hydrogen:potassium-exchanging ATPase activity24.10.006036.584128
GOTERM_MF_DIRECTGO:0015467∼G-protein activated inward rectifier potassium channel activity24.10.01996520.31405
Results of KEGG analysis showed that upregulated DEGs were particularly enriched in ECM-receptor interaction, focal adhesion, protein digestion and absorption, PI3K-Akt signaling pathway, amoebiasis, and platelet activation (Figure 3(a)), while downregulated DEGs in gastric acid secretion, retinol metabolism, chemical carcinogenesis, collecting duct acid secretion, glycolysis/gluconeogenesis, drug metabolism-cytochrome P450, metabolism of xenobiotics by cytochrome P450, and metabolic pathways (Figure 3(b) and Table 3, P < 0.05).
Figure 3

KEGG analysis of commonly DEGs in gastric adenocarcinoma. KEGG analysis of upregulated DEGs (a) and downregulated DEGs (b) was shown (P < 0.05).

Table 3

KEGG pathway analysis of differentially expressed genes in gastric adenocarcinoma.

ExpressionTermCount% P-valueGenes
Upregulatedptr04512:ECM-receptor interaction525.08.56E − 07COL1A2, COL1A1, THBS2, COL11A1, SPP1
ptr04510:focal adhesion525.02.60E − 05COL1A2, COL1A1, THBS2, COL11A1, SPP1
ptr04974:protein digestion and absorption420.05.98E − 05COL1A2, COL1A1, COL11A1, COL10A1
ptr04151:PI3K-Akt signaling pathway525.01.78E − 04COL1A2, COL1A1, THBS2, COL11A1, SPP1
ptr05146:amoebiasis315.00.004694COL1A2, COL1A1, COL11A1
ptr04611:platelet activation315.00.00728COL1A2, COL1A1, COL11A1

Downregulatedhsa04971:gastric acid secretion612.22.68E − 06KCNJ16, KCNJ15, ATP4A, ATP4B, KCNE2, CA2
hsa00830:retinol metabolism48.20.001043RDH12, CYP2C18, ADH7, UGT2B15
hsa05204:chemical carcinogenesis48.20.001989CYP2C18, ADH7, UGT2B15, ALDH3A1
hsa04966:collecting duct acid secretion36.10.003265ATP4A, ATP4B, CA2
hsa00010:glycolysis/gluconeogenesis36.10.019041ADH7, FBP2, ALDH3A1
hsa00982:drug metabolism—cytochrome P45036.10.019581ADH7, UGT2B15, ALDH3A1
hsa00980:metabolism of xenobiotics by cytochrome P45036.10.022951ADH7, UGT2B15, ALDH3A1
hsa01100:metabolic pathways918.40.029288RDH12, CYP2C18, ACER2, AKR1B10, ADH7, FBP2, UGT2B15, ALDH3A1, LIPF

3.3. PPI Network and Modular Analysis of DEGs

The DEGs PPI network complex was constructed via Cytoscape software. Results showed that 44 DEGs including 16 upregulated and 28 downregulated genes were enrolled, and 75 edges were formed (Figure 4(a)). There were 25 DEGs which were not included into the DEGs PPI network. Then, we applied Cytoscape MCODE to further analyze the prime module and ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, BGN, and SPP1 were identified among the 44 nodes. Results also showed that the above seven hub nodes were all upregulated genes (Figure 4(b)).
Figure 4

DEGs PPI network was constructed by Cytoscape software. (a) There were a total of 44 nodes and 75 edges in the DEGs PPI network complex. The nodes meant proteins; the edges meant the interaction between proteins; red circles meant upregulated DEGs; and green circles meant downregulated DEGs. (b) Modular analysis via Cytoscape software (degree cutoff = 2, max. depth = 100, k-core = 2, and node score cutoff = 0.2). Seven central nodes were screened. Circle size represents node degree, and label font size represents betweenness centrality.

3.4. Re-Analysis of Seven Hub Genes by KEGG Pathway Enrichment

To further understand the possible enriched pathways of the seven hub DEGs, KEGG pathway enrichment was re-analyzed via DAVID. Results showed that seven core genes were significantly enriched in several cancer-related pathways. In detail, COL1A2, COL1A1, and COL10A1 were enriched in protein digestion and absorption; COL1A2, COL1A1, and SPP1 were enriched in ECM-receptor interaction, focal adhesion, and PI3K-Akt signaling pathway; COL1A2 and COL1A1 were further enriched in amoebiasis (Figure 5 and Table 4, P < 0.05).
Figure 5

KEGG re-analysis of the seven hub genes.

Table 4

Re-analysis of seven selected genes via KEGG pathway enrichment.

Pathway IDNameCount% P-valueGenes
cfa04974Protein digestion and absorption342.94.55E − 04COL1A2, COL1A1, COL10A1
cfa04512ECM-receptor interaction342.94.89E − 04COL1A2, COL1A1, SPP1
cfa04510Focal adhesion342.90.002751COL1A2, COL1A1, SPP1
cfa04151PI3K-Akt signaling pathway342.90.00721COL1A2, COL1A1, SPP1
cfa05146Amoebiasis228.60.046382COL1A2, COL1A1

3.5. Analysis of Hub Genes via the GEPIA and Kaplan–Meier Plotter

To further validate the significance of the seven central genes, GEPIA and Kaplan–Meier plotter online tools were utilized to identify the expression data and survival data, respectively. GEPIA expression data showed that all seven hub genes were highly expressed in GAC tissues compared to normal tissues (Figure 6 and Table 5, P < 0.05). Kaplan–Meier plotter survival data showed that high expression of ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN had a significantly worse survival probability, while high expression of SPP1 showed no effect on patient survival (Figure 7 and Table 6, P < 0.05).
Figure 6

Expression level of the seven hub genes in gastric adenocarcinoma patients compared to healthy people. To further validate the expression level between GAC patients and normal people, seven genes were analyzed via GEPIA website. All seven genes were significantly highly expressed in GAC specimen compared to normal specimen (P < 0.05). Red color meant GAC tissues (n = 408), and grey color meant normal tissues (n = 211).

Table 5

Validation of seven hub genes via GEPIA.

CategoryGenes
Genes with high expression in GAC (P < 0.05)ADAMTS2 COL10A1 COL1A1 COL1A2 COL8A1 BGN SPP1
Figure 7

Prognostic information of the seven hub genes. Kaplan–Meier plotter online tools were used to analyze the prognostic information of the seven hub genes. High expression of ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN had a significantly worse survival rate (P < 0.05).

Table 6

The prognostic information of the seven key candidate genes.

CategoryGenes
Genes with significantly worse survival (P < 0.05)ADAMTS2 COL10A1 COL1A1 COL1A2 COL8A1 BGN
Genes without significantly survival (P > 0.05)SPP1

3.6. Validation of the Expression Levels of Six Core Genes in GAC Patients

Finally, we detected the expression levels of the above six genes in GAC specimens and adjacent normal specimens by immunohistochemistry (Figure 8(a)), Western blot (Figure 8(b)), and RNA quantification (Figure 8(c)) analysis. Results showed that ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN were highly expressed in GAC tissues compared to adjacent normal tissues (Figure 8), consistent with the GEPIA expression data.
Figure 8

Validation of expression levels of ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN in GAC patients. To further validate the expression level in GAC patients, six genes were re-analyzed via immunohistochemistry (a), Western blot (b), and real-time PCR (c) analysis. Representative images of IHC staining were shown. Scale bar, 200 μm. Real-time PCR data were normalized to GAPDH expression. All six genes were highly expressed in GAC tissue compared to adjacent normal tissue.

4. Discussion

Gastric adenocarcinoma is a lethal malignance cancer. In this study, we applied bioinformatical methods on the basis of four gene expression profile datasets to identify more useful prognostic molecular markers in GAC. A total of 171 GAC specimens and 77 normal specimens were enrolled. First, we revealed a total of 69 commonly DEGs via GEO2R and Venn software (|log2FC| > 2 and adjust P value <0.05), including 20 upregulated and 49 downregulated DEGs. Second, GO and KEGG pathway enrichment analysis showed that 20 upregulated genes enriched in endodermal cell differentiation, protein heterotrimerization, ECM-receptor interaction, focal adhesion, protein digestion and absorption, PI3K-Akt signaling pathway, amoebiasis, and platelet activation, while 49 downregulated genes enriched in digestion, potassium ion import, oxidation-reduction process, bicarbonate transport, inward rectifier potassium channel activity, hydrogen:potassium-exchanging ATPase activity, gastric acid secretion, retinol metabolism, and metabolic pathways (P < 0.05). Third, DEGs PPI network complex of 44 nodes and 75 edges was constructed via Cytoscape software and prime module analysis identified 7 hub genes (ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, BGN, and SPP1), which were all upregulated genes and were significantly enriched in several cancer-related pathways. Furthermore, GEPIA analysis showed that all the seven hub genes were highly expressed in GAC tissues (P < 0.05). In addition, Kaplan–Meier plotter analysis showed that high expression of ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN had a significantly worse survival probability (P < 0.05), while SPP1 had no significance (P > 0.05). Finally, the 6 highly expressed core genes were validated via immunohistochemistry, Western blot, and RNA quantification analysis in tissue samples. Altogether, we identified six significant upregulated genes as poor prognosis markers in gastric adenocarcinoma via bioinformatical analysis, which could be potential new molecular markers and effective targets for early detection and further research. The hub genes in the main module of the PPI network of the commonly DEGs are mainly associated with protein digestion and absorption, ECM-receptor interaction, focal adhesion, PI3K-Akt signaling pathway, and amoebiasis. The family of collagen genes (CLO10A1, COL1A1, COL1A2, etc.) is tightly clustered and participates in the above cancer-related pathways. Furthermore, studies have demonstrated the close relation between collagen genes and gastric adenocarcinoma, including COL10A1, COL1A1, COL1A2, and COL8A1. What's more, it is well known that PI3K-Akt signaling pathway (COL1A2, COL1A1, etc.) plays a vital role in the cell cycle and is activated in various cancers, including GAC [26]. For ADAMTS2, a member of the ADAMTS family is a procollagen N-proteinase [27]. Researches have shown that ADAMTS2 participated in major biological pathways and human disorders [28], but the relation between ADAMTS2 and GAC has rarely been studied [27]. Furthermore, BGN, a key member of the small leucine-rich proteoglycan family, has been shown to participate in many cancers and is associated with poor prognosis in cancer patients, including gastric adenocarcinoma [29]. The results and related studies have provided solid evidence to prove the relation between the hub genes along with the enriched pathways and GAC. Expression and survival analysis have demonstrated that ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN are all highly expressed in GAC and their high expression has a significantly worse survival. Previous studies have also showed that the abnormal expression level of the six hub genes could be indicators of the initiation, progression, and clinical outcome of GAC. Till now, little is known about the exact mechanism of the six genes in GAC initiation and progression. In our study, we have provided more helpful information and direction for the future study of GAC via integrated bioinformatical methods, which would be new perspective and clues for early detection and diagnosis of GAC.

5. Conclusion

Altogether, our bioinformatics analysis study identified six upregulated DEGs (ADAMTS2, COL10A1, COL1A1, COL1A2, COL8A1, and BGN) between gastric adenocarcinoma and normal tissues based on four different microarray datasets. Results showed that these six genes were poor prognostic markers, which may play key roles in the initiation and progression of GAC. These data presented in this study may provide new perspectives and clues into the early detection and therapeutic targets of GAC. However, more experiments and details are needed to verify the prediction and underlying mechanisms in the near future.
  29 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

Review 2.  ADAMTS proteins in human disorders.

Authors:  Timothy J Mead; Suneel S Apte
Journal:  Matrix Biol       Date:  2018-06-06       Impact factor: 11.583

3.  Recent cancer survival in Europe: a 2000-02 period analysis of EUROCARE-4 data.

Authors:  Arduino Verdecchia; Silvia Francisci; Hermann Brenner; Gemma Gatta; Andrea Micheli; Lucia Mangone; Ian Kunkler
Journal:  Lancet Oncol       Date:  2007-09       Impact factor: 41.316

Review 4.  Gastric adenocarcinoma.

Authors:  Jaffer A Ajani; Jeeyun Lee; Takeshi Sano; Yelena Y Janjigian; Daiming Fan; Shumei Song
Journal:  Nat Rev Dis Primers       Date:  2017-06-01       Impact factor: 52.329

Review 5.  Screening for gastric cancer in Asia: current evidence and practice.

Authors:  Wai K Leung; Ming-shiang Wu; Yasuo Kakugawa; Jae J Kim; Khay-guan Yeoh; Khean Lee Goh; Kai-chun Wu; Deng-chyang Wu; Jose Sollano; Udom Kachintorn; Takuji Gotoda; Jaw-town Lin; Wei-cheng You; Enders K W Ng; Joseph J Y Sung
Journal:  Lancet Oncol       Date:  2008-03       Impact factor: 41.316

Review 6.  Global Cancer Incidence and Mortality Rates and Trends--An Update.

Authors:  Lindsey A Torre; Rebecca L Siegel; Elizabeth M Ward; Ahmedin Jemal
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2015-12-14       Impact factor: 4.254

7.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

8.  Identification of significant genes with poor prognosis in ovarian cancer via bioinformatical analysis.

Authors:  Hao Feng; Zhong-Yi Gu; Qin Li; Qiong-Hua Liu; Xiao-Yu Yang; Jun-Jie Zhang
Journal:  J Ovarian Res       Date:  2019-04-22       Impact factor: 4.234

Review 9.  Validation of miRNA prognostic power in hepatocellular carcinoma using expression data of independent datasets.

Authors:  Ádám Nagy; András Lánczky; Otília Menyhárt; Balázs Győrffy
Journal:  Sci Rep       Date:  2018-06-15       Impact factor: 4.379

Review 10.  Biomarkers of gastric cancer: Current topics and future perspective.

Authors:  Tasuku Matsuoka; Masakazu Yashiro
Journal:  World J Gastroenterol       Date:  2018-07-14       Impact factor: 5.742

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.