Literature DB >> 35117911

Identification of key biomarkers and potential signaling pathway associated with poor progression of gastric cancer.

Yangzhi Hu1,2, Zhili Hu2, Hui Ding1, Yuan Li2, Xiaoxu Zhao1, Mingtao Shao3, Yunlong Pan1.   

Abstract

BACKGROUND: We aimed to identify the key differentially expressed genes (DEGs) associated with poor prognosis in gastric cancer (GC) and to elucidate the underlying molecular mechanisms in order to provide a therapeutic target for this disease.
METHODS: The DEGs common in two datasets, GSE54129 and GSE79973, were screened. GO and KEGG enrichment analyses were then performed for these DEGs using DAVID's tool. STRING and the Cytoscope software were also used to analyze the protein-protein interaction (PPI) networks of the DEGs common between the two datasets.
RESULTS: A total of 164 common DEGs were identified from GSE79973 and GSE54129 datasets, 42 were up-regulated and 122 were down-regulated in GC. KEGG analysis demonstrated that up-regulated DEGs were mainly enriched for focal adhesion, ECM-receptor interaction, PI3K-Akt signaling pathway, protein digestion and absorption, and vascular smooth muscle contraction, while down-regulated DEGs were enriched for chemical carcinogenesis, metabolism of xenobiotics by cytochrome P450, drug metabolism-cytochrome P450, and retinol metabolism (P<0.05). Obtained PPI network for the 164 DEGs via Cytotype software, using MCODE app of Cytotype software we identified 13 hub genes. Twelve of these genes were found to be associated with poor prognosis in GC by survival analysis. Post validation by the GEPIA, Oncomine, and Human Protein Atlas databases, eight genes (COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, and FN1) were found to be up-regulated in GC tissues and correlated with poor prognosis of GC.
CONCLUSIONS: COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, and FN1 could serve as potential targets for GC diagnosis and prognosis. 2020 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  Gastric cancer (GC); Oncomine; bioinformatics analysis; differentially expressed genes (DEGs); gene expression omnibus (GEO)

Year:  2020        PMID: 35117911      PMCID: PMC8799033          DOI: 10.21037/tcr-20-926

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


Introduction

Gastric cancer (GC) is one of the most common cancers today, and the third-most common cause of cancer-related deaths (1). Most GC patients are diagnosed in the advanced stages of the disease because it is often asymptomatic in the early stages (2), and therefore, the prognosis is poor (3). However, the molecular mechanisms of GC initiation and development are still unclear (3), and it is necessary to further investigate these mechanisms. Gene expression omnibus (GEO) is a public and free database for storage and extraction of genomics data and currently stores 4,348 datasets, 115,586 series, and 3,146,641 samples (July 2019). We screen for differentially expressed genes (DEGs) in the GEO database to be able to explore molecular signals, correlate regulatory genes, and analyze protein-protein interaction (PPI) networks to ultimately obtain a deeper understanding of tumors. In recent years, there have been numerous studies based on the GEO database to discover DEGs in a variety of cancers. Tang et al. (4) and Jin et al. (5) used GEO datasets in their studies to obtain a deeper understanding of the molecular mechanisms involved in tumor formation and proliferation. In this study, we mined two GEO datasets to identify significant DEGs associated with poor GC prognosis and to elucidate the underlying mechanisms. We present the following article in accordance with the MDAR checklist (http://dx.doi.org/10.21037/tcr-20-926).

Methods

The two datasets used

We downloaded the data of GSE54129 and GSE79973 in gastric tumor tissues and healthy gastric tissues from the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) database. The GEO is a publicly functional genomics data repository with available tools to answer the users’ queries, download experiments, and curate the gene expression profiles. The two datasets are all based on GPL570 (HG-U133_Plus_2) Affymetrix Human Genome U133 Plus 2.0 Array and consist of gastric cancer samples and healthy gastric tissue samples. GSE54129 comprises the data of 111 cancer and 21 healthy tissue samples, and GSE79973 comprises the data of 10 cancer and 10 healthy tissue samples.

Identification of DEGs

We identified DEGs with fold change >2 and adjusted the P value <0.05 via the GEO website tool-GEO2R (6). Following this, the online Venn software was used to detect common DEGs from the raw data (7). The DEGs identified were defined as those that were up-regulated (log FC >2) or down-regulated (log FC <−2).

Gene ontology and KEGG analyses

DAVID’s tool can identify the functions of genes or proteins (8), and it was employed for gene ontology (GO) and KEGG analyses (P<0.05). GO analysis is used to identify genes and their RNA or protein products in order to determine unique biological properties from high-throughput transcriptomic or genomic data (9). KEGG is a database that deals with genomes, biological pathways, diseases, drugs, and chemical substances (10).

PPI networks and module analysis

The PPI information was evaluated by STRING (11). To examine the potential correlation between the identified DEGs, we imported the raw data to the Cytoscape software (12) and set the following parameters: maximum number of interactors =0 and confidence score ≥0.4. In addition, we checked the modules of the PPI network via the MCODE app in Cytoscape, with the following parameters: degree cutoff =2, maximum depth =100, k-core =2, and node score cutoff =0.2.

Survival analysis

The survival of GC patients expressing the core genes was analyzed using the Kaplan Meier-plotter (12), which was based on some public datasets (13). The P value and hazard ratio were computed with 95% confidence.

Determination of mRNA expression levels of hub genes

Oncomine and GEPIA databases were used to test the expression levels of the mRNAs of the hub genes in GC. Gene expression profiling interactive analysis (GEPIA v1.0) performs DEG analysis, correlation analysis, patient survival analysis, similar gene detection, and dimensionality reduction analysis based on the data from TCGA and GTEx (14). Oncomine (v4.5) was used to collect 729 gene expression datasets and the data of 86,733 samples. Using Oncomine, differential expression analysis and co-expression analysis can be performed to identify DEGs in a certain cancer and determine the target gene (15). In this study, we discovered the expression of eight core genes using GEPIA, with a threshold of P<0.05 and fold change =2, and using Oncomine, with a P value <1E-4, fold change =2, and gene rank =10%.

Determination of the protein expression levels of the hub genes

The human protein atlas database (HPA v18.1) provides abundant transcriptome and proteome data via immunohistochemistry and RNA-sequencing analyses (16). In this study, the protein expression levels of the core genes were determined by immunohistochemistry.

Results

DEGs of GC in the two GEO datasets

We used 121 cancer and 31 healthy tissue samples. Using GEO2R website tool, we identified 415 DEGs from GSE79973 and 768 DEGs from GSE54129, and these genes were plotted on a Volcano plot using software R (version 3.6.0) (). We used an online tool to produce a Venn diagram in order to extract the DEGs common between the two datasets. Finally, 164 common DEGs were detected. Of these, 42 were found to be up-regulated and 122 were found to be down-regulated genes in the GC tissue samples (, ).
Figure 1

Volcano plot. (A) Volcano plot of GSE79973; (B) volcano plot of GSE54129. Different colors represent different expression levels, green: down-regulated, red: up-regulated. |LogFC| >2, P<0.05.

Table 1

All commonly differentially expressed genes (DEGs) were detected from two profile datasets

DEGsGenes name
Up-regulatedPDPN, COL4A1, CAP2, IGF2BP3, SULF1, PI15, FAP, LY6E, RARRES1, THY1, INHBA, PDLIM7, COL6A3, SPP1, CRISPLD1, COL1A1, MIR675///H19, COL10A1, SFRP4, SPARC, FNDC1, COL11A1, HOXC6, CEMIP, CTHRC1, THBS1, TIMP1, NRP2, THBS2, BGN, COL1A2, CST1, MFAP2, ADAMTS2, WISP1, COL8A1, CXCL8, COL12A1, FN1, PRRX1, ASPN, SPOCK1
Down-regulatedSMIM24, CAPN8, LYPD6B, SH3RF2, CNTN3, MGAM, LIPF, GSTA1, STYK1, TRIM74///TRIM73, S100P, XK, PROM2, KLHDC7A, CAPN13, FBP2, BTNL8, AKR1B10, SLC28A2, CYP2C19, AADAC, IGH, ADAM28, APOBEC1, B4GALNT3, CYP2C18, ALDH3A1, ATP4A, LOC101930400///AKR1C2, PCAT18, UGT2B15, SCIN, LINC00992, KRT20, KIAA1324, GKN1, HRASLS2, ADGRG2, RDH12, GIF, SMPD3, CA2, LTF, STX19, GATA5, ATP4B, MAL, BCAS1, SULT1C2, FCGBP, LINC00675, CAPN9, ATP13A4, SLC26A9, PKIB, ADH1A, SMIM6, ESRRG, AKR7A3, PBLD, ADTRP, VSTM2A, VILL, SSTR1, RFX6, ACER2, LRRC66, KAZALD1, RNASE1, MFSD4A, STS, CYP3A5, LINC01133, GC, RAB27B, ACKR4, FA2H, PLLP, DPCR1, ADH7, HHIP, VSIG1, PGC, AKR1C1, UPK1B, DDX60, KCNE2, SOSTDC1, TPCN2, TPH1, CA9, AMPD1, LOC643201, MUC5AC, VSIG2, ADH1C, CYP2C9, GATA6-AS1, SGK2, PIK3C2G, SPINK7, HEPACAM2, TMED6, AXDND1, SCNN1B, LINC00982, ANG, HPGD, PSAPL1, CWH43, KCNJ16, KCNJ15, SLC26A7, PGA4///PGA3///PGA5, LOC101930400///AKR1C2///AKR1C1, SULT1B1, RASSF6, OASL, GKN2, JCHAIN, CXCL17, HAPLN1
Figure 2

The common differentially expressed genes in the two datasets (GSE79973, GSE54129). Different colors represent different datasets. (A) Up-regulated differentially expressed genes in the two datasets (logFC >2, P<0.05). (B) Down-regulated differentially expressed genes in the two datasets (logFC >−2, P<0.05).

Volcano plot. (A) Volcano plot of GSE79973; (B) volcano plot of GSE54129. Different colors represent different expression levels, green: down-regulated, red: up-regulated. |LogFC| >2, P<0.05. The common differentially expressed genes in the two datasets (GSE79973, GSE54129). Different colors represent different datasets. (A) Up-regulated differentially expressed genes in the two datasets (logFC >2, P<0.05). (B) Down-regulated differentially expressed genes in the two datasets (logFC >−2, P<0.05).

GO and KEGG analyses

All 164 DEGs were annotated using the DAVID online analysis tool. Results showed that: (I) in biological processes, up-regulated DEGs were mainly enriched for endodermal cell differentiation, cell adhesion, collagen fibril organization, negative regulation of angiogenesis, and negative regulation of endothelial cell proliferation, while down-regulated DEGs were enriched for regulation of cell proliferation, potassium ion import, myelination, regulation of intracellular pH, and secretion; (II) in cellular components, up-regulated DEGs were significantly enriched for the proteinaceous extracellular matrix, extracellular space, collagen trimer, and extracellular matrix, while down-regulated DEGs were enriched for the extracellular exosome, integral component of plasma membrane, and extracellular space; (III) for molecular function, up-regulated DEGs were mainly involved in extracellular matrix binding, extracellular matrix structural constituent, and heparin binding, while down-regulated DEGs were involved in iron ion binding, inward rectifier potassium channel activity, and ribonuclease A activity (). KEGG analysis demonstrated that up-regulated DEGs were mainly enriched for focal adhesion, ECM-receptor interaction, PI3K-Akt signaling pathway, protein digestion and absorption, and vascular smooth muscle contraction, while down-regulated DEGs were enriched for chemical carcinogenesis, metabolism of xenobiotics by cytochrome P450, drug metabolism-cytochrome P450, and retinol metabolism (P<0.05) ().
Table 2

Gene ontology analysis of differentially expressed genes in gastric cancer

ExpressionCategoryTermCount%P valueFDR
Up-regulatedGOTERM_BP_DIRECTGO:0035987: endodermal cell differentiation50.084E-070.00053
GOTERM_BP_DIRECTGO:0007155: cell adhesion60.092.44E-050.032387
GOTERM_BP_DIRECTGO:0030199: collagen fibril organization40.065.04E-050.066786
GOTERM_BP_DIRECTGO:0016525: negative regulation of angiogenesis40.060.0001660.220091
GOTERM_BP_DIRECTGO:0001937: negative regulation of endothelial cell proliferation30.040.0012491.642253
GOTERM_CC_DIRECTGO:0005578: proteinaceous extracellular matrix130.194.06E-154.04E-12
GOTERM_CC_DIRECTGO:0005615: extracellular space140.215.60E-085.51E-05
GOTERM_CC_DIRECTGO:0005581: collagen trimer60.096.88E-086.77E-05
GOTERM_CC_DIRECTGO:0031012: extracellular matrix40.060.0020421.990259
GOTERM_MF_DIRECTGO:0050840: extracellular matrix binding40.062.15E-050.020885
GOTERM_MF_DIRECTGO:0005201: extracellular matrix structural constituent40.069.22E-050.089555
GOTERM_MF_DIRECTGO:0008201: heparin binding40.060.0011221.085096
Down-regulatedGOTERM_BP_DIRECTGO:0042127: regulation of cell proliferation60.040.0015622.065131
GOTERM_BP_DIRECTGO:0010107: potassium ion import30.020.0077339.844791
GOTERM_BP_DIRECTGO:0042552: myelination30.020.01466117.89412
GOTERM_BP_DIRECTGO:0051453: regulation of intracellular pH30.020.01645919.87216
GOTERM_CC_DIRECTGO:0070062: extracellular exosome220.150.0066566.917043
GOTERM_CC_DIRECTGO:0005887: integral component of plasma membrane110.080.01541815.36148
GOTERM_CC_DIRECTGO:0005615: extracellular space120.080.01595515.85574
GOTERM_MF_DIRECTGO:0005506: iron ion binding70.051.27E-040.150032
GOTERM_MF_DIRECTGO:0005242: inward rectifier potassium channel activity30.020.0022362.613104
GOTERM_MF_DIRECTGO:0004522: ribonuclease A activity20.010.01640617.77084
Table 3

KEGG pathway analysis of differentially expressed genes in gastric cancer

ExpressionPathway IDNameCount%P valueGenes
Uphsa04510Focal adhesion260.046.22E-11 TLN1, TNC, MYL9, COMP, COL6A3, COL6A2, COL6A1, ZYX, THBS1, COL11A1, THBS2, PIK3R1, SPP1, THBS4, FN1, COL4A2, COL4A1, IGF1, FLNA, VEGFC, ITGA5, FYN, ITGA7, COL1A2, COL1A1, MYLK
hsa04512ECM-receptor interaction170.034.52E-10 COL4A2, COL4A1, TNC, ITGA5, COMP, ITGA7, COL6A3, COL6A2, COL1A2, COL6A1, COL1A1, THBS1, THBS2, COL11A1, SPP1, FN1, THBS4
hsa04151PI3K-Akt signalling pathway260.042.14E-06 MCL1, OSMR, TNC, BCL2L1, IL7R, COMP, COL6A3, COL6A2, COL6A1, IL2RG, THBS1, THBS2, COL11A1, PIK3R1, SPP1, FN1, THBS4, COL4A2, COL4A1, IGF1, YWHAE, VEGFC, ITGA5, ITGA7, COL1A2, COL1A1
hsa04974Protein digestion and absorption120.021.37E-05 COL4A2, COL14A1, COL4A1, COL6A3, ELN, COL1A2, COL6A2, COL12A1, COL6A1, COL1A1, COL11A1, COL10A1
hsa04270Vascular smooth muscle contraction120.021.95E-04 EDNRA, GNA13, ACTG2, ACTA2, CALD1, PLA2G2A, GUCY1A3, GUCY1B3, CALCRL, MYLK, KCNMB1, MYL9
Downhsa05204Chemical carcinogenesis160.021.79E-08 GSTA1, CYP3A4, CYP3A5, GSTA3, SULT2A1, CYP2C19, CYP2C18, CYP2C9, CYP2C8, ADH1C, ADH1A, ADH7, CYP2E1, ALDH3A1, CBR1, UGT2B15
hsa00980Metabolism of xenobiotics by cytochrome P450150.024.83E-08 GSTA1, CYP3A4, CYP3A5, GSTA3, SULT2A1, CYP2C9, ADH1C, ADH1A, ADH7, CYP2E1, ALDH3A1, CBR1, AKR7A3, UGT2B15, AKR1C1
hsa00982Drug metabolism-cytochrome P450140.021.30E-07 GSTA1, CYP3A4, CYP3A5, GSTA3, CYP2C19, CYP2C9, CYP2C8, ADH1C, ADH7, ADH1A, CYP2E1, ALDH3A1, FMO5, UGT2B15
hsa00830Retinol metabolism130.025.01E-07 CYP3A4, CYP3A5, CYP2C18, CYP2C9, CYP2C8, ADH1C, DHRS9, ADH7, ADH1A, RDH12, ALDH1A1, SDR16C5, UGT2B15

PPI network and modular analysis

The 164 DEGs were imported into Cytotype software to obtain a PPI network which included 109 nodes and 269 edges (). Using Cytotype MCODE to carry out an in-depth analysis, we identified 13 central nodes among the 109 nodes, all of which corresponded to up-regulated genes ().
Figure 3

PPI network of the differentially expressed genes. The nodes indicate proteins; the edges indicate the interaction between proteins.

PPI network of the differentially expressed genes. The nodes indicate proteins; the edges indicate the interaction between proteins.

Survival analysis of core genes

To evaluate the survival data for the 13 core genes, we used the Kaplan Meier-plotter. This revealed that 12 of the genes had a significantly worse survival rate while data for THBS1 was not significant (P<0.05, ).
Figure 4

Prognostic information of the 12 core genes. Red: high expression; black: low expression. (A) BGN; (B) COL1A1; (C) COL1A2; (D) COL4A1; (E) COL6A3; (F) COL11A1; (G) COL12A1; (H) FN1; (I) SPP1; (J) SPARC; (K) THBS2; (L) TIMP1.

Prognostic information of the 12 core genes. Red: high expression; black: low expression. (A) BGN; (B) COL1A1; (C) COL1A2; (D) COL4A1; (E) COL6A3; (F) COL11A1; (G) COL12A1; (H) FN1; (I) SPP1; (J) SPARC; (K) THBS2; (L) TIMP1.

mRNA expression levels of hub genes

mRNA levels of the 13 hub genes were evaluated in cancer and healthy tissue samples via GEPIA. This revealed that 12 of these genes (all except THBS1) were highly expressed in GC specimen in contrast to normal gastric samples (P<0.05, ).
Figure 5

Significantly expressed genes in gastric cancer patients compared to healthy individuals. Red: tumor tissue; grey: normal tissues. (A) BGN; (B) COL1A1; (C) COL1A2; (D) COL4A1; (E) COL6A3; (F) COL11A1; (G) COL12A1; (H) FN1; (I) SPP1; (J) SPARC; (K) THBS2; (L) TIMP1 (*, P<0.05).

Significantly expressed genes in gastric cancer patients compared to healthy individuals. Red: tumor tissue; grey: normal tissues. (A) BGN; (B) COL1A1; (C) COL1A2; (D) COL4A1; (E) COL6A3; (F) COL11A1; (G) COL12A1; (H) FN1; (I) SPP1; (J) SPARC; (K) THBS2; (L) TIMP1 (*, P<0.05).

KEGG pathway enrichment re-analysis the hub genes

To obtain enrichment pathway information related to the 12 selected DEGs, we re-analyzed KEGG pathway enrichment using the DAVID online analysis tool. This revealed that eight of the genes (COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, and FN1) were enriched for the ECM-receptor interaction pathway (P=1.6E-12, , ).
Table 4

Re-analysis of 12 selected genes via KEGG pathway enrichment

Pathway IDNameCountPercentageP valueGenes
cfa04512ECM-receptor interaction80.411.63E-12 COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, FN1
cfa04510Focal adhesion80.417.89E-10 COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, FN1
cfa04151PI3K-Akt signalling pathway80.412.40E-08 COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, FN1
cfa04974Protein digestion and absorption60.303.21E-08 COL4A1, COL6A3, COL1A2, COL12A1, COL1A1, COL11A1
cfa05146Amoebiasis50.256.82E-06 COL4A1, COL1A2, COL1A1, COL11A1, FN1
cfa04611Platelet activation30.150.011596 COL1A2, COL1A1, COL11A1
Figure 6

KEGG pathway enrichment re-analysis of the eight hub genes (COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, FN1). Red star: hub genes.

KEGG pathway enrichment re-analysis of the eight hub genes (COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, FN1). Red star: hub genes.

Hub gene expression in cancer tissues

mRNA expression levels of the eight core DEGs were analyzed via Oncomine databases shown in . Protein expression of the eight core DEGs was analyzed in human GC tissue samples using The Human Protein Atlas (). Three proteins COL4A1, COL6A3, and FN1 () were expressed at low levels in both GC and healthy gastric tissue, and three proteins COL1A2, COL1A1, and THBS2 () showed medium expression levels in both. Only SPP1 () showed differential expression between GC and healthy gastric tissue samples (, ).
Figure 7

The hub genes expression in gastric cancer tissues vs. healthy gastric tissues. Red: up-regulation, blue: down-regulation.

Figure 8

The hub genes protein expression in gastric cancer tissues. Images were taken from the Human Protein Atlas (http://www.proteinatlas.org) online database (HE, ×4). (A) COL1A1; (B) COL1A2; (C) COL4A1; (D) COL6A3; (E) FN1; (F) SPP1; (G) THBS2.

Table 5

Eight DEGs protein expression in human gastric cancer tissues and normal tissues

Gene nameStaining
Normal tissueCancer tissue
HighMediumLowNot detectedHighMediumLowNot detected
COL4A1 000601/121/1210/12
COL6A3 003/63/611/11
COL1A2 03/30005/82/81/8
COL1A1 0/113/113/115/111/203/204/1011/20
THBS2 05/50006/103/101/10
COL11A1 NANANANANANANANA
SPP1 2/119/117/224/2211/22
FN1 3/129/125/185/188/18

NA, not applicable.

The hub genes expression in gastric cancer tissues vs. healthy gastric tissues. Red: up-regulation, blue: down-regulation. The hub genes protein expression in gastric cancer tissues. Images were taken from the Human Protein Atlas (http://www.proteinatlas.org) online database (HE, ×4). (A) COL1A1; (B) COL1A2; (C) COL4A1; (D) COL6A3; (E) FN1; (F) SPP1; (G) THBS2. NA, not applicable.

Discussion

GC is the fifth most frequent cancer and shows the third highest cancer-related mortality in the world (17). According to statistics, about 1,033,701 new GC cases occurred in 2018, with 782,685 resulting in death (18). The majority of GC cases are diagnosed in advanced stages, resulting in a relatively poor prognosis for survival (19). Therefore, it is extremely important to identify sensitive markers to improve the diagnosis and prognosis of GC. To identify effective prognostic biomarkers for GC, we used bioinformatics to analyze two datasets (GSE79973 and GSE54129). Through a variety of methods and tools, we finally identified that eight genes (COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1, and FN1) were associated with poor prognosis of GC, all of which were enriched for the ECM-receptor interaction pathway. SPP1 or secreted phosphoprotein 1, containing six introns and seven exons, is located on chromosome four. SPP1 participates in pathological processes such as tumorigenesis, invasion, and metastasis (20) and is highly expressed in many cancer tissues (21-23), with tumor progression promoted by SPP1 overexpression (24). In colorectal cancer (CRC) cells, up-regulated SPP1 expression accelerates proliferation and enhances invasion (25). However, when SPP1 expression is down-regulated, tumor growth is suppressed (26,27). SPP1 affects tumor cell metabolism via the PI3K/AKT signaling pathway. Silencing the SPP1 gene inhibits the AKT pathway, thereby preventing the growth of mouse ovarian cancer (28). Additionally, SPP1 is considered a prognostic biomarker for renal cancer (23). Another study demonstrated that the higher the levels of SPP1, the poorer the prognosis of GC (29). Significant research is being carried out on SPP1 and broadening its role in GC. Many studies have demonstrated that members of the fibrillar collagen family play a key role in various cancers. Collagen type I consists of COL1A1 and COL1A2 (30), which is the most abundant collagen in the human body (31). Some studies have shown that COL1 is a tumor-related gene (32,33). COL1A1 and COL1A2 mRNAs are overexpressed in GC and other cancer tissues (34,35). COL1A1 participates in tumor proliferation, migration, and invasion (36). Furthermore, up-regulation of COL1A1 expression contributes to cisplatin resistance in ovarian cancer cells (37). Collagen type IV is most abundant in basement membranes (BMs) (38). COL4A1 is up-regulated in bladder cancer cells, promoting tumor invasion (38). Overexpression of COL4A1 contributes to proliferation in breast cancer cells (39). COL4A1 has also been considered to be a biomarker for the prognosis of intrahepatic cholangiocarcinoma (40). Both COL1A1 (37) and COL4A1 (41) were shown to be associated with chemotherapy resistance. COL6A3, expressed in stromal cancer-associated fibroblasts, is an independent prognostic factor in some cancers. Knockout of the COL6A3 gene in CRC cells decreases proliferation, invasion, and migration (42). COL11A was also confirmed play a role in proliferation, migration, and invasion of GC (43). Thrombospondin 2 (THBS2) is a member of the Ca2+-binding glycoprotein family, and plays a critical role in some cancers (44,45). Many studies have indicated that THBS2 is related to tumor prognosis. Sun et al. (46) found that higher THBS2 levels in GC were correlated with better prognosis; however, patients with lower THBS2 mRNA expression show a higher histological grade of malignancy. Another study on colon cancer yielded similar results; higher expression of THBS2 led to a significantly lower metastasis rate (47). THBS2 may be exert its effects by inhibiting the process of tumor angiogenesis (48).

Conclusions

COL4A1, COL6A3, COL1A2, COL1A1, THBS2, COL11A1, SPP1 and FN1 were identified from two datasets, which associated with the poor prognosis of GC. Bioinformatic analysis revealed that these genes are effective and reliable molecular biomarkers for the diagnosis and prognosis of GC, providing a new and potential therapeutic target for GC. The limitations in our study should be mentioned, the crucial roles of these hub genes in GC were only based on public databases theoretical predication. Further research is required to substantiate the findings of the present study.
  48 in total

1.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

2.  GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor.

Authors:  Sean Davis; Paul S Meltzer
Journal:  Bioinformatics       Date:  2007-05-12       Impact factor: 6.937

3.  Suppression of tumor growth in lung cancer xenograft model mice by poly(sorbitol-co-PEI)-mediated delivery of osteopontin siRNA.

Authors:  Won-Young Cho; Seong-Ho Hong; Bijay Singh; Mohammad Ariful Islam; Somin Lee; Ah Young Lee; Nomundelger Gankhuyag; Ji-Eun Kim; Kyeong-Nam Yu; Kwang-Ho Kim; Young-Chan Park; Chong-Su Cho; Myung-Haing Cho
Journal:  Eur J Pharm Biopharm       Date:  2015-06-30       Impact factor: 5.571

Review 4.  RNA protein interactions governing expression of the most abundant protein in human body, type I collagen.

Authors:  Branko Stefanovic
Journal:  Wiley Interdiscip Rev RNA       Date:  2013-05-28       Impact factor: 9.957

5.  Outcome of gastric cancer patients after successful gastrectomy: influence of the type of recurrence and histology on survival.

Authors:  Pooja R Rohatgi; James C Yao; Kenneth Hess; Isac Schnirer; Asif Rashid; Paul F Mansfield; Peter W Pisters; Jaffer A Ajani
Journal:  Cancer       Date:  2006-12-01       Impact factor: 6.860

6.  Identification of SPP1 as a promising biomarker to predict clinical outcome of lung adenocarcinoma individuals.

Authors:  Shicheng Li; Ronghua Yang; Xiao Sun; Shuncheng Miao; Tong Lu; Yuanyong Wang; Yang Wo; Wenjie Jiao
Journal:  Gene       Date:  2018-09-18       Impact factor: 3.688

7.  GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses.

Authors:  Zefang Tang; Chenwei Li; Boxi Kang; Ge Gao; Cheng Li; Zemin Zhang
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

8.  Identification of significant genes with poor prognosis in ovarian cancer via bioinformatical analysis.

Authors:  Hao Feng; Zhong-Yi Gu; Qin Li; Qiong-Hua Liu; Xiao-Yu Yang; Jun-Jie Zhang
Journal:  J Ovarian Res       Date:  2019-04-22       Impact factor: 4.234

9.  Thrombospondin 2 expression is correlated with inhibition of angiogenesis and metastasis of colon cancer.

Authors:  T Tokunaga; M Nakamura; Y Oshika; Y Abe; Y Ozeki; Y Fukushima; H Hatanaka; S Sadahiro; H Kijima; T Tsuchida; H Yamazaki; N Tamaoki; Y Ueyama
Journal:  Br J Cancer       Date:  1999-01       Impact factor: 7.640

10.  TCEA3 Attenuates Gastric Cancer Growth by Apoptosis Induction.

Authors:  Jia Li; Yin Jin; Shuang Pan; Yina Chen; Kaijing Wang; Chunjing Lin; Shuqing Jin; Jiansheng Wu
Journal:  Med Sci Monit       Date:  2015-10-26
View more
  1 in total

1.  Bioinformatics analysis of potential Key lncRNA-miRNA-mRNA molecules as prognostic markers and important ceRNA axes in gastric cancer.

Authors:  Siqi Tang; Keyong Liao; Yongpeng Shi; Tingting Tang; Beibei Cui; Zunnan Huang
Journal:  Am J Cancer Res       Date:  2022-05-15       Impact factor: 5.942

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.