Literature DB >> 30948703

Identification of Key Genes and Circular RNAs in Human Gastric Cancer.

Shuhong Hao1, Junfeng Lv2, Qiwei Yang3, Ao Wang4, Zhaoyan Li4, Yuchen Guo5, Guizhen Zhang3,4.   

Abstract

BACKGROUND Globally, gastric cancer (GC) is the third most common source of cancer-associated mortality. The aim of this study was to identify key genes and circular RNAs (circRNAs) in GC diagnosis, prognosis, and therapy and to further explore the potential molecular mechanisms of GC. MATERIAL AND METHODS Differentially expressed genes (DEGs) and circRNAs (DE circRNAs) between GC tissues and adjacent non-tumor tissues were identified from 3 mRNA and 3 circRNA expression profiles. Functional analyses were performed, and protein-protein interaction (PPI) networks were constructed. The significant modules and key genes in the PPI networks were identified. Kaplan-Meier analysis was performed to evaluate the prognostic value of these key genes. Potential miRNA-binding sites of the DE circRNAs and target genes of these miRNAs were predicted and used to construct DE circRNA-miRNA-mRNA networks. RESULTS A total of 196 upregulated and 311 downregulated genes were identified in GC. The results of functional analysis showed that these DEGs were significantly enriched in a variety of functions and pathways, including extracellular matrix-related pathways. Ten hub genes (COL1A1, COL3A1, COL1A2, COL5A2, FN1, THBS1, COL5A1, SPARC, COL18A1, and COL11A1) were identified via PPI network analysis. Kaplan-Meier analysis revealed that 7 of these were associated with a poor overall survival in GC patients. Furthermore, we identified 2 DE circRNAs, hsa_circ_0000332 and hsa_circ_0021087. To reveal the potential molecular mechanisms of circRNAs in GC, DE circRNA-microRNA-mRNA networks were constructed. CONCLUSIONS Key candidate genes and circRNAs were identified, and novel PPI and circRNA-microRNA-mRNA networks in GC were constructed. These may provide useful information for the exploration of potential biomarkers and targets for the diagnosis, prognosis, and therapy of GC.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30948703      PMCID: PMC6463957          DOI: 10.12659/MSM.915382

Source DB:  PubMed          Journal:  Med Sci Monit        ISSN: 1234-1010


Background

Globally, gastric cancer (GC) is the third most common cause of cancer-associated mortality [1]. In the United States, an estimated 26 240 cases of gastric cancer were diagnosed in 2018, and 10 800 patients died of the disease [2]. Because of the lack of specific signs of early GC, most patients (>70%) are diagnosed with advanced-stage GC, after the best opportunity for treatment [3]. Although considerable advancements in the diagnosis and treatment of GC have been achieved, the prognosis of GC patients remains poor, particularly for those with advanced disease [4]. Therefore, a deeper understanding of the mechanisms involved in GC progression and identification of potential biomarkers and targets for the diagnosis, prognosis, and therapy of GC are urgently needed. Bioinformatics analyses, including the analysis of gene interaction networks, microarray expression profiles, and gene annotations, are being utilized as powerful tools for identifying potential diagnostic and treatment-relevant biomarkers of cancers [5]. For instance, by analyzing data from the Gene Expression Omnibus (GEO) database, Cao et al. identified 5 genes (COL1A2, COL1A1, COL4A1, THBS2, and ITGA5) as potential biomarkers and therapeutic targets for GC [6]. In addition, by analyzing data from GEO and The Cancer Genome Atlas (TCGA), Zhu et al. found that high expression of CDK1 is a prognostic factor for hepatocellular carcinoma (HCC), making it a potential therapeutic target and biomarker for the diagnosis of HCC [7]. In particular, the method of integrated bioinformatics analysis can be used to overcome inaccuracies in sequencing due to small sample sizes [8]. Circular RNAs (circRNAs) are a novel class of endogenous non-coding RNAs that form a covalently closed continuous loop by back-splicing events via exon or intron circularization [9,10]. Owing to the development of high-throughput sequencing, researchers have discovered thousands of circRNAs, and emerging evidence has shown that circRNAs are involved in the progression of oncogenesis, invasion, and metastasis by regulating microRNAs (miRNAs) [10]. However, at present, most studies involving circRNAs have been limited to the sequencing of a few samples [11] or exploring the biological function of a single circRNA [12]. To the best of our knowledge, no researchers have yet used integrated analysis of multiple datasets to investigate GC-related circRNAs. In this study, differentially expressed genes (DEGs) between human GC tissues and adjacent non-tumor tissues were identified via analysis of public GEO datasets. Next, to explore the roles of these genes, functional enrichment analyses and pathway enrichment analyses were performed. Then, protein–protein interaction (PPI) networks were successfully constructed. The key genes and significant modules in the networks were identified. Kaplan-Meier analysis was performed to evaluate the prognostic value of these key genes. Furthermore, 3 additional circRNA expression profiles were analyzed to identify differentially expressed circRNAs (DE circRNAs) between GC and adjacent non-tumor tissues. Finally, DE circRNA–miRNA–mRNA networks were constructed. The present study provides useful information for the exploration of potential biomarkers and targets for the diagnosis, prognosis, and therapy of GC.

Material and Methods

Microarray data source

Microarray data were obtained from the GEO database (), a free database containing microarray profiles and next-generation sequencing data. Three mRNA expression profiles (GSE19826, GSE54129, and GSE79973), including data from 133 GC tissues and 46 adjacent non-tumor tissues, were downloaded to identify DEGs, while 3 circRNA expression profiles (GSE83521, GSE89143, and GSE93541), including data from 12 GC samples and 12 normal gastric samples, were downloaded to select DE circRNAs. Details for each dataset are provided in Table 1.
Table 1

Expression profile information.

SeriesTypeFirst authorPublication yearCountryPlatformSampleNumber of samples (normal/tumor)
GSE19826mRNAWang Q2010ChinaGPL570Stomach27 (15/12)
GSE54129mRNALiu B2017ChinaGPL570Stomach132 (21/111)
GSE79973mRNAShao Q2016ChinaGPL570Stomach20 (10/10)
GSE83521circRNAZhang Y2017ChinaGPL19978Stomach12 (6/6)
GSE89143circRNAShao Y2017ChinaGPL19978Stomach6 (3/3)
GSE93541circRNAGuo J2017ChinaGPL19978Stomach6 (3/3)

Identification of DEGs and DE circRNAs

The GEO2R tool () was used to identify DEGs and DE circRNAs between GC and adjacent non-tumor tissues. Values of |log2 fold change| (|log2FC|) >1 and p<0.05 were set as cut-off criteria. Venn diagrams were constructed to determine intersections via VENNY 2.1 software () [13].

GO analysis and KEGG analysis

GO analysis is widely used to provide gene annotation terms [14], and the KEGG database is widely used for pathway enrichment analysis [15]. In the present study, an online tool, the Database for Annotation, Visualization and Integration Discovery (DAVID 6.8) (), was used to perform GO analysis [16]. KEGG analysis was performed using the online tool KOBAS 3.0 () [17], with 4 databases, including KEGG Pathway (), Reactome (), BioCyc (), and PANTHER (), were used in the analysis. A value of p<0.05 was selected as the threshold for significant GO and KEGG terms.

Construction of PPI and DE circRNA–miRNA–mRNA networks

Search Tool for the Retrieval of Interacting Genes (STRING) () was used to construct PPI networks [18]. A combined interaction score >0.5 was used as the cut-off criterion for the extraction of PPI pairs. Cytoscape software version 3.6.0 () was used for visualization of the PPI and DE circRNA–miRNA–mRNA networks [19]. The cytoHubba plug-in was used to screen hub genes, with a cut-off of degree ≥20. The MCODE plug-in was used to identify significant modules of the PPI networks in Cytoscape, with cut-off criteria of degree ≥5 and k-core ≥4.

Validation of gene expression

UALCAN (), an online tool used to analyze gene expression data from The Cancer Genome Atlas (TCGA) database (including 34 normal gastric tissues and 415 primary tumors), was used to validate the expression of hub genes in the PPI networks. P<0.05 was considered to indicate a statistically significant difference [20],

Survival analysis

The Kaplan-Meier plotter database (), which is an online tool used to evaluate the prognostic values of genes in breast, ovarian, lung, and gastric cancer patients, was applied to analyze the associations between the hub genes in the PPI networks and overall survival [21]. Differences in the survival rates were statistically analyzed between patients with high and low expression levels of hub genes. The hazard ratio (HR) and its 95% confidence intervals were calculated. P<0.05 was considered to indicate a statistically significant difference.

Prediction of circRNA and miRNA targets

The miRNA-binding sites of DE circRNAs were predicted using the Circular RNA Interactome online tool () [22]. Possible targets of miRNAs were predicted using the miRWalk 2.0 online tool (), with a “predicted target module” [23]. Genes that appeared in 4 (miRWalk, miRanda, miRDB, and Targetscan) of the 12 software programs from miRWalk v.2.0 were considered as possible target genes. Then, comparison of the list of potential target genes with the list of DEGs identified from the 3 gene expression profiles (GSE19826, GSE54129, and GSE79973) produced an intersection of genes used to construct the DE circRNA–miRNA–mRNA networks.

Results

Identification of DEGs in GC

Three mRNA expression profiles including data from GC and adjacent non-tumor tissues were obtained from the NCBI GEO database (Table 1). In total, 2949, 3675, and 3251 DEGs from the GSE19826, GSE54129, and GSE79973 datasets, respectively, were extracted using GEO2R and met the thresholds of |log2FC| >1.0 and p<0.05 (Supplementary Table 1, Figure 1). By comparing the DEGs from the 3 cohort profile datasets, an intersection of 507 common DEGs was obtained, including 196 genes that were upregulated and 311 that were downregulated in GC tissues compared to levels in adjacent non-tumor tissues (Figure 2). A list of all 507 DEGs is provided in Supplementary Table 2.
Figure 1

DEGs between GC and adjacent non-tumor tissues. (A) Volcano plot for DEGs in GSE19826. (B) Volcano plot for DEGs in GSE54129. (C) Volcano plot for DEGs in GSE79973. Red: log (FC) >1, p<0.05; green: log (FC) <1, p<0.05). DEG – differentially expressed gene; GC – gastric cancer; FC – fold change.

Figure 2

Venn diagram of DEGs in the 3 cohort profile datasets (GSE19826, GSE54129, and GSE79973). (A) Upregulated DEGs. (B) Downregulated DEGs. Different color areas represent different datasets. The overlapping areas are the common DEGs. DEG – differentially expressed gene.

GO analysis and KEGG analysis of DEGs

DEGs were functionally classified into the BP, CC, and MF categories. In the BP category, the top 3 most enriched terms were “collagen fibril organization”, “cell adhesion”, and “angiogenesis”. In the CC category, the top 3 most enriched terms were “extracellular space”, “proteinaceous extracellular matrix”, and “collagen trimer”. In the MF category, the top 3 most enriched terms were “extracellular matrix structural constituent”, “heparin binding”, and “extracellular matrix binding” (Supplementary Table 3, Figure 3A–3C). Moreover, the top 3 most enriched terms in the KEGG analysis were “extracellular matrix organization”, “degradation of the extracellular matrix”, and “collagen formation” (Supplementary Table 4, Figure 3D).
Figure 3

GO annotation and pathway enrichment analysis. (A) Top 10 terms in BP category. (B) Top 10 terms in CC category. (C) Top 10 terms in MF category. (D) Top 10 terms in pathway enrichment analysis. GO – gene ontology; DEG – differentially expressed gene; BP – biological process; CC – cellular component; MF – molecular function.

PPI network construction and modular analysis

A total of 253 DEGs (122 upregulated and 131 downregulated) among the 507 common DEGs were used to construct the PPI networks, which included 253 nodes and 588 edges (Figure 4). Using the cytoHubba plug-in, 10 genes – COL1A1, COL3A1, COL1A2, COL5A2, FN1, THBS1, COL5A1, SPARC, COL18A1, and COL11A1 – were confirmed as hub genes (Table 2, Figure 5A). All of the hub genes were upregulated in the GC samples compared to levels in normal gastric samples. We further verified the expression of the 10 hub genes in the TCGA stomach adenocarcinoma database using UALCAN. The results showed that the expression levels of 9 of the 10 genes (all except THBS1), were significantly higher in primary tumors than those in the normal stomach tissues of GC patients from TCGA, suggesting that our results are highly credible (Supplementary Figure 1). Using the MCODE plug-in, 3 significant modules were screened from the PPI networks (Figure 5B–5D). KEGG enrichment analysis showed that Module 1 was mainly associated with collagen metabolism, including collagen biosynthesis and modifying enzymes, collagen formation, collagen degradation, and assembly of collagen fibrils. Module 2 was mainly associated with platelet function, including platelet degranulation; response to elevated platelet cytosolic Ca2+; and platelet activation, signaling, and aggregation. Module 3 was mainly associated with chemical carcinogenesis, – O linked glycosylation, metabolic pathways, and biological oxidation (Figure 5E, Supplementary Table 5).
Figure 4

PPI networks of the DEGs. The network contains 253 nodes and 588 edges. Network nodes represent proteins (shown with gene names). The color of each node denotes the expression of genes in the GC samples compared to that in non-tumor samples (pink represents upregulated and blue represents downregulated). PPI – protein–protein interaction; DEG – differentially expressed gene.

Table 2

Hub genes in the protein–protein interaction networks identified by cytoHubba.

Gene symbolOfficial full nameDegreeExpression in GC
COL1A1Collagen type I alpha 1 chain32Up-regulation
COL3A1Collagen type III alpha 1 chain32Up-regulation
COL1A2Collagen type I alpha 2 chain31Up-regulation
COL5A2Collagen type V alpha 2 chain23Up-regulation
FN1Fibronectin 123Up-regulation
THBS1Thrombospondin 122Up-regulation
COL5A1Collagen type V alpha 1 chain22Up-regulation
SPARCSecreted protein acidic and cysteine rich22Up-regulation
COL18A1Collagen type XVIII alpha 1 chain21Up-regulation
COL11A1Collagen type XI alpha 1 chain20Up-regulation

GC – gastric cancer.

Figure 5

Hub genes and module analysis of the PPI networks. (A) Hub genes identified by cytoHubba plug-in (interaction degree ≥20). Node color denotes interaction degree (red for high degree, orange for intermediate degree, and yellow for low degree). (B) Module 1, identified by MCODE plug-in, contains 16 genes. (C) Module 2 contains 7 genes. (D) Module 3 contains 11 genes. The color of each node denotes the expression of genes in the GC samples compared to that in normal gastric samples (pink for upregulation and blue for downregulation). (E) Pathway enrichment analysis of 3 significant modules in PPI networks. PPI, protein–protein interaction.

Survival analysis of hub genes

Using the Kaplan-Meier plotter database, the prognostic value of the 10 hub genes was evaluated in 876 GC patients. The results indicated that 7 out of the 10 hub genes – COL1A1, COL1A2, COL5A2, COL11A1, COL18A1, FN1, and SPARC – were related to the OS of GC patients. High expression of each of the 7 genes was associated with poor OS in GC patients (p<0.05). In contrast, the other 3 hub genes – COL3A1, COL5A2, and THBS1 – were not related to the OS of GC patients (p>0.05) (Figure 6).
Figure 6

Association of the 10 hub genes with the OS of GC patients, analyzed by Kaplan-Meier survival plots. (A–G) High expression of COL1A1, COL1A2, COL5A2, COL11A1, COL18A1, FN1, and SPARC was associated with poor OS of GC patients. (H–J) Expression of COL3A1, COL5A2, and THBS1 was not related to OS of GC patients. OS, overall survival; GC, gastric cancer.

Identification of DE circRNAs in GC

Three circRNA expression profiles including data from GC and adjacent non-tumor tissues were obtained from the GEO database (Table 1). In total, 150, 191, and 416 DE circRNAs from the GSE83521, GSE89143, and GSE93541 expression profiles, respectively, were extracted using GEO2R, meeting the thresholds of |log2FC| >1.0 and p<0.05 (Supplementary Table 6). By comparing the DE circRNAs from the 3 cohort profile datasets, we obtained 2 common DE circRNAs – hsa_circ_0000332 and hsa_circ_0021087 – both of which were downregulated in GC tissues compared to levels in adjacent non-tumor tissues (Figure 7A, 7B).
Figure 7

Venn diagrams of (A) common upregulated and (B) common downregulated DE circRNAs in the 3 cohort profile datasets (GSE83521, GSE89143, and GSE93541). Different colored areas represent different datasets. The overlapping areas indicate common DE circRNAs. (C) Predicted target genes overlapping DEGs identified from the gene expression profiles. DE circRNAs, differentially expressed circRNAs; DEGs, differentially expressed genes.

Prediction of targets for circRNAs and miRNAs and construction of DE circRNA–miRNA–mRNA networks

Using the Circular RNA Interactome online tool to predict potential miRNAs capable of binding to the 2 DE circRNAs, we obtained a total of 13 miRNAs, 2 of which were for hsa_circ_0000332, while the remaining 11 were for hsa_circ_0021087. Using the miRWalk 2.0 online tool to predict the targets of these 13 miRNAs, we obtained 1563 genes. Then, by comparing the 1563 predicted targets with the 507 common DEGs identified from the gene expression profiles, we found that none of the predicted targets of 5 of the 13 miRNAs were included in the list of DEGs, while there were 42 predicted targets of 8 of the 13 miRNAs that were present on the list of DEGs (Table 3, Figure 7C). Based on these results, we constructed circRNA–miRNA–mRNA networks with the 2 circRNAs, 8 miRNAs, and 42 target genes using Cytoscape 3.6.0 software (Figure 8). From the networks, we found that hsa_circ_0021087, which interacted with 7 miRNAs and indirectly interacted with 39 target genes, may play a more important circRNA role in the GC network.
Table 3

Prediction of targets for circRNAs and miRNAs.

DE circRNAsPredicted miRNAs binding to DE circRNAsPredicted targets for miRNAs overlapped with DEGs
hsa_circ_0000332hsa-miR-1292-5pNone
hsa-miR-370-3pCLIC6, NDOR1, PRLR
hsa_circ_0021087hsa-miR-1224-3pTFCP2L1
hsa-miR-1276COL4A2, FMNL3, FGD6, PAIP2B, GRAMD1C, SLC14A1, DUOX2
hsa-miR-1304-5pTHY1, DPYSL3, LY6E, FAM189A2, CMTM4, DNER, SYTL2, KIAA1324, SCNN1G, ACVR1C, SLC4A4, PRSS12
hsa-miR-184None
hsa-miR-450b-3pADAM12, TGM2, C4orf19, MAP7D2
hsa-miR-769-3pADAM12, C4orf19, LDLR
hsa-miR-517a-3pNone
hsa-miR-517c-3pNone
hsa-miR-623NRP2, ELOVL6, LIPH
hsa-miR-638None
hsa-miR-1827SERPINH1, EFEMP2, IGF2BP3, FGD6, THBS1, MXRA7, AKR7A3, METTL7A, SLC26A9, IQGAP2, KIAA1324, PLAC8, IL1RN

DE circRNAs – differentially expressed circRNAs.

Figure 8

DE circRNA–miRNA–mRNA networks. The network contains 52 nodes and 54 edges. The green diamond nodes represent circRNAs; yellow ellipsoidal nodes, miRNAs; orange square nodes, upregulated genes; blue square nodes, downregulated genes.

KEGG analysis of genes in the DE circRNA–miRNA–mRNA networks

KEGG analysis of the 42 genes in the DE circRNA–miRNA–mRNA networks was performed. The results indicated that these 42 genes were primarily enriched in the following terms: “signal transduction”, “extracellular matrix organization”, and “non-integrin membrane-ECM interactions” (Supplementary Table 7, Figure 9).
Figure 9

Pathway enrichment analysis of the 42 genes in the DE circRNA–miRNA–mRNA networks.

Discussion

Numerous studies have been conducted to understand the causes and underlying mechanisms of GC occurrence and development over the past several decades; however, the incidence and mortality of GC remain very high globally [1,4]. With the development of next-generation sequencing, large-scale high-throughput molecular profiling studies are now providing molecular insights into the complex nature of this disease [24]. The overall aim of this study was to identify key genes and circRNAs for GC diagnosis, prognosis, and therapy and to further explore the potential mechanisms of GC by integrated profiling analysis. Initially, 3 public GEO datasets were analyzed to identify DEGs between human GC tissues and adjacent non-tumor tissues. A total of 507 DEGs (311 upregulated and 196 downregulated) were identified. To further elucidate the underlying functions of these DEGs, GO analysis was performed. The results indicated several abnormally modified GO terms closely associated with cancer, such as cell adhesion, angiogenesis, negative regulation of angiogenesis, and regulation of cell growth. The results of KEGG analysis indicated that the most significantly enriched pathways were extracellular matrix (ECM)-related pathways, such as ECM organization, degradation of the ECM, and ECM proteoglycans. The ECM is a noncellular component that is composed of 2 major classes of molecules: glycosaminoglycans and fibrous proteins, which include collagen, elastin, fibronectin, and laminin [25]. As an important member of the tumor microenvironment, the ECM secreted by cancer and/or stromal cells affects the dissemination process through modulation of the interaction between cancer and endothelial cells and plays important roles in tumor angiogenesis, metastasis, and chemotherapy resistance [26]. Furthermore, other significantly enriched pathways, including integrin-related pathways (integrin cell surface interactions and integrin signaling pathway) and collagen-related pathways (collagen formation, collagen biosynthesis and modifying enzymes, collagen degradation, and assembly of collagen fibrils) are also related to the ECM. Integrins are a family of essential cell surface adhesion receptors with the ability to sense the content and stiffness of the surrounding ECM and transfer this information to cytoplasmic signaling pathways [27]. Collagen, as a key structural component of the ECM, is known to regulate cell migration, proliferation, differentiation, and survival by signaling through cell surface receptors, including integrins [28]. Thus, the results of the present study suggest that pathways associated with the ECM play an important role in the occurrence of GC and could be potential therapeutic targets. PPI networks were constructed, and the most closely related genes were identified: COL1A1, COL3A1, COL1A2, COL5A2, FN1, THBS1, COL5A1, SPARC, COL18A1, and COL11A1. Among these, COL1A1, COL3A1, COL1A2, COL5A2, COL5A1, COL18A1, and COL11A1 belong to the collagen family, which comprises 28 members (I–XXVIII) [29]. Each collagen consists of 3 polypeptide chains (α chains) numbered with Arabic numerals [29]. For instance, collagen type V α2 (COL5A2) encodes the pro-α2 chains of type V collagen, and collagen type III α1 (COL3A1) encodes the pro-α1 chains of type III collagen. As key structural components of the ECM, collagens have been found to be overexpressed in multiple cancers to provide a rigid matrix that facilitates tumor growth [25]. The expression of COL1A1 and COL1A2 has been confirmed by some researchers [30,31] to be upregulated in GC tissues compared to levels in normal tissues and may predict an adverse prognosis in GC patients [32]. Furthermore, Zhang et al. found that COL1A1 promotes CRC metastasis and may serve as an oncoprotein by regulating key genes in the WNT/PCP signaling pathway [33]. Ao et al. found that the silencing of COL1A2 inhibits the proliferation and promotes the apoptosis of GC cells through the PI3k-Akt signaling pathway [34]. Overexpression of COL3A1 has been confirmed in many cancers, including bladder cancer [35], glioblastoma [36], and GC [37]. Boguslawska et al. found that the expression of some genes related to adhesion and ECM remodeling, including COL1A1, COL5A1, and COL11A1, was upregulated in renal cell carcinoma compared with levels in controls [38]. COL5A1 is also reported to be involved in breast cancer [39], oral squamous cell carcinoma [40], lung adenocarcinoma [41], and GC [42]. COL5A2 has previously been found to be associated with the pathological processes of osteosarcoma [43], bladder cancer [44], and GC [45]. The role of COL18A1 is discrepant in different cancers. Fang et al. found that COL18A1 may be involved in bladder cancer progression, as its expression was increased with the progression of pathological stage [46]. However, a study by Qiu et al. found that the expression level of COL18A1 was negatively correlated with metastasis of melanoma, indicating that COL18A1 can inhibit the proliferation and metastasis of melanoma [47]. Nevertheless, there are few published investigations of the expression of COL18A1 in GC. Furthermore, other hub genes may additionally be involved in the metastasis of GC. Secreted protein acidic and rich in cysteine (SPARC) is regarded as a collagen chaperone because it binds to the procollagens and its existence is essential for collagen deposition in tissues [29,48]. SPARC overexpression has been demonstrated in some cancers, such as pancreatic cancer [49], esophageal squamous cell carcinoma [50], and GC [51]. In contrast, some studies have shown that SPARC expression is reduced in bladder cancer [52] and acute leukemia [53]. Fibronectin (FN) is a glycoprotein component of the ECM involved in cell adhesion, migration, metastasis, proliferation, and differentiation [54]. Recent evidence showed that FN is overexpressed in GC cells and that microRNA-200c can suppress the proliferation of GC cells by selectively downregulating FN expression [55]. Thrombospondin 1 (THBS1) is an extracellular glycoprotein that plays multiple roles in the ECM and cell–cell interactions [56]. Numerous studies have confirmed a role for THBS1 in cell invasion and migration. However, conflicting results have been obtained in different cell types [57]. In GC cells, Huang et al. demonstrated a marked overexpression of THBS1 when compared to levels in adjacent normal tissues. Nevertheless, THBS1 was correlated with well- and moderately-differentiated tumors, clearly indicating a less aggressive tumor biology [56]. In the present study, we also evaluated the prognostic value of the hub genes in GC using Kaplan-Meier analysis. The results showed that high expression of COL1A1, COL1A2, COL5A2, COL11A1, COL18A1, FN1, and SPARC was related to poor OS in patients with GC. These results suggest that these genes may provide novel tools for prognostication, as well as drug targets for GC patients. Additionally, the 3 most significant modules were extracted from the PPI network. Module 1 was mainly associated with the ECM and collagen, which have been thoroughly discussed above. Interestingly, we found that the significantly enriched pathways and most of the corresponding genes of Module 2 were mainly associated with platelets and hemostasis. It has been demonstrated that cancer increases the risk of thrombosis by 4-fold, indicating a close relationship between cancer and platelet activation [58]. Platelets induce cancer cell plasticity and result in the enhancement of extravasation and survival during the process of circulating tumor cell dissemination [58,59]. Similarly, tumor cells induce platelet aggregation, suggesting that the interaction between platelets and cancers is reciprocal [58,59]. Mikami et al. showed that platelets promoted the proliferation of GC cells [60]. Hiroto found that platelets contribute to creating a favorable microenvironment for resistance to chemotherapy [61]. Increasing evidence has shown that circRNAs are abundant and stable RNAs in mammalian cells and are important regulators of some biological processes related to cancer, including cell invasion and metastasis [62]. The most well-known function of circRNAs is their ability to “sponge up” miRNAs, thus serving as competitive inhibitors and suppressing the ability of the miRNA to bind its mRNA targets [63]. However, at present, the biological functions of most circRNAs are unclear, and studies of the underlying functional pathways and the co-expression networks among circRNAs, miRNAs, and mRNAs in GC are limited. In the present study, 3 circRNA expression profiles were analyzed to identify DE circRNAs between human GC and adjacent non-tumor tissues. Two DE circRNAs were identified, including hsa_circ_0000332 (111 nt), whose parent gene is pyruvate carboxylase (PC), located at chr11:66626276–66626387, and hsa_circ_0021087 (340 nt), whose parent gene is LIM domain only 1 (LMO1), located at chr11:8248521–8252051. Investigations of the biological functions of these 2 circRNAs have not been reported. Additionally, to reveal the comprehensive circRNA-related regulatory network in GC, DE circRNA–miRNA–mRNA networks were constructed, showing that hsa_circ_0021087 in particular may play a more critical circRNA role in GC. The construction of circRNA-related regulation networks in GC may provide valuable information for exploration of the potential mechanisms of circRNAs in GC. The present study was limited by the fact that all results were derived from bioinformatics analysis and lacked corresponding experiments. Functional research is necessary to explore the potential mechanisms of the DEGs and DE circRNAs in GC. The roles of DEGs and DE circRNAs in prognosis and therapy require further investigation in large clinical samples.

Conclusions

Using integrated bioinformatical analysis, we identified 507 DEGs in GC. PPI network analysis identified 10 hub genes: COL1A1, COL3A1, COL1A2, COL5A2, FN1, THBS1, COL5A1, SPARC, COL18A1, and COL11A1. Kaplan-Meier analysis revealed that 7 of the hub genes were associated with poor OS in GC patients. GO and KEGG analysis indicated that these genes are enriched in a variety of functions and pathways, among which ECM-related pathways may be the most important in GC. Furthermore, we identified 2 DE circRNAs: hsa_circ_0000332 and hsa_circ_0021087. To reveal the comprehensive circRNA-related regulatory network in GC, DE circRNA–miRNA–mRNA networks were constructed. Although further validation of the present results is required, this study provides useful information for exploration of potential biomarkers and targets for the diagnosis, prognosis, and therapy of GC. (A–J) Validation of the expression of 10 hub genes in the TCGA database. STAD – stomach adenocarcinoma. Number of differentially expressed genes in each dataset. DEGs – differentially expressed genes. Common differentially expressed genes among all three gene expression profiles. DEGs – differentially expressed genes. Gene ontology annotation of differentially expressed genes in gastric cancer. BP – biological process; CC – cellular component; MF – molecular function. Significantly enriched pathway terms of differentially expressed genes in gastric cancer. Significantly enriched pathway terms of three significant modules in protein–protein interaction networks. Number of differentially expressed circRNAs from each expression profile dataset. DE circRNAs – differentially expressed circRNAs. Significantly enriched pathway terms of genes in the DE circRNA–miRNA–mRNA networks.
Supplementary Table 1.

Number of differentially expressed genes in each dataset.

Up-regulationDown-regulationTotal
GSE1982688220672949
GSE54129168319923675
GSE7997399122603251
Common DEGs196311507

DEGs – differentially expressed genes.

Supplementary Table 2.

Common differentially expressed genes among all three gene expression profiles.

Common DEGsGene symbol
Up-regulated DEGsCOL1A, COL5A1, COL1A1, INHBA, COL6A3, MFAP2, SPARC, SERPINH1, COL8A1, HOXA13, FAP, COL18A1, COL5A2, COL3A1, CDH11, EDNRA, COL6A2, PRRX1, VCAN, HOXA10, SERPINE2, ADAMTS2, SULF1, FN1, NID2, PDGFRB, BGN, COL12A1, SEPT11, THY1, RAB31, THBS2, COL4A1, COL4A2, LGALS1, TIMP1, FBN1, IGFBP4, BICC1, CTSK, APOC1, HEYL, TGFBI, P4HA3, HOXB7, PLXDC2, BMP1, RCN3, COL10A1, GPNMB, TMEM158, ADAMTS12, GUCY1A3, PDLIM7, ACTN1, SFRP4, EFEMP2, SERPINF1, FSTL1, ASPN, IGFBP7, CLEC11A, AMIGO2, SALL4, SFRP2, CLDN1, CTHRC1, CST1, CDH3, FNDC1, PGF, SPHK1, ALDH1A3, WISP1, GUCY1B3, HES4, MGP, DOK5, C3, RARRES1, FLNA, CEBPB, SULF2, NRP1, ELOVL5, ISLR, LEF1, ZAK, PMEPA1, AEBP1, TNFRSF11B, SVIL, SPP1, GDPD5, OLFML2B, LOX, EMILIN1, PLEKHO1, HTRA1, GPR161, DGKH, ANTXR1, GXYLT2, PLAU, FSCN1, SRPX2, IGF2BP3, EPHB2, ANOS1, CEMIP, FBXO32, COMP, COL11A1, THBS4, PLEKHG2, E2F3, ADAM12, DNM1, ITGBL1, MFGE8, RIN3, NRP2, CTSL, CPXM1, ENG, NRK, MMP11, S100A10, DPYSL3, FMNL3, MYO1B, FKBP10, CST4, KLHL23, SPOCK1, FGD6, CCDC80, TNFSF4, BAG2, MLLT11, THBS1, TGM2, PLPPR4, CST2, ITGA5, MIR100HG, PLA2G7, VAC14, ZNF423, CRISPLD1, GINS4, NT5DC2, MMP7, FXYD5, GREM1, FOXC2, CLDN3, FAM83D, FJX1, OLR1, DUXAP10, KLK10, HS3ST3A1, ADAMTS6, SLCO2B1, LY6E, BOC, AHNAK2, HOXC10, SNX10, BUB1, DTL, FOXC1, IL32, MYL9, CXCL8, NOX4, CAP2, WNT2, ETV6, LILRB1, PDE3A, RHOJ, TREM1, SERPINE1, YBX2, TPM2, STC2, MXRA7, TREM2, PLEKHA4, MEOX2, CXCL9, SLC1A3, TAGLN, HOXC6
Down-regulated DEGsMUC5AC, C8orf49, ZNF57, UBL3, MAGI1, AKR7A3, PSAPL1, MFSD4A, GATA4, SMIM6, ALDH6A1, DPCR1, TRIM50, NFE2L2, CLIC6, KLK11, FAM107B, TMPRSS2, MUC6, METTL7A, RASSF6, VSIG1, SGPP2, RAB27A, CWH43, WIPF3, ELOVL6, SVIP, PELI2, C1orf116, FMO5, ADTRP, HOMER2, SHROOM3, SMIM5, ELL2, AQP4, EPN3, CYSTM1, PPFIBP2, PBLD, FA2H, RNASE4, CAPN9, ATP11B, BEX5, C1orf210, FAM189A2, VSIG2, PTER, RDH12, CA2, CXCL17, PXMP2, PLPP5, SLC16A7, TPCN2, KCNE2, CLDN18, TMEM220, DGKD, CMTM4, SLC41A2, KCNJ15, SIDT2, ERO1B, DNER, ANKRD22, DHRS7, PAIP2B, CYP3A5, MYZAP, FBXL13, SSTR1, FGD4, FAM20A, CA9, ALAD, RAB27B, SOSTDC1, PDGFD, LIPH, TMED6, ALDH3A1, UBE2QL1, TMEM92, MECOM, ARHGEF28, RNASE1, SULT1B1, PDIA2, CPA2, ADAM28, VSTM2A, ADGRG2, PLLP, TMEM171, TMEM116, KLHDC7A, LDHD, ADRB2, SH3BGRL2, TFF1, EPB41L4B, PGC, IRS4, CNTN3, B4GALNT3, ATP4A, ARHGEF37, GCNT2, PPID, SGK2, CCKBR, MAL, ATP4B, RASSF10, SLC26A9, CYP3A43, HPGD, NEDD4L, NECTIN3, PTPRZ1, AKR1C3, IGFBP2, FAM46C, SHISA6, MUM1L1, HMGN2P46, FBP2, ANG, KIAA1804, SPINK2, GRAMD1C, SCGB2A1, GUCA2B, C16orf89, GKN1, AKR1B10, KCNJ16, PPP1R36, GATA6-AS1, ALDOB, SST, PCAT18, LIPF, GATA5, TMEM125, NTN4, ENPP6, GPT2, BCL2L15, TFF2, IQGAP2, PTPRN2, SYTL2, RPS6KA6, KIAA1324, PSCA, OASL, GABRB3, ASPA, SCNN1G, SLC7A8, SMPD3, LYPD6B, CYP2C18, ALDH1A1, PLAC8, AHCYL2, GIF, ASCL1, ESRRG, RAB11B-AS1, CHGA, PIGR, ACVR1C, NOSTRIN, SPINK7, ADH7,TSPAN12, ACKR4, GKN2, NRG4, PKIB, CAPN13, RNF128, NKX2-3, ATP8A1, ZBTB7C, SH3RF2, SCNN1B, GDPD1, SYTL5, TEX9, UGT2B15, HMGN5, B3GNT6, SULT2A1, GOLM1, PIWIL2, ACER2, CYP2C9, SOWAHA, ST6GALNAC1, GSTA4, ODAM, PRR15L, TMEM266, SCGN, SLC28A2, TMEM27, SELENBP1, ANKRD29, ATP13A4, HAPLN1, MAMDC2, C4orf19, REG1A, UPK1B, SCN3A, AMPD1, SLC4A4, FAM3B, SMIM24, ANXA10, IL1R2, AKR1C1, SLC14A1, HHIP, CTSE, LRRC66, KCNMB2, STYK1, GCNT1, NDOR1, IRX3, PRDM16, PER3, ZNF793-AS1, TCEA3, SCIN, NR3C2, DUSP4, SLC26A7, UNC5B-AS1, OSBPL7, XK, ARL14, FER1L4, MAP7D2, TMEM38A, ADAMTS15, CAPN8, TPH1, GATA6, SULT1C2, NQO1, KLF4, ITPKA, DDX60, SLC44A4, KBTBD12, RHBDL2, MUC1, KAZALD1, MST1L, DUOX2, EYA2, NEUROD1, TFCP2L1, PDZD3, PIK3C2G, RMST, JCHAIN, STK32B, HRASLS2, RFX6, TCAIM, GAS2, MRAP2, CYTIP, SERPINA4, CDHR3, GCKR, POU2AF1, RGS7, CYP2C19, HGD, SPINK1, PRSS12, PRLR, APOBEC1, TRPA1, GSTA1, NCKAP5, TRIQK, LRRC31, STX19, KRT20, BMP5, ATP2A3, LTF, IGH, ADH1C, IL1RN, LDLR, MGAM, TOX3, ABCC13, FCRL5

DEGs – differentially expressed genes.

Supplementary Table 3.

Gene ontology annotation of differentially expressed genes in gastric cancer.

TermIDP-valueCount
Top 10 terms in BP category
Collagen fibril organizationGO: 00301991.04E-1012
Cell adhesionGO: 00071559.74E-0719
AngiogenesisGO: 00015253.16E-0616
Endodermal cell differentiationGO: 00359875.32E-057
Extracellular matrix organizationGO: 00301981.15E-0410
Negative regulation of angiogenesisGO: 00165252.75E-048
Regulation of cell growthGO: 00015583.59E-047
Patterning of blood vesselsGO: 00015695.67E-046
Embryonic limb morphogenesisGO: 00303260.0013116
Cellular response to BMP stimulusGO: 00717730.0014394
Top 10 terms in CC category
Extracellular spaceGO: 00056154.06E-2079
Proteinaceous extracellular matrixGO: 00055782.82E-1530
Collagen trimerGO: 00055811.53E-1015
Extracellular exosomeGO: 00700623.76E-10104
Extracellular matrixGO: 00310121.98E-0615
Basement membraneGO: 00056045.98E-0610
Cell surfaceGO: 00099861.81E-0422
Extracellular regionGO: 00055760.00365825
FibrilGO: 00432050.0039473
Endoplasmic reticulumGO: 00057830.00596224
Top 10 terms in MF category
Extracellular matrix structural constituentGO: 00052012.84E-0811
Heparin bindingGO: 00082017.82E-0715
Extracellular matrix bindingGO: 00508401.76E-057
Calcium ion bindingGO: 00055095.56E-0533
Integrin bindingGO: 00051787.75E-057
Protein bindingGO: 00055150.00168613
Serine-type endopeptidase inhibitor activityGO: 00048670.0018649
Aldehyde dehydrogenase (NAD) activityGO: 00040290.0054474
Collagen bindingGO: 00055180.0066764
Identical protein bindingGO: 00428020.00994910

BP – biological process; CC – cellular component; MF – molecular function.

Supplementary Table 4.

Significantly enriched pathway terms of differentially expressed genes in gastric cancer.

TermDatabaseIDP-valueCount
Extracellular matrix organizationReactomeR-HSA-14742442.06E-3345
Degradation of the extracellular matrixReactomeR-HSA-14742287.14E-2226
Collagen formationReactomeR-HSA-14742902.36E-1820
Collagen biosynthesis and modifying enzymesReactomeR-HSA-16508149.37E-1818
Collagen degradationReactomeR-HSA-14424909.00E-1717
Assembly of collagen fibrilsReactomeR-HSA-20220901.69E-1616
Integrin cell surface interactionsReactomeR-HSA-2160833.51E-1618
ECM proteoglycansReactomeR-HSA-30001787.22E-1617
Chemical carcinogenesisKEGG PATHWAYhsa052046.76E-1315
Integrin signalling pathwayPANTHERP000349.35E-1319
Supplementary Table 5.

Significantly enriched pathway terms of three significant modules in protein–protein interaction networks.

TermDatabaseIDP-valueCount
Module 1
Collagen biosynthesis and modifying enzymesReactomeR-HSA-16508142.76E-4416
Collagen formationReactomeR-HSA-14742901.42E-4216
Collagen degradationReactomeR-HSA-14424904.31E-3714
Extracellular matrix organizationReactomeR-HSA-14742441.01E-3416
Assembly of collagen fibrilsReactomeR-HSA-20220901.42E-3413
Module 2
Platelet degranulationReactomeR-HSA-1146089.13E-156
Response to elevated platelet cytosolic Ca2+ReactomeR-HSA-760051.14E-146
HemostasisReactomeR-HSA-1095821.98E-137
Platelet activation, signaling and aggregationReactomeR-HSA-760028.57E-136
Extracellular matrix organizationReactomeR-HSA-14742444.51E-105
Module 3
O-linked glycosylation of mucinsReactomeR-HSA-9137096.69E-125
Chemical carcinogenesisKEGG PATHWAYhsa052042.04E-115
O-linked glycosylationReactomeR-HSA-51731058.07E-115
Metabolic pathwaysKEGG PATHWAYhsa011001.42E-108
Biological oxidationsReactomeR-HSA-2118591.49E-095
Supplementary Table 6.

Number of differentially expressed circRNAs from each expression profile dataset.

Up-regulationDown-regulationTotal
GSE835218070150
GSE891433188191
GSE93541215201416
Common DE circRNAs022

DE circRNAs – differentially expressed circRNAs.

Supplementary Table 7.

Significantly enriched pathway terms of genes in the DE circRNA–miRNA–mRNA networks.

TermDatabaseIDP-valueCount
Signal transductionReactomeR-HSA-1625820.00018810
Extracellular matrix organizationReactomeR-HSA-14742440.0002624
Non-integrin membrane-ECM interactionsReactomeR-HSA-30001710.0019122
Collagen biosynthesis and modifying enzymesReactomeR-HSA-16508140.0024382
Bile secretionKEGG PATHWAYhsa049760.0027242
SLC-mediated transmembrane transportReactomeR-HSA-4254070.0028543
ECM-receptor interactionKEGG PATHWAYhsa045120.0035872
TGF-beta signaling pathwayKEGG PATHWAYhsa043500.0037562
Integrin cell surface interactionsReactomeR-HSA-2160830.0038422
Collagen formationReactomeR-HSA-14742900.0041052
  10 in total

1.  The Effect of Long Non-Coding RNA (lncRNA) HCP5 on Regulating Epithelial-Mesenchymal Transition (EMT)-Related Markers in Gastric Carcinoma Is Partially Reversed by miR-27b-3p.

Authors:  Shudong Chen; Chenglei Ren; Hui Zheng; Xianchun Sun; Jundi Dai
Journal:  Med Sci Monit       Date:  2020-05-01

2.  Identification of the potential biomarkers in patients with glioma: a weighted gene co-expression network analysis.

Authors:  Ting-Yu Chen; Yang Liu; Liang Chen; Jie Luo; Chao Zhang; Xian-Feng Shen
Journal:  Carcinogenesis       Date:  2020-07-10       Impact factor: 4.944

3.  Silencing circZFR inhibits the proliferation, migration and invasion of human renal carcinoma cells by regulating miR-206.

Authors:  Mi Wang; Yisheng Gao; Jie Liu
Journal:  Onco Targets Ther       Date:  2019-09-13       Impact factor: 4.147

4.  ICAM1 Regulates the Development of Gastric Cancer and May Be a Potential Biomarker for the Early Diagnosis and Prognosis of Gastric Cancer.

Authors:  Songda Chen; Shan Pan; Huijie Wu; Jingyuan Zhou; Yueli Huang; Shuai Wang; Aiqun Liu
Journal:  Cancer Manag Res       Date:  2020-03-02       Impact factor: 3.989

5.  Identifying of biomarkers associated with gastric cancer based on 11 topological analysis methods of CytoHubba.

Authors:  Hua Ma; Zhihui He; Jing Chen; Xu Zhang; Pingping Song
Journal:  Sci Rep       Date:  2021-01-14       Impact factor: 4.379

6.  Circular RNA circ_0020123 Promotes Non-Small Cell Lung Cancer Progression Through miR-384/TRIM44 Axis.

Authors:  Qingshan Ma; Baogang Huai; Yuting Liu; Zhongyao Jia; Qilong Zhao
Journal:  Cancer Manag Res       Date:  2021-01-07       Impact factor: 3.989

7.  Identification of Potential Bisphenol A (BPA) Exposure Biomarkers in Ovarian Cancer.

Authors:  Aeman Zahra; Qiduo Dong; Marcia Hall; Jeyarooban Jeyaneethi; Elisabete Silva; Emmanouil Karteris; Cristina Sisu
Journal:  J Clin Med       Date:  2021-05-05       Impact factor: 4.241

8.  Biglycan as a potential diagnostic and prognostic biomarker in multiple human cancers.

Authors:  Shou-Feng Zhao; Xue-Jing Yin; Wen-Ju Zhao; Le-Cui Liu; Zhi-Peng Wang
Journal:  Oncol Lett       Date:  2020-01-08       Impact factor: 2.967

9.  CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

Authors:  Mengting Niu; Quan Zou; Chen Lin
Journal:  PLoS Comput Biol       Date:  2022-01-20       Impact factor: 4.475

10.  Circular RNA circLMO1 Suppresses Cervical Cancer Growth and Metastasis by Triggering miR-4291/ACSL4-Mediated Ferroptosis.

Authors:  Rongying Ou; Shun Lu; Luhui Wang; Yebo Wang; Mingfen Lv; Tian Li; Yunsheng Xu; Jieqiang Lu; Ren-Shan Ge
Journal:  Front Oncol       Date:  2022-03-07       Impact factor: 6.244

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.