Literature DB >> 30106150

Novel prognostic biomarkers of gastric cancer based on gene expression microarray: COL12A1, GSTA3, FGA and FGG.

Shijie Duan¹, Baocheng Gong¹, Pengliang Wang¹, Hanwei Huang¹, Lei Luo¹, Funan Liu¹.

Abstract

Gastric cancer (GC) is the fifth most common malignancy and the third leading cause of cancer‑associated mortality in the world. However, its mechanisms of occurrence and development have not been clearly elucidated. Furthermore, there is no effective tumor marker for GC. Using DNA microarray analysis, the present study revealed genetic alterations, screened out core genes as novel markers and discovered pathways for potential therapeutic targets. Differentially expressed genes (DEGs) between GC and adjacent normal tissues were identified, followed by pathway enrichment analysis of DEGs. Next, the protein‑protein interaction (PPI) network of DEGs was built and visualized. Analyses of modules in the PPI network were then performed to identify the functional core genes. Finally, survival analysis of core genes was conducted. A total of 256 genes were identified as DEGs between the GC samples and normal samples, including 169 downregulated and 87 upregulated genes. Through Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis, the present study identified a total of 143 GO terms and 21 pathways. Six clusters of functional modules were identified, and the genes associated with these modules were screened out as the functional core genes. Certain core genes, including collagen type 12 α1 chain (COL12A1), glutathione S‑transferase α3 (GSTA3), fibrinogen α chain (FGA) and fibrinogen γ chain (FGG), were the first reported to be associated with GC. Survival analysis suggested that these four genes, COL12A1 (P=0.002), GSTA3 (P=3.4x10‑6), FGA (P=0.00075) and FGG (P=1.4x10‑5), were significant poor prognostic factors and therefore, potential targets to improve diagnosis, optimize chemotherapy and predict prognostic outcomes.

Entities: Chemical Disease Gene Mutation Species

Mesh：

Substances：

Year: 2018 PMID： 30106150 PMCID： PMC6131538 DOI： 10.3892/mmr.2018.9368

Source DB: PubMed Journal: Mol Med Rep ISSN： 1791-2997 Impact factor: 2.952

Introduction

Gastric cancer (GC) is the fifth most common malignancy in the world, especially in developing countries (1). GC has been the leading cause of cancer death worldwide until the mid-1990s, after which the occurrence has been substantially declining. However, certain countries in Eastern Asia, such as China, Korea and Japan, remain as highly endemic areas (2). In China, the 5-year survival rate of GC has improved 16% for patients diagnosed during 1995–2009 (3). However, GC is still the third leading cause of cancer death in the world and its prognosis is relatively poor (1). In recent years, a growing number of researchers devoted to GC research and achieved considerable achievements. In the previous studies, various molecular mechanisms and biomarkers related to GC have been identified. Through a retrospective study, Pectasides et al (4) found that using carcinoembryonic antigen (CEA), carbohydrate antigen 19-9 (CA 19-9) and carbohydrate antigen 50 (CA-50) as GC biomarkers had significant value in early detection of recurrence and monitoring progression. Ara et al (5), discovered that non-canonical Wnt signaling pathway contributed to GC progression, which may serve as a new potential therapeutic target for GC. Despite of these advances, the mechanisms of GC occurrence and development are yet to be clearly elaborated. Furthermore, the biomarkers commonly used for GC diagnosis and treatment, including carbohydrate antigen 125 (CA125), CA19-9 and CEA, have significant sensitivity and specificity issues (6). As such, further research on revealing genetic alterations, identification of new biomarkers and exploration of pathways associated with GC is critically needed. DNA microarray analysis is a rather powerful method to screen out cancer-related genes as novel diagnostic and prognostic markers and disclose genetic alterations of cancer evolution and progression. For example, Yang et al (7) found that Chromogranin A (CHGA) and Thy-1 cell surface antigen (THY1) were novel biomarkers in the diagnosis of cancer via DNA microarray analysis. Similarly, Sun et al (8) identified the COL family as promising prognostic markers for GC. Often used in combination with DNA microarray analysis, gene ontology (GO) enrichment analysis is a strategy to characterize the function categories affected by cancer and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis is an approach used to find out biologic pathways enriched in cancer. For example, with the above analysis methods, Hu et al (9) confirmed that several pathways, including focal adhesion, ECM-receptor interactions and the metabolism of xenobiotics by cytochrome P450, were associated with the progression of GC. Another important technique, modules analysis, is more crucial to study the specific behavior of modules and identify functional genes as cancer biomarkers compared with genes with straightforward high interaction degree (10). In the present study, we selected expression profile (GSE79973) to perform DNA microarray analysis to reveal genetic alterations, identify new biomarkers and explore novel pathways associated with GC. In addition to biomarkers previously reported, we identified four novel biomarkers, COL12A1, GSTA3, FGA, and FGG, as significant poor prognostic factors. Most of the enriched GO terms and KEGG pathways of those genes are related to GC, such as chemical carcinogenesis, metabolism of xenobiotics by cytochrome P450, ECM-receptor interaction, focal adhesion, and platelet activation. These findings may provide insights on GC occurrence and development, as well as potential therapeutic targets for future research.

Materials and methods

Microarray data

To account for tumor heterogeneity, we conducted DNA microarray analysis with the new gene profiles, GSE79973, to identify genes as novel biomarkers. In the present study, the expression profiles associated with GC, GSE79973, were downloaded from GEO (www.ncbi.nlm.nih.gov/geo/), which is a public functional genomic data repository. GSE79973 containing 20 tissues (10 pairs of GC tissues and adjacent non-tumor tissues) were obtained from Zhejiang Provincial People's Hospital, Zhejiang, China. The platform was based on GPL570 [HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0.

Preprocess microarray data

The Microarray Data was preprocessed by affy package in R (11). The purpose of this step was to filter out unwanted noise of the raw microarray and ensure background and data were standardized (12). Furthermore, for these genes that corresponded with multiple probes, we used the average expression values of those probes as the expression value of each gene. Furthermore, to visualize the difference before and after normalization, we constructed box plots of raw and normalization data.

Identification of the DEGs

Using LIMMA package in R, we collected a data list from GSE79973 (13). We then screened out these DEGs by means of t-test with P-value <0.05 and |log2(fold change) |>2. The heatmap and volcano map of these DEGs were constructed using pheatmap package in R and gplots package in R, respectively.

GO terms and KEGG pathways of DEGs

To elaborate how DEGs affected the GC cells, GO and KEGG pathway enrichment analyses of DEGs were conducted using DAVID (david.abcc.ncifcrf.gov/). The DAVID database includes a total of four modules (functional annotation, gene functional classification, gene ID conversion and gene name batch viewer). The database for annotation was used in the presnt study. The GO terms and pathways of DEGs with P<0.05 and at least five genes were screened out as significant function annotation of DEGs.

PPI network analysis of the DEGs

To study the molecular interactions between DEGs, we built the PPI network using the STRING (www.string-db.org/) database (14). The PPI network includes direct (physical) and indirect (functional) associations and stem from computational prediction, knowledge transfer between organisms, and interactions aggregated from other (primary) databases. Using the STRING database, we could obtain certain integrated scores of interactions among DEGs and select out the genes whose integrated scores were bigger than 0.4 (the default threshold in the STRING database). The PPI networks were then visualized by the Cytoscape software (15).

Sub-network modeling analysis of PPI networks

In PPI networks, genes in the same module typically show the same or similar function and work together to implement their biological function. To visualize the network and identify the modules in the network, MCODE plug-in on the Cytospace software (www.cytoscape.org/) was used. The parameters were set as follows: Degree cutoff ≥2 (degrees of each node in module were at least larger than 2), K-core ≥2 (subgraphs of each node in module were at least more than 2). After that, the GO enrichment analysis was performed using DAVID and the functional core genes of subsets were selected in each module.

Survival analysis of core genes

The survival analysis of core genes was conducted by Kaplan-Meier plotter (KM plotter, www.kmplot.com), whose gene expression data as well as relapse free and overall survival information are downloaded from the well-known public database, including GEO (Affymetrix microarrays only), EGA and TCGA. The Kaplan Meier (KM) plotter is capable of assessing the effect of 54,675 genes on survival using 10,461 cancer samples, 1,065 of which are GC patients with a mean follow-up of 69/40/49/33 months. Based on the KM plotter on the webpage, the hazard ratio (HR) with 95% confidence intervals and log rank P-values were calculated and the curves were generated.

Results

Data preprocessing

Genes with systematic bias among original data were removed after preprocessing using the Affy package in R software. The expression data of genes before and after normalization are shown in Fig. 1. The black lines in each of the boxes represent the medians of each dataset. The black lines are shown at almost the same level in the box plots, indicating a significant effect of standardization.

Figure 1.

Normalized expression value data. The black line in each box represents the median of each set of data, which determined the degree of standardization of data through its distribution. (A) Box plots of expressed value data prior to normalization. (B) Box plots of expressed value data following normalization. The gene expression value is presented on the y-axis and the samples on the x-axis.

Get the DEGs

Using the R package LIMMA, we identified the DEGs between the GC and normal samples. A total of 256 DEGs in the data list were identified from GSE79973, including 169 down-regulated and 87 up-regulated genes. We ranked these DEGs by P-value and constructed the heatmap (Fig. 2) as well as volcano map (Fig. 3). Based on the heatmap and volcano map, the gene expressions of these DEGs exhibit significant differences.

Figure 2.

Heatmap of the DEGs. Overview of the associations between DEGs from normal and GC tissues. Each row represents the tissues (10 GC and 10 normal). Red coloration indicated 10 GC tissues. Blue coloration indicated 10 adjacent normal tissues. Each column represents the gene IDs. Log ratio scale bar for the Treeview color change was also presented. The red in the log ratio scale bar indicates highly expressed genes in GC tissues when compared with the normal tissues, while the blue represents the low expression of genes. The number in each block with its color in the log ratio scale bar represents the expression level of each gene in different samples. GC, gastric cancer; DEGs, differentially expressed genes.

Figure 3.

Volcano plot overview of the distribution of DEGs. Red spots indicate upregulated genes, while green spots indicate downregulated genes. Black spots represent genes that were not DEGs. The-log10 (adj. P-value) on the x-axis is the P-value following correction and the log2FC value on the y-axis represents the fold change of genes. DEGs, differentially expressed genes.

GO term enrichment analysis of the DEGs

Using the DAVID and pathway enrichment analysis of the DEGs, we identified a total of 100 GO terms and 16 pathways. The top 10 enriched GO terms of the DEGs based on the P-values are listed (Table I). As shown in Table I, DEGs were significantly enriched in biological processes (BP), including extracellular matrix organization, skeletal system development, cell adhesion and xenobiotic metabolic process. For molecular function (MF), DEGs were significantly enriched in extracellular matrix structural constituent and extracellular matrix binding. In addition, GO cell component (CC) analysis indicated that the DEGs were significantly enriched in extracellular space, extracellular region, proteinaceous extracellular matrix and collagen trimer.

Table I.

Top 10 enriched GO terms, which were sorted by P-value in ascending order.

Category	GO ID	GO name	P-value	Gene number
CC	GO:0005615	Extracellular space	1.07×10⁻¹⁵	55
CC	GO:0005576	Extracellular region	4.88×10⁻¹⁵	59
BP	GO:0030198	Extracellular matrix organization	1.99×10⁻¹¹	19
CC	GO:0005578	Proteinaceous extracellular matrix	3.87×10⁻⁹	19
CC	GO:0005581	Collagen trimer	1.12×10⁻⁸	12
BP	GO:0001501	Skeletal system development	9.95×10⁻⁹	14
BP	GO:0007155	Cell adhesion	2.95×10⁻⁸	23
MF	GO:0005201	Extracellular matrix structural constituent	9.78×10⁻⁷	9
BP	GO:0006805	Xenobiotic metabolic process	3.84×10⁻⁵	8
MF	GO:0050840	Extracellular matrix binding	2.09×10⁻⁴	5

GO, Gene Ontology; BP, biological process; CC, cellular component; MF, molecular function.

KEGG pathway analysis of the DEGs

A total of 16 enriched pathways with P<0.05 were shown in Table II. The first enriched pathway, chemical carcinogenesis, was directly related to cancer and all the others have been reported to play an important role in cancer progression via certain biological processes, such as protein digestion and absorption, metabolism of xenobiotics by cytochrome P450, ECM-receptor interaction, focal adhesion, metabolic pathways, and platelet activation.

Table II.

KEGG pathways enriched in DEGs.

Category	Pathway name	Gene number	P-value
KEGG_PATHWAY	hsa05204:Chemical carcinogenesis	11	1.65×10⁻⁷
KEGG_PATHWAY	hsa04974:Protein digestion and absorption	11	4.12×10⁻⁷
KEGG_PATHWAY	hsa00980:Metabolism of xenobiotics by cytochrome P450	10	9.02×10⁻⁷
KEGG_PATHWAY	hsa04512:ECM-receptor interaction	10	3.59×10⁻⁶
KEGG_PATHWAY	hsa00982:Drug metabolism-cytochrome P450	9	4.86×10⁻⁶
KEGG_PATHWAY	hsa00830:Retinol metabolism	7	3.10×10⁻⁴
KEGG_PATHWAY	hsa04971:Gastric acid secretion	7	5.83×10⁻⁴
KEGG_PATHWAY	hsa00010:Glycolysis/Gluconeogenesis	6	0.0027
KEGG_PATHWAY	hsa04510:Focal adhesion	10	0.0027
KEGG_PATHWAY	hsa04978:Mineral absorption	5	0.0041
KEGG_PATHWAY	hsa00350:Tyrosine metabolism	4	0.0134
KEGG_PATHWAY	hsa05146:Amoebiasis	6	0.0177
KEGG_PATHWAY	hsa00071:Fatty acid degradation	4	0.0247
KEGG_PATHWAY	hsa01100:Metabolic pathways	26	0.0379
KEGG_PATHWAY	hsa04611:Platelet activation	6	0.0383
KEGG_PATHWAY	hsa04972:Pancreatic secretion	5	0.0440

DEGs, differentially expressed genes; KEGG, Kyoto Encyclopedia of Genes and Genomes.

PPI network of DEGs

The whole PPI network shown in Fig. 4 contained 116 nodes and 197 edges. The nodes represent DEGs with higher integrated scores of interactions and the edges represent interactions among the DEGs. The genes with interaction degree ≥5 were shown in Table III. COL1A2, COL2A1, COL11A1 and SPARC, whose corresponding degree was beyond 10, were hub genes in the PPI network and closely related to cancer.

Figure 4.

PPI network of DEGs. The network contained 116 nodes and 197 edges. The 39 red nodes were the upregulated genes and the 76 green nodes were the downregulated genes. The one pink node indicates a gene that was automatically generated and was potential associated with DEGs. The edges represented the correlation between two genes. PPI, protein-protein interaction; DEGs, differentially expressed genes.

Table III.

Core genes with corresponding degree ≥5.

Gene	Degree
COL1A2	15
COL2A1	15
COL11A1	12
SPARC	11
COL5A2	9
FN1	9
IL8	9
TIMP1	9
ALDH3A1	8
ATP4A	8
COL6A3	8
GSTA1	8
GSTA3	8
THBS2	8
ADH1A	7
ADH1B	7
ADH7	7
ADIPOQ	7
ALDOB	7
COL10A1	7
COL12A1	7
COL8A1	7
TNFRSF11B	7
BGN	6
CYP2C19	6
CYP2C9	6
PLA2G1B	6
AKR1C1	5
ASPN	5
CHGA	5
COMP	5

Modules in PPI network

Six modules were identified in this study, including 8, 6, 3, 3, 3 and 3 genes, respectively (Fig. 5). The top three GO terms of each modules are shown in Table IV. Interestingly, the GO terms of these modules were mainly associated with collagen trimer, (S)-limonene 7-monooxygenase activity, extracellular matrix organization, multicellular organism development, glutathione derivative biosynthetic process and fibrinogen complex, respectively. Among the core genes involved in these modules, COL12A1, GSTA3, FGA and FGG were firstly reported to be associated with GC.

Figure 5.

Functional modules involved in the PPI network. The red nodes represent the upregulated genes and the green nodes represent the downregulated genes. The edges indicate the correlation between two genes. (A) Module A was comprised of 8 nodes and 28 edges. (B) Module B was comprised of 6 nodes and 9 edges. (C) Module C was comprised of 3 nodes and 3 edges. (D) Module D was comprised of 3 nodes and 3 edges. (E) Module E was comprised of 3 nodes and 3 edges. (F) Module F was comprised of 3 nodes and 3 edges. PPI, protein-protein interaction.

Table IV.

Top three GO terms in each module.

GOID	GO name	P-value	Genes ID
Module A
0005581	Collagen trimer	2.49×10⁻⁹	COL6A3, COL1A2, COL2A1, COL8A1, COL10A1
0030199	Collagen fibril organization	1.17×10⁻⁷	COL1A2, COL2A1, COL5A2, COL11A1
0005201	Extracellular matrix structural constituent	1.19×10⁻⁷	COL1A2, COL2A1, COL5A2, COL11A1
Module B
0018676	(S)-limonene 7-monooxygenase activity	4.74×10⁻⁴	CYP2C19, CYP2C9
0018675	Limonene 6-monooxygenase activity	4.74×10⁻⁴	CYP2C19, CYP2C9
0052741	(R)-limonene 6-monooxygenase activity	4.74×10⁻⁴	CYP2C19, CYP2C9
Module C
0030198	Extracellular matrix organization	1.36×10⁻⁴	TNFRSF11B, SPARC, SPP1
0050840	Extracellular matrix binding	0.0031	SPARC, SPP1
0005615	Extracellular space	0.0055	TNFRSF11B, SPARC, SPP1
Module D
0007275	Multicellular organism development	9.61×10⁻⁴	WNT2, SFRP4, WIF1
0017147	Wnt-protein binding	0.0037	SFRP4, WIF1
0030178	Negative regulation of Wnt signaling pathway	0.0062	SFRP4, WIF1
Module E
1901687	Glutathione derivative biosynthetic process	0.0026	GSTA1, GSTA3
0004364	Glutathione transferase activity	0.0041	GSTA1, GSTA3
0006749	Glutathione metabolic process	0.0067	GSTA1, GSTA3
Module F
0005577	Fibrinogen complex	2.17×10⁻⁷	FGG, FGA, FN1
0031093	Platelet α granule lumen	8.94×10⁻⁶	FGG, FGA, FN1
0002576	Platelet degranulation	3.73×10⁻⁵	FGG, FGA, FN1

GO, Gene Ontology.

Kaplan-Meier survival curves

The Kaplan-Meier (KM) survival curves are shown in Fig. 6. Using the KM plotter platform, the GC patients were divided into two groups, with low and high expression of the four core genes, respectively. We found that each of the four core genes, COL12A1 [HR 1.4 (1.13–1.74) P=0.002], GSTA3 [HR 1.64 (1.33–2.02) P=3.4e-06], FGA [HR 1.36 (1.14–1.63) P=0.00075] and FGG [HR 1.45 (1.23–1.72) P=1.4e-05], was a poor prognostic factor of overall survival.

Figure 6.

Kaplan-Meier survival curves of four genes generated from Kaplan-Meier plotter. Curves of (A) COL12A1, (B) GSTA3, (C) FGA and (D) FGG. COL12A1, collagen type 12 α1 chain; GSTA3, glutathione S-transferase α3; FGA, fibrinogen α chain; FGG, fibrinogen γ chain; HR, hazard ratio; CI, confidence interval.

Discussion

GC, as one of the most malignant tumor, was a typical heterogeneous cancer with molecular complexity and heterogeneity (16,17). Recently, a substantial number of effort have been paid in exploring the underwent molecular mechanism of GC and several genetic and epigenetic alterations have been identified as biomarkers with effect on diagnosis, treatment, stratification and prognosis (18–20). However, with the technological improvement, its heterogeneity was found to be more complex than previous imagined, regarding genomic instability, differentially expressed genes (DEGs), genetic variations, epigenetic heterogeneity, protein heterogeneity and so on (21,22). The heterogeneity of GC presented some complex biological characteristics, such as recurrence and metastasis of cancer, sensitivity to adjuvant therapy, and so on (23–25). Therefore, these identified biomarkers still were ineffective and inconstant, which slow their clinical application (17,26). Even so, it still should not be neglected that exploring novel biomarkers was helpful to further construct the molecular network of GC, improve our understanding on heterogeneity, and identify more meaningful genetic subtypes related to individual characteristics, such as prognosis, sensitivity of adjuvant therapy and so on (21). In our study, a total of six modules were identified. These modules were analyzed with these DEGs that met with the default parameters. And the expression data of these DEGs were identified based on the gene profiles (GSE79973). Therefore, as a result of the tumor heterogeneity, these modules were associated with this gene profiles. Interestingly, from the function annotation of these modules, on the one hand, these modules were enriched in some common cancer signaling pathway such as PI3K-Akt signaling pathway, Wnt signaling pathway and so on (27,28). On the other hand, these modules also were enriched in the processes, such as gastric acid secretion, protein digestion and absorption and so on, which were specific in the GC. We also found that certain well-known gastric prognosis factors, such as some non-canonical Wnt signaling genes and PICT1, were not included in the identified modules (29). In fact, these well-known gastric prognosis factors were identified as DEGs in our study. But they were not included in further module analysis for their expression level without meeting with the default parameter. This issue resulted from the tumor heterogeneity. It is the heterogeneity that restricts these well-known prognostic factors as sites of targeted therapy, for example trastuzumab for the GC patients with HER-2 positive (30). The trastuzumab has been a standard treatment strategy for the GC patients with HER-2 positive (31). However, only a small number of patients can benefit from it, which mainly results from HER-2 in a fraction of GC patients (~7–34%) being overexpression due to the heterogeneity (32). Even so, the tumor heterogeneity makes it possible to explore some novel genes related to prognosis of patients with GC as the candidates of biomarkers. These novel genes may not be as important as these well-known prognostic genes. But It is helpful to construct tumor molecular network and provide more accurate individualized treatment plan. Remarkably, we identified four GC-associated functional core genes, COL12A1 in Module A, GSTA3 in Module E and FGA, FGG in Module F, and the level of these genes correlate with poor prognosis. COL12A1 was reported to be related to several types of cancers, such as subungual exostosis, ovarian, breast and colon cancer, which suggest that COL12A1 may be a new potential cancer biomarker (33–36). As a member of the FACIT (fibril associated collagens with interrupted triple helices) collagen family, COL12A1, together with COL6A3, COL8A1, COL1A2, COL5A2, COL10A1, COL11A1 and COL2A1, has the molecular function of extracellular matrix structural constituent. These FACIT members take part in the biological processes of collagen fibril organization and construct the cellular component of collagen trimer. Although COL12A is expressed in a variety of tumor tissues, its exact function remains poorly understood. Januchowski et al (34) found that COL12A1 played a role in the drug resistance of cancer cells and tumor progression. Based on our observed GO terms and pathways in module A (Fig. 5), COL12A1 appear to participate in pathways of protein digestion and absorption as well as focal adsorption, which may allow tumor cells to invade into the surrounding microenvironment. In addition, high expression of COL12A1 is associated with poor prognosis. Thus, COL12A1 may be a new potential marker and prognostic factor of GC and its pathways may be potential treatment targets. GSTA3 was identified to be a down-regulated gene in our study. Together with GSTA1 and CYP2C18, GSTA3 constructed the Module E (Fig. 5). They interact with each other and participate in chemical carcinogenesis, glutathione metabolism and drug metabolism-cytochrome P450. GSTA3 and GSTA1 encode a superfamily of glutathione S-transferases (GST), which are involved in biotransformation of toxic xenobiotics and endobiotics, as well as arachidonic acid metabolism via conjugation with reduced glutathione (GSH) (37). The expression level of GST are associated with the resistance of cells to toxic chemicals, such as carcinogens, antitumor drugs, environmental pollutants, and products of oxidative stress (38). Therefore, GSTA3 may be a potential marker in selecting chemotherapy drug (39). Further research on the association of these molecules is needed to help us better understand the exact role of GSTA3 in GC regulation. FGA and FGG encode the alpha and gamma components of fibrinogen, which are constituent parts of blood-borne glycoprotein (40). They were reported to differentially express in many types of cancer, such as prostate cancer, lung cancer, hepatocellular carcinoma, and pancreatic cancer (40–43). Plasma fibrinogen levels before surgery together with histological grade and lymph node involvement have been defined as independent prognostic factors in patients with colorectal cancer (44). Ghezzi et al (45), reported that plasma fibrinogen level might be a potential index to predict prognosis, improve the early diagnosis of endometrial cancer and optimize the treatment schedule. The possible mechanisms behind the association of plasma fibrinogen level with cancer include (1) the soluble form of fibrinogen may serve as the bridging molecule between tumor cells and host cells; (2) tumor cells and platelets may form large aggregates, which prevent tumor cells from being attacked by the innate immune system and (3) fibrinogen is essential to the sustained adherence of tumor cells to the endothelia of target tissues (46–48). In GC, fibrinogen plays a key role in hematogenous and lymphatic metastasis of cancer cells through spontaneous metastasis, facilitating the stable adhesion and/or survival of metastatic emboli after tumor cell intravasation (49). The plasma fibrinogen level on the prognosis was based on staging of tumor with worse prognosis in T2 GC as well as lymphatic and hematogenous metastasis, however, not in T3/T4 GC (50). However, they were directly reported to be associated with GC for the first time in this study. There still existed debate in expression level about these genes in tissues of cancer (51–53). In our study, FGA and FGG were identified to be down-regulated genes, while their high expression indicated poor prognosis for patients with GC. Besides tumor, many other factors may influence the alteration of expression level of FGG and FGA, such as acute-phase reactant and binding sites in tissues (40,54). Furthermore, levels of fibrinogen genes (FGA and FGG) changed following a circadian rhythms pattern and this additional level of fibrinogen transcriptional regulation has not yet been characterized (55). Considering pulmonary infarction as one of the most serious complications after GC surgery and the relationships between plasma fibrinogen level and GC, we should take it into consideration whether postoperative patients need a routine prophylactic anticoagulation. As evaluation indexes of blood coagulation state, FGG and FAG are potential biomarkers to improve diagnosis, optimize treatment and predict prognosis. We have a lot of limitations in the universality and applicability of these novel biomarkers for the observational nature of this article. Our study was conducted based on the gene profiles, GSE79973, which contained 10 pairs of GC tissues and adjacent non-tumor tissues. The association of the identified genes and GC, their value to predict prognosis and molecular interactions among these genes were not verified in clinical samples by experiments. For above reasons, the further experiments and studies are needed to measure the identified genes in clinical practice and investigate the biological mechanisms of the interactions among these identified genes. In conclusion, COL12A1, GSTA3, FGA and FGG have been identified to be associated with GC in this study and were selected as the core genes. Through GO and pathway enrichment of modules, we identified the functions and pathways of these core genes. Furthermore, based on survival analysis, the four genes were found to be significant poor prognostic factors. These core genes might be potential markers to improve diagnosis, optimize chemotherapy plan and predict prognosis. In addition, the pathways related to these genes may be potential therapeutic targets for GC. We plan to embark on validating the potential functions and pathways of these genes in our future studies.

55 in total

1. Spontaneous hematogenous and lymphatic metastasis, but not primary tumor growth or angiogenesis, is diminished in fibrinogen-deficient mice.

Authors: Joseph S Palumbo; Jill M Potter; Lisa S Kaplan; Kathryn Talmage; David G Jackson; Jay L Degen
Journal: Cancer Res Date: 2002-12-01 Impact factor: 12.701

2. Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors: Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal: Genome Res Date: 2003-11 Impact factor: 9.043

3. CEA, CA 19-9, and CA-50 in monitoring gastric carcinoma.

Authors: D Pectasides; A Mylonakis; M Kostopoulou; M Papadopoulou; D Triantafillis; J Varthalitis; M Dimitriades; A Athanassiou
Journal: Am J Clin Oncol Date: 1997-08 Impact factor: 2.339

4. Global surveillance of cancer survival 1995-2009: analysis of individual data for 25,676,887 patients from 279 population-based registries in 67 countries (CONCORD-2).

Authors: Claudia Allemani; Hannah K Weir; Helena Carreira; Rhea Harewood; Devon Spika; Xiao-Si Wang; Finian Bannon; Jane V Ahn; Christopher J Johnson; Audrey Bonaventure; Rafael Marcos-Gragera; Charles Stiller; Gulnar Azevedo e Silva; Wan-Qing Chen; Olufemi J Ogunbiyi; Bernard Rachet; Matthew J Soeberg; Hui You; Tomohiro Matsuda; Magdalena Bielska-Lasota; Hans Storm; Thomas C Tucker; Michel P Coleman
Journal: Lancet Date: 2014-11-26 Impact factor: 79.321

5. Novel biomarker candidates for gastric cancer.

Authors: Sanghwa Yang; Hyun Cheol Chung
Journal: Oncol Rep Date: 2008-03 Impact factor: 3.906

Review 6. Perspectives on new biomarkers in gastric cancer: diagnostic and prognostic applications.

Authors: Danilo do Rosário Pinheiro; Wallax Augusto Silva Ferreira; Mariceli Baia Leão Barros; Mariana Diniz Araújo; Symara Rodrigues-Antunes; Bárbara do Nascimento Borges
Journal: World J Gastroenterol Date: 2014-09-07 Impact factor: 5.742

Review 7. Genetics/genomics/proteomics of gastric adenocarcinoma.

Authors: Dushant S Uppal; Steven M Powell
Journal: Gastroenterol Clin North Am Date: 2013-03-05 Impact factor: 3.806

8. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012.

Authors: Jacques Ferlay; Isabelle Soerjomataram; Rajesh Dikshit; Sultan Eser; Colin Mathers; Marise Rebelo; Donald Maxwell Parkin; David Forman; Freddie Bray
Journal: Int J Cancer Date: 2014-10-09 Impact factor: 7.396

9. Identification of key genes associated with gastric cancer based on DNA microarray data.

Authors: Hui Sun
Journal: Oncol Lett Date: 2015-11-17 Impact factor: 2.967

10. Integrative approach for differentially overexpressed genes in gastric cancer by combining large-scale gene expression profiling and network analysis.

Authors: A Takeno; I Takemasa; Y Doki; M Yamasaki; H Miyata; S Takiguchi; Y Fujiwara; K Matsubara; M Monden
Journal: Br J Cancer Date: 2008-09-30 Impact factor: 7.640

18 in total

1. A tumour-resident Lgr5⁺ stem-cell-like pool drives the establishment and progression of advanced gastric cancers.

Authors: A Fatehullah; Y Terakado; S Sagiraju; T L Tan; T Sheng; S H Tan; K Murakami; Y Swathi; N Ang; R Rajarethinam; T Ming; P Tan; B Lee; N Barker
Journal: Nat Cell Biol Date: 2021-12-02 Impact factor: 28.824

2. FN1 encoding fibronectin as a pivotal signaling gene for therapeutic intervention against pancreatic cancer.

Authors: Gayathri Ashok; Sravan Kumar Miryala; Megha Treesa Saju; Anand Anbarasu; Sudha Ramaiah
Journal: Mol Genet Genomics Date: 2022-08-18 Impact factor: 2.980

3. A model of twenty-three metabolic-related genes predicting overall survival for lung adenocarcinoma.

Authors: Zhenyu Zhao; Boxue He; Qidong Cai; Pengfei Zhang; Xiong Peng; Yuqian Zhang; Hui Xie; Xiang Wang
Journal: PeerJ Date: 2020-09-24 Impact factor: 2.984

4. Integrated bioinformatics analysis of expression and gene regulation network of COL12A1 in colorectal cancer.

Authors: Yibin Wu; Ye Xu
Journal: Cancer Med Date: 2020-05-01 Impact factor: 4.452

5. Proteomics profiling of plasma exosomes in epithelial ovarian cancer: A potential role in the coagulation cascade, diagnosis and prognosis.

Authors: Wei Zhang; Xiaoxuan Ou; Xiaohua Wu
Journal: Int J Oncol Date: 2019-03-07 Impact factor: 5.650

6. FGG promotes migration and invasion in hepatocellular carcinoma cells through activating epithelial to mesenchymal transition.

Authors: Xiang Zhang; Fei Wang; Yanbing Huang; Kun Ke; Bixing Zhao; Lihong Chen; Naishun Liao; Lei Wang; Qin Li; Xiaolong Liu; Yingchao Wang; Jingfeng Liu
Journal: Cancer Manag Res Date: 2019-02-19 Impact factor: 3.989

7. Exploring the Molecular Mechanism of the Drug-Treated Breast Cancer Based on Gene Expression Microarray.

Authors: Ali Mohamed Alshabi; Ibrahim Ahmed Shaikh; Chanabasayya Vastrad
Journal: Biomolecules Date: 2019-07-15

8. Gene Expression Analysis Platform (GEAP): A highly customizable, fast, versatile and ready-to-use microarray analysis platform.

Authors: Itamar José Guimarães Nunes; Mariana Recamonde-Mendoza; Bruno César Feltes
Journal: Genet Mol Biol Date: 2021-12-17 Impact factor: 1.771

9. Biomarkers of tumor invasiveness in proteomics (Review).

Authors: Daniel L Pouliquen; Alice Boissard; Olivier Coqueret; Catherine Guette
Journal: Int J Oncol Date: 2020-05-28 Impact factor: 5.650

10. Identification of Key Pathways and Genes in Obesity Using Bioinformatics Analysis and Molecular Docking Studies.

Authors: Harish Joshi; Basavaraj Vastrad; Nidhi Joshi; Chanabasayya Vastrad; Anandkumar Tengli; Iranna Kotturshetti
Journal: Front Endocrinol (Lausanne) Date: 2021-06-24 Impact factor: 5.555