Literature DB >> 35117805

Nomograms combined with SERPINE1-related module genes predict overall and recurrence-free survival after curative resection of gastric cancer: a study based on TCGA and GEO data.

Xing-Chuan Li1,2, Song Wang3, Jia-Rui Zhu4, Yu-Ping Wang1,2, Yong-Ning Zhou1,2.   

Abstract

BACKGROUND: Serpin peptidase inhibitor, clade E, member 1 (SERPINE1) has been investigated as an oncogene and potential biomarker in several cancers, including gastric cancer (GC). This study aimed to investigate SERPINE1 expression and its diagnostic and prognostic value by analyzing data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases.
METHODS: A meta-analysis was performed to investigate SERPINE1 expression levels in GC tissues and adjacent normal tissues. Gene set enrichment, multi experiment matrix (MEM), and protein-protein interaction (PPI) network analyses were performed to identify the most enriched signaling pathways and SERPINE1-related module genes. A Cox regression model was used to develop a nomogram that was able to predict the overall survival (OS) and recurrence-free survival (RFS) of individual patients.
RESULTS: Meta-analyses revealed an elevated trend in SERPINE1 expression levels in TCGA [standard mean difference (SMD) =0.95; 95% confidence interval (CI), 0.53-1.36; P<0.001]. The diagnostic meta-analysis results indicated that the area under the curve (AUC) of the summary receiver operating characteristic (SROC) was 0.80 (95% CI, 0.77-0.84). The factors identified to predict OS were age ≥60 years [hazard ratio (HR), 2.14; 95% CI, 1.45-3.16; P<0.01], R2 margins (HR, 2.70; 95% CI, 1.41-5.14; P<0.05), lymph node-positive proportion (HR, 3.38; 95% CI, 2.03-5.63; P<0.001), patient tumor status (HR, 3.33; 95% CI, 2.28-4.87; P<0.001), and OS risk score (HR, 2.72; 95% CI, 1.82-4.05; P<0.05). The following variables were associated with RFS: male sex (HR, 2.55; 95% CI, 1.46-4.45; P<0.01), R2 margins (HR, 13.08; 95% CI, 4.26-40.15; P<0.001), lymph node-positive proportion (HR, 2.55; 95% CI, 1.20-5.45; P<0.05), and RFS risk score (HR, 2.70; 95% CI, 1.82-4.06; P<0.001). The discriminative ability of the final model for OS and RFS was assessed using C statistics (0.755 for OS and 0.745 for RFS).
CONCLUSIONS: SERPINE1 was upregulated in GC, showed a high diagnostic value, and was associated with poorer OS and RFS. The OS and RFS risk for an individual patient could be estimated using these nomograms, which could lead to individualized therapeutic choices. 2020 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  Computational biology; meta-analysis; nomograms; plasminogen activator inhibitor-1 (PAI-1); stomach neoplasms

Year:  2020        PMID: 35117805      PMCID: PMC8798744          DOI: 10.21037/tcr-20-818

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


Introduction

Gastric cancer (GC) is the fourth most common malignancy and ranks as the second leading cause of cancer death worldwide (1). The highest GC incidence and mortality rates occur in East Asia, especially in China. Like other cancers, prognosis is mainly dependent upon tumor stage. Unfortunately, most GC patients are diagnosed at an advanced stage and the 5-year survival rate is significantly lower than that of patients diagnosed at an early stage (2). Although various biomarkers including carcinoembryonic antigen (CEA), alpha-fetoprotein (AFP), cancer antigen 125 (CA125), and carbohydrate antigen 199 (CA199) have been used in clinical practice, their reliability in the identification of early stage GC remains unsatisfactory (3). Therefore, the identification of reliable biomarkers related to tumor diagnosis, treatment, and prognostic evaluation is urgently needed. Serpin peptidase inhibitor, clade E, member 1 (SERPINE1), also known as endothelial plasminogen activator inhibitor (PAI), serpin E1, PLANH1, and PAI-1, encodes PAI-1, which is a primary member of the serpin superfamily and functions as a principal inhibitor of tissue plasminogen activator (tPA) and urokinase plasminogen activator (uPA). Although previous studies have mainly focused on the role of the SERPINE1 gene expression product PAI-1 in thrombosis, vascular diseases, obesity, and metabolic syndrome, accumulating evidence has highlighted the role of SERPINE1 in cancer progression (4). SERPINE1 has been identified as a key gene associated with prognosis by integrated bioinformatics analysis (5). SERPINE1 is generally accepted to not only play a key role in oncogenesis but also to serve as a new prognostic factor in certain cancers including breast cancer and head and neck squamous cell carcinoma (6,7). However, the molecular mechanism of SERPINE1 in GC, especially the vital signaling pathways involved in GC development, remains unclear. Furthermore, although surgical resection is a GC treatment, patients have a high risk of local relapse or distant metastasis after gastrectomy (8). Therefore, accurate data on the prognosis of postoperative GC patients are critical for treating physicians when making decisions regarding adjuvant treatment and follow-up frequency. Although the American Joint Committee on Cancer (AJCC) tumor-node-metastases (TNM) system, which has been widely used in clinical practice, may be helpful for the general prediction of GC survival, its use as a risk stratification system may not be suitable for predicting the survival and recurrence of an individual patient. The development of a reliable predictive model that incorporates factors associated with survival and recurrence based on postoperative clinicopathologic data combined with biological markers is urgently needed. A nomogram that can be widely and easily used could not only provide individualized, evidence-based, and highly accurate risk estimations, but could also aid in management-related decision making. Currently, microarray technology combined with bioinformatics analysis has provided an opportunity to comprehensively analyze the changes in gene transcription and posttranscriptional regulation during GC development and progression. Therefore, a meta-analysis was performed to evaluate SERPINE1 expression in GC and normal gastric tissues based on the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases. Furthermore, SERPINE1-related biological pathways involved in GC were detected using gene set enrichment analysis (GSEA) and multi experiment matrix (MEM) analysis. A nomogram combined with SERPINE1-related module genes was established to effectively predict the overall survival (OS) and recurrence-free survival (RFS) of patients after GC resection.

Methods

SERPINE1 expression profile mining

The gene expression data of gastric adenocarcinoma and corresponding clinical information were downloaded from the official TCGA website (http://cancergenome.nih.gov) in August 2019. These data included the SERPINE1 expression levels from 343 GC tissues and 30 tumor-adjacent normal control tissues. SERPINE1 values were carefully checked for each sample and values below single counts were treated as missing values. Gene expression level was normalized using the EdgeR package in R (version 3.6.1) and log2-transformed for further analysis. The clinical parameters of GC patients that were relevant to SERPINE1 were extracted and included age at the initial pathologic diagnosis, sex, anatomic location (cardia, fundus, antrum, or gastroesophageal junction), histologic grade [defined as poorly (G1), moderately (G2), or well-differentiated (G3)], resection margin status [negative (R0), microscopically positive (R1), or positive to the naked eye (R2)], lymph node-positive rate (defined as the number of lymph nodes that were positive by hematoxylin and eosin (HE) staining/the number of examined lymph nodes), patient tumor status (with tumor or tumor-free), and TNM stage. The relationship between SERPINE1 and the clinicopathological parameters in GC were determined based on TCGA database data. Then, the clinical diagnostic value of SERPINE1 was analyzed using a receiver operating characteristic (ROC) curve.

Meta-analysis

To strengthen the reliability of the results, all included datasets were combined to perform a meta-analysis using STATA 12.0 (STATA Corp., College Station, TX, USA). We screened GC microarray datasets from the GEO database (http://www.ncbi.nlm.nih.gov/gds/) up until August 2019 to perform a meta-analysis. The following keywords were used: gastric, GC, gastric carcinoma, stomach adenocarcinoma, SERPINE1, PAI, and PAI-1. Eligible microarrays were included if they met the following standards: (I) each dataset included GC tissues and peritumoral tissues and more than 10 samples were included in the study; (II) the expression profiling data of SERPINE1 from the GC case and their paired tumor-adjacent tissues controls were provided or could be calculated; and (III) the study subjects were human. Datasets with expression profiling data from animals or cell lines, or with no SERPINE1 expression profiling data were excluded. The expression data were log2-transformed. The SERPINE1 expression mean value, standard deviation (SD), and sample size of the tumor and control groups were calculated using SPSS version 24.0 (IBM Corp., Armonk, NY, USA). Continuous outcomes obtained from GEO datasets were estimated as the standard mean difference (SMD) with a 95% confidence interval (CI). Effect sizes were pooled using a random- or fixed-effects model. Heterogeneity across studies was assessed with I2; when I2<50%, a fixed-effects model was used and when I2≥50%, a random-effects model was selected. The number of true-positives (tps), true-negatives (tns), false-positives (fps), and false-negatives (fns) was extracted from the following basic formulae: or To calculate the incidence. A P value <0.05 was considered indicative of a statistically significant difference.

Gene set enrichment analysis

To identify the potential Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways underlying the influence of SERPINE1 expression on GC prognosis, GSEA was performed to detect the potential differentially expressed SERPINE1 KEGG pathways SERPINE1 between the high expression and low expression groups. The number of gene set permutations was 1,000 times for each analysis. SERPINE1 expression level SERPINE1was considered a phenotype label. Gene sets with a nominal P value <0.05 and a false discovery rate (FDR) <0.05 were considered significantly enriched.

Genes co-expressed with SERPINE1

Adler developed the MEM query engine (https://biit.cs.ut.ee/mem/) that detects co-expressed genes in large platform-specific microarray collections (9). MEM was used to identify genes that were co-expressed with SERPINE1 in large platform-specific microarray collections. First, SERPINE1 was input as a single query gene that acted as the template pattern for the co-expression search. Two probe sets were linked to the gene; the first probe set was chosen for further analysis. Current (24.02.12) was selected as the search database and H. sapiens was chosen as the organism filter. The other parameters were set as follows: distance measure, Pearson correlation distance; rank aggregation method, beta MEM method was used to obtain P values for selected ranks; set output limit, 3,000; gene filters, remove unknown genes and ambiguous genes; and dataset filter, 0.9 was set as the StDev threshold for query genes.

SERPINE1-related module screening from the protein-protein interaction (PPI) network and gene ontology (GO) annotation analysis

To investigate the central interactions between SERPINE1 and other genes enriched in overlapping KEGG pathways, a PPI network was constructed using the STRING online tool (https://string-db.org). The resulting network contained a subset of proteins that physically interacted with at least one other list member. Cytoscape was used to visualize this network, and the Molecular Complex Detection (MCODE) algorithm was then applied to this network to identify the SERPINE1-related module. GO enrichment analysis was conducted using R software to reveal the function of SERPINE1-related module genes. To examine the potential prognostic value of the module genes, the UALCAN online tool (http://ualcan.path.uab.edu/analysis.html) was then used to investigate the influence of SERPINE1-related module genes on the OS of GC patients. According to univariate survival analysis, module genes with P<0.05 were considered candidate prognostic module genes and were included in the multivariate Cox proportional hazards regression. To identify independent predictors that significantly contributed to OS or RFS, we used the lowest value of the Akaike information criterion (AIC) with respect to module gene selection and the established MRS (module gene risk score) values. The risk score of each patient was calculated to predict the OS and RFS of GC patients and the regression coefficients of the multivariate Cox regression model were used to weight the expression level of each module gene in the prognostic classifier: In order to investigate the relationship between risk scores and survival, patients were divided into high-risk and low-risk groups according to the optimum cut-off values obtained from X-tile plots version 3.6.1 (X-TILE, Yale University School of Medicine, New Haven, CT, USA).

Statistical analysis

The mean ± SD was calculated using SPSS to estimate the SERPINE1 expression level in each dataset. SERPINE1 expression was compared between normal gastric tissues and GC by Student’s t-test. A Student’s t-test was also used to evaluate the relationships between SERPINE1 expression and clinicopathological parameters. One-way analysis of variance (ANOVA) was used to compare mean values among subgroups. A ROC curve was generated to evaluate the diagnostic value of SERPINE1 expression using SPSS, and the area under the curve (AUC) was calculated to evaluate the diagnostic value. Patients were divided into two groups (high and low SERPINE1 expression) according to the threshold value identified from the ROC curve. Survival curves were plotted using the Kaplan-Meier method and compared using the log-rank test. A multivariate Cox proportional hazards regression model was used to identify the independent prognostic factors for OS. Univariate and multivariate Cox proportional hazards regression analyses were performed using R software (v.3.6.1). The Kaplan-Meier method was used to compare the survival between high- and low-SERPINE1 expression patients. The hazard ratio (HR) and 95% CI were calculated to identify protective factors (HR <1) or risk factors (HR >1). A correlation matrix was used to evaluate all variables for collinearity and interaction between terms; no significant collinearity or interactions were found. All variables significantly associated with OS were candidates for stepwise multivariate analysis. A nomogram was formulated based on multivariate Cox regression analysis results using the RMS package of R version 3.6.1 (http://www.r-project.org/). Nomogram predictive performance was measured by C statistics and calibration with 1,000 bootstrap samples to decrease the overfit bias (10). The net reclassification improvement (NRI) was calculated to estimate the overall improvement in the reclassification of patients between the two models using the nricens package in R (parameters: t0, 1,095 days; nIter, 1,000). Egger’s test was performed for all datasets to assess publication bias (11-16). In all analyses, P<0.05 was considered statistically significant. Data analysis was conducted from August 1 to October 24, 2019.

Results

SERPINE1 was overexpressed in GC tissues

As shown in , TCGA SERPINE1 expression data analysis revealed that SERPINE1 was significantly overexpressed in GC (11.99±1.52) compared with adjacent, nontumor tissue samples (9.47±1.65, P<0.001). SERPINE1 expression level SERPINE1 in stage T2/T3/T4 GC tissues was significantly higher than that in stage T1 tissues (P<0.001), and the expression level of SERPINE1 in deceased patients was significantly higher than that in surviving patients (P<0.001). These results suggested that SERPINE1 was overexpressed in GC and related to both T stage and survival.
Table 1

Expression of SERPINE1 in GC based on TCGA database

Clinicopathological featureNSERPINE1 expression (log2)T or F valueP value
Tissue type–8.6430.000*
   Normal309.47±1.65
   GC34311.99±1.52
Age0.1380.089
   ≤6011012.01±1.53
   >6023311.98±1.52
Sex0.7680.443
   Female12712.07±1.55
   Male21611.94±1.50
Histologic grade2.9740.052
   G1811.08±2.03
   G212811.82±1.50
   G320012.12±1.49
Anatomic location0.8750.454
   Antrum12311.85±1.50
   Cardia4512.26±1.70
   Fundus12212.04±1.35
Gastroesophageal junction3611.95±1.80
Resection margin1.7330.179
   R027411.90±1.51
   R11112.73±1.91
   R21412.19±1.42
T stage6.2670.000*
   T11910.57±1.99
   T27412.02±1.38
   T315712.00±1.54
   T48512.19±1.36
N stage0.8410.472
   N010211.83±1.55
   N19011.95±1.47
   N27212.17±1.62
   N36511.99±1.51
M stage–0.0890.929
   M031811.98±1.52
   M12311.96±1.57
TNM stage1.6810.171
   I5111.53±1.74
   II10512.04±1.51
   III13912.07±1.44
   IV3512.03±1.49
Survival status3.9330.000*
   Dead13412.37±1.61
   Alive18611.71±1.39
Recurrence1.5770.116
   Yes6012.24±1.48
   No20511.88±1.53

* indicate the clinical variables are related to SERPINE1 expression. SERPINE1 expression values are expressed as the mean ± SD. GC, gastric cancer; TCGA, The Cancer Genome Atlas; N, number; T, Student’s t-test; F, one-way ANOVA; ANOVA, analysis of variance; TNM, tumor-node-metastases; SD, standard deviation.

* indicate the clinical variables are related to SERPINE1 expression. SERPINE1 expression values are expressed as the mean ± SD. GC, gastric cancer; TCGA, The Cancer Genome Atlas; N, number; T, Student’s t-test; F, one-way ANOVA; ANOVA, analysis of variance; TNM, tumor-node-metastases; SD, standard deviation. In addition to evaluating the diagnostic value of SERPINE1, we generated a ROC curve using TCGA expression data from GC patients and healthy individuals (). The ROC AUC was 0.876, which was indicative of a high diagnostic value. Subgroup analysis showed the diagnostic value of SERPINE1 expression in different GC stages, with AUC values of 0.800, 0.878, 0.891, and 0.897 for stages I, II, III, and IV, respectively ().
Figure 1

Diagnosis value of SERPINE1 expression in GC. (A) ROC curve for SERPINE1 expression in normal gastric tissue and GC; (B,C,D,E) subgroup analysis for stage I, II, III, and IV GC. GC, gastric cancer; ROC, receiver operating characteristic; AUC, area under the curve.

Diagnosis value of SERPINE1 expression in GC. (A) ROC curve for SERPINE1 expression in normal gastric tissue and GC; (B,C,D,E) subgroup analysis for stage I, II, III, and IV GC. GC, gastric cancer; ROC, receiver operating characteristic; AUC, area under the curve. To strengthen the reliability of the results, a meta-analysis of GEO and TCGA database data was performed. The GEO dataset included in the following meta-analysis is summarized in . In total, 631 GC and 314 normal (tumor-adjacent tissues) samples were included. A significant difference was identified in SERPINE1 expression SERPINE1 between GC and normal tissues and the heterogeneity among the individual datasets was high (I2=80.5%, P<0.001; ); thus, a random-effects model was selected. The pooled SMD of the seven studies was 0.95 (95% CI, 0.53–1.36). This result further suggested that SERPINE1 was overexpressed in GC tissues. Publication bias assessment yielded a value of P=0.189. This result suggested that publication bias was absent in the current study.
Table 2

Characteristics of SERPINE1 gene expression profiling datasets obtained from GEO

AccessionPlatformCountrySubmission yearNumber of normal samplesSERPINE1 expression (log2) of normal samplesNumber of tumor samplesSERPINE1 expression (log2) of tumor samples
GSE2685GPL80Japan200585.59±0.75125.79±0.69
GSE19826GPL570China2010128.17±1.03128.89±0.92
GSE27342GPL5175USA2011806.75±1.96807.56±2.55
GSE29272GPL96USA20111347.10±0.611348.11±1.12
GSE56807GPL5175China201455.87±0.6957.69±1.33
GSE63089GPL5175China2014456.59±1.07457.71±1.19

SERPINE1 expression values are expressed as the mean ± SD. GEO, Gene Expression Omnibus; SD, standard deviation.

Figure 2

Meta-analysis of SERPINE1 as a GC biomarker based on GEO and TCGA datasets. (A) Forest plot of studies evaluating SMD of SERPINE1 expression between GC and control groups (random-effects model); (B) the SROC curve for the diagnostic accuracy assessment of SERPINE1 in GC; (C) pre- and post-test probability of the included studies; (D) publication bias of the included studies. 1/root (ESS) indicated the inverse root of ESS. Each circle represented an included study. GC, gastric cancer; GEO, Gene Expression Omnibus; TCGA, The Cancer Genome Atlas; SMD, standard mean difference; SROC, summary receiver operating characteristic; ESS, effective sample sizes; CI, confidence interval; SENS, sensitivity; SPEC, specificity; AUC, area under the curve.

SERPINE1 expression values are expressed as the mean ± SD. GEO, Gene Expression Omnibus; SD, standard deviation. Meta-analysis of SERPINE1 as a GC biomarker based on GEO and TCGA datasets. (A) Forest plot of studies evaluating SMD of SERPINE1 expression between GC and control groups (random-effects model); (B) the SROC curve for the diagnostic accuracy assessment of SERPINE1 in GC; (C) pre- and post-test probability of the included studies; (D) publication bias of the included studies. 1/root (ESS) indicated the inverse root of ESS. Each circle represented an included study. GC, gastric cancer; GEO, Gene Expression Omnibus; TCGA, The Cancer Genome Atlas; SMD, standard mean difference; SROC, summary receiver operating characteristic; ESS, effective sample sizes; CI, confidence interval; SENS, sensitivity; SPEC, specificity; AUC, area under the curve. SERPINE1 showed a surprising diagnostic value in TCGA dataset. To further identify the prognostic value of SERPINE1, a diagnostic meta-analysis was performed. As shown in , the AUC of the summary ROC (SROC) was 0.80 (0.77–0.84), which indicated that SERPINE1 had a moderate diagnostic value in GC. The pooled sensitivity and specificity of SERPINE1 was 0.69 (0.60–0.77) and 0.78 (0.70–0.84), respectively. In addition, the DLR-positive and DLR-negative values were 3.08 (2.22–4.27) and 0.40 (0.30–0.53), respectively. The diagnostic score and odds ratio were 2.04 (1.51–2.57) and 7.69 (4.52–13.09), respectively. The pretest probability was 20% when the positive and negative pretest probabilities were 44% and 9% (), respectively. Additionally, no significant publication bias was found (P=0.821, ).

Prognostic value of SERPINE1 in GC

We further assessed the relationship between SERPINE1 expression and GC patient survival. Our data suggested that GC patients with high SERPINE1 expression had poorer OS and RFS than those with low SERPINE1 expression ().
Figure 3

Kaplan-Meier curve for SERPINE1 expression in TCGA GC cohort. (A) GC patients with high SERPINE1 expression (n=163) had a poorer OS than those with low SERPINE1 expression (n=157); (B) GC patients with high SERPINE1 expression had a poorer RFS than those with low SERPINE1 expression. TCGA, The Cancer Genome Atlas; GC, gastric cancer; OS, overall survival; RFS, recurrence-free survival.

Kaplan-Meier curve for SERPINE1 expression in TCGA GC cohort. (A) GC patients with high SERPINE1 expression (n=163) had a poorer OS than those with low SERPINE1 expression (n=157); (B) GC patients with high SERPINE1 expression had a poorer RFS than those with low SERPINE1 expression. TCGA, The Cancer Genome Atlas; GC, gastric cancer; OS, overall survival; RFS, recurrence-free survival.

SERPINE1-related signaling pathways based on GSEA

To identify the signaling pathways engaged in GC, we performed a GSEA to compare the low- and high-SERPINE1 expression data sets. GSEA revealed significant differences (FDR <0.05, nominal P value <0.05) in the enrichment of the Molecular Signature Database (MSigDB) collection (c2.cp.kegg.v7.0 symbols). As shown in , we selected a total of 42 significantly enriched signaling pathways. The top four differentially enriched pathways in the SERPINE1-high expression phenotype group were the focal adhesion, extracellular matrix (ECM) receptor interaction, leukocyte transendothelial migration, and cytokine-cytokine receptor interaction signaling pathways, indicating the potential role of SERPINE1 in GC development ().
Table S1

GSEA KEGG pathway enrichment in the SERPINE1-high expression phenotype group

KEGG pathwaySizeNESNOM P valueFDR q value
Focal adhesion1992.500.0000.000
ECM receptor interaction832.430.0000.000
Leukocyte transendothelial migration1152.350.0000.000
Cytokine receptor interaction2442.190.0000.001
NOD like receptor signaling pathway622.120.0000.001
Regulation of actin cytoskeleton2102.100.0000.003
Pathways in cancer3252.100.0000.002
Bladder cancer422.090.0000.002
Axon guidance1292.090.0000.002
MAPK signaling pathway2662.070.0000.003
Prion diseases352.070.0000.002
Leishmania infection692.050.0000.003
Hematopoietic cell lineage832.040.0020.003
Chemokine signaling pathway1852.040.0000.003
Cell adhesion molecules cams1302.010.0020.004
Glycosaminoglycan biosynthesis chondroitin sulfate221.970.0000.006
Glycosaminoglycan biosynthesis heparan sulfate261.970.0020.006
TGF beta signaling pathway851.970.0000.006
Renal cell carcinoma701.970.0000.005
Complement and coagulation cascades681.960.0000.006
Jak stat signaling pathway1401.960.0000.006
Toll like receptor signaling pathway901.890.0060.012
Natural killer cell mediated cytotoxicity1191.890.0080.011
Dilated cardiomyopathy901.890.0080.012
Neurotrophin signaling pathway1261.850.0040.016
Melanoma711.840.0000.018
Hypertrophic cardiomyopathy (HCM)831.820.0080.020
Pancreatic cancer701.820.0060.020
Small cell lung cancer841.820.0080.020
Glycosaminoglycan biosynthesis keratan sulfate151.810.0040.021
Gap junction871.780.0020.027
Glycosaminoglycan degradation211.780.0080.027
Fc gamma r mediated phagocytosis951.770.0060.028
Epithelial cell signaling in helicobacter pylori infection681.750.0020.032
mTOR signaling pathway511.750.0140.033
Arrhythmogenic right ventricular cardiomyopathy741.740.0150.034
Glycosphingolipid biosynthesis ganglio series151.740.0100.034
Hedgehog signaling pathway561.720.0130.038
Graft versus host disease371.710.0300.042
Endocytosis1801.690.0040.047
Acute myeloid leukemia571.680.0100.050
Chronic myeloid leukemia731.670.0250.049

Gene sets with NOM P values <0.05 and FDR q values <0.25 were considered significantly enriched. GSEA, gene set enrichment analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes; NES, normalized enrichment score; NOM, nominal; FDR, false discovery rate.

Figure 4

Enrichment plots from GSEA. GSEA results showing the focal adhesion (A), ECM receptor interaction (B), leukocyte transendothelial migration (C), and cytokine-cytokine receptor interaction (D) signaling pathways that were differentially enriched in the SERPINE1 high SERPINE1 expression phenotype group. GSEA, gene set enrichment analysis; ECM, extracellular matrix.

Enrichment plots from GSEA. GSEA results showing the focal adhesion (A), ECM receptor interaction (B), leukocyte transendothelial migration (C), and cytokine-cytokine receptor interaction (D) signaling pathways that were differentially enriched in the SERPINE1 high SERPINE1 expression phenotype group. GSEA, gene set enrichment analysis; ECM, extracellular matrix.

Genes co-expressed with SERPINE1 and bioinformatics analysis

A total of 1,769 genes that were co-expressed with SERPINE1 were extracted from the MEM database. To investigate the pathways of SERPINE1 and its co-expressed genes, 1,769 co-expressed genes were selected and subjected to in silico analysis using the STRING online database. KEGG pathway enrichment analysis revealed a significant enrichment of SERPINE1 co-expressed genes in a total of 200 pathways (). To more accurately identify SERPINE1-involved KEGG pathways, the pathways extracted from the GSEA and SERPINE1 co-expressed genes in KEGG functional annotation were overlapped and 23 pathways were identified for further analysis (). A total of 1,401 genes were identified as GSEA gene set members involved in the 23 overlapping pathways.
Table S2

KEGG pathways enriched by genes MEM co-expressed with SERPINE1

KEGG pathwaysDescriptionCountGene set countFDR
hsa04010MAPK signaling pathway2742934.62E–170
hsa05200Pathways in cancer3255152.12E–167
hsa04060Cytokine-cytokine receptor interaction2362637.22E–143
hsa04810Regulatiin cytoskeleton1982055.42E–123
hsa04151PI3K-Akt signaling pathway2263481.23E–115
hsa04510Focal adhesion1871973.71E–115
hsa04144Endocytosis1812423.53E–99
hsa04062Chemokine signaling pathway1551811.75E–90
hsa04014Ras signaling pathway1672282.51E–90
hsa04015Rap1 signaling pathway1492032.01E–80
hsa04630Jak-STAT signaling pathway1331602.52E–76
hsa04514Cell adhesion molecules (CAMs)1271393.33E–76
hsa04722Neurotrophin signaling pathway1121168.87E–69
hsa04670Leukocyte transendothelial migration1091123.56E–67
hsa05165Human papillomavirus infection1573176.93E–67
hsa04650Natural killer cell mediated cytotoxicity1091243.65E–64
hsa05167Kaposi's sarcoma-associated herpesvirus infection1231833.72E–63
hsa05205Proteoglycans in cancer1251951.67E–62
hsa05166HTLV-I infection1342504.58E–60
hsa05145Toxoplasmosis971091.93E–57
hsa04921Oxytocin signaling pathway1041491.76E–54
hsa05414Dilated cardiomyopathy (DCM)87884.92E–54
hsa04666Fc gamma R-mediated phagocytosis85895.30E–52
hsa05410Hypertrophic cardiomyopathy (HCM)81811.40E–50
hsa05418Fluid shear stress and atherosclerosis951331.99E–50
hsa04660T cell receptor signaling pathway86992.05E–50
hsa04659Th17 cell differentiation861021.03E–49
hsa05222Small cell lung cancer83921.48E–49
hsa04380Osteoclast differentiation911244.61E–49
hsa05161Hepatitis B951421.04E–48
hsa04611Platelet activation901231.79E–48
hsa04350TGF-beta signaling pathway79832.18E–48
hsa05169Epstein-Barr virus infection1051942.12E–47
hsa05152Tuberculosis1001723.01E–47
hsa04218Cellular senescence961565.86E–47
hsa04933AGE-RAGE signaling pathway in diabetic complications81981.74E–46
hsa05220Chronic myeloid leukemia73765.82E–45
hsa05226Gastric cancer (GC)911471.07E–44
hsa04512ECM-receptor interaction74811.41E–44
hsa04668TNF signaling pathway811082.66E–44
hsa04072Phospholipase D signaling pathway891451.58E–43
hsa05164Influenza A941681.97E–43
hsa05212Pancreatic cancer70746.80E–43
hsa04610Complement and coagulation cascades71789.24E–43
hsa05206MicroRNAs in cancer881494.35E–42
hsa05218Melanoma68721.12E–41
hsa04926Relaxin signaling pathway831301.31E–41
hsa04261Adrenergic signaling in cardiomyocytes851391.50E–41
hsa05160Hepatitis C831311.92E–41
hsa01522Endocrine resistance73951.52E–40
hsa05140Leishmaniasis66701.78E–40
hsa01521EGFR tyrosine kinase inhibitor resistance68783.17E–40
hsa05211Renal cell carcinoma65683.95E–40
hsa05142Chagas disease (American trypanosomiasis)741014.00E–40
hsa04012ErbB signaling pathway69836.43E–40
hsa04912GnRH signaling pathway70881.25E–39
hsa05215Prostate cancer72972.43E–39
hsa05203Viral carcinogenesis911835.06E–39
hsa04024cAMP signaling pathway9351959.80E–39
hsa04068FoxO signaling pathway791301.39E–38
hsa05214Glioma63681.98E–38
hsa05162Measles791334.51E–38
hsa04530Tight junction861678.16E–38
hsa05223Non-small cell lung cancer61663.30E–37
hsa04658Th1 and Th2 cell differentiation67883.50E–37
hsa04640Hematopoietic cell lineage68949.61E–37
hsa05412Arrhythmogenic right ventricular cardiomyopathy (ARVC)62721.30E–36
hsa05224Breast cancer7971478.59E–36
hsa05210Colorectal cancer64852.29E–35
hsa04664Fc epsilon RI signaling pathway59672.94E–35
hsa05146Amoebiasis66943.80E–35
hsa05133Pertussis60741.80E–34
hsa04370VEGF signaling pathway55598.52E–34
hsa05168Herpes simplex infection831819.79E–34
hsa04620Toll-like receptor signaling pathway661021.30E–33
hsa05132Salmonella infection61843.77E–33
hsa05231Choline metabolism in cancer64988.48E–33
hsa04210Apoptosis721351.45E–32
hsa04657IL-17 signaling pathway62922.28E–32
hsa04621NOD-like receptor signaling pathway781662.50E–32
hsa04064NF-kappa B signaling pathway61932.16E–31
hsa04910Insulin signaling pathway701342.85E–31
hsa04662B cell receptor signaling pathway55715.20E–31
hsa04750Inflammatory mediator regulation of TRP channels60928.32E–31
hsa05100Bacterial invasion of epithelial cells55728.39E–31
hsa04270Vascular smooth muscle contraction661191.05E–30
hsa04917Prolactin signaling pathway54691.23E–30
hsa05321Inflammatory bowel disease (IBD)52621.50E–30
hsa04066HIF-1 signaling pathway61981.66E–30
hsa05416Viral myocarditis50562.75E–30
hsa04371Apelin signaling pathway681335.17E–30
hsa05131Shigellosis51631.72E–29
hsa04071Sphingolipid signaling pathway631165.45E–29
hsa05225Hepatocellular carcinoma721631.20E–28
hsa04550Signaling pathways regulating pluripotency of stem cells671381.40E–28
hsa05213Endometrial cancer48584.00E–28
hsa04360Axon guidance731734.60E–28
hsa04022cGMP-PKG signaling pathway701601.08E–27
hsa04217Necroptosis691551.15E–27
hsa04915Estrogen signaling pathway641333.43E–27
hsa05323Rheumatoid arthritis53847.06E–27
hsa04520Adherens junction49713.42E–26
hsa04390Hippo signaling pathway661524.97E–26
hsa04213Longevity regulating pathway—multiple species46618.08E–26
hsa05150Staphylococcus aureus infection43511.49E–25
hsa04725Cholinergic synapse561111.05E–24
hsa05219Bladder cancer39411.42E–24
hsa04720Long-term potentiation45642.13E–24
hsa05221Acute myeloid leukemia45665.28E–24
hsa04672Intestinal immune network for IgA production39447.89E–24
hsa05332Graft-versus-host disease36362.90E–23
hsa04020Calcium signaling pathway6691796.70E–23
hsa04919Thyroid hormone signaling pathway541159.56E–23
hsa04934Cushing’s syndrome601535.42E–22
hsa04114Oocyte meiosis531166.34E–22
hsa05330Allograft rejection34358.83E–22
hsa04914Progesterone-mediated oocyte maturation48941.58E–21
hsa04211Longevity regulating pathway46885.39E–21
hsa04931Insulin resistance491072.15E–20
hsa04145Phagosome541454.24E–19
hsa04110Cell cycle501234.78E–19
hsa04540Gap junction43875.30E–19
hsa04940Type I diabetes mellitus32407.16E–19
hsa05120Epithelial cell signaling in Helicobacter pylori infection38661.26E–18
hsa05320Autoimmune thyroid disease34491.28E–18
hsa05144Malaria33473.22E–18
hsa04140Autophagy—animal491253.46E–18
hsa04920Adipocytokine signaling pathway38693.81E–18
hsa05202Transcriptional misregulation in cancer561696.67E–18
hsa04612Antigen processing and presentation37666.76E–18
hsa04932Non-alcoholic fatty liver disease (NAFLD)521491.74E–17
hsa04728Dopaminergic synapse481283.11E–17
hsa05134Legionellosis33546.37E–17
hsa05322Systemic lupus erythematosus41941.05E–16
hsa05230Central carbon metabolism in cancer35651.36E–16
hsa04923Regulatiolysis in adipocytes32532.43E–16
hsa05020Prion diseases27332.61E–16
hsa04730Long-term depression33606.32E–16
hsa04310Wnt signaling pathway481431.03E–15
hsa04916Melanogenesis40981.45E–15
hsa05014Amyotrophic lateral sclerosis (ALS)29501.35E–14
hsa04930Type II diabetes mellitus28461.58E–14
hsa01524Platinum drug resistance33701.88E–14
hsa04260Cardiac muscle contraction34762.47E–14
hsa04713Circadian entrainment37933.18E–14
hsa04971Gastric acid secretion33723.46E–14
hsa04150mTOR signaling pathway461484.10E–14
hsa04724Glutamatergic synapse401124.95E–14
hsa05031Amphetamine addiction31659.03E–14
hsa04925Aldosterone synthesis and secretion36931.34E–13
hsa05216Thyroid cancer24374.37E–13
hsa05130Pathogenic Escherichia coli infection27531.11E–12
hsa04726Serotonergic synapse371122.99E–12
hsa04972Pancreatic secretion34953.81E–12
hsa04115p53 signaling pathway29685.02E–12
hsa04622RIG-I-like receptor signaling pathway29708.79E–12
hsa05310Asthma20281.14E–11
hsa04961Endocrine and other factor-regulated calcium reabsorption24471.95E–11
hsa04913Ovarian steroidogenesis24493.79E–11
hsa04924Renin secretion26631.13E–10
hsa00592Alpha-linolenic acid metabolism18251.14E–10
hsa04922Glucagon signaling pathway321001.72E–10
hsa04152AMPK signaling pathway351201.94E–10
hsa04911Insulin secretion29842.90E–10
hsa00565Ether lipid metabolism22463.50E–10
hsa04927Cortisol synthesis and secretion25634.88E–10
hsa00591Linoleic acid metabolism18296.37E–10
hsa05034Alcoholism371428.22E–10
hsa05032Morphine addiction29911.31E–09
hsa04714Thermogenesis472283.40E–09
hsa04215Apoptosis—multiple species17317.73E–09
hsa04723Retrograde endocannabinoid signaling351481.95E–08
hsa05143African trypanosomiasis17342.20E–08
hsa04727GABAergic synapse26883.38E–08
hsa04970Salivary secretion25868.09E–08
hsa04960Aldosterone-regulated sodium reabsorption16372.73E–07
hsa04137Mitophagy—animal20635.19E–07
hsa05340Primary immunodeficiency15371.24E–06
hsa00590Arachidonic acid metabolism19611.27E–06
hsa04070Phosphatidylinositol signaling system24971.71E–06
hsa04918Thyroid hormone synthesis20733.47E–06
hsa04120Ubiquitin mediated proteolysis281344.13E–06
hsa04975Fat digestion and absorption14398.69E–06
hsa05010Alzheimer’s disease311681.15E–05
hsa00564Glycerophospholipid metabolism22961.29E–05
hsa04340Hedgehog signaling pathway14463.97E–05
hsa05030Cocaine addiction14497.05E–05
hsa04976Bile secretion17717.89E–05
hsa04962Vasopressin-regulated water reabsorption13449.59E–05
hsa04974Protein digestion and absorption19901.30E–04
hsa04710Circadian rhythm10302.80E–04
hsa04721Synaptic vesicle cycle14614.90E–04
hsa04973Carbohydrate digestion and absorption11427.80E–04
hsa00562Inositol phosphate metabolism15738.30E–04
hsa05110Vibrio cholerae infection11480.0020
hsa01523Antifolate resistance8310.0045
hsa05217Basal cell carcinoma12630.0047
hsa04141Protein processing in endoplasmic reticulum221610.0065
hsa04744Phototransduction6260.0211
hsa05016Huntington’s disease221930.0362

KEGG, Kyoto Encyclopedia of Genes and Genomes; MEM, multi experiment matrix; FDR, false discovery rate.

Table 3

GSEA and MEM overlapped KEGG pathway

KEGG pathwaysDescriptionCountGene set countFDR
hsa04510Focal adhesion691972.46E–16
hsa04810Regulatiin cytoskeleton542054.40E–09
hsa04512ECM-receptor interaction30818.47E–08
hsa04010MAPK signaling pathway602936.53E–07
hsa04144Endocytosis522421.25E–06
hsa04621NOD-like receptor signaling pathway371663.09E–05
hsa05222Small cell lung cancer25926.03E–05
hsa05212Pancreatic cancer22746.39E–05
hsa05220Chronic myeloid leukemia21762.10E–04
hsa04140Autophagy - animal271250.0006
hsa04060Cytokine-cytokine receptor interaction442639.10E–04
hsa05410Hypertrophic cardiomyopathy (HCM)19810.0023
hsa05211Renal cell carcinoma17680.0024
hsa05219Bladder cancer12410.0046
hsa04630Jak-STAT signaling pathway281600.0057
hsa04350TGF-beta signaling pathway18830.0057
hsa04610Complement and coagulation cascades17780.0069
hsa04722Neurotrophin signaling pathway221160.0070
hsa04666Fc gamma R-mediated phagocytosis18890.0095
hsa04670Leukocyte transendothelial migration211120.0095
hsa05414Dilated cardiomyopathy (DCM)17880.0153
hsa04514Cell adhesion molecules (CAMs)231390.0191
hsa04650Natural killer cell mediated cytotoxicity201240.0351

GSEA, gene set enrichment analysis; MEM, multi experiment matrix; KEGG, Kyoto Encyclopedia of Genes and Genomes; FDR, false discovery rate.

GSEA, gene set enrichment analysis; MEM, multi experiment matrix; KEGG, Kyoto Encyclopedia of Genes and Genomes; FDR, false discovery rate. Utilizing the MCODE algorithm, 60 genes involved in the SERPINE1-related module were identified (). According to GO enrichment analysis, these 60 genes were mainly enriched in ‘platelet degranulation’, ‘ECM organization’, and ‘extracellular structure organization’ in the biological process (BP) category; ‘platelet alpha granule lumen’, ‘platelet alpha granule lumen’, and ‘secretory granule lumen’ in the cellular component (CC) category; and ‘ECM structural constituent’, ‘cell adhesion molecule binding’, and ‘integrin binding’ in the molecular function (MF) category. The PI3K-Akt, Ras, and MAPK signaling pathways were the most enriched KEGG terms. GO functional annotations of the KEGG pathway enrichment results are shown in and the top 10 significantly enriched terms for SERPINE1-related module genes are provided for each category.
Figure 5

The PPI network of the SERPINE1-related module genes. The PPI network was constructed online via STRING and those genes were chosen for further analysis. Network nodes represent proteins and edges represent protein-protein associations. PPI, protein-protein interaction.

Figure 6

Function analysis of SERPINE1-related module genes. (A) The top 10 significantly enriched GO categories of SERPINE1-related module genes; (B) the top 10 significantly enriched KEGG signaling pathways of SERPINE1-related module genes. GO, gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.

The PPI network of the SERPINE1-related module genes. The PPI network was constructed online via STRING and those genes were chosen for further analysis. Network nodes represent proteins and edges represent protein-protein associations. PPI, protein-protein interaction. Function analysis of SERPINE1-related module genes. (A) The top 10 significantly enriched GO categories of SERPINE1-related module genes; (B) the top 10 significantly enriched KEGG signaling pathways of SERPINE1-related module genes. GO, gene ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Identification of the prognostic module genes and construction of the SERPINE1-related module genes prognostic risk model

Investigation of the influence of module genes on the OS of GC patients using the UALCAN online tool showed that 15 SERPINE1-related module genes (LAMA4, PROS1, LEFTY2, A2M, THBS1, FN1, SERPING1, PAK3, LAMA2, TGFB1, VWF, F8, F5, ARHGEF6, and ACTN2) affected the OS of GC patients. Kaplan-Meier analysis showed that eight SERPINE1-related module genes (F13A1, PROS1, LEFTY2, SERPING1, PAK3, TGFB1, VEGFB, and VEGFC) were associated with GC RFS. These genes were subsequently entered into a multivariate Cox regression analysis. To identify the best predictors that significantly contributed to patient OS and RFS, we used the lowest AIC value for variable selection to build prognostic classifiers that consisted of five genes (LAMA4, PAK3, TGFB1, ARHGEF6, and SERPING1) for OS and two genes (VEGFB and LEFTY2) for RFS. We developed risk score formulas to predict patient survival: We then calculated the risk scores for all GC patients using these two formulas. Additionally, by using Pearson’s correlation analysis in the GEPIA online database, SERPINE1 expression was found to be correlated with the expression of SERPINE1-related module genes included in the Cox regression model with the following findings: TGFB1 (r=0.37; P<0.0001), LAMA4 (r=0.22; P<0.0001), PAK3 (r=0.13; P<0.01), ARHGEF6 (r=0.29; P<0.05), SERPING1 (r=0.28; P<0.0001), VEGFB (r=0.14; P<0.0001), and LEFTY2 (r=0.2; P<0.0001) ().
Figure S1

Correlation analysis between SERPINE1 and SERPINE1-related module genes included in the Cox regression model using Pearson’s correlation based on TCGA database. (A) LAMA4, (B) ARHGEF6, (C) TGFB1, (D) PAK3, (E) SERPING1, (F) LEFTY2, and (G) VEGFB. TCGA, The Cancer Genome Atlas.

X-tile plots were used to obtain the optimum cutoff values for OS (3.5) and RFS (7.5) risk scores. Patients with a higher risk score generally had poorer survival than those with a lower risk score. Kaplan-Meier survival analysis demonstrated that patients with high-risk scores had a shorter OS and RFS than those with low-risk scores ().
Figure 7

Kaplan-Meier curves demonstrating patient survival after resection for GC according to risk score based on SERPINE1-related module genes prognostic classifiers. (A) GC patients with high risk score had a poorer OS than those with low risk score; (B) GC patients with high risk score had a poorer RFS than those with low risk score. GC, gastric cancer; OS, overall survival; RFS, recurrence-free survival.

Kaplan-Meier curves demonstrating patient survival after resection for GC according to risk score based on SERPINE1-related module genes prognostic classifiers. (A) GC patients with high risk score had a poorer OS than those with low risk score; (B) GC patients with high risk score had a poorer RFS than those with low risk score. GC, gastric cancer; OS, overall survival; RFS, recurrence-free survival.

Using a univariate and multivariate Cox proportional hazards regression model to identify OS and RFS predictors

All variables listed in were used for univariate and multivariate Cox proportional hazards regression analysis. A Cox proportional hazards regression model with backward stepwise selection using the AIC from the Cox proportional hazards regression model showed the following five OS-associated variables: age, resection margins, lymph node-positive proportion, patient tumor status, and risk score (). In multivariable analysis, age ≥60 years (HR, 2.14; 95% CI, 1.45–3.16; P<0.01), R2 margins (HR, 2.70; 95% CI, 1.41–5.14; P<0.05), lymph node-positive proportion (HR, 3.38; 95% CI, 2.03–5.63; P<0.001), patient tumor status (HR, 3.33; 95% CI, 2.28–4.87; P<0.001), and OS risk score (HR, 2.72; 95% CI, 1.82–4.05; P<0.05) were independently associated with OS. Male sex (HR, 2.55; 95% CI, 1.46–4.45; P<0.01), R2 margins (HR, 13.08; 95% CI, 4.26–40.15; P<0.001), lymph node-positive proportion (HR, 2.55; 95% CI, 1.20–5.45; P<0.05), and RFS risk score (HR, 2.70; 95% CI, 1.82–4.06; P<0.001) were independently associated with RFS ().
Table 4

Cox proportional hazards regression model showing the association of variables with OS

VariablesUnivariate analysisMultivariate analysis
HR (95% CI)P valueHR (95% CI)P value
Factors selected
   Age, y
      <601 (Reference)NA1 (Reference)NA
      ≥601.61 (1.21–2.23)0.0183*2.14 (1.45–3.16)0.0013*
   Resection margin
      R01 (Reference)NA1 (Reference)NA
      R12.25 (1.17–4.31)0.0407*1.20 (0.59–2.44)0.6734
      R27.39 (4.31–12.69)<0.0001*2.70 (1.41–5.14)0.0115*
   Lymph node positive proportion4.31 (2.77–6.71)<0.0001*3.38 (2.03–5.63)<0.0001*
   Patient tumor status
      Tumor free1 (Reference)NA1 (Reference)NA
      With tumor4.92 (3.47–6.98)<0.0001*3.33 (2.28–4.87)<0.0001*
      Risk score1.74 (1.32–2.30)<0.0010*2.72 (1.82–4.05)<0.0001*
Factors not selected
   Sex
      Female1 (Reference)NANANA
      Male1.26 (0.93–1.71)0.0207*NANA
   Histologic grade
      G11 (Reference)NANANA
      G21.22 (0.37–4.01)0.781NANA
      G31.54 (0.47–4.99)0.549NANA
   Tumor anatomic site
      Antrum1 (Reference)NANANA
      Cardia1.04 (0.68–1.58)0.8790NANA
      Fundus0.81 (0.58–1.14)0.316NANA
   Gastroesophageal junction0.73 (0.42–1.26)0.346NANA
   TNM stage
      I/II1 (Reference)NA
      III/IV2.01 (1.48–2.74)<0.0002*
   T stage
      T1/T21 (Reference)NA
      T3/T41.64 (1.15–2.35)0.0224*
   N stage
      N0/N11 (Reference)NA
      N2/N31.56 (1.17–2.09)0.0109*
   M stage
      M01 (Reference)NA
      M12.12 (1.31–3.44)0.0103*
   SERPINE1 expression1.26 (1.14–1.38)0.0001*

* indicate P<0.05. OS, overall survival; HR, hazard ratio; CI, confidence interval; NA, not applicable; TNM, tumor-node-metastases.

Table 5

Cox proportional hazards regression model showing the association of variables with RFS

VariablesUnivariate analysisMultivariate analysis
HR (95% CI)P valueHR (95% CI)P value
Factors selected
   Sex
      Female1 (Reference)NANANA
      Male1.98 (1.21–3.24)0.0220*2.55 (1.46–4.45)0.0060*
   Resection margin
      R01 (Reference)NA1 (Reference)NA
      R11.24 (0.38–4.08)0.76800.67 (0.20–2.28)0.5953
      R28.21 (3.03–22.25)0.0005*13.08 (4.26–40.15)0.0002*
   Lymph node positive proportion3.94 (1.98–7.82)0.0010*2.55 (1.20–5.45)<0.0417*
   Risk score, RFS2.67 (1.90–3.75)<0.0001*2.70 (1.82–4.06)<0.0001*
Factors not selected
   Age, y
      <601 (Reference)NANANA
      ≥600.69 (0.45–1.07)0.1617NANA
   Histologic grade
      G1/G21 (Reference)NANANA
      G32.02 (1.25–3.27)0.0158*NANA
   Tumor anatomic site
      Antrum1 (Reference)NANANA
      Cardia1.42 (0.79–2.56)0.3300NANA
      Fundus0.63 (0.37–1.08)0.1603NANA
   Gastroesophageal junction0.91 (0.44–1.86)0.8194NANA
   TNM stage
      I/II1 (Reference)NA
      III/IV0.96 (0.63–1.47)0.8686
   T stage
      T1/T21 (Reference)NA
      T3/T40.75 (0.48–1.16)0.2783
   N stage
      N0/N11 (Reference)NA
      N2/N31.39 (0.91–2.13)0.2041
   M stage
      M01 (Reference)NA
      M11.43 (0.61–3.36)0.4910
   SERPINE1 expression1.20 (1.04–1.38)0.0384*

* indicate P<0.05; RFS, recurrence-free survival; HR, hazard ratio; CI, confidence interval; NA, not applicable; TNM, tumor-node-metastases.

* indicate P<0.05. OS, overall survival; HR, hazard ratio; CI, confidence interval; NA, not applicable; TNM, tumor-node-metastases. * indicate P<0.05; RFS, recurrence-free survival; HR, hazard ratio; CI, confidence interval; NA, not applicable; TNM, tumor-node-metastases.

Nomograms and model performance

Nomograms to predict GC patient OS and RFS are shown in . The nomogram to predict OS was created based on the following five independent prognostic factors: age (<60 or ≥60 years), resection margins (R0, R1, or R2), patient tumor status (tumor-free or with tumor), lymph node-positive proportion, and risk score. The nomogram to predict RFS was created based on the following four independent prognostic factors: sex (female or male), resection margins (R0, R1, or R2), lymph node-positive proportion, and RFS risk score. A higher total number of points based on the sum of the number of points assigned to each factor in the nomograms was associated with a poorer prognosis. The discriminative ability of the final model for OS and RFS was assessed using C statistics (0.755 for OS and 0.745 for RFS). Model accuracy and potential overfit were assessed by bootstrap validation with 1,000 re-samplings. The 60-sample bootstrapped calibration plots for the prediction of 3-year OS and RFS are presented in . Predictive accuracy for OS was compared between the proposed nomogram and the nomogram based on the conventional staging system constructed using the prognostic factors of age (<60 or ≥60 years) and TNM stage (T1/T2, T3/T4). The C statistics of the proposed nomogram were greater than those of the TNM stage nomogram (0.755 vs. 0.617). The calculated NRI was 0.48 (95% CI, 0.23–0.96), which indicated that the performance of the new model was better than that of the TNM stage model for predicting OS.
Figure 8

Nomogram for predicting OS in GC patients after surgery. OS, overall survival; GC, gastric cancer.

Figure 9

Nomogram for predicting RFS in GC patients after surgery. RFS, recurrence-free survival; GC, gastric cancer.

Figure 10

Calibration plot comparing predicted and actual survival probabilities at the 3-year follow-up. The 60-sample bootstrapped calibration plot for 3-year OS (A) and RFS (B) prediction is shown. The 45-degree line represents the ideal fit; rhombuses represent nomogram-predicted probabilities; crosses represent the bootstrap-corrected estimates; and error bars represent the 95% CIs of these estimates. OS, overall survival; RFS, recurrence-free survival; CI, confidence interval.

Nomogram for predicting OS in GC patients after surgery. OS, overall survival; GC, gastric cancer. Nomogram for predicting RFS in GC patients after surgery. RFS, recurrence-free survival; GC, gastric cancer. Calibration plot comparing predicted and actual survival probabilities at the 3-year follow-up. The 60-sample bootstrapped calibration plot for 3-year OS (A) and RFS (B) prediction is shown. The 45-degree line represents the ideal fit; rhombuses represent nomogram-predicted probabilities; crosses represent the bootstrap-corrected estimates; and error bars represent the 95% CIs of these estimates. OS, overall survival; RFS, recurrence-free survival; CI, confidence interval.

Discussion

In the current study, we found that SERPINE1 was significantly upregulated in GC tissues compared to normal or adjacent normal tissues based on the meta-analysis of TCGA and GEO datasets. Moreover, high SERPINE1 expression was associated with GC T stage and survival status. Univariate Cox regression analyses indicated that SERPINE1 expression was associated with prognosis and may therefore be a potentially useful biomarker for GC prognosis and diagnosis and a potential therapeutic target. Meta-analysis confirmed the diagnostic value of SERPINE1 in GC. Similarly, Sakakibara et al. found that SERPINE1 overexpression is significantly associated with malignancy in GC (17). A meta-analysis of 22 studies that included 1,966 patients revealed that high SERPINE1 expression is associated with a short OS (18). Furthermore, Nishioka et al. reported that SERPINE1 RNA interference (RNAi) suppresses GC metastasis in vivo (19). These conclusions are consistent with those of our study and demonstrate the prognostic value and potential therapeutic roles of SERPINE1. Interestingly, SERPINE1 showed surprising diagnostic value in TCGA data; for healthy individuals the AUC was 0.876 and the AUC values were 0.800, 0.878, 0.891, and 0.897 for stages I, II, III, and IV GC patients, respectively. In the diagnostic meta-analysis, 631 GC and 314 controls were included from the GEO and TCGA databases. The meta-analysis was performed to evaluate the accuracy of SERPINE1 for GC detection. The combined AUC was 0.80, which was indicative of moderate diagnostic accuracy. The combined values of the sensitivity (0.69) and specificity (0.78) showed the accuracy of SERPINE1 for GC detection. However, there were some limitations to our meta-analysis. Heterogeneity (I2=80.5%) was unavoidable, partly because of the different platforms that were used. Furthermore, different races also contributed to heterogeneity. Because SERPINE1 is not the only factor with diagnostic value for GC, combining SERPINE1 with other specific markers for GC diagnosis might further improve diagnostic accuracy. The molecular mechanisms underlying the differential expression of SERPINE1 and its potential prognostic impact on GC are still poorly understood. The current study improved our understanding of the relationship between SERPINE1 and GC. In the current study, functional annotation based on GSEA and MEM SERPINE1 co-expression analysis showed that SERPINE1 the three most significant pathways associated with the high SERPINE1 expression phenotype were the PI3K-Akt, Ras, and MAPK signaling pathways; this indicated that SERPINE1 and related module genes might promote GC cell growth and metastasis, and result in poorer survival via the PI3K-Akt, Ras, and MAPK pathways. Accumulating evidence shows that the activation of these pathways plays a critical role in promoting GC progression and metastasis (20-22). The creation of a reliable and practicable nomogram for predicting GC OS and recurrence is both clinically valuable and challenging to create. GC is a highly malignant tumor, with up to 18.4% of patients with R0 resections for node-negative GC experiencing recurrence after surgical resection (23). The results from a large sample and multicenter cohort of Chinese patients indicated that 60.8% of patients experienced recurrence after curative resection for GC from 1986 to 2013 (24). Accurate prognostication for GC after surgery is vital, not only for informing patients about their risk of recurrence and prognosis, but also for selecting patients for further adjuvant treatment. Recent studies on clinical measurement models of GC have shown that a nomogram with the TNM staging system combined with other variables is better than that of the TNM staging system alone (25,26). Consistently, our results showed that the proposed nomogram provided more accurate OS prediction for GC patients than the AJCC TNM-based nomogram Although the accuracy and discrimination of a model with one biomarker may be limited, a model established on the basis of module genes could likely provide more accurate and reliable prognostic predictions for GC patients. Therefore, we proposed a signature comprising these SERPINE1-related module genes that could be independent factors affecting OS and RFS in GC patients. Studies have shown that resection margins and lymph node-positive proportions are independent prognostic factors for GC and that patients with positive margins and higher lymph node-positive proportions have a poor prognosis (27,28). Accordingly, our results showed that these two factors were independent prognostic factors for OS and RFS in GC. Limitations to the current study included the following: First, our study is a retrospective study and therefore has inherent defects such as selection bias. Second, GC development is a complex process and all kinds of clinical factors, such as treatment details, should be considered to clarify the key role of SERPINE1 in GC development; however, this kind of information is lacking or inconsistently available in public databases. Third, our nomograms were internally validated using bootstrap validation and lack external validation. Future studies are urgently needed to externally validate the proposed nomograms and other essential factors based on treatment strategies should be incorporated. Finally, the current study was based on TCGA data mining; therefore, the protein level of SERPINE1 expression could not be directly evaluated, and the SERPINE1 mechanisms involved in GC development could not be clearly illustrated. The signaling pathways involved in SERPINE1 upregulation SERPINE1 in GC patients need to be verified by in vivo and in vitro experiments.

Conclusions

This study comprehensively analyzed the expression of SERPINE1 in patients with GC and evaluated the potential clinical value of SERPINE1 expression by performing a meta-analysis of data from GEO and TCGA databases. Bioinformatics analysis identified the possible functional mechanisms of SERPINE1 expression that facilitate GC onset and development as being regulated through the PI3K-Akt, Ras, and MAPK pathways. Finally, a nomogram based on SERPINE1-related module genes provided a more accurate OS prediction for GC patients than the AJCC TNM-based nomogram. These findings must be validated in multicenter clinical trials.
  28 in total

1.  Towards better clinical prediction models: seven steps for development and an ABCD for validation.

Authors:  Ewout W Steyerberg; Yvonne Vergouwe
Journal:  Eur Heart J       Date:  2014-06-04       Impact factor: 29.983

2.  A five-miRNA signature predicts survival in gastric cancer using bioinformatics analysis.

Authors:  Ziqiang Zhang; Yuanqiang Dong; Jin Hua; Hongyuan Xue; Jian Hu; Tao Jiang; Liubin Shi; Jianjun Du
Journal:  Gene       Date:  2019-03-05       Impact factor: 3.688

3.  Prognostic value of surgical margin status in gastric cancer patients.

Authors:  Yuexiang Liang; Xuewei Ding; Xiaona Wang; Baogui Wang; Jingyu Deng; Li Zhang; Han Liang
Journal:  ANZ J Surg       Date:  2014-01-20       Impact factor: 1.872

4.  PAI-1 expression levels in gastric cancers are closely correlated to those in corresponding normal tissues.

Authors:  Takumi Sakakibara; Kenji Hibi; Masahiko Koike; Michitaka Fujiwara; Yasuhiro Kodera; Katsuki Ito; Akimasa Nakao
Journal:  Hepatogastroenterology       Date:  2008 Jul-Aug

5.  Early Gastric Cancer: Trends in Incidence, Management, and Survival in a Well-Defined French Population.

Authors:  Nicolas Chapelle; Anne-Marie Bouvier; Sylvain Manfredi; Antoine Drouillard; Come Lepage; Jean Faivre; Valerie Jooste
Journal:  Ann Surg Oncol       Date:  2016-05-23       Impact factor: 5.344

6.  Altered expression of hypoxia-inducible factor-1α (HIF-1α) and its regulatory genes in gastric cancer tissues.

Authors:  Jihan Wang; Zhaohui Ni; Zipeng Duan; Guoqing Wang; Fan Li
Journal:  PLoS One       Date:  2014-06-13       Impact factor: 3.240

7.  The patterns and timing of recurrence after curative resection for gastric cancer in China.

Authors:  Dan Liu; Ming Lu; Jian Li; Zuyao Yang; Qi Feng; Menglong Zhou; Zhen Zhang; Lin Shen
Journal:  World J Surg Oncol       Date:  2016-12-08       Impact factor: 2.754

8.  K-ras-ERK1/2 down-regulates H2A.XY142ph through WSTF to promote the progress of gastric cancer.

Authors:  Chao Dong; Jing Sun; Sha Ma; Guoying Zhang
Journal:  BMC Cancer       Date:  2019-05-31       Impact factor: 4.638

9.  The expression of the PI3K/AKT/mTOR pathway in gastric cancer and its role in gastric cancer prognosis.

Authors:  Jieer Ying; Qi Xu; Bixia Liu; Gu Zhang; Lei Chen; Hongming Pan
Journal:  Onco Targets Ther       Date:  2015-09-01       Impact factor: 4.147

10.  Genome-wide identification of a novel miRNA-based signature to predict recurrence in patients with gastric cancer.

Authors:  Yongmei Yang; Ailin Qu; Rui Zhao; Mengmeng Hua; Xin Zhang; Zhaogang Dong; Guixi Zheng; Hongwei Pan; Hongchun Wang; Xiaoyun Yang; Yi Zhang
Journal:  Mol Oncol       Date:  2018-10-10       Impact factor: 6.603

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.