Zhigang Qiu1, Haitao Jiang2, Kun Ju3, Xichun Liu1. 1. Department of Gastrointestinal Surgery, The Affiliated Hospital of Qingdao University, Qingdao, China. 2. Department of Oncology, Qingdao Municipal Hospital, Qingdao, China. 3. Department of Emergency, The Affiliated Hospital of Qingdao University, Qingdao, China.
Abstract
Objective: This study aimed to develop an RNA-binding protein (RBP)-based signature for risk stratification and guiding clinical therapy in gastric cancer. Methods: Based on survival-related RBPs, an RBP-based signature was established by LASSO regression analysis in TCGA dataset. Kaplan-Meier curves were drawn between high- and low-risk groups. The predictive efficacy of this signature was assessed via ROCs at 1-, 3-, and 5-year survival. Its generalizability was verified in an external dataset. Following adjustment with other clinicopathological characteristics, the independency of survival prediction was evaluated via multivariate Cox regression and subgroup analyses. GSEA was utilized in identifying activated pathways in two groups. Stromal score, immune score, tumor purity, and infiltration levels of 22 immune cells were determined in each sample via the ESTIMATE and CIBERSORT algorithms. The sensitivity to chemotherapy drugs was assessed through the GDSC database. Results: Data showed that patients with high risk exhibited unfavorable clinical outcomes than those with low risk. This signature possessed good performance in predicting 1-, 3-, and 5-year survival and can be independently predictive of patients' survival. Calcium, ECM receptor interaction, and focal adhesion were highly enriched in high-risk samples. High-risk samples presented increased stromal and immune scores and reduced tumor purity. Moreover, this signature presented close relationships with immune infiltrations. Low-risk specimens were more sensitive to sorafenib, gefitinib, vinorelbine, and gemcitabine than high-risk specimens. Conclusion: This RBP-based signature may be a promising tool for predicting clinical outcomes and guiding clinical therapy in gastric cancer.
Objective: This study aimed to develop an RNA-binding protein (RBP)-based signature for risk stratification and guiding clinical therapy in gastric cancer. Methods: Based on survival-related RBPs, an RBP-based signature was established by LASSO regression analysis in TCGA dataset. Kaplan-Meier curves were drawn between high- and low-risk groups. The predictive efficacy of this signature was assessed via ROCs at 1-, 3-, and 5-year survival. Its generalizability was verified in an external dataset. Following adjustment with other clinicopathological characteristics, the independency of survival prediction was evaluated via multivariate Cox regression and subgroup analyses. GSEA was utilized in identifying activated pathways in two groups. Stromal score, immune score, tumor purity, and infiltration levels of 22 immune cells were determined in each sample via the ESTIMATE and CIBERSORT algorithms. The sensitivity to chemotherapy drugs was assessed through the GDSC database. Results: Data showed that patients with high risk exhibited unfavorable clinical outcomes than those with low risk. This signature possessed good performance in predicting 1-, 3-, and 5-year survival and can be independently predictive of patients' survival. Calcium, ECM receptor interaction, and focal adhesion were highly enriched in high-risk samples. High-risk samples presented increased stromal and immune scores and reduced tumor purity. Moreover, this signature presented close relationships with immune infiltrations. Low-risk specimens were more sensitive to sorafenib, gefitinib, vinorelbine, and gemcitabine than high-risk specimens. Conclusion: This RBP-based signature may be a promising tool for predicting clinical outcomes and guiding clinical therapy in gastric cancer.
Gastric cancer ranks fifth in incidence and third mortality among global cancers (1, 2). Patients are diagnosed in histology following endoscopic biopsy and staged by computed tomography, endoscopic ultrasound, positron emission tomography, or laparoscopy (3). This cancer is a highly heterogeneous disease at the molecular and phenotypic levels. Subjects diagnosed by the same TNM stage and treated by similar therapeutic regimens present varied prognoses, emphasizing that TNM stage by itself cannot provide complete prognostic information (4). Endoscopic surgery is a primary therapeutic method for early subjects. Nevertheless, most patients are diagnosed at an advanced stage, who have missed the optimal time for surgery. Despite adjuvant chemotherapy, immunotherapy, and targeted therapy, advanced subjects' median survival time is <1 year (5). Hence, innovative strategies are required for boosting risk stratification as well as predictive accuracy of clinical outcomes.RNA-binding proteins (RBPs), a type of protein, may be interacted with a variety of RNAs. At present, 1,542 humanRBP genes have been found, which participate in posttranscriptional modulation such as RNA splicing, polyadenylation, editing, modification, and translation (6–8). Aberrant expression of RBPs may induce progress of various malignancies, including gastric cancer (9, 10). RBPs have been detected to widely express in tumor cells, thereby affecting the translation of mRNAs into proteins and carcinogenesis processes (11). Increasing evidence has highlighted clinicopathologic implication of immune microenvironment in survival outcomes and therapeutic efficacy in gastric cancer (12). Recent findings have found that RBPs may affect immune microenvironment across different cancer types (13). For example, RBP SORBS2 inhibits metastatic colonization of ovarian cancer through enhancing stability of tumor-suppressive immunomodulatory transcripts (13). In-depth understanding of the roles of RBPs will offer innovative ideas for immunotherapy of gastric cancer. Previously, Huang et al. (14) proposed a 6-RBP signature that predicted the survival of hepatocellular carcinoma with high accuracy. Li et al. (15) developed a 9-RBP signature with accurate predictive efficacy for lung squamous cell carcinomapatients' prognosis. However, there is still lack of gene signature based on RBPs for gastric cancer. Furthermore, the relationships between RBPs and immune microenvironment are required for further analysis. Here, this work developed and verified an RBP-based model that exhibited a good performance in predicting patients' survival and was significantly associated with immune microenvironment using public datasets.
Materials and Methods
Gastric Cancer Datasets Acquiring and Preprocessing
Transcriptome FPKM RNA-seq profiles of gastric cancer were retrieved from the Cancer Genome Atlas (TCGA; https://tcga-data.nci.nih.gov/tcga/). Meanwhile, clinical information, including age, gender, grade, stage, TNM, and overall survival, was acquired by the UCSC Xena (https://xena.ucsc.edu/). The details are listed in Supplementary Table 1. After excluding samples with survival time of 0, 350 cases of gastric cancer specimens were retained as a training set. FPKM values were converted to TPM values for normalization (16). In the Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/gds/) repository, the GSE84437 dataset was obtained on the GPL6947 platform (17). A total of 431 samples with survival time >0 were utilized as a validation set. The clinical information is shown in Supplementary Table 2. Based on a previous published study, 1,542 RBPs were retrieved (Supplementary Table 3) (18).
Establishment and Validation of a Prognostic RNA-Binding Protein Gene Signature
Univariate Cox regression analyses were employed for analyzing associations between RBPs and clinical outcomes of gastric cancer. Prognosis-related RBPs with p < 0.05 were retained. Then, least absolute shrinkage and selection operator (LASSO) regression analyses were adopted to acquire key prognostic RBPs (19). The risk scores of subjects were determined following the formula: risk score = Σ expression level of genei
* βi. β represents the regression coefficient of genei. Then, the median value was utilized as the cutoff value. Subjects were separated into high- and low-risk subgroups. Utilizing Kaplan–Meier curves, survival probability between the two groups was compared by log-rank test. Receiver operating characteristic curves (ROCs) for 1-, 3-, and 5-year survival were conducted via the ROC package in R. Area under the curve (AUC) was then determined. With the same cutoff value, predictive efficacy of the RBP gene signature was validated in the verification set.
Protein–Protein Interaction Analysis
Functional associations between prognosis-related RBPs were predicted by the STRING online database (http://www.bork.embl-heidelberg.de/STRING/) (20).
Univariate and Multivariate Cox Regression Analyses
To analyze the relationships between clinical factors (age, gender, grade, stage, TNM, and risk score) and survival, univariate Cox regression analyses were carried out in the training and verification sets, separately. The independency of survival prediction of clinical factors was evaluated via multivariate Cox regression analyses. Hazard ratio (HR), 95% confidence interval (CI), and p values were calculated, respectively.
Subgroup Analyses
Patients were separated into different subgroups on the basis of different clinicopathological characteristics, including age (>65 and ≤ 65), gender (female and male), grade (grades 1–2 and grade 3), stage (stages I–II and stages III–IV), T (T1–2 and T3–4), N (N0 and N1–3), and M (M0 and M1). Kaplan–Meier curves followed by log-rank test were presented between high- and low-risk subjects in above subgroups.
Pathway Enrichment Analysis
The gene set enrichment analysis (GSEA) 4.0.3 software was utilized in identifying activated signaling pathways in high- and low-risk subgroups (21, 22). This study retrieved the hallmark gene set (h.all.v6.0.symbol.gmt) from the Molecular Signatures Database as a reference gene set. Enriched pathways were screened according to nominal p < 0.05 and adjusted p < 0.05.
Estimation of Stromal Score, Immune Score, and Tumor Purity
Stromal score, immune score, and tumor purity for each specimen were evaluated via the Estimation of STromal and Immune cells in MAlignant Tumour tissues using Expression data (ESTIMATE) algorithm (23). The differences in stromal score, immune score, and tumor purity between the two subgroups were compared through the Wilcoxon rank-sum test. Kaplan–Meier curves were conducted for estimating survival differences between different subgroups, as follows: high vs. low stromal score, high vs. low immune score, and high and low tumor purity.
Assessment of Immune Cell Infiltration
The infiltration levels of 22 immune cell types were quantified in gastric cancer specimens utilizing the Cell type Identification by Estimating Relative Subsets of RNA Transcripts (CIBERSORT) algorithm as well as the LM22 gene sets containing 547 markers (24). The comparison of immune cell types between the high- and low-risk groups was carried out through the Wilcoxon rank-sum test.
Estimation of Immune Checkpoint Expression
The expression levels of 47 immune checkpoints were estimated in gastric cancer samples. Their expression was compared in the high- and low-risk groups by the Wilcoxon rank-sum test.
Drug Sensitivity Assessment
The sensitivity to different chemotherapy drugs for each sample was estimated through the Genomics of Drug Sensitivity in Cancer (GDSC; https://www.cancerrxgene.org/) database (25). The calculation of half maximal inhibitory concentration (IC50) was achieved through the pRRophetic package in R (26).
Construction a Prognostic Nomogram Model
A nomogram model construction was achieved by the rms package as well as the survival package in R. This nomogram contained independent prognostic factors. Calibration curves were then depicted for evaluation of the predictive potency for 1-, 3-, and 5-year clinical outcomes of this nomogram.
Statistical Analyses
All analyses were achieved by available packages in R language 3.4.1 (http://www.R-project.org). Comparisons between the two groups were performed by the Wilcoxon rank-sum test or Student' t-test. Values of p < 0.05 indicated statistical significance.
Results
Construction of a Prognostic Signature for Gastric Cancer
Herein, 350 gastric cancer specimens were employed as the training set. Totally, 58 RBPs exhibited significant associations with survival of gastric cancerpatients (Table 1). To avoid data overfitting, coexpressed RBPs were eliminated through LASSO regression analyses (Figures 1A,B). Consequently, 33 key RBPs were retained for establishment of a prognostic signature. We determined the risk scores of all subjects. Table 2 listed the regression coefficients of these key RBPs. Then, these subjects were separated into high- and low-risk groups (Figure 1C). In Figure 1D, the number of dead patients in the high-risk group was significantly higher than that in the low-risk group. The difference in survival between groups was compared in depth. Figure 1E displayed that subjects with high risk often experienced more unfavorable survival time than those with low risk (p = 1.033e−14). Following confirmation by ROCs, the AUCs for 1-, 3-, and 5-year clinical outcomes were separately 0.779, 0.759, and 0.788 (Figure 1F). These data were indicative of the predictive potential of the signature. To observe the interactions between 33 key RBPs, we constructed a PPI network. In Figure 1G, 14 key RBPs had mutual regulation.
Table 1
Univariate Cox regression analyses of survival-related RBPs in gastric cancer.
A gene signature for predicting the survival of gastric cancer patients. (A) The optimal parameter (λ) is where the vertical dotted line is. (B) Regression coefficients for RNA-binding proteins (RBPs) when determining the optimal parameter (λ). Each curve represents the change in trajectory of coefficients for each independent variable, and the ordinate indicates the coefficient values. (C) The distribution of risk scores for all subjects. The vertical dotted line corresponds to the median value of risk scores. (D) Survival status for all subjects ranked by risk scores. (E) Kaplan–Meier curves of overall survival between high- and low-risk subjects. (F) Receiver operating characteristic curves (ROCs) of 1-, 3-, and 5-year survival. (G) A protein–protein interaction (PPI) network based on the 33 key RBP genes.
Table 2
The coefficients of 33 key RBPs in the gene signature.
Gene
Coef
Gene
Coef
Gene
Coef
MSI2
−0.41056215
RNASE3
0.148036367
PCF11
−0.586573788
METTL2B
−0.105183273
HEXIM2
−0.068963889
TTF2
−0.001441001
DYNLL1
0.181641481
TRIM25
−0.151111951
TLR7
0.013281286
BOLL
0.511182302
ZFP36
0.118338975
PABPC5
−0.00140872
PWP2
0.311152879
PURG
−0.02930587
NR0B1
0.319435254
MRPL4
−0.322619794
TSEN54
0.161142621
RPS4Y2
0.301449471
QTRT1
−0.062291791
ISY1
−0.196715792
ADAT3
−0.255476366
SMAD5
0.420293951
PPAN
0.301063239
BICC1
−0.002450685
ADARB1
0.096350867
RNASE13
1.413824833
RPS23
0.121943685
PEG10
0.025434693
FTO
0.364676671
IFIT1
0.036295595
RNASE1
0.147258292
FAM98C
−0.099287095
NOVA1
−0.173519708
Coef, coefficients.
Univariate Cox regression analyses of survival-related RBPs in gastric cancer.RBPs, RNA-binding proteins; HR, hazard ratio; HR.95L, lower 95% confidence interval; HR.95H, higher 95% confidence interval.A gene signature for predicting the survival of gastric cancerpatients. (A) The optimal parameter (λ) is where the vertical dotted line is. (B) Regression coefficients for RNA-binding proteins (RBPs) when determining the optimal parameter (λ). Each curve represents the change in trajectory of coefficients for each independent variable, and the ordinate indicates the coefficient values. (C) The distribution of risk scores for all subjects. The vertical dotted line corresponds to the median value of risk scores. (D) Survival status for all subjects ranked by risk scores. (E) Kaplan–Meier curves of overall survival between high- and low-risk subjects. (F) Receiver operating characteristic curves (ROCs) of 1-, 3-, and 5-year survival. (G) A protein–protein interaction (PPI) network based on the 33 key RBP genes.The coefficients of 33 key RBPs in the gene signature.Coef, coefficients.
Verification of the Prognostic Signature in an External Dataset
We further evaluated the generalizability of the signature in the GSE84437 dataset. With the same cutoff value, subjects were separated into high- and low-risk subgroups (Figure 2A). Compared with the low-risk group, there were more patients with dead status in the high-risk group (Figure 2B). Those with high risk presented worse survival time than those with low risk (p = 7.208e−10; Figure 2C). The AUCs of 1-, 3-, and 5-year clinical outcomes were separately 0.647, 0.645, and 0.669, which was suggestive that this signature might be used in predicting the patients' survival (Figure 2D).
Figure 2
Assessment of this prognostic signature in the verification dataset. (A) The ranking of risk scores, and (B) the distribution of survival status among gastric cancer samples. The vertical dotted line corresponds to the cutoff of risk scores. (C) Kaplan–Meier curves of clinical outcomes concerning high- and low-risk subjects. (D) ROCs of 1-, 3-, and 5-year survival.
Assessment of this prognostic signature in the verification dataset. (A) The ranking of risk scores, and (B) the distribution of survival status among gastric cancer samples. The vertical dotted line corresponds to the cutoff of risk scores. (C) Kaplan–Meier curves of clinical outcomes concerning high- and low-risk subjects. (D) ROCs of 1-, 3-, and 5-year survival.
The Signature as an Independent Prognostic Factor for Gastric Cancer
In the training set, our univariate Cox regression analyses were indicative that risk score presented a significant correlation with gastric cancer prognosis [p < 0.001; HR (95% CI): 1.355 (1.269–1.447)] in Figure 3A. Moreover, age (p = 0.033; HR (95%CI): 1.021 (1.002–1.042)], stage (p = 0.002; HR (95% CI): 1.465 (1.154–1.861)], and N [p = 0.022; HR (95%CI): 1.235 (1.031–1.478)] were also correlated to gastric cancer prognoses. Above were risk factors of gastric cancer. Following multivariate Cox regression analyses, age [p < 0.001; HR (95% CI): 1.039 (1.018–1.061)], stage [p = 0.020; HR (95% CI): 1.578 (1.074–2.318], and this gene signature [p < 0.001; HR (95% CI): 1.437 (1.335–1.547)] independently predicted the patients' survival (Figure 3B). We further verified the independency of the signature in predicting prognosis in an external dataset. Data showed that age [p = 0.003; HR (95% CI): 1.019 (1.006–1.032)], T [p < 0.001; HR (95% CI): 1.729 (1.369–2.184)], N (p < 0.001; HR (95% CI): 1.669 (1.421–1.959)], and this gene signature [p < 0.001; HR (95% CI): 1.914 (1.615–2.269)] were risk factors of gastric cancer (Figure 3C). By confirmation of multivariate Cox regression analyses, age [p < 0.001; HR (95% CI): 1.022 (1.009–1.034)], T [p < 0.001; HR (95% CI): 1.560 (1.221–1.994)], N [p < 0.001; HR (95% CI): 1.459 (1.237–1.721)], and this gene signature [p < 0.001; HR (95% CI): 1.724 (1.449–2.050)] were independently predictive of clinical outcomes (Figure 3D). Collectively, this signature was an independent risk factor of gastric cancer.
Figure 3
The signature as an independent risk factor of gastric cancer. (A) Univariate and (B) multivariate Cox regression analyses of age, gender, grade, stage, TNM, as well as signature for gastric cancer in the training set. (C) Univariate and (D) multivariate Cox regression analyses for age, sex, T, N, and risk score for gastric cancer in the verification set.
The signature as an independent risk factor of gastric cancer. (A) Univariate and (B) multivariate Cox regression analyses of age, gender, grade, stage, TNM, as well as signature for gastric cancer in the training set. (C) Univariate and (D) multivariate Cox regression analyses for age, sex, T, N, and risk score for gastric cancer in the verification set.
Subgroup Analysis of the Signature in Predicting Gastric Cancer Patients' Survival
Subgroup analysis was presented to assess whether the signature was accurately predictive of patients' clinical outcomes in the training set. Data indicated that subjects with high risk were indicative of more unfavorable survival in comparison with those with low risk in different subgroups according to age (>65 and ≤ 65; Figures 4A,B), gender (female and male; Figures 4C,D), grade (G1–2 and G3; Figures 4E,F), stage (stages I–II and III–IV; Figures 4G,H), T (T1–2 and T3–4; Figures 4I,J), N (N0 and N1–3; Figures 4K,L), as well as M (M0 and M1; Figures 4M,N).
Figure 4
Subgroup analysis of the signature in predicting gastric cancer patients' survival. Kaplan–Meier curves for high- and low-risk subjects in different subgroups including (A) age >65, (B) age ≤ 65; (C) female, (D) male; (E) grades 1–2, (F) grade 3; (G) stages I–II, (H) stages III–IV; (I) T1–2, (J) T3–4; (K) N0, (L) N1–3; (M) M0, (N) M1.
Subgroup analysis of the signature in predicting gastric cancerpatients' survival. Kaplan–Meier curves for high- and low-risk subjects in different subgroups including (A) age >65, (B) age ≤ 65; (C) female, (D) male; (E) grades 1–2, (F) grade 3; (G) stages I–II, (H) stages III–IV; (I) T1–2, (J) T3–4; (K) N0, (L) N1–3; (M) M0, (N) M1.
Signaling Pathways Involved in High- and Low-Risk Subgroups
This study evaluated the signaling pathways enriched by high- and low-risk samples via the GSEA in depth. Data indicated that calcium signaling pathway, ECM receptor interaction, and focal adhesion were highly enriched in high-risk samples (Figure 5A). In Figure 5B, base excision repair, cell cycle, DNA replication, mismatch repair, P53 signaling pathway, as well as spliceosome were highly enriched in low-risk specimens.
Figure 5
Gene set enrichment analysis (GSEA) for signaling pathways involved in high- and low-risk groups. (A) Highly enriched pathways in high-risk gastric cancer samples. (B) Highly enriched pathways in low-risk specimens.
Gene set enrichment analysis (GSEA) for signaling pathways involved in high- and low-risk groups. (A) Highly enriched pathways in high-risk gastric cancer samples. (B) Highly enriched pathways in low-risk specimens.
Association Between This Signature and Tumor Microenvironment
This study next probed the association between this signature and tumor microenvironment in the training set. In Figure 6A, high-risk specimens presented an increased stromal score (p = 6.9e−11) and immune score (p = 0.0029) than low-risk subjects. Meanwhile, high-risk subjects displayed distinctly lowered tumor purity in comparison with those with low risk (p = 4.2e−07). Prognostic values of stromal score, immune score, and tumor purity were then evaluated in gastric cancer. As a result, high stromal score was distinctly related to poorer prognosis than low stromal score (p = 0.014; Figure 6B). No significant difference in survival was found between high- and low-immune score groups (p = 0.126; Figure 6C). Moreover, high tumor purity was distinctly associated with prolonged survival duration (p = 0.045; Figure 6D). We further assessed whether the signature was in association with immune cell infiltrations in gastric cancer tissues from TCGA dataset. Data were indicative that subjects with high risk exhibited increased infiltration levels of T-cell CD4 memory resting (p < 0.01), monocytes (p < 0.05), macrophages M2 (p < 0.01), and mast cells resting (p < 0.05) in Figure 6E. Moreover, high-risk subjects had reduced infiltration levels of T-cell CD4 memory activated as well as T-cell follicular helper (both p < 0.01) than those with low risk. Furthermore, we found that high risk was characterized by increased expression of immune checkpoints including BTLA (p < 0.01), BTNL2 (p < 0.05), CD200 (p < 0.001), CD200R1 (p < 0.05), CD27 (p < 0.05), CD276 (p < 0.05), CD28 (p < 0.01), CD40 (p < 0.01), CD40LG (p < 0.01), CD44CD40 (p < 0.05), CD48 (p < 0.01), CD40 (p < 0.01), CD86 (p < 0.001), HAVCR2 (p < 0.01), LAIR1 (p < 0.01), NRP1 (p < 0.001), PDCD1LG2 (p < 0.001), TMIGD2 (p < 0.05), TNFSF14 (p < 0.05), TNFSF18 (p < 0.001), TNFSF4 (p < 0.001), and VSIR (p < 0.05; Figure 6F).
Figure 6
Association between this signature and tumor microenvironment in the training set. (A) Violin plots for stromal score, immune score, and tumor purity of high- and low-risk specimens via the ESTIMATE. (B) Kaplan–Meier curves of high- and low-stromal score groups. (C) Kaplan–Meier curves of high- and low-immune score groups. (D) Kaplan–Meier curves of high- and low-tumor purity groups. (E) Box plots for infiltration levels of immune cells in high- and low-risk samples through the CIBERSORT. (F) Box plots for expression of immune checkpoints in high- and low-risk samples. Ns, not significant. *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001.
Association between this signature and tumor microenvironment in the training set. (A) Violin plots for stromal score, immune score, and tumor purity of high- and low-risk specimens via the ESTIMATE. (B) Kaplan–Meier curves of high- and low-stromal score groups. (C) Kaplan–Meier curves of high- and low-immune score groups. (D) Kaplan–Meier curves of high- and low-tumor purity groups. (E) Box plots for infiltration levels of immune cells in high- and low-risk samples through the CIBERSORT. (F) Box plots for expression of immune checkpoints in high- and low-risk samples. Ns, not significant. *p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001.
Correlation Between This Signature and Drug Sensitivity
We further evaluated the sensitivity to chemotherapy drugs between high- and low-risk groups. Our data were indicative of increased IC50 values of sorafenib (p = 5.23–05; Figure 7A), gefitinib (p = 0.011; Figure 7B), vinorelbine (p = 0.006; Figure 7C), and gemcitabine (p = 0.011; Figure 7D) in specimens with high risk than those with low risk. Hence, low-risk specimens were more sensitive to sorafenib, gefitinib, vinorelbine, and gemcitabine than high-risk specimens.
Figure 7
Correlation between this signature and drug sensitivity. Box plots for estimated IC50 of (A) sorafenib, (B) gefitinib, (C) vinorelbine, and (D) gemcitabine between high- and low-risk gastric cancer specimens.
Correlation between this signature and drug sensitivity. Box plots for estimated IC50 of (A) sorafenib, (B) gefitinib, (C) vinorelbine, and (D) gemcitabine between high- and low-risk gastric cancer specimens.
Establishment of a Nomogram Integrating Age, Stage, and Risk Score
To personally predict the prognosis of each subject, a nomogram was established via integrating age, stage, and gene signature, which could be predictive of 1-, 3-, and 5-year survival probability (Figure 8A). Through confirmation of these calibration curves, 1-, 3-, and 5-year clinical outcomes by this nomogram exhibited high consistency with actual clinical outcomes for gastric cancer subjects in the training set (Figures 8B–D).
Figure 8
Establishment of a nomogram for gastric cancer in the training set. (A) A nomogram integrating age, stage, as well as signature in prediction of 1-, 3-, and 5-year outcomes in gastric cancer. The calibration curves for the relationships between (B) 1-, (C) 3-, and (D) 5-year survival predicted by this nomogram and actual clinical outcomes.
Establishment of a nomogram for gastric cancer in the training set. (A) A nomogram integrating age, stage, as well as signature in prediction of 1-, 3-, and 5-year outcomes in gastric cancer. The calibration curves for the relationships between (B) 1-, (C) 3-, and (D) 5-year survival predicted by this nomogram and actual clinical outcomes.
Discussion
This study developed an RBP-based signature in the prediction of gastric cancerpatients' survival. Subjects with high risk presented an unacceptable clinical outcome. Following verification, this signature was independently predictive of prognosis of patients. Moreover, it was distinctly related to immune microenvironment and sensitivity to chemotherapy drugs. Hence, this RBP-based signature may be a promising tool for predicting clinical outcomes and guiding clinical therapy in gastric cancer.The molecular heterogeneity features between high- and low-risk patients were further analyzed. We found that calcium signaling pathway, ECM receptor interaction, and focal adhesion were highly activated in high-risk samples. Previously, calcium facilitates gastric carcinoma progress through calcium-sensing receptor as well as TRPV4 (27). Furthermore, VPAC1 and TRPV4 channels may accelerate gastric cancer progress by relying on calcium (28). The ECM receptor contributes to carcinogenesis, progress, and unfavorable survival in gastric cancer (29). Focal adhesion-related proteins are independently predictive of pessimistic clinical outcomes in gastric cancer (30). Meanwhile, activation of base excision repair, cell cycle, DNA replication, mismatch repair, P53 signaling pathway, as well as spliceosome was detected in low-risk specimens. The clinical implications of DNA repair like base excision repair and mismatch repair have been confirmed in gastric cancer (31). Deregulation of p53 pathway induces malignant biological properties for gastric cancer cells (32).Immune cell ingredients contribute to gastric cancer initiation and progression. Moreover, immune escape exerts a critical role in tumorigenesis. Immune infiltration levels distinctly affect patients' survival. Tumor immune microenvironment that contains stromal and immune cells exhibits an association with immunotherapy response (5). Immune cells are correlated with tumor invasion and metastases. Stromal cells present close relationships with tumor growth, progression, response to chemotherapy, as well as recurrence. This study demonstrated that high-risk subjects had increased immune and stromal scores than those with low risk. Consistently, Mao et al. (33) found that subjects with high stromal scores presented unfavorable clinical outcomes. At present, novel immunotherapies like anti-PD-1 and anti-PD-L1 have been applied in gastric cancer. Nevertheless, only a minority of subjects benefit from immunotherapies. The compositions in the immune microenvironment are key determinants for prognoses and response to immunotherapies (34). Herein, this study comprehensively analyzed the correlations between immune cell infiltrations and this signature via the CIBERSORT algorithm. High-risk subjects presented increased infiltration levels of T-cell CD4 memory resting, monocytes, macrophage M2, and mast cells resting, and had reduced infiltration levels of T-cell CD4 memory activated as well as T-cell follicular helper than those with low risk. Moreover, we found that high risk was characterized by increased expression of immune checkpoints including BTLA that was expressed in B and T lymphocytes, BTNL2 that was expressed in antigen-processing and presentation cells, CD200 that was mainly expressed in B and T lymphocytes, CD200R1 that was expressed in myeloid lineage cells, CD27 that was expressed in T cells, CD276 that was expressed in cancer cells, CD28 that was expressed in T cells, CD40 that was expressed in antigen-presenting cells, CD40LG that was expressed in T cells, CD44 that was expressed in T cells, CD48 that was expressed in lymphocytes and dendritic cells, CD86 that was expressed in antigen-presenting cells, HAVCR2 that was expressed in T cells, LAIR1 that was expressed in natural killer cells, T cells, and B cells, NRP1 that was expressed in cancer cells, PDCD1LG2 that was expressed in T cells and dendritic cells, TMIGD2 that was expressed in T cells, TNFSF14 that was expressed in T cells, TNFSF18 that was expressed in T cells, TNFSF4 that was expressed in T cells, and VSIR that was expressed in T cells. These data were indicative of this signature being closely related to immunotherapy.For advanced subjects, surgical resection followed by auxiliary chemotherapy is a major therapeutic strategy. In recent years, a few clinical trials of postoperative chemotherapy have been launched in gastric cancer (35–37). Miserably, response to chemotherapy is relatively low on account of tumor heterogeneity (38). Our data indicated that subjects with low risk were more sensitive to sorafenib, gefitinib, vinorelbine, and gemcitabine than those with high risk. This RBP-based signature seems to be considered as a classification tool for making individualized therapeutic decisions. Furthermore, a nomogram was then developed for individualized clinical outcome prediction. This model also showed good predictive performance for 1-, 3-, and 5-year survival.A few disadvantages of this study need to be pointed out. First, this was a retrospective study according to public datasets. In our future studies, we will present prospective multicenter clinical trials for validation of this RBP signature in predicting gastric cancerpatients' survival. Second, activated signal pathways in high- and low-risk subgroups should be verified in further basic experiments. In future research, the molecular mechanisms of RBPs will be observed in gastric cancer. Furthermore, we will further validate the relationships of RBPs with immune microenvironment of gastric cancer, which could be used for guiding immunotherapy in clinical practice.
Conclusion
This study developed and externally verified an independent RBP-based signature in the prediction of gastric cancerpatients' survival. This signature was closely related to tumor microenvironment and chemosensitivity, assisting in the expanding of the applications of immunotherapy and chemotherapy. A nomogram integrating this signature, age, and stage could offer individualized prediction of prognosis. Thus, this RBP signature may represent a prognostic stratification tool for gastric cancer.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author Contributions
XL conceived and designed the study. ZQ conducted most of the experiments and data analysis, and wrote the manuscript. HJ and KJ participated in collecting the data and helped draft the manuscript. All authors reviewed and approved the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Authors: Christian von Mering; Martijn Huynen; Daniel Jaeggi; Steffen Schmidt; Peer Bork; Berend Snel Journal: Nucleic Acids Res Date: 2003-01-01 Impact factor: 16.971
Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205
Authors: Charles S Fuchs; Donna Niedzwiecki; Harvey J Mamon; Joel E Tepper; Xing Ye; Richard S Swanson; Peter C Enzinger; Daniel G Haller; Tomislav Dragovich; Steven R Alberts; Georg A Bjarnason; Christopher G Willett; Leonard L Gunderson; Richard M Goldberg; Alan P Venook; David Ilson; Eileen O'Reilly; Kristen Ciombor; David J Berg; Jeffrey Meyerhardt; Robert J Mayer Journal: J Clin Oncol Date: 2017-10-04 Impact factor: 44.544
Authors: Aaron M Newman; Chih Long Liu; Michael R Green; Andrew J Gentles; Weiguo Feng; Yue Xu; Chuong D Hoang; Maximilian Diehn; Ash A Alizadeh Journal: Nat Methods Date: 2015-03-30 Impact factor: 28.547
Authors: J Ferlay; M Colombet; I Soerjomataram; C Mathers; D M Parkin; M Piñeros; A Znaor; F Bray Journal: Int J Cancer Date: 2018-12-06 Impact factor: 7.396