| Literature DB >> 29520031 |
Sai Ge1,2, Xia Xia1, Chen Ding1,3, Bei Zhen1, Quan Zhou1, Jinwen Feng1,4, Jiajia Yuan1,2, Rui Chen5, Yumei Li5, Zhongqi Ge5, Jiafu Ji2, Lianhai Zhang2, Jiayuan Wang2, Zhongwu Li2, Yumei Lai2, Ying Hu2, Yanyan Li2, Yilin Li2, Jing Gao2, Lin Chen6, Jianming Xu7, Chunchao Zhang8, Sung Yun Jung8, Jong Min Choi8, Antrix Jain8, Mingwei Liu1, Lei Song1, Wanlin Liu1, Gaigai Guo1, Tongqing Gong1, Yin Huang1, Yang Qiu1, Wenwen Huang1,2, Tieliu Shi4, Weimin Zhu1, Yi Wang1,8, Fuchu He9,10, Lin Shen11,12, Jun Qin13,14,15.
Abstract
The diffuse-type gastric cancer (DGC) is a subtype of gastric cancer with the worst prognosis and few treatment options. Here we present a dataset from 84 DGC patients, composed of a proteome of 11,340 gene products and mutation information of 274 cancer driver genes covering paired tumor and nearby tissue. DGC can be classified into three subtypes (PX1-3) based on the altered proteome alone. PX1 and PX2 exhibit dysregulation in the cell cycle and PX2 features an additional EMT process; PX3 is enriched in immune response proteins, has the worst survival, and is insensitive to chemotherapy. Data analysis revealed four major vulnerabilities in DGC that may be targeted for treatment, and allowed the nomination of potential immunotherapy targets for DGC patients, particularly for those in PX3. This dataset provides a rich resource for information and knowledge mining toward altered signaling pathways in DGC and demonstrates the benefit of proteomic analysis in cancer molecular subtyping.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29520031 PMCID: PMC5843664 DOI: 10.1038/s41467-018-03121-2
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1A summary of proteomic and genomic analysis of diffuse-type gastric cancer. a Proteomic datasets filtered at different levels for various statistical analyses. D1: all gene products (GPs) identified in 168 samples (84 patients) at 1% protein level FDR; D2: high confidence unique proteins identified with ≥2 unique peptides with ion score >20 and peptide FDR <1% that were used for identifying differentially expressed proteins (DEPs) between tumors and nearby tissues; D3: GPs in the medium to high abundance range (FOT >10−5 in at least one case), D4: GPs identified in more than one-sixth of the samples; D5: common GPs in the medium to high abundance range (FOT >10-5) in D4, D6, differentially expressed proteins. b Cumulative number of proteins identified as a function of patient numbers. c Subcellular distribution of DGC proteins annotated with Gene Ontology. d Targeted exome sequencing was performed. Genes with non-silent variants in at least 4 patients (5%) were depicted on the OncoPrint. Bars on top and to the right of the graph show the number of non-synonymous mutations in each patient and gene, respectively
Fig. 2Proteomic features of the diffuse-type gastric cancer. a Top ranked pathways that are significantly altered in tumors as compared with nearby tissues. b Significantly decreased gastric function proteins and gastric mucosa signature proteins in tumors and nearby tissues. The y-axis represents log10 (FOT) +5. *P < 0.05, **P < 0.01, *** P < 0.001 (Wilcoxon rank-sum test), ΔInsufficient sample size. c Box plots of log10 transformed T/N ratios of stomach-specific and not stomach-specific proteins; whiskers show the 1.5-fold IQR, P-value was calculated by Wilcoxon rank-sum test. d Common proteins up-regulated in majority of tumors that were classified in the four functional categories. (% in parenthesis denotes percentage of tumor samples that exhibit >3-fold change in 84 patients). e Altered proteins in the WNT, NOTCH, TGF, and INF pathways. Differentially expressed proteins in tumors (T/N >3) were boxed with red color. The number in parenthesis is the percentage of samples with overexpression when the protein was detected
Fig. 3Molecular subtyping of DGC based on altered proteomes and their correlations with clinical outcomes. a The number of differentially expressed proteins in each cluster. b Major signaling pathways enriched in PX1, PX2, and PX3. c A heat map of selected proteins representing major altered signaling pathways across 84 patients in PX1–3. d Left: the association of molecular subtypes with overall survival of stage III and IV patients (Kaplan–Meier analysis, P-value from log-rank test); Right: the association of adjuvant chemotherapy with overall survival of stage III and IV patients in PX3 (Kaplan–Meier analysis, P-value from log-rank test)
Univariate and multivariate analysis of overall survival in 82 patients
| Variable | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | |||
| Agea | 1.037 (1.001–1.076) |
| 1.033 (0.994–1.074) | 0.102 |
| Gender | ||||
| Male (51) | 1.0 | |||
| Female (31) | 1.122 (0.478–2.638) | 0.791 | ||
| Adjuvant chemotherapy | ||||
| Without (21) | 1.0 | 1.0 | ||
| Withb (61) |
|
| 0.834 (0.282–2.466) | 0.742 |
| Tumor site | ||||
| Cardia, GEJ (20) | 1.0 | |||
| Body (33) | 1.405 (0.439–4.501) | 0.567 | ||
| Antrum (29) | 1.497 (0.450–4.986) | 0.511 | ||
| Clinical stagea | ||||
| (Ib to IV) |
|
|
|
|
| Wild type (45) | 1.0 | |||
| Mutant (37) | 1.407 (0.608–3.260) | 0.423 | ||
| Profiling cluster | ||||
| PX1 (17) | 1.0 | 1.0 | ||
| PX2 (32) | 2.186 (0.453–10.560) | 0.330 | 4.280 (0.731–25.052) | 0.107 |
| PX3 (34) | 4.221 (0.950–18.760) | 0.058 |
|
|
| |
|
|
|
|
GEJ, gastroesophageal junction; HR, hazard ratio; CI, confidence interval
aContinuous variable
bPatients proceed through at least one cycle of adjuvant chemotherapy. Significant data are emphasized in bold
Fig. 4Correlation of genomic mutations and protein expressions. a Differentially mutated genes and their pathways in PX1–3, *P < 0.05, **P < 0.01 (Fisher’s exact test). b Protein expression status of selected mutated genes. c Differential protein expression of mutated genes in the major cancer driver pathways. The left panel summarizes the correlation of altered protein expression and gene mutations of selected proteins in major oncogenic and tumor suppressive pathways; numbers in parenthesis represent number of patients that differential protein expression were detected, number of mutations detected, and number of cases where both were detected, respectively. Right panel shows differential protein expression and gene mutations in each patient. Large dots depict changes in protein expression, and small dots depict a variety of gene mutations. d Altered expression of tumor proteins associated with CDH1 mutations. Boxplots show protein expression levels of CDH1-mutated and CDH1-wild-type patients (*P < 0.05; **P < 0.01; ***P < 0.001, Wilcoxon rank-sum test). Whiskers show the 1.5-fold IQR
Fig. 5Nominating potential druggable proteins for DGC. a The association between protein expression (T/N ratio) and overall survival of genomic data discovered drug targets. The ln (Hazard ratio) (x-axis) and log10(P-value) (y-axis) were calculated from Cox proportional hazards regression analysis. Ln (Hazard ratio) >4 or <−4 were plotted with 4 or −4. Large dots depict proteins overexpressed in tumors, and small dots depict proteins that are not overexpressed in tumors. b The association between protein expression (T/N ratio) and overall survival of proteomic data discovered drug targets. A list of 23 proteins overexpressed in tumors with log-rank P-value < 0.05 and ln (hazard ratio) >0 is included in the box on the right. c The association between protein expression of CD300A, CYBA and IDO1 and overall survival (Kaplan–Meier analysis, P-value from log-rank test, high means T/N >median value). d Cellular localization and signaling pathways of nominated drug target candidates. The nominated druggable protein candidates are depicted in red
Fig. 6The protein landscape of cancer immunotherapy in DGC. a Multiple receptor–ligand pairs found in antigen-presenting cells (APCs) and T cells (figure adapted from ref.[45] and ref.[46]). Shade of the each of the three blocks represents the percentage of patients with the protein overexpressed in PX1–3, respectively, and the number above the block denotes the total percentage of all patients with the protein overexpressed. b Expressions of 19 immunotherapy targets in clinical development. The y-axis of the box plots at the bottom represents log10 (FOT ×105) of each protein; the number in parenthesis in the x-axis represents number of patients with FOT >10−5. T1–3 tumor tissues of PX1–3; N1–3 nearby tissues of PX1–3. The box plots show the median, 25th and 75th percentile values (horizontal bar, bottom and top bounds of the box), and whiskers show the 1.5-fold IQR. c Comparison of various parameters among three clusters. Patients in PX1 (blue), PX2 (orange), and PX3 (red) are ordered by mutation numbers. *P < 0.05, **P < 0.01, *** P < 0.001 (Chi-square test), IDO immunohistochemistry staining of representative examples (bottom, shown at ×100 magnifications, scale bar: 100 μm), their IHC scores, the corresponding FOT ×105 values obtained from protein profiling, and IDO1 protein expression measured by western blot are shown. IT-TILs intratumoral tumor-infiltrating lymphocytes, CD8+T-high CD8-positive T-cell number ≥298 per μm2 (see Supplementary Figure 6d for details), MSI-H microsatellite instability high, EBV Epstein–Barr virus status; IDO1/ARG1-high ≥upper quartile of all detected values. d H&E and CD8 IHC staining in representative examples in PX1–3, shown at ×100 magnification (scale bar: 200 μm)*P < 0.05, **P < 0.01, *** P < 0.001 (Chi-square test),