| Literature DB >> 29245927 |
Hidekazu Yoshie1,2, Anna S Sedukhina1, Kimino Minagawa1, Keiko Oda1, Shigeko Ohnuma3, Nobuyuki Yanagisawa3, Ichiro Maeda3, Masayuki Takagi3, Hiroya Kudo2, Ryuto Nakazawa2, Hideo Sasaki2, Toshio Kumai1, Tatsuya Chikaraishi2, Ko Sato1.
Abstract
Biomarker-driven cancer therapy has met with significant clinical success. Identification of a biomarker implicated in a malignant phenotype and linked to poor clinical outcome is required if we are to develop these types of therapies. A subset of prostate adenocarcinoma (PACa) cases are treatment-resistant, making them an attractive target for such an approach. To identify target molecules implicated in shorter survival of patients with PACa, we established a bioinformatics-to-clinic sequential analysis approach, beginning with 2-step in silico analysis of a TCGA dataset for localized PACa. The effect of candidate genes identified by in silico analysis on survival was then assessed using biopsy specimens taken at the time of initial diagnosis of localized and metastatic PACa. We identified PEG10 as a candidate biomarker. Data from clinical samples suggested that increased expression of PEG10 at the time of initial diagnosis was linked to shorter survival time. Interestingly, PEG10 overexpression also correlated with expression of chromogranin A and synaptophysin, markers for neuroendocrine prostate cancer, a type of treatment-resistant prostate cancer. These results indicate that PEG10 is a novel biomarker for shorter survival of patients with PACa. Also, PEG10 expression at the time of initial diagnosis may predict focal neuroendocrine differentiation of PACa. Thus, PEG10 may be an attractive target for biomarker-driven cancer therapy. Thus, bioinformatics-to-clinic sequential analysis is a valid tool for identifying targets for precision oncology.Entities:
Keywords: PEG10; bioinformatics; neuroendocrine prostate cancer; precision oncology; prostate cancer
Year: 2017 PMID: 29245927 PMCID: PMC5725118 DOI: 10.18632/oncotarget.20448
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Baseline characteristics of TCGA database (Gleason score > 8)
| Total cases | Total | 1y | 3y | 5y | |||
|---|---|---|---|---|---|---|---|
| <1 | ≥1 | <3 | ≥3 | <5 | ≥5 | ||
| 201 | 24 | 102 | 48 | 45 | 57 | 15 | |
| Age | |||||||
| Average | 62.4 | 63.3 | 62.3 | 62.7 | 62.3 | 62.4 | 61.5 |
| Range | 44-78 | 53-72 | 44-76 | 46-78 | 46-71 | 46-78 | 46-70 |
| T category | |||||||
| T2b | 3 | 2 | 1 | 1 | |||
| T2c | 24 | 1 | 17 | 3 | 8 | 3 | 3 |
| T3a | 62 | 5 | 31 | 14 | 12 | 18 | 5 |
| T3b | 103 | 16 | 47 | 29 | 22 | 34 | 6 |
| T4 | 7 | 2 | 3 | 2 | 2 | 2 | |
| NA | 2 | 2 | |||||
| Gleason score | |||||||
| 3+5 | 7 | 1 | 6 | 1 | 3 | 1 | 2 |
| 4+4 | 50 | 3 | 33 | 9 | 10 | 10 | 3 |
| 5+3 | 7 | 6 | 4 | 1 | |||
| 4+5 | 97 | 13 | 40 | 26 | 23 | 34 | 8 |
| 5+4 | 37 | 7 | 15 | 11 | 4 | 11 | 1 |
| 5+5 | 3 | 2 | 1 | 1 | 1 | ||
| Tissue | |||||||
| Prostate | 201 | 24 | 102 | 48 | 45 | 57 | 15 |
| Histology | |||||||
| Adenocarcinoma | 145 | 17 | 72 | 38 | 30 | 43 | 11 |
| Aca mixed subtype | 2 | 1 | 1 | 1 | 1 | ||
| Mucinous ca | 1 | ||||||
| Signet ring cell ca | 1 | 1 | 1 | ||||
| Infiltrating duct ca | 8 | 1 | 3 | 2 | 1 | 2 | 1 |
| Acinar cell ca | 44 | 5 | 25 | 7 | 13 | 6 | 3 |
Adenocarcinoma: Adenocarcinoma NOS
Aca: Adenocarcinoma
Ca: carcinoma
Figure 1Schematic showing the bioinformatics-to-clinic sequential analysis method
Bioinformatics-to-clinic sequential analysis from transcriptome data.
Figure 2Alterations in gene expression in patients with different prognoses
MA plot showing gene expression in three different settings: relapse-free survival at 1 year (A), 3 years (B), or 5 years (C). Pink dots denote differentially expressed genes (defined as FDR <0.05). G1 and G2 indicate cohorts with longer and shorter life spans, respectively.
Differentially expressed genes overlapped
| Gene symbol | 1 year | 3 year | 5 year | |||
|---|---|---|---|---|---|---|
| LogFC | FDR | LogFC | FDR | LogFC | FDR | |
| CDH17 | 1.60 | 1.82E-02 | 1.97 | 5.29E-04 | 2.86 | 1.45E-02 |
| CDH20 | 4.20 | 1.01E-26 | 3.14 | 9.04E-07 | 3.63 | 1.39E-02 |
| CHRNB2 | 3.08 | 1.94E-13 | 2.72 | 2.97E-06 | 2.94 | 4.58E-02 |
| ELAVL3 | 2.40 | 5.03E-08 | 1.64 | 2.71E-02 | 2.99 | 1.48E-02 |
| GDAP1L1 | 3.16 | 6.78E-16 | 2.75 | 2.19E-06 | 2.96 | 4.34E-02 |
| GRIA4 | 3.91 | 3.37E-12 | 4.78 | 2.11E-08 | 4.78 | 2.59E-02 |
| PEG10 | 2.80 | 4.62E-13 | 2.52 | 2.06E-06 | 2.86 | 2.18E-02 |
| SYT5 | 3.08 | 1.72E-08 | 3.24 | 2.22E-06 | 3.66 | 3.37E-02 |
FDR: false discovery rate
LogFC: logarithmic fold change
Top 40 genes affecting hazard ratio
| Gene symbol | HR (95%CI) | p-value |
|---|---|---|
| NRN1L | 4.468 (2.085 – 9.574) | 1.18E-04 |
| NDUFB11 | 4.304 (1.785 – 10.38) | 1.16E-03 |
| ING4 | 4.034 (2.217 – 7.337) | 4.91E-06 |
| ZBTB8B | 4.022 (1.823 – 8.875) | 5.66E-04 |
| GPR35 | 3.993 (1.826 – 8.731) | 5.22E-04 |
| B4GALT2 | 3.912 (1.797 – 8.52) | 5.92E-04 |
| C14ORF178 | 3.881 (1.838 – 8.193) | 3.75E-04 |
| SARS | 3.761 (1.868 – 7.574) | 2.08E-04 |
| ACADVL | 3.759 (1.744 – 8.103) | 7.26E-04 |
| RPS29 | 3.725 (1.567 – 8.852) | 2.90E-03 |
| CER1 | 3.701 (1.666 – 8.224) | 1.32E-03 |
| HOXC8 | 3.625 (1.522 – 8.632) | 3.63E-03 |
| NPPC | 3.571 (1.723 – 7.402) | 6.21E-04 |
| PMVK | 3.545 (1.487 – 8.447) | 4.29E-03 |
| LAMTOR5 | 3.522 (1.727 – 7.18) | 5.33E-04 |
| NEURL1 | 3.434 (1.531 – 7.705) | 2.77E-03 |
| RBPJL | 3.433 (1.811 – 6.507) | 1.57E-04 |
| BAG1 | 3.389 (1.56 – 7.358) | 2.04E-03 |
| RNASEH2C | 3.388 (1.419 – 8.088) | 6.00E-03 |
| RWDD1 | 3.362 (1.41 – 8.017) | 6.24E-03 |
| OR8S1 | 3.351 (1.511 – 7.436) | 2.94E-03 |
| MRPS5 | 3.334 (1.399 – 7.944) | 6.57E-03 |
| PRMT1 | 3.316 (1.704 – 6.45) | 4.15E-04 |
| SIGLEC12 | 3.312 (1.618 – 6.776) | 1.05E-03 |
| WDR83OS | 3.294 (1.467 – 7.398) | 3.87E-03 |
| MRPL46 | 3.279 (1.593 – 6.752) | 1.27E-03 |
| COX14 | 3.240 (1.528 – 6.869) | 2.17E-03 |
| ZNF581 | 3.220 (1.437 – 7.213) | 4.49E-03 |
| AUNIP | 3.215 (1.819 – 5.681) | 5.82E-05 |
| PARL | 3.183 (1.548 – 6.544) | 1.64E-03 |
| CTDNEP1 | 3.178 (1.424 – 7.096) | 4.78E-03 |
| CIB2 | 3.177 (1.543 – 6.54) | 1.71E-03 |
| PAF1 | 3.146 (1.478 – 6.697) | 2.95E-03 |
| NUDT2 | 3.126 (1.461 – 6.692) | 3.33E-03 |
| COQ10A | 3.124 (1.218 – 8.012) | 1.78E-02 |
| B4GALT7 | 3.104 (1.464 – 6.582) | 3.13E-03 |
| CTNS | 3.097 (1.406 – 6.823) | 5.03E-03 |
| SNX21 | 3.089 (1.44 – 6.628) | 3.78E-03 |
| PEG10 | 3.084 (1.397 – 6.812) | 5.34E-03 |
| NEIL2 | 3.069 (1.485 – 6.341) | 2.46E-03 |
HR: hazard ratio
CI: confidence interval
Figure 3Effect of PEG10 expression on survival of PACa patients
(A and B) Relapse-free survival curves for prostate cancer patients in the TCGA dataset (A) and the GSE21032 dataset (B). The definition of a high PEG10 expression is a z-score ≥1. (C and D) Representative images of PEG10 expression: high expression (C) and low expression (D). (E–G) Relapse-free survival curves for the clinical cohorts with localized PACa (E) and metastatic PACa (F). (G) Relapse-free survival for total PACa (G). Each population was divided into two groups (high and low expressing subpopulations), according to median expression of PEG10 protein.
Baseline characteristics of GSE21032 database (Gleason score ≥ 8)
| Total cases | 21 |
|---|---|
| Age | |
| Average | 59.7 |
| Range | 46-71 |
| T category | |
| T2a | 1 |
| T2b | 1 |
| T2c | 1 |
| T3a | 6 |
| T3b | 7 |
| T3c | 2 |
| T4 | 3 |
| Gleason score | |
| 3+5 | 1 |
| 4+4 | 8 |
| 5+3 | 2 |
| 4+5 | 10 |
Baseline characteristics of clinical samples
| Age | Median range | 72.055 - 86 |
|---|---|---|
| T category | T2a | 6 |
| T2b | 30 | |
| T2c | 36 | |
| T3a | 16 | |
| T3b | 11 | |
| T4 | 6 | |
| NA | 7 | |
| Gleason score | 3 + 5 | 4 |
| 5 + 3 | 3 | |
| 4 + 4 | 55 | |
| 4 + 5 | 29 | |
| 5 + 4 | 14 | |
| 5 + 5 | 7 | |
| Histology | Adenocarcinoma NOS | 112 |
| Treatment | RP | 29 |
| RT | 45 | |
| HT | 38 |
NA: not available
RP: radical prostatectomy
RT: radiation therapy
HT: hormonal therapy
Multivariate analysis of PEG10 expression on RFS
| Unadjusted | Adjusted | |||
|---|---|---|---|---|
| HR (95%CI) | p-value | HR (95%CI) | p-value | |
| PEG10 | 3.51 (1.84-6.70) | *** 1.44E-04 | 2.50 (1.27-4.93) | ** 8.11E-03 |
| Age | 0.99 (0.95-1.04) | 6.79E-01 | 9.71 (0.92-1.02) | 2.44E-01 |
| iPSA | 1.00 (1.00-1.00) | * 2.40E-02 | 1.00 (0.99-1.00) | 8.82E-01 |
| Treatment | 6.37 (3.44-11.77) | *** 3.58E-09 | 1.51 (0.27-844.23) | 1.87E-01 |
| Stage | 2.49 (1.80-3.46) | *** 4.97E-08 | 0.61 (0.08-4.80) | 6.52E-01 |
HR: hazard ratio
CI: confidence interval
iPSA: initial PSA
Treatment includes localized treatment (radical prostatectomy and radiation therapy) and hormonal therapy.
Age, iPSA and stage were evaluated as a continuous variable.
Figure 4A link between PEG10 expression and neuroendocrine differentiation
(A and B) Representative images showing CGA expression: high (A) and low (B). (C) Relapse-free survival curves for the clinical cohort with localized PACa. The definition of high CGA expression is a CGA-positive population ≥1%. (D) Dot plot showing the correlation between PEG10 expression and CGA expression. (E and F) Representative images showing synaptophysin expression: high (E) and low (F). (G) Dot plot showing the correlation between PEG10 expression and synaptophysin expression.