| Literature DB >> 30024105 |
Yanzhi Jiang1,2,3,4, Wenjuan Mei2,3,4,5, Yan Gu2,3,4, Xiaozeng Lin2,3,4, Lizhi He6, Hui Zeng2,3,4,7, Fengxiang Wei8, Xinhong Wan8, Huixiang Yang1, Pierre Major9, Damu Tang2,3,4.
Abstract
We report here numerous novel genes and multiple new signatures which robustly predict prostate cancer (PC) recurrence. We extracted 696 differentially expressed genes relative to a reported PC signature from the TCGA dataset (n = 492) and built a 15-gene signature (SigMuc1NW) using Elastic-net with 10-fold cross-validation through analyzing their expressions at 1.5 standard deviation/SD below and 2 SD above a population mean. SigMuc1NW predicts biochemical recurrence (BCR) following surgery with 56.4% sensitivity, 72.6% specificity, and 63.24 median months disease free (MMDF) (P = 1.12e-12). The prediction accuracy is improved with the use of SigMuc1NW's cutpoint (P = 3e-15) and is further enhanced (sensitivity 67%, specificity 75.7%, MMDF 45.2, P = 0) when all 15 genes were analyzed through their cutpoints instead of their SDs. These genes individually associate with BCR using either SD or cutpoint as the cutoff points. Eight of 15 genes are individual risk factors after adjusting for age at diagnosis, Gleason score, surgical margin, and tumor stage. Eleven of 15 genes are novel to PC. SigMuc1NW discriminates BCR with time-dependent AUC (tAUC) values of 76.6% at 11.5 months (76.6%-11.5 m), 73.8%-22.3 m, 78.5%-32.1 m, and 76.4%-48.4 m. SigMuc1NW is correlated with adverse features of PC, high Gleason scores (odds ratio/OR 1.48, P < 2e-16), and advanced tumor stages (OR 1.33, P = 4.37e-13). SigMuc1NW remains an independent risk factor of BCR (HR 2.44, 95% CI 1.53-3.87, P = 1.62e-4) after adjusting for age at diagnosis, Gleason score, surgical margin, and tumor stage. In an independent PC (MSKCC) cohort (n = 140), these 15 genes were altered in PC vs normal tissue, metastatic PCs vs primary PCs, and recurrent PCs vs nonrecurrent PCs. Importantly, a 10-gene subsignature SigMuc1NW1 predicts BCR in MSKCC (P = 3.11e-15) and TCGA (P = 3.13e-12); SigMuc1NW1 discriminates BCR at 18.4 m with tAUC as 82.5%. Collectively, our analyses support SigMuc1NW as a novel and robust signature in predicting BCR of PC.Entities:
Keywords: MUC1; biomarkers; prostate cancer; prostate cancer recurrence
Mesh:
Year: 2018 PMID: 30024105 PMCID: PMC6120243 DOI: 10.1002/1878-0261.12359
Source DB: PubMed Journal: Mol Oncol ISSN: 1574-7891 Impact factor: 6.603
Figure 1Construction of a 15‐gene signature. (A) Strategy used to produce the signature. The TCGA Provisional dataset within cBioPortal has 492 prostate cancers with gene expression profiled by RNA sequencing. The cohort was first divided into two populations: one (n = 100) positive for a 9‐gene signature derived from a MUC1 genomic network (Lin et al., 2017) and another (n = 392) negative for the signature. From these two populations, 696 differentially expressed genes (DEGs) were identified based on the mean mRNA expression and q < 0.001. These DEGs consist of 461 downregulated genes and 218 upregulated genes. For the downregulated genes, we have assigned tumors with gene expression at 1.5 SD (standard deviation) lower than a reference population mean (−1.5 SD); for those upregulated genes, we have located PCs with these gene expression at 2 SD above the population mean. We then performed model‐building using regularization‐coupled covariate selection of these 696 DEGs for their impact on BCR using the Elastic‐net penalty in the R glmnet package (Fig S1 for a typical selection), which resulted in a 15‐gene signature (SigMuc1NW). (B) PCs of the TCGA cohort with −1.5 SD (SLCP2A1 and CGNL1) and 2 SD expression are shown using OncoPrint (top gray illustration) and clustered (bottom color image). The disease‐free status is also included. The illustration was generated using tools provided by cBioPortal.
The component genes of SigMuc1NW
| Gene | Locus | Name | Role in PC/other tumorigenesis | References |
|---|---|---|---|---|
| SLCO2A1 | 3q22.1‐q22.2 | Solute carrier organic anion transporter family member 2A1 | Unknown/inactivation of it facilitates color cancer formation | Guda |
| CGNL1 | 15q21.3 | Cingulin like 1 | Unknown/unknown | NA |
| SUPV3L1 | 10q22.1 | Suv3 like RNA helicase | Unknown/unknown | NA |
| TATDN2 | 3p25.3 | TatD DNase domain containing 2 | Unknown/unknown | NA |
| MGAT4B | 5q35.3 | Mannosyl (alpha‐1,3‐)‐glycoprotein β‐1,4‐N‐acetylglucosaminyltransferase, isozyme B | Unknown/upregulation in murine hepatocellular carcinoma | Blomme |
| VAV2 | 9q34.2 | Vav guanine nucleotide exchange factor 2 | An androgen receptor (AR) coactivator; enhancing AR signaling in PC/ | Magani |
| SLC25A33 | 1p36.22 | Solute carrier family 25 member 33 | Unknown/a mitochondrial UTP carrier; contributing to IGF‐induced cell growth | Lyons |
| MCCC1 | 3q27.1 | Methylcrotonyl‐CoA carboxylase 1 | Unknown/gain of function was reported in oral squamous cell carcinoma | Ribeiro |
| ASNS | 7q21.3 | Asparagine synthetase | Contributing to CRPC/ | Sircar |
| CASKIN1 | 16p13.3 | CASK interacting protein 1 | Unknown/unknown | NA |
| DNMT3B | 20q11.21 | DNA methyltransferase 3 beta | Likely facilitating CRPC/ | Gravina |
| AURKA | 20q13.2 | Aurora kinase A | Contributing to CRPC/ | Mosquera |
| OIP5 | 15q15.1 | Opa interacting protein 5 | Unknown/a cancer testis antigen detected in colorectal cancer | Tarnowski |
| CTHRC1 | 8q22.3 | Collagen triple helix repeat containing 1 | Unknown/promoting tumorigenesis in multiple cancer types | Ke |
| GOLGA7B | 10q24.2 | Golgin A7 family member B | Unknown/unknown | NA |
−1.5 SD downregulated genes.
2 SD upregulated genes.
NA: not available.
Association of the component genes of SigMuc1NW with PC recurrencea
| Genes | Coef | HR | 95% CI |
|
|---|---|---|---|---|
| SLCO2A1 | 1.5813 | 4.861 | 1.763–13.4 | 0.00225 |
| CGNL1 | 0.9902 | 2.692 | 1.546–4.686 | 0.000464 |
| SUPV3L1 | 0.8437 | 2.325 | 1.168–4.629 | 0.0163 |
| TATDN2 | 1.3132 | 3.718 | 1.855–7.45 | 0.000213 |
| MGAT4B | 1.5178 | 4.562 | 2.245–9.272 | 2.73e‐5 |
| VAV2 | 1.1027 | 3.012 | 1.671–5.429 | 0.000244 |
| SLC25A33 | 1.096 | 2.992 | 1.55–5.777 | 0.00109 |
| MCCC1 | 0.8336 | 2.302 | 1.322–4.007 | 0.00321 |
| ASNS | 1.3456 | 3.84 | 2.064–7.145 | 2.15e‐5 |
| CASKIN1 | 1.0286 | 2.797 | 1.55–5.047 | 0.000636 |
| DNMT3B | 1.2919 | 3.64 | 1.928–6.87 | 6.73e‐5 |
| AURKA | 1.0966 | 2.994 | 1.692–5.298 | 0.000166 |
| OIP5 | 1.365 | 3.914 | 2.022–7.576 | 5.13e‐5 |
| CTHRC1 | 0.7981 | 2.221 | 1.15–4.289 | 0.0174 |
| GOLGA7B | 2.0406 | 7.695 | 2.388–24.79 | 0.00063 |
Univariate Cox analysis was performed using the TCGA Provisional cohort (n = 492).
Cox coefficient.
Hazard ratio.
Confidence interval.
Gene expression was < −1.5 SD of the reference population mean.
Gene expression was at > 2 SD of the reference population mean.
*P < 0.05; **P < 0.01; ***P < 0.001.
Figure 2SigMuc1NW is associated with reductions in disease‐free survival (DFS) and overall survival (OS) in patients with PC. The TCGA Provisional cohort was used in these analyses. (A) The effect of SigMuc1NW on DFS. MDF: months disease free; MS: months survival; MMDF: median months disease free; NA: not available as MMDF being not reached. Numbers of patient at risk at the start of the indicated follow‐up period were included. (B) The impact of SigMuc1NW on OS. MMS: median months survival. Kaplan–Meier and log‐rank test were performed using the R survival Package.
Figure 3SigMuc1NW scores effectively stratify PCs with a high risk of recurrence. (A) All tumors within the TCGA Provisional cohort were scored for SigMuc1NW (see Results for details). The scores were analyzed for discrimination of tumors with high risk of recurrence using tROC. AUC at the indicated period of time (tAUC) along with the status of disease recurrence are indicated. DF: disease free. (B) The cutpoint of SigMuc1NW scores for effectively separating PCs with high risk of recurrence from low risk PCs was estimated (Fig S4 for details), followed by assigning binary codes to tumors based on the cutpoint (see Results for details). The effects of cutpoint on DFS of the patients in the TCGA cohort were then determined. (C, D) The effects of Mean and Q3 scores of SigMuc1NW on BCR in PC patients in the TCGA Provisional cohort. Kaplan–Meier and log‐rank test were performed using the R survival Package. The vertical dot line shows MMDF. The color dot curves are for 95% CI.
Univariate and multivariate Cox analysis of SigMuc1NW for PC recurrence
| Factors | Univariate Cox analysis | Multivariate Cox analysis | Multivariate Cox analysis | ||||||
|---|---|---|---|---|---|---|---|---|---|
| HR | 95% CI |
| HR | 95% CI |
| HR | 95% CI |
| |
| Sig | 4.16 | 2.74–6.36 | 5.54e‐11 | 2.44 | 1.53–3.87 | 1.62e‐4 | NA | NA | NA |
| Cutpoint | 4.6 | 3.03–6.97 | 6.44e‐13 | NA | NA | NA | 2.67 | 1.70–4.20 | 2.05e‐5 |
| Age | 1.03 | 0.99–1.06 | 0.0981 | 0.999 | 0.97–1.03 | 0.9711 | 1.001 | 0.97–1.03 | 0.9756 |
| GS | 2.19 | 1.76–2.72 | 1.49e‐12 | 1.62 | 1.25‐2.11 | 2.71e‐4 | 1.62 | 1.25–2.10 | 2.86e–4 |
| SMargin | 2.25 | 1.48–3.41 | 0.000137 | 1.25 | 0.79–1.98 | 0.3306 | 1.28 | 0.81–2.02 | 0.2976 |
| TumStge | 3.68 | 2.08–6.51 | 8.19e‐6 | 1.82 | 0.97–3.40 | 0.0614 | 1.82 | 0.96–3.45 | 0.0668 |
SigMuc1NW.
SigMuc1NW‐derived cutpoint.
Age at diagnosis.
Radical prostatectomy Gleason score.
Surgical margin.
Tumor stages (0 for ≤ T2; 1 for T3 and T4).
HR, hazard ratio; CI, confidence interval; NA, not available.
*P < 0.05.
SigMuc1NWa component genes defined at their cutpoints associate with BCR
| Genes | Cutpoint |
| Coef |
|
|---|---|---|---|---|
| SLCO2A1 | 497.3292 | 0.09128 | 0.7967 | 0.00499 |
| CGNL1 | 3066.229 | 0.004126 | 0.7966 | 0.000372 |
| SUPV3L1 | 545.8928 | 0.007953 | 0.7992 | 0.000187 |
| TATDN2 | 1756.057 | 0.002471 | 0.8731 | 8.48e‐5 |
| MGAT4B | 1818.718 | 6.389e‐5 | 1.0331 | 2.61e‐6 |
| VAV2 | 1489.06 | 0.000547 | 0.9402 | 9.94e‐6 |
| SLC25A33 | 297.5508 | 0.2522 | 0.8503 | 0.0218 |
| MCCC1 | 1233.159 | 0.001077 | 1.0179 | 1.2e‐5 |
| ASNS | 1041.086 | 0.01123 | 1.0544 | 0.000109 |
| CASKIN1 | 106.4046 | 0.02646 | 0.7006 | 0.00125 |
| DNMT3B | 61.4086 | 0.008576 | 0.9082 | 0.000175 |
| AURKA | 81.1249 | 3.807e‐5 | 1.0223 | 1.12e‐6 |
| OIP5 | 16.4317 | 4.237e‐7 | 1.242 | 2.64e‐8 |
| CTHRC1 | 180.8622 | 0.01389 | 0.7608 | 0.000537 |
| GOLGA7B | 23.2022 | 0.01249* | 0.7623 | 0.000581 |
RNA sequencing data of SigMuc1NW's component genes were retrieved from the TCGA Provisional dataset (cBioPortal).
Cutpoint was estimated using Maximally Selected Rank Statistics in R.
Coefficient to BCR was determined using univariate Cox proportion hazard analysis.
PH assumption was at P < 0.05.
*P < 0.05; **P < 0.01; ***P < 0.001.
Figure 4All 15 component genes are significantly associated with PC recurrence and the formulation of three subsignatures. The mRNA expression data for the 15 genes were retrieved from the TCGA Provisional dataset (cBioPortal). Individual cutpoints were derived, and binary codes were assigned to all tumors. The hazard ratio (HR) of PC recurrence for all individual genes was determined using the univariate Cox proportional hazards (PH) mode. The PH assumption was evaluated and confirmed. These analyses were carried out using the R survival package. Individual HR, the 95% CI, and P‐value are included. The inclusion of component genes in SigCut1, SigCut2, and SigCut3 were shown, which was based on the P‐values.
Univariate and multivariate Cox analysis of SigMuc1NW component genes defined at cutpoint for PC recurrence
| Factors | Univariate Cox analysis | Multivariate Cox analysis | ||||
|---|---|---|---|---|---|---|
| HR | 95% CI |
| HR | 95% CI |
| |
| Age | 1.03 | 0.99–1.06 | 0.0981 | NS | ||
| GS | 2.19 | 1.76–2.72 | 1.49e‐12 | 1.71–1.89 | (1.32–1.46)–(2.20‐2.41) | 4.48e‐7 |
| SMargin | 2.25 | 1.48–3.41 | 0.000137 | NS | ||
| TumStge | 3.68 | 2.08–6.51 | 8.19e‐6 | 1.62–2.07 | (0.85–1.08)‐(3.08–3.96) | 0.0272 |
| SLCO2A1 | 2.22 | 1.27–3.87 | 0.00499 | 1.82 | 1.04–3.19 | 0.0369 |
| SUPV3L1 | 2.22 | 1.46–3.38 | 1.87e‐4 | 2.08 | 1.36–3.19 | 7.98e‐4 |
| TATDN2 | 2.39 | 1.55–3.70 | 8.48e‐5 | 2.15 | 1.37–3.37 | 8.35e‐4 |
| MGAT4B | 2.81 | 1.83–4.32 | 2.61e‐6 | 1.77 | 1.23–2.78 | 0.0128 |
| VAV2 | 2.56 | 1.69–3.89 | 9.94e‐6 | 1.93 | 1.26–2.95 | 0.0024 |
| SLC25A33 | 2.34 | 1.13–4.84 | 0.0218 | 2.25 | 1.08–4.67 | 0.0297 |
| ASNS | 2.87 | 1.68–4.90 | 1.09e‐4 | 1.91 | 1.09–3.36 | 0.0239 |
| OIP5 | 3.46 | 2.24–5.36 | 2.64e‐8 | 1.94 | 1.20–3.12 | 0.00638 |
Age at diagnosis.
Radical prostatectomy Gleason score.
Surgical margin.
Tumor stages (0 for ≤ T2; 1 for T3 and T4).
Not significant.
Range of HR, 95% CI, and P‐values resulted from multivariate Cox analysis with the individual genes.
The P‐values for SLCO2A1 (P = 0.0749), MGAT4B (P = 0.0891), ASNS (P = 0.0917), and OIP5 (P = 0.139).
hThe P‐values for SUPV3L1 (P = 0.0431*), TATDN2 (P = 0.0272*), VAV2 (P = 0.0364*), and SLC25A33 (P = 0.0334*).
HR, hazard ratio; CI, confidence interval.
Figure 5Analyses of SigCut1, SigCut2, and SigCut3 for their association with reductions in DFS. The TCGA Provisional dataset was used here. (A) All tumors were scored for SigCut1, SigCut2, and SigCut3 using the respective Cox coefficient. Time‐dependent AUCs for individual signature at the current follow‐up period and the corresponding recurrent status are shown. (B‐D) The associations of SigCut1, SigCut2, and SigCut3 with BCR. (E) The Q1, Median, Cutpoint, and Q3 scores of SigCut3 were analyzed for the stratification of PC with high risk of recurrence. The number of risk individuals at the indicated follow‐up period is included. The multiple Kaplan–Meier curves and log‐rank test were performed using the R survival package.
Figure 6Alterations in the expression of the component genes in an independent PC population. Gene expression data determined by microarray were extracted from the MSKCC dataset (Robinson et al., 2015) within cBioPortal. The mRNA levels in normal and PC tissues (A), in primary PC and metastatic PC (B), and in nonrecurrent and recurrent PC (C) were determined. The number of cases used in the comparisons is indicated. Means ± SD are graphed. Statistical analyses were performed using Student's t‐test (2‐tailed). *P < 0.05, **P < 0.01, and ***P < 0.001.
Cutpoint and Cox coefficients of SigMuc1NW component genes in the MSKCC cohorta
| Genes | Cutpoint |
| Coef | HR | 95% CI |
|
|---|---|---|---|---|---|---|
| SLCO2A1 | 8.155098 | 0.7073 | 0.6364 | 1.89 | 0.7835–4.558 | 0.157 |
| CGNL1 | 10.02132 | 0.004758 | 1.4679 | 4.34 | 2.084–9.038 | 8.8e‐5 |
| SUPV3L1 | 7.655546 | 0.7029 | −0.6931 | 0.5 | 0.2277–1.098 | 0.0841 |
| TATDN2 | 7.755133 | 0.969 | −0.5149 | 0.5976 | 0.2476–1.442 | 0.252 |
| MGAT4B | 8.536576 | 0.01469 | 1.3245 | 3.76 | 1.833–7.712 | 0.000302 |
| VAV2 | 7.801308 | 0.2076 | 0.8258 | 2.284 | 1.184–4.405 | 0.0138 |
| SLC25A33 | 8.653056 | 1 | 0.4752 | 1.608 | 0.6248–4.14 | 0.325 |
| MCCC1 | 7.789343 | 0.2982 | −1.0768 | 0.3407 | 0.1467–0.7911 | 0.0122 |
| ASNS | 7.946625 | 0.01918 | 1.1815 | 3.259 | 1.567–6.78 | 0.00157 |
| CASKIN1 | 8.142854 | 0.04935 | 1.0985 | 3 | 1.529–5.886 | 0.0014 |
| DNMT3B | 7.199673 | 0.06077 | 1.0373 | 2.822 | 1.385–5.749 | 0.00428 |
| AURKA | 7.215284 | 0.03781 | 1.0552 | 2.873 | 1.435–5.75 | 0.00288 |
| OIP5 | 6.026397 | 0.05557 | 0.9789 | 2.662 | 1.374–5.156 | 0.00372 |
| CTHRC1 | 7.827664 | 0.0001814 | 1.631 | 5.109 | 2.4–10.88 | 2.33e‐5 |
| GOLGA7B | 7.534541 | 0.1695 | 1.1095 | 3.033 | 1.371–6.71 | 0.00617 |
Microarray data of SigMuc1NW's component genes were retrieved from the MSKCC dataset (cBioPortal).
Cutpoint was estimated using Maximally Selected Rank Statistics in R.
Coefficient to BCR was determined using univariate Cox proportion hazard analysis.
*P < 0.05; **P < 0.01; ***P < 0.001.
Figure 7SigMuc1NW1 robustly predicts PC recurrent in an independent PC dataset. The follow‐up data along with mRNA expression data for all 15 genes were retrieved from the MSKCC dataset (Robinson et al., 2015). SigMuc1NW1 was formed using 10 genes (see Results for details). Time‐dependent AUCs were derived (A). The stratification of PC with increased risk of recurrence was analyzed using the cutpoint (B), Q1 (C), Median (D), and Q3 (E) scores of SigCut1NW1. Numbers of risk individuals at the current follow‐up period are also included.
Figure 8SigMuc1NW1 significantly correlates with reductions in DFS and OS in PC patients. The analyses were performed using the TCGA Provisional dataset. SigMuc1NW1 gene expression was based on the SD levels. Kaplan–Meier curve and log‐rank test were performed using tools provided by cBioPortal.