| Literature DB >> 24394557 |
Z Peng1, L Skoog2, H Hellborg3, G Jonstam4, I-L Wingmo5, M Hjälm-Eriksson6, U Harmenberg6, G C Cedermark6, K Andersson7, L Ahrlund-Richter8, S Pramana9, Y Pawitan9, M Nistér2, S Nilsson6, C Li6.
Abstract
BACKGROUND: This study aimed to identify biomarkers for estimating the overall and prostate cancer (PCa)-specific survival in PCa patients at diagnosis.Entities:
Mesh:
Year: 2014 PMID: 24394557 PMCID: PMC3921673 DOI: 10.1038/pcan.2013.57
Source DB: PubMed Journal: Prostate Cancer Prostatic Dis ISSN: 1365-7852 Impact factor: 5.554
Figure 1Outline of a stepwise gene selection process. (a) Identification of 641 embryonic stem cell gene predictors (ESCGPs) by bioinformatic analysis. Previously published data sets of whole-genome complementary DNA microarrays derived from five human ESC lines and 115 human normal tissues from various organs were retrieved from the Stanford Microarray Database (SMD). After a data-centering process, a sub-data set with expression profile of 24 361 genes in the ESC lines was isolated from the combined whole data set. A single-class significance analysis of microarray (SAM) was performed and a SAM plot was generated. The 328 genes with the highest expression levels and 313 genes with the lowest expression levels were identified, in total 641 ESCGPs. (b) Identification of 258 ESCGPs in prostate cancer (PCa). PCa ESCGPs were identified by matching the list of the 641 ESCGPs and the list of 5513 genes published by Lapointe et al.[9] When clustering the 112 PCa tissue samples and comparing the cluster results when using all 5513 genes and when using only the 258 ESCGPs present in the data set, nearly identical results were obtained. Sample labeling: PL, lymph node metastasis; PN, normal prostate tissue; PT, prostate tumor. Three cases (marked green) were placed in different classification positions and two cases (purple) were consistently misclassified. (c) Selection of important candidate ESCGPs for clinical survival correlation. Of 258 PCa ESCGPs, 34 genes were selected by their high-ranking order in the SAM analysis identifying significant genes for the subtype classification or for the discriminating between tumor and normal samples. Of these 34 ESCGPs, 19 were selected based on their markedly different expression patterns and robust performances in RT-PCR reactions (Supplementary Figure S1). The 19 ESCGPs and the 5 reported genes were included in the optimization of the 4-plex qPCR method using RNAs from PCa cell lines. (d) Identification of the ESCGP signature in Subset 1. After the 4-plex qPCR optimization, the method was used to analyze 36 fresh–frozen fine-needle aspiration (FNA) biopsies taken from PCa patients (Subset 1). RNAs could be extracted in 28 biopsies. A series of cluster analyses using different gene combinations revealed that the ESCGP signature VGLL3, IGFBP3 and F3 classified Subset 1 samples into three subtypes with strong survival correlations. The level of gene expression increases from blue to red, whereas the delta Ct value decreases from blue to red. Gray areas represent missing qPCR data.
Characteristics of the patients
| FNA biopsies, n | 36 | 65 | 88 | 189 |
| Overall | 35 | 64 | 86 | 185 |
| Death due to prostate cancer | 13 | 40 | 45 | 98 |
| Death due to other causes | 19 | 21 | 25 | 65 |
| Alive | 3 | 3 | 16 | 22 |
| Missing | 1 | 1 | 2 | 4 |
| Survival (years), median (range) | 7.7 (0.1–17.8) | 4.0 (0.2–15.7) | 4.3 (0.2–15.1) | 4.3 (0.1–17.8) |
| Age (years), mean±s.d. | 70.4±7.8 | 72.1±8.7 | 73.8±8.9 | 72.6±8.7 |
| Missing, n | 1 | 1 | 2 | 4 |
| ⩽20 | 9 (32.1) | 16 (31.2) | 23 (28.7) | 48 (29.8) |
| > 20 and⩽50 | 9 (32.1) | 14 (26.4) | 23 (28.7) | 46 (28.6) |
| > 50 | 10 (35.7) | 23 (43.4) | 34 (42.5) | 67 (41.6) |
| Missing | 8 | 12 | 8 | 28 |
| Localized | 19 (59.4) | 27 (45.8) | 33 (39.3) | 79 (45.1) |
| Advanced | 13 (40.6) | 32 (54.2) | 51 (60.7) | 96 (54.9) |
| Missing | 4 | 6 | 4 | 14 |
| Well/moderately | 22 (61.1) | 31 (50.0) | 33 (37.9) | 86 (46.5) |
| Poorly | 14 (38.9) | 31 (50.0) | 54 (62.1) | 99 (53.5) |
| Missing | 0 | 3 | 1 | 4 |
| Never treated | 6 (19.4) | 2 (3.3) | 4 (4.9) | 12 (7.0) |
| Hormone, orchiectomy | 19 (61.3) | 53 (88.3) | 62 (76.5) | 134 (77.9) |
| Radiation | 5 (16.1) | 2 (3.3) | 11 (13.6) | 18 (10.5) |
| Radical prostatectomy | 1 (3.2) | 3 (5.0) | 4 (4.9) | 8 (4.7) |
| Missing | 5 | 5 | 7 | 17 |
Abbreviations: FNA, fine-needle aspiration; WHO, World Health Organization.
Advanced clinical stage was defined as T⩾T3 or N1 or M1 or PSA >100 ng ml−1.
PSA levels in serum were measured at the time of diagnosis (before treatment).
Localized clinical stage was defined as T
Number of patients' samples for gene expression profiling
| N | ||||
|---|---|---|---|---|
| Total | 36 | 65 | 88 | 189 |
| CTGF | 36 | — | 67 | 103 |
| FBP1 | 36 | — | 46 | 82 |
| EGR1 | 26 | 65 | 88 | 179 |
| CYR61 | 36 | — | 46 | 82 |
| WNT5B | 36 | — | 56 | 92 |
| LRP4 | 28 | — | — | 28 |
| CDH1 | 36 | — | — | 36 |
| BASP1 | 28 | 65 | 88 | 181 |
| PTN | 28 | — | — | 28 |
| COL12A1 | 28 | 64 | 88 | 180 |
| VGLL3 | 28 | 40 | 88 | 156 |
| METTL7A | 36 | — | — | 36 |
| F3 | 28 | — | 67 | 95 |
| GREM1 | 36 | — | — | 36 |
| ERBB3 | 36 | — | 56 | 92 |
| LRNN1 | 36 | 62 | 88 | 186 |
| THBS1 | 28 | — | — | 28 |
| IGFBP3 | 26 | 59 | 88 | 173 |
| WNT11 | 28 | 65 | 88 | 181 |
| c-MAF-a | 26 | 64 | 88 | 178 |
| c-MAF-b | 26 | — | 46 | 72 |
| AZGP1 | — | 63 | 88 | 151 |
| AMACR | — | 63 | 88 | 151 |
| MUC1 | — | 58 | 88 | 146 |
| EZH2 | — | 59 | 88 | 147 |
| The ESCGP signature | 28 | — | 67 | 95 |
Abbreviation: ESCGP, embryonic stem cell gene predictors.
The number of samples varies between genes because not all genes were profiled across all samples.
The ESCGP signature includes the expression levels of VGLL3, IGFBP3 and F3.
Cox proportional hazards analysis of ESCGPs and various clinical parameters (univariate analysis)
| n | N | |||||
|---|---|---|---|---|---|---|
| P | P | |||||
| ⩽50 | 94 (58%) | 161 | 1.00 (reference) | 1.00 (reference) | ||
| >50 | 67 (42%) | 161 | 2.34 (1.65–3.31) | <0.0001 | 2.61 (1.68–4.05) | <0.0001 |
| Well/moderately | 85 (47%) | 181 | 1.00 (reference) | 1.00 (reference) | ||
| Poorly | 96 (53%) | 181 | 1.59 (1.16–2.18) | 0.004 | 1.94 (1.28–2.94) | 0.002 |
| Localized | 79 (45%) | 175 | 1.00 (reference) | 1.00 (reference) | ||
| Advanced | 96 (55%) | 175 | 1.70 (1.23–2.35) | 0.001 | 2.20 (1.44–3.38) | <0.0001 |
| Age | 185 | 1.04 (1.02–1.06) | <0.0001 | 1.03 (1.00–1.05) | 0.035 | |
| PSA (ng ml−1) | 161 | 1.00 (1.00–1.00) | 0.005 | 1.00 (1.00–1.00) | 0.004 | |
| F3 | 92 | 1.11 (1.04–1.17) | 0.001 | 1.14 (1.06–1.22) | <0.0001 | |
| WNT5B | 89 | 1.14 (1.04–1.25) | 0.004 | 1.26 (1.11–1.42) | <0.0001 | |
| VGLL3 | 152 | 1.09 (1.04–1.15) | <0.0001 | 1.08 (1.02–1.15) | 0.014 | |
| c-MAF-a | 174 | 1.09 (1.02–1.16) | 0.008 | 1.09 (1.01–1.19) | 0.036 | |
| CTGF | 100 | 1.13 (1.03–1.23) | 0.008 | 1.15 (1.02–1.29) | 0.023 | |
| IGFBP3 | 169 | 1.05 (0.99–1.12) | 0.078 | 1.10 (1.02–1.18) | 0.013 | |
| c-MAF-b | 69 | 1.13 (0.96–1.33) | 0.134 | 1.28 (1.04–1.57) | 0.019 | |
| EZH2 | 144 | 0.93 (0.83–1.04) | 0.208 | 0.85 (0.74–0.97) | 0.018 | |
| AMACR | 148 | 1.09 (1.02–1.16) | 0.009 | 1.08 (1.00–1.17) | 0.049 | |
| MUC1 | 143 | 1.07 (1.01–1.14) | 0.025 | 1.06 (0.99–1.15) | 0.109 | |
Abbreviations: CI, confidence interval; ESCGP, embryonic stem cell gene predictors; PCa, prostate cancer; WHO, World Health Organization.
The number of samples varies between ESCGPs because not all ESCGPs were profiled across all samples.
Localized clinical stage was defined as T
Age was modeled as a continuous variable. The hazard ratio is for each 1.0 year increase in age.
PSA was modeled as a continuous variable. The hazard ratio is for each 1.0 ng ml−1 PSA increase in serum.
The centered delta Ct value for gene expression was modeled as a continuous variable. It is inversely correlated to the gene's expression level. The hazard ratio is for each increase of 1.0 unit in centered delta Ct value.
Cox proportional hazards analysis of the ESCGP signature and various clinical parameters (univariate and multivariate analyses)
| n | N | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| P | P | P | P | |||||||
| Group 3 | 26 (30%) | 87 | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | ||||
| Group 2 | 32 (37%) | 87 | 3.45 (1.79–6.66) | <0.0001 | 2.51 (1.21–5.21) | 0.013 | 3.99 (1.65–9.64) | 0.002 | 2.96 (1.11–7.87) | 0.030 |
| Group 1 | 29 (33%) | 87 | 5.86 (2.91–11.78) | <0.0001 | 4.77 (2.27–10.01) | <0.0001 | 7.67 (3.04–19.36) | <0.0001 | 7.12 (2.56–19.85) | <0.0001 |
| ⩽50 | 48 (55%) | 87 | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | ||||
| >50 | 39 (45%) | 87 | 2.93 (1.76–4.86) | <0.0001 | 2.09 (1.10–3.94) | 0.023 | 3.33 (1.73–6.41) | <0.0001 | 1.76 (0.77–4.03) | 0.183 |
| Well/moderately | 35 (40%) | 87 | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | ||||
| Poorly | 52 (60%) | 87 | 1.65 (1.03–2.66) | 0.039 | 1.17 (0.69–2.00) | 0.556 | 1.93 (1.04–3.57) | 0.038 | 1.20 (0.61–2.39) | 0.596 |
| Localized | 37 (43%) | 87 | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | 1.00 (reference) | ||||
| Advanced | 50 (57%) | 87 | 2.13 (1.32–3.45) | 0.002 | 1.68 (0.91–3.08) | 0.097 | 3.87 (1.94–7.70) | <0.0001 | 3.62 (1.55–8.45) | 0.003 |
| Age | 87 | 1.06 (1.03–1.09) | <0.0001 | 1.03 (1.00–1.06) | 0.048 | 1.06 (1.02–1.10) | 0.003 | 1.03 (0.99–1.08) | 0.108 | |
Abbreviations: CI, confidence interval; ESCGP, embryonic stem cell gene predictors; PCa, prostate cancer; WHO, World Health Organization.
Eighty-seven out of the 95 clustered samples had all clinical information including age at diagnosis, PSA value, WHO tumor grade and clinical stage. Univariate and multivariate analyses included these 87 samples.
The ESCGP signature includes the expression levels of VGLL3, IGFBP3 and F3, and classified samples into three tumor subtypes (Group 1, Group 2 and Group 3) by Cluster analysis (Figure 2a). It was modeled as a non-continuous variable with three categories according to the tumor subtype.
Localized clinical stage was defined as T
Age was modeled as a continuous variable. The hazard ratio is for each 1.0 year increase in age.
Figure 2Clear survival difference according to tumor subtypes classification based on the embryonic stem cell gene predictor (ESCGP) signature (VGLL3, IGFBP3 and F3). Data were available for evaluation of the ESCGP signature for 95 of the 189 patients. (a) Fine-needle aspiration (FNA) samples from the 95 patients were used to create three tumor subtypes (group 1, red tree; group 2, yellow tree; group 3, blue tree) according to the ESCGP signature. The expression data was evaluated using the unsupervised hierarchical clustering method and the gene median-centered delta Ct values; the results were visualized using Treeview software. The gene expression level increases from blue to red, whereas the delta Ct value decreases from blue to red. Missing data are represented by the gray color. The clinical parameters of each patient are marked by various squares. Empty squares represent a longer survival period, lower PSA level, localized PCa clinical disease stage and a well or moderately differentiated tumor grade. Squares with various fill colors represent a shorter survival period, higher PSA level, advanced clinical disease stage and poorly differentiated tumor grade. (b–d) The overall, PCa-specific and non-PCa-specific survival analyses of the three subtypes were presented by Kaplan–Meier curves. X and Y axis presents actual time as diagnosis and survival rate, respectively. The P-values for differences between each of the three tumor subtypes were calculated using a log-rank test, and the P-values marked with stars represent statistical significance (P-value<0.05). Besides the most significant difference between subtypes 1 and 3 shown in the figure, the other P-values between each two subtypes were P1–2=0.063, P2–3<0.001 (b); P1–2=0.063, P2–3<0.001 (c); P1–2=0.523, P2–3=0.070 (d).
Figure 3Survival difference between the three tumor subtypes classified according to the embryonic stem cell gene predictor (ESCGP) signature in patients primarily treated with castration therapy. Of the 95 patients shown in Figure 2, 65 received castration therapy as their primary treatment. Within this group, clear survival differences could still be observed according to the three tumor subtypes classified based on the ESCGP signature. The overall (upper panel), PCa-specific (middle panel) and non-PCa-specific (lower panel) survival analyses of the three subtypes are shown by the Kaplan–Meier curves. The P-values for differences between each of the three tumor subtypes were calculated using a log-rank test. Besides the most significant difference between subtypes 1 and 3 shown in the figure, the other P-values between each two subtypes were P1–2=0.037*, P2–3=0.001* (a); P1–2=0.009*, P2–3=0.006* (b); P1–2=0.955, P2–3=0.076 (c). The overall survival rates at 5 years of follow-up were 13.6%, 36.0% and 77.8% for groups 1, 2 and 3, respectively.
Analysis of classification error of kNN model performance.
| P | ||||||
|---|---|---|---|---|---|---|
| RND | Random numbers | 3.85 | 3.18 | |||
| 1 | Age, WHO, CS, log-PSA [1,9,3,9] | 2.97 | 2.14 | 3.44 | 2.93 | 0.2300 |
| 2 | IGFBP3, VGLL3, F3, min(3G), max(3G) [1,9,1,3,1] | 2.82 | 2.36 | 3.14 | 2.71 | 0.0373 |
| 3 | Age, log-PSA, IGFBP3, VGLL3, F3, min(3G), max(3G) [1,3,1,9,3,3,1] | 2.85 | 2.05 | 2.69 | 2.34 | 0.0024 |
| 4 | Age, WHO, CS, log-PSA, IGFBP3, VGLL3, F3, min(3G), max(3G) [1,9,9,3,1,9,1,3,1] | 2.81 | 1.76 | 2.72 | 2.41 | 0.0038 |
Abbreviations: CS, clinical stage; 3G, IGFBP3, VGLL3, F3; kNN, k-nearest neighbor; WHO, World Health Organization tumor grading.
Figure 4Receiver operating characteristic (ROC) curves for 5-year survival prediction. Prediction of survival time was modeled using a parametric model based on the assumption of the Weibull distribution. ROC curves at 5-year survival prediction show the sensitivity and the specificity of survival prediction. Overall (upper panel), PCa-specific (middle panel) and non-PCa-specific survival (bottom panel) predictions at 5 years were determined by the clinical parameters alone (black lines), and by both clinical parameters and the tumor subtypes classified by embryonic stem cell gene predictor (ESCGP) signature (red lines). The area under the curve (AUC) values of overall, PCa-specific and non-PCa-specific survival predictions were all increased by adding ESCGP signature. Positive predictive value (PPV) and negative predictive value (NPV) both increased.