| Literature DB >> 18498629 |
Sherene Loi1, Benjamin Haibe-Kains, Christine Desmedt, Pratyaksha Wirapati, Françoise Lallemand, Andrew M Tutt, Cheryl Gillet, Paul Ellis, Kenneth Ryder, James F Reid, Maria G Daidone, Marco A Pierotti, Els Mjj Berns, Maurice Phm Jansen, John A Foekens, Mauro Delorenzi, Gianluca Bontempi, Martine J Piccart, Christos Sotiriou.
Abstract
BACKGROUND: Estrogen receptor positive (ER+) breast cancers (BC) are heterogeneous with regard to their clinical behavior and response to therapies. The ER is currently the best predictor of response to the anti-estrogen agent tamoxifen, yet up to 30-40% of ER+BC will relapse despite tamoxifen treatment. New prognostic biomarkers and further biological understanding of tamoxifen resistance are required. We used gene expression profiling to develop an outcome-based predictor using a training set of 255 ER+ BC samples from women treated with adjuvant tamoxifen monotherapy. We used clusters of highly correlated genes to develop our predictor to facilitate both signature stability and biological interpretation. Independent validation was performed using 362 tamoxifen-treated ER+ BC samples obtained from multiple institutions and treated with tamoxifen only in the adjuvant and metastatic settings.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18498629 PMCID: PMC2423197 DOI: 10.1186/1471-2164-9-239
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Overview of the analysis design. (a) First part of the analysis including quality controls, normalization, preliminary clustering performed on the untreated dataset, computation of the cluster centroids on the tamoxifen treated dataset, and estimation of signature stability with regards to signature size, using cross-validation. (b) Second part of the analysis including the classifier development, performance assessment by cross-validation and performance assessment on independent validation data sets.
Figure 2Signature stability. Signature stability demonstrating frequency of selection for the various clusters in multiple 10-fold cross-validations.
Performance of the classifier. Performance of the 13 clusters classifier algorithm re-training and validation on the separate institutional populations using both leave-one-out and multiple 10-fold cross-validations.
| DMFS at 3 years | DMFS at 5 years | DMFS at 10 years | ||||
| OXFT (99/19) | KIT/GUYT(156/48) | 2.17 (1.2–3.91) | <0.00001 | 91% | 87% | 79% |
| KIT (69/20) | OXFT/GUYT (186/47) | 4.07 (2.23–7.41) | <0.00001 | 96% | 92% | 88% |
| GUYT (87/28) | OXFT/KIT (168/39) | 5.93 (3.0–11.75) | <0.00001 | 93% | 89% | 82% |
| KIT/GUYT (156/48) | OXFT (99/19) | 14.59 (5.38–39.5) | <0.00001 | 97% | 94% | 91% |
| OXFT/GUYT (186/47) | KIT (69/20) | 3.44 (1.36–8.67) | 0.005 | 96% | 92% | 84% |
| OXFT/KIT(168/39) | GUYT (87/28) | 2.23 (1.05–4.71) | 0.03 | 96% | 92% | 84% |
| 3.86 (2.32–6.41) | <0.0001 | 94% | 91% | 84% | ||
| 3.23 (2.66–3.84) | <0.0001 | 94% | 90% | 83% | ||
# as estimated by Kaplan Meier survival curves.
Patients samples obtained from: OXFT: John Radcliffe Hospital, Oxford, UK; KIT Uppsala University hospital, Uppsala, Sweden; GUYT Guys hospital, London, UK.
• Reported results.
Figure 3Survival curves for training set. Kaplan Meier curves for the binary classification computed using leave-one-out cross-validation on the tamoxifen-treated dataset (n = 255). The two survival curves were significantly different according to the log rank test (p < 0.0001).
Cox regression analysis. Univariate and multivariate Cox regression analysis for time to distant metastases in 255 patients.
| Hazard ratio (95%CI) | p | Hazard ratio (95%CI) | p | |
| Histological grade (1 vs. 2 vs. 3) | 3.14 (1.37–7.17) | 0.007 | 0.94 (0.36–2.42) | 0.9 |
| Tumor size (≤ 20 mm vs. ≥ 20 mm) | 2.18 (1.27–3.75) | 0.005 | 1.58(0.83–3) | 0.2 |
| Nodal status (positive vs. negative) | 1.62 (0.95–2.79) | 0.08 | 1.30 (0.71–2.37) | 0.4 |
| ER high vs. low expression | 0.86 (1.18–0.522) | 0.5 | 0.97 (0.56–1.7) | 0.9 |
| PgR high vs. low expression | 0.42 (0.25–0.7) | 0.0007 | 0.49 (0.26–0.9) | 0.02 |
| HER2 high vs. low expression | 0.88 (0.55–1.42) | 0.6 | 0.66 (0.37–1.18) | 0.2 |
| 13 cluster gene classifier* | 3.86 (2.32–6.41) | <0.0001 | 3.26 (1.76–6.05) | 0.0002 |
#Multivariate model contained included 210 patients due to missing values, stratified by population.
*Binary classification using leave-one-out cross-validation.
**Age was not included in the model as 92% of patients were ≥ 50 years of age.
ER: estrogen receptor status represented by ESR1 Affymetrix probe set 205225_at.
PgR: progesterone receptor status represented by PGR Affymetrix probe set 208305_at.
HER2: represented by ERBB2 Affymetrix probe set 216836_s_at.
For ER, PgR and HER2, high vs. low expression groups was defined by generating groups at the median value.
Mapping. Number of probe sets able to be mapped across datasets during independent validations.
| 79 | 7 | 7 | 3 | 1 | 2 |
| 148 | 45 | 45 | 23 | 23 | 15 |
| 112 | 14 | 14 | 5 | 4 | 9 |
| 120 | 38 | 38 | 18 | 15 | 16 |
| 375 | 8 | 8 | 4 | 3 | 3 |
| 201 | 17 | 17 | 5 | 8 | 6 |
| 521 | 19 | 19 | 10 | 8 | 5 |
| 784 | 7 | 7 | 1 | 1 | 1 |
| 859 | 7 | 7 | 4 | 2 | 2 |
| 360 | 14 | 14 | 4 | 2 | 0 |
| 231 | 26 | 26 | 8 | 4 | 9 |
| 110 | 30 | 30 | 13 | 3 | 10 |
| 337 | 7 | 7 | 4 | 1 | 1 |
| 239 (100%) | 239 (100%) | 102 (42.6%) | 75 (31.4%) | 73 (30.5%) | |
Figure 4External validation of the classifier. (a) Kaplan Meier curves for the GUYT2 dataset. The two survival curves were significantly different according to the log rank test (p = 0.03). (b) Forest plots of hazard ratios obtained from the three independent validation datasets.
Functional analysis. Functional analysis of the 13 clusters from the gene signature (for full gene list [see Additional file 2b]).
| 79 | Cancer Inflammatory disease Cell cycle | Lipid metabolism Molecular transport | cAMP mediated signaling | 3 |
| 148 | Cancer Immune response | Cell cycle | 1 carbon pool by folate | 28 |
| 112 | Gene expression | Gene expression Protein synthesis | EGF signaling | 10 |
| 120 | Cell cycle Cellular movement | Cell cycle | G2/M checkpoint | 28 |
| 375 | Cellular movement, inflammatory disease | Carbohydrate metabolism | TGF-beta signaling | 4 |
| 201 | DNA recombination and repair | Cell cycle | G1/S checkpoint | 11 |
| 521 | Cell cycle | Cell cycle | G1/S checkpoint | 14 |
| 784 | Cell death | Cell morphology Cellular development | IL4 signaling | 3 |
| 859 | Gene expression | Cell morphology | None given | 4 |
| 360 | DNA recombination and repair | Cell cycle | None given | 6 |
| 231 | Cell to cell signaling and interaction | Embryonic development | IGF1 pathway | 13 |
| 110 | Cell death Cellular development | Cancer Inflammation | PDGF signaling | 14 |
| 337 | Cell morphology | Cellular function and maintenance | None given | 1 |
* note that clusters (pclust) often contained probe sets that represented the same gene