| Literature DB >> 22752290 |
A Prat1, J S Parker, C Fan, C M Perou.
Abstract
It has recently been proposed that a three-gene model (SCMGENE) that measures ESR1, ERBB2, and AURKA identifies the major breast cancer intrinsic subtypes and provides robust discrimination for clinical use in a manner very similar to a 50-gene subtype predictor (PAM50). However, the clinical relevance of both predictors was not fully explored, which is needed given that a ~30 % discordance rate between these two predictors was observed. Using the same datasets and subtype calls provided by Haibe-Kains and colleagues, we compared the SCMGENE assignments and the research-based PAM50 assignments in terms of their ability to (1) predict patient outcome, (2) predict pathological complete response (pCR) after anthracycline/taxane-based chemotherapy, and (3) capture the main biological diversity displayed by all genes from a microarray. In terms of survival predictions, both assays provided independent prognostic information from each other and beyond the data provided by standard clinical-pathological variables; however, the amount of prognostic information was found to be significantly greater with the PAM50 assay than the SCMGENE assay. In terms of chemotherapy response, the PAM50 assay was the only assay to provide independent predictive information of pCR in multivariate models. Finally, compared to the SCMGENE predictor, the PAM50 assay explained a significantly greater amount of gene expression diversity as captured by the two main principal components of the breast cancer microarray data. Our results show that classification of the major and clinically relevant molecular subtypes of breast cancer are best captured using larger gene panels.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22752290 PMCID: PMC3413822 DOI: 10.1007/s10549-012-2143-0
Source DB: PubMed Journal: Breast Cancer Res Treat ISSN: 0167-6806 Impact factor: 4.872
Distant metastasis-free survival Cox proportional hazards models of primary breast cancer patients
| Variables | Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|---|
| HR | Lower 95 % | Upper 95 % |
| HR | Lower 95 % | Upper 95 % |
| |
| Age (cont. variable) | 0.989 | 0.983 | 0.996 | 0.003 | 0.996 | 0.988 | 1.003 | 0.257 |
| Node status | 1.176 | 0.851 | 0.992 | 0.063 | 1.695 | 1.315 | 2.184 | <0.001 |
| Tumor size T2–T4 versus T0–T1 | 1.305 | 1.104 | 1.541 | 0.002 | 1.242 | 1.042 | 1.480 | 0.015 |
| Treatment (yes vs. no) | 0.973 | 0.845 | 1.121 | 0.707 | 0.547 | 0.428 | 0.700 | <0.001 |
| PAM50 | ||||||||
| Luminal A | 1.0 | – | – | – | 1.0 | – | – | – |
| Luminal B | 1.797 | 1.503 | 2.149 | <0.001 | 2.041 | 1.578 | 2.641 | <0.001 |
| HER2-E | 2.677 | 2.120 | 3.380 | <0.001 | 1.648 | 1.073 | 2.530 | 0.023 |
| Basal-like | 2.144 | 1.737 | 2.647 | <0.001 | 1.312 | 0.812 | 2.121 | 0.268 |
| Normal-like | 1.073 | 0.670 | 1.718 | 0.769 | 1.024 | 0.572 | 1.835 | 0.936 |
| Three-gene signature | ||||||||
| ER+/HER2−/Low Prolif | 1.0 | – | – | – | 1.0 | – | – | – |
| ER+/HER2−/High Prolif | 1.852 | 1.531 | 2.241 | <0.001 | 1.153 | 0.882 | 1.508 | 0.297 |
| HER2+ | 2.785 | 2.196 | 3.533 | <0.001 | 1.588 | 1.053 | 2.395 | 0.028 |
| ER−/HER2− | 2.536 | 2.041 | 3.150 | <0.001 | 1.762 | 1.095 | 2.835 | 0.020 |
HER2-E HER2-enriched, Prolif proliferation, HR hazard ratio
Fig. 1Distant metastasis-free survival likelihood ratio statistics of subtypes defined by the PAM50 or the SCMGENE predictors, after accounting for clinical–pathological variables (age at diagnosis, nodal status, treatment and tumor size). Models were first conditioned on one predictor and the clinical–pathological variables, and then the significance of the other was tested. (A – B) Entire combined dataset (n = 2,008), (C–D) subset of patients that did not receive adjuvant systemic therapy (n = 994), (E–F) subset of patients with HR+ tumors that received adjuvant tamoxifen-only (n = 491). Similar results are obtained if a term for dataset is included in the model
pCR logistic regression models of the MAQC2 (GSE20194) [21] neoadjuvant breast cancer dataset
| Variables |
| pCR rate (%) | Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| OR | Lower 95 % | Upper 95 % |
| OR | Lower 95 % | Upper 95 % |
| |||
| Age (cont. variable) | – | – | 1.0 | 0.95 | 1.01 | 0.169 | – | – | – | – |
| Tumor size | ||||||||||
| T0–T1 | 23 | 35 | 1.0 | – | – | – | 1.0 | – | – | – |
| T2–T4 | 203 | 19 | 2.3 | 0.92 | 5.86 | 0.076 | 0.4 | 0.13 | 1.23 | 0.111 |
| PAM50 | ||||||||||
| Luminal A | 66 | 3 | 1.0 | – | – | – | 1.0 | – | – | – |
| Luminal B | 66 | 9 | 3.2 | 0.62 | 16.47 | 0.164 | 5.2 | 0.68 | 37.97 | 0.108 |
| HER2-E | 28 | 46 | 23.5 | 5.25 | 105.36 | <0.001 | 12.5 | 1.46 | 145.68 | 0.030 |
| Basal-like | 59 | 42 | 27.7 | 5.65 | 136.18 | <0.001 | 25.3 | 2.64 | 255.95 | 0.005 |
| Normal-like | 7 | 0 | 0.0 | 0.00 | – | 0.988 | 0.0 | 0.00 | – | 0.988 |
| Three-gene signature | ||||||||||
| ER+/HER2−/Low Prolif | 52 | 4 | 1.0 | – | – | – | 1.0 | – | – | – |
| ER+/HER2−/High Prolif | 85 | 8 | 2.2 | 0.45 | 11.23 | 0.325 | 0.6 | 0.08 | 4.62 | 0.633 |
| HER2+ | 24 | 50 | 25.0 | 4.93 | 126.80 | <0.001 | 3.9 | 0.34 | 46.46 | 0.275 |
| ER−/HER2− | 65 | 38 | 15.6 | 3.49 | 69.93 | <0.001 | 0.9 | 0.09 | 9.97 | 0.954 |
HER2-E HER2-enriched, Prolif proliferation, OR odds ratio
Fig. 2PC1 and PC2 loading plots of 3,316 samples using 18 Affymetrix and Agilent-based datasets taken from Haibe-Kains et al. [6]. Samples colored based on the a SCMGENE calls, or b PAM50 subtype calls. PC1 and PC2 R 2 values obtained from simple linear regression models are shown. Only datasets with >50 % and <90 % ER+ tumors were included in this analysis. Blue Luminal A or ER+/HER2−/Low Proliferative, light blue Luminal B or ER+/HER2−/High Proliferative, pink HER2-enriched or HER2+, red Basal-like or ER−/HER2−, green normal-like, black normal breast samples (only present in the UNC337 dataset [29]). For the UNC337 dataset, we colored samples based on the subtype calls obtained after median centering as shown in Supplemental Fig. 1