| Literature DB >> 21785460 |
M Guedj1, L Marisa, A de Reynies, B Orsetti, R Schiappa, F Bibeau, G MacGrogan, F Lerebours, P Finetti, M Longy, P Bertheau, F Bertrand, F Bonnet, A L Martin, J P Feugeas, I Bièche, J Lehmann-Che, R Lidereau, D Birnbaum, F Bertucci, H de Thé, C Theillet.
Abstract
The current histoclinical breast cancer classification is simple but imprecise. Several molecular classifications of breast cancers based on expression profiling have been proposed as alternatives. However, their reliability and clinical utility have been repeatedly questioned, notably because most of them were derived from relatively small initial patient populations. We analyzed the transcriptomes of 537 breast tumors using three unsupervised classification methods. A core subset of 355 tumors was assigned to six clusters by all three methods. These six subgroups overlapped with previously defined molecular classes of breast cancer, but also showed important differences, notably the absence of an ERBB2 subgroup and the division of the large luminal ER+ group into four subgroups, two of them being highly proliferative. Of the six subgroups, four were ER+/PR+/AR+, one was ER-/PR-/AR+ and one was triple negative (AR-/ER-/PR-). ERBB2-amplified tumors were split between the ER-/PR-/AR+ subgroup and the highly proliferative ER+ LumC subgroup. Importantly, each of these six molecular subgroups showed specific copy-number alterations. Gene expression changes were correlated to specific signaling pathways. Each of these six subgroups showed very significant differences in tumor grade, metastatic sites, relapse-free survival or response to chemotherapy. All these findings were validated on large external datasets including more than 3000 tumors. Our data thus indicate that these six molecular subgroups represent well-defined clinico-biological entities of breast cancer. Their identification should facilitate the detection of novel prognostic factors or therapeutical targets in breast cancer.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21785460 PMCID: PMC3307061 DOI: 10.1038/onc.2011.301
Source DB: PubMed Journal: Oncogene ISSN: 0950-9232 Impact factor: 9.867
Figure 1Breast tumor classification according to the CIT classification into six subgroups of tumors. (a) Heatmap representing the expression of the 256 genes (nine clusters of genes represented by vertical color bars on the left of the heatmap) through the six groups. (b) Principal component analysis (PCA) of the samples of the coreset according to the 256 gene signature. The first principal component (PC1) represents the combined expression of the three transversal clusters (ER, AR and cell cycle), the second component (PC2) differentiates LumB and NormL. (c) Distribution of mean expression levels of the three transversal gene clusters (ER, AR and Cell Cycle) over the six main molecular subgroups. (d) Comparison of the CIT classification with those obtained using the Sorlie, Hu, Parker and Jönsson systems.
Molecular subgroups show differential correlation to breast cancer clinico-biological parameters and different sites of metastatic relapse
| Total | 53 | 39 | 48 | 66 | 61 | 88 | |
| ER+ (IHC) | 1.00E–50 | 5 (10%) | 1 (3%) | 37 (84%) | 63 (98%) | 58 (97%) | 81 (93%) |
| ER− (IHC) | 46 (90%) | 35 (97%) | 7 (16%) | 1 (2%) | 2 (3%) | 6 (7%) | |
| ER+ (EXP) | 6.00E–68 | 3 (6%) | 2 (5%) | 48 (100%) | 66 (100%) | 61 (100%) | 87 (99%) |
| ER− (EXP) | 50 (94%) | 37 (95%) | 0 (0%) | 0 (0%) | 0 (0%) | 1 (1%) | |
| PR+ (IHC) | 2.00E–25 | 4 (8%) | 1 (3%) | 25 (54%) | 43 (67%) | 53 (88%) | 62 (71%) |
| PR− (IHC) | 48 (92%) | 34 (97%) | 21 (46%) | 21 (33%) | 7 (12%) | 25 (29%) | |
| PR+ (EXP) | 1.00E–37 | 5 (9%) | 5 (13%) | 32 (67%) | 47 (71%) | 58 (95%) | 85 (97%) |
| PR− (EXP) | 48 (91%) | 34 (87%) | 16 (33%) | 19 (29%) | 3 (5%) | 3 (3%) | |
| ERBB2+ (IHC) | 9.00E–19 | 3 (7%) | 19 (68%) | 10 (26%) | 5 (11%) | 0 (0%) | 0 (0%) |
| ERBB2− (IHC) | 43 (93%) | 9 (32%) | 28 (74%) | 41 (89%) | 37 (100%) | 74 (100%) | |
| ERBB2+ (EXP) | 4.00E–31 | 2 (4%) | 29 (74%) | 20 (42%) | 2 (3%) | 0 (0%) | 5 (6%) |
| ERBB2− (EXP) | 51 (96%) | 10 (26%) | 28 (58%) | 64 (97%) | 61 (100%) | 83 (94%) | |
| AR+ (EXP) | 2.00E–57 | 2 (4%) | 32 (82%) | 47 (98%) | 63 (95%) | 61 (100%) | 88 (100%) |
| AR− (EXP) | 51 (96%) | 7 (18%) | 1 (2%) | 3 (5%) | 0 (0%) | 0 (0%) | |
| P53mut | 1.00E–15 | 29 (83%) | 13 (72%) | 24 (69%) | 5 (16%) | 1 (4%) | 1 (5%) |
| P53wt | 6 (17%) | 5 (28%) | 11 (31%) | 27 (84%) | 27 (96%) | 21 (95%) | |
| Ductal | 0.05 | 51 (98%) | 32 (84%) | 39 (87%) | 54 (84%) | 50 (83%) | 61 (77%) |
| Lobular | 0.004 | 1 (2%) | 1 (3%) | 3 (7%) | 3 (5%) | 5 (8%) | 15 (19%) |
| Other | 0.1 | 0 (0%) | 5 (13%) | 3 (7%) | 7 (11%) | 5 (8%) | 3 (4%) |
| SBR Grade 1 | 8.00E–11 | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 7 (12%) | 23 (27%) |
| SBR Grade 2 | 2.00E–13 | 6 (11%) | 8 (21%) | 21 (47%) | 38 (58%) | 44 (77%) | 53 (62%) |
| SBR Grade 3 | 4.00E–26 | 47 (89%) | 30 (79%) | 24 (53%) | 28 (42%) | 6 (11%) | 9 (11%) |
| Age (median) | 4.00E–07 | 50 | 56 | 54 | 57 | 62 | 52 |
| MR 5year | 0.001 | 17 (36%) | 14 (38%) | 11 (34%) | 15 (26%) | 9 (20%) | 6 (8%) |
| MR 15year | 0.01 | 17 (36%) | 14 (38%) | 13 (41%) | 18 (32%) | 10 (22%) | 11 (15%) |
| Bones | 0.01 | 4 (24%) | 8 (57%) | 7 (54%) | 14 (78%) | 7 (70%) | 9 (82%) |
| Brain | 0.06 | 5 (29%) | 3 (21%) | 1 (8%) | 0 (0%) | 0 (0%) | 2 (18%) |
| Liver | 0.7 | 5 (29%) | 6 (43%) | 7 (54%) | 8 (44%) | 3 (30%) | 3 (27%) |
| Lung | 0.9 | 6 (35%) | 4 (29%) | 6 (46%) | 8 (44%) | 3 (30%) | 4 (36%) |
| Other | 0.1 | 4 (24%) | 1 (7%) | 7 (54%) | 8 (44%) | 3 (30%) | 3 (27%) |
Abbreviations: CIT, Cartes d'Identité des Tumeurs program; MR, metastasis relapse.
Expression of ER, PR and ERBB2/HER2 were determined by immunohistochemistry as well as by RNA expression (for greater details see Supplementary Methods). TP53 mutation status was determined by the yeast functional assay (Supplementary Methods). P-values for qualitative variables (ER, PR, ERBB2/HER2, TP53 mutation, histological type, SBR grading) result from a Fisher exact test. P-values for quantitative variables (median age) result from an analysis of variance. MR was determined 5 and 15 years after surgery. Frequency of MR in a subgroup was calculated as the ratio of MR with the total number of MR. For each subgroup, percentages of MR in a given site are determined by the number of MR in this site over the whole number of MR in the subgroup. MR may occur at more than one site; hence, the sum of percentages may not equate 100.
Figure 2Breast cancer molecular subgroups show distinctly different disease outcome. Kaplan–Meier curves shown in this figure represent disease-free survival with metastatic relapse as an end point. (a, b) show survival curves in the CIT and validation set, respectively. Abrupt breaks in some curves of (a) are related to small numbers of patients with long-term follow-up in these subgroups. These appear smoothed out in (b) because of greater numbers in the validation set.
Figure 3Molecular subgroups show differential activation of major signaling pathways: correlations between a given pathway and a subgroup are indicated by color boxes. Red boxes show upregulation of the pathway, green downregulation. Up or downregulation was deduced using KEGGanim tool where relative expression measures are projected in the related KEGG pathway interaction graph. Pathways showing no clear direction of regulation were excluded.
Figure 4Breast cancer molecular subgroups present different copy-number change (CNC) profiles. CNC profiles were established using genome-wide array-CGH on a 488 breast tumor dataset and subsequently stratified according to the CIT classification. Panel a shows frequency of gains (vertical bars going up) or losses (bars going down) at a given location on the genome. Graphs from top to bottom correspond to profiles of the whole CIT breast cancer set and each of the six molecular subgroups. Panel b represents regions of CNC correlating to a specific subgroup. Specific genomic regions for the whole CIT set are the ones for which the proportion of alterations (in gain or loss) exceeded 20%. Subgroup-specific regions are those that present significant increase in proportion (at a 0.1 FDR level) in a given subgroup tested against all others. Bars represent P-values after a standard logarithmic transformation.
Figure 5Principal component analysis (PCA) of the CIT coreset expression profiles based on a meta-signature comparing normal mammary epithelial cell subpopulations. A 163 gene signature was produced by comparing different normal mammary cell contingents from three independent studies (GSE16997, GSE18931, GSE11395) and used in a PCA. Samples from the CIT coreset (panel a) and normal mammary gland samples (panel b) from GSE16997 were projected in the two first principal components in the upper and lower panel, respectively.
Prognostic significance of the CIT classification
| P | P | n | P | P | n | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| CIT (ref=normL) | LumA | 1.66 | 0.84–3.30 | 0.15 | 1.8 × 10−5 | 426 | 1.66 | 0.78–3.52 | 2.5 × 10−4 | 6.4 × 10−6 | 371 |
| Other | 3.16 | 1.82–5.48 | 4.3 × 10−5 | 426 | 2.99 | 1.62–5.51 | |||||
| ER (ref=Pos) | Negative | 1.85 | 1.22–2.81 | 0.003 | 0.003 | 426 | 1.19 | 0.72–1.97 | 0.5 | ||
| ERBB2 (ref=Neg) | Positive | 1.18 | 0.74–1.9 | 0.49 | 0.49 | 426 | 0.89 | 0.52–1.5 | 0.66 | ||
| N (ref=0) | 1+ | 1.43 | 0.85–2.38 | 0.18 | 0.17 | 373 | 1.55 | 0.92–2.63 | 0.1 | ||
| T (ref=[0,1]) | >1 | 2.08 | 1.3–3.31 | 0.0021 | 0.0016 | 422 | 2.21 | 1.3–3.76 | 0.003 | ||
| SBR (ref=1) | 2 | 2.92 | 0.91–9.36 | 0.07 | 3 × 10−4 | 418 | |||||
| 3 | 5.19 | 1.63–16.53 | 0.005 | 418 | |||||||
| Chemotherapy adjuvant (ref=No) | Yes | 1.09 | 0.73–1.62 | 0.67 | 0.67 | 378 | |||||
| Hormononal adjuvant (ref=No) | Yes | 0.64 | 0.44–0.94 | 0.02 | 0.02 | 375 | |||||
| CIT (ref=NormL) | LumA | 1.66 | 0.84–3.30 | 1.8 × 10−5 | 426 | 2.0 | 0.74–5.33 | 3.7 × 10−1 | 426 | ||
| Other | 3.16 | 1.82–5.48 | 426 | 1.8 | 0.63–5.06 | ||||||
| Sorlie (ref=NormL) | LumA | 1.37 | 0.74–2.52 | 2.4 × 10−3 | 426 | 1.9 | 0.7–5.03 | 5.0 × 10−1 | |||
| Other | 2.29 | 1.31–4.00 | 426 | 1.4 | 0.57–3.29 | ||||||
| Hu (ref=LumA) | NormL | 1.67 | 0.86–3.25 | 9.6 × 10−5 | 426 | 2.7 | 0.98–7.35 | 4.2 × 10−1 | |||
| Other | 2.88 | 1.69–4.93 | 426 | 1.6 | 0.72–3.73 | ||||||
| Parker (ref=LumA) | NormL | 1.43 | 0.74–2.75 | 3.5 × 10−3 | 426 | 1.25 | 0.49–3.18 | 3.5 × 10−1 | |||
| Other | 2.26 | 1.34–3.81 | 426 | 0.8 | 0.36–1.79 | ||||||
| GGI (ref=Low risk) | High risk | 2.51 | 1.60–3.93 | 3.4 × 10−5 | 426 | 1.0 | 0.43–2.47 | 8.0 × 10−1 | |||
| Van′t Veer (ref=Low risk) | High risk | 2.93 | 2.00–4.27 | 5.9 × 10−9 | 426 | 2.8 | 1.53–5.1 | 3.4 × 10−3 | |||
Abbreviations: CI, confidence-interval; CIT, our classification; HR, hazard ratio.
Relative risk was calculated taking metastatic relapse as an endpoint and compared with that of (a) clinical parameters and (b) of three molecular classifiers (Sorlie, Hu, Parker) and two prognostic signature (GGI, Van′t Veer). The dataset comprised 426 patients from the CIT discovery set for which MFS information was available. Complete clinical information was available in 371 cases explaining the smaller numbers in the multivariate analysis on prognostic factors. Prognostic significance was assessed by applying a Cox model. Columns refer to the HR, the 95% CI and the P-values for both univariate and multivariate models.
Differential response to chemotherapy according to molecular subgroups of the CIT classification
| n | P | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Hess | T/FAC | 125 | pCR | 17 (68%) | 11 (32%) | 3 (7%) | 0 (0%) | 2.6 × 10−9 | |
| no pCR | 8 (32%) | 23 (68%) | 41 (93%) | 22 (100%) | |||||
| CIT | EC | 58 | pCR | 8 (53%) | 6 (46%) | 2 (7%) | 0 (0%) | 1.6 × 10−3 | |
| no pCR | 7 (47%) | 7 (54%) | 25 (93%) | 3 (100%) | |||||
| Bonnefoi | FEC | 66 | pCR | 16 (43%) | 7 (41%) | 5 (42%) | NS | ||
| no pCR | 21 (57%) | 10 (59%) | 7 (58%) | ||||||
| TET | 58 | pCR | 17 (45%) | 6 (35%) | 3 (100%) | 0.11 | |||
| no pCR | 21 (55%) | 11 (65%) | 0 (0%) | ||||||
| Total | 307 | pCR | 58 (50%) | 30 (37%) | 13 (15%) | 0 (0%) | 4.3 × 10−10 | ||
| no pCR | 57 (50%) | 51 (63%) | 73 (85%) | 25 (100%) | |||||
| | n | n | P | ||||||
| ER | ER− | 307 | 4.5 | 2.5–8.4 | 2.1 × 10−08 | 291 | 1.6 | 0.67–4.2 | 0.28 |
| Grade | Grade3 | 291 | 3.2 | 1.8–5.8 | 3.6 × 10−05 | 1.9 | 1–3.5 | 0.04 | |
| CIT molecular classification | BasL/mApo | 307 | 6.1 | 3.1–13 | 7.0 × 10−10 | 3.8 | 1.3–11 | 0.01 | |
Abbreviations: CIT, our classification; NS, not significant; pCR, pathological complete response.
Table 3a shows the correlation between pCR and CIT molecular subgroups. pCR and absence of response (no pCR) to chemotherapy were analyzed in three clinical trials (Hess , Bonnefoi , CIT set). Owing to the small number of data, four main subgroups and two intermediate subgroups were combined into two groups: (LumB; LumC; LumB/C) and (NormL; LumA; NormL/LumA). Treatment description: EC, six cycles of a dose-dense regimen of 75 mg/m2 epirubicin and 1200 mg/m2 cyclophosphamide, given every 14 days; T/FAC, 24 weeks of sequential paclitaxel and fluorouracil-doxorubicin-cyclophosphamide; FEC, fluorouracil, epirubicin, and cyclophosphamide for six cycles; TET, docetaxel for three cycles followed by epirubicin plus docetaxel for three cycles. Correlations were calculated using Fisher exact test. Table 3b shows uni- and multivariate analyses of factors predictive of pCR in the three pooled datasets. Univariate analysis was done using the Fisher exact test and multivariate analysis by logistic regression.