| Literature DB >> 28356166 |
Miriam Ragle Aure1,2, Valeria Vitelli3, Sandra Jernström1,2, Surendra Kumar1,2,4, Marit Krohn1,2, Eldri U Due1,2, Tonje Husby Haukaas2,5, Suvi-Katri Leivonen6, Hans Kristian Moen Vollan1,2, Torben Lüders2,4, Einar Rødland7, Charles J Vaske8, Wei Zhao9, Elen K Møller1,2, Silje Nord1,2, Guro F Giskeødegård5, Tone Frost Bathen2,5, Carlos Caldas10,11, Trine Tramm12, Jan Alsner12, Jens Overgaard12, Jürgen Geisler13,14, Ida R K Bukholm14,15, Bjørn Naume2,16, Ellen Schlichting2,17, Torill Sauer2,14,18, Gordon B Mills9, Rolf Kåresen2,14,17, Gunhild M Mælandsmo2,7, Ole Christian Lingjærde2,19,20, Arnoldo Frigessi3,21, Vessela N Kristensen1,2,4, Anne-Lise Børresen-Dale1,2, Kristine K Sahlberg22,23,24.
Abstract
BACKGROUND: Breast cancer is a heterogeneous disease at the clinical and molecular level. In this study we integrate classifications extracted from five different molecular levels in order to identify integrated subtypes.Entities:
Keywords: Breast cancer; Consensus clustering; Integration; Luminal A; MicroRNA
Mesh:
Substances:
Year: 2017 PMID: 28356166 PMCID: PMC5372339 DOI: 10.1186/s13058-017-0812-y
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Fig. 1The seven input levels for integrative clustering and association with clinical/molecular classifications. Original subtypes/clusters of each input level and corresponding PAM50 gene expression subtype, estrogen receptor (ER), progesterone receptor (PR) and epidermal growth factor receptor 2 (HER2) status, and TP53 and PIK3CA mutation status of the tumor samples. The tumor samples are sorted in the following order: molecular level, PAM50 gene expression subtype, ER, PR, HER2, TP53 and PIK3CA status. a PAM50 gene expression subtypes (n = 377). b Reverse-phase protein array (RPPA) subtypes (n = 173). c Complex arm aberration index (CAAI) subtypes (n = 349). 0 no CAAI events, 1 one CAAI event, 2 at least two CAAI events. d miRNA clusters (n = 423). e Metabolic clusters (n = 233). f Integrated clusters (IntClust; n = 291). g Pathway recognition algorithm using data integration on genomic models (PARADIGM) clusters (n = 312). Lum luminal, Pos positive, Neg negative, Mut mutant, Wt wild-type, NA not applicable
Fig. 2Cluster-of-clusters analysis (COCA) identifies six major groups based on seven molecular input levels. Consensus clustering was used to cluster 419 primary breast cancers in the Oslo2 study. The six resulting COCA clusters are numbered and the corresponding PAM50 subtype indicated (top). Heatmap representation of the subtypes/clusters independently defined: PAM50 mRNA subtypes, reverse-phase protein array (RPPA) expression subtypes, complex arm aberration index (CAAI) subtypes based on copy numbers, miRNA clusters, metabolic clusters, pathway recognition algorithm using data integration on genomic models (PARADIGM) clusters and integrated clusters (IntClust). Colored bar indicates membership of a subtype/cluster type, white indicates no membership to a given subtype and gray represents data not available (NA). The rows in the heatmap are ordered according to clustering. Clinical annotation of the tumors is shown (bottom). HER2 human epidermal growth factor receptor 2, Lum luminal, Mut mutant, WT wild-type
Top five molecular subtype levels ranked according to correlation to each COCA cluster
| COCA cluster 1 | COCA cluster 2 | COCA cluster 3 | COCA cluster 4 | COCA cluster 5 | COCA cluster 6 |
|---|---|---|---|---|---|
| miRNA2 ( | PAM50 - LumB ( | IntClust10 ( | PAM50 - LumA ( | miRNA4 ( | PAM50 - HER2 ( |
| PAM50 - LumA (0.23) | PARADIGM1 ( | PAM50 - Basal ( | miRNA1 ( | PAM50 - Normal ( | IntClust5 ( |
| PARADIGM6 (0.15) | miRNA1 ( | PARADIGM2 ( | RPPA - Luminal ( | RPPA - ReacI (0.25) | RPPA - HER2 ( |
| IntClust4 (0.12) | IntClust1 (0.28) | RPPA - Basal ( | IntClust3 (0.29) | Metabolic2 (0.15) | CAAI3 (0.28) |
| PARADIGM5 (0.12) | IntClust9 (0.27) | miRNA3 ( | PARADIGM4 (0.27) | miRNA3 (0.13) | PARADIGM2 (0.28) |
Values in parentheses are the Pearson correlation values. Correlation ≥0.3 (p < 0.001 for all) is indicated in bold font. COCA cluster-of-clusters analysis, LumA luminal A, PARADIGM pathway recognition algorithm using data integration on genomic model, IntClust integrated clusters, RPPA reverse-phase protein array, CAAI complex arm aberration index
Fig. 3Correlation between cluster-of-clusters analysis (COCA) clusters and molecular input levels. Pearson correlation coefficient (y-axis) calculated between each molecular subtype level and each COCA cluster (x-axis) by coding membership to a cluster as 1 and 0 otherwise. Each panel represents one molecular input level to the COCA analysis. RPPA reverse-phase protein array, CAAI complex arm aberration index, PARADIGM pathway recognition algorithm using data integration on genomic models, IntClust integrated clusters, HER2 human epidermal growth factor receptor 2, Lum luminal
Fig. 4miRNA expression separates luminal A tumors into clusters with different outcomes. Top panel Luminal A tumors in The Cancer Genome Atlas (TCGA), Molecular Taxonomy of Breast Cancer International Consortium (METABRIC), Danish Breast Cancer Cooperative Group (DBCG) and the Oslo Micrometastasis cohort (Micma) breast cancer cohorts were clustered based on the expression of selected miRNAs using Pearson correlation and complete linkage (patients in columns and miRNAs in rows). Bottom panel Kaplan-Meier survival curves for the red and blue clusters in the top panel. The p-values are from log-rank tests (METABRIC p-value was adjusted for hospital site and DBCG p-value was adjusted for radiation therapy and lymph node status). Dashed lines indicate confidence intervals for the survival curves