| Literature DB >> 24327800 |
Mads Thomassen1, Qihua Tan, Mark Burton, Torben A Kruse.
Abstract
BACKGROUND: Breast tumors have been described by molecular subtypes characterized by pervasively different gene expression profiles. The subtypes are associated with different clinical parameters and origin of precursor cells. However, the biological pathways and chromosomal aberrations that differ between the subgroups are less well characterized. The molecular subtypes are associated with different risk of metastatic recurrence of the disease. Nevertheless, the performance of these overall patterns to predict outcome is far from optimal, suggesting that biological mechanisms that extend beyond the subgroups impact metastasis.Entities:
Keywords: breast cancer; gene expression; metastasis; microarray; molecular subtypes; pathway analysis
Year: 2013 PMID: 24327800 PMCID: PMC3855200 DOI: 10.4137/CIN.S12840
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Datasets and patients included in the study.
| Data set | Country | Platform | #genes | Outcome | #pt with follow up | #sub typed samples |
|---|---|---|---|---|---|---|
| Amsterdam1 | The Nederlands | Agilent | 24451 | Distant metastasis | 295 | 287 |
| Rotterdam2 | The Nederlands | 133A | 22283 | Distant metastasis | 286 | 277 |
| Stockholm32 | Sweden | 133A+B | 44982 | Death from BC | 159 | 152 |
| Uppsala33 | Sweden | 133A+B | 44982 | Death from BC | 236 | 229 |
| Bild34 | US | 95av2 | 12625 | Metastasis | 158 | 153 |
| Hu8 | 3 Agilent | 22K | Relapse | 105 | 105 | |
| Training total | 1239 | 1203 | ||||
| Mainz9 | Germany | 133A | 22283 | Distant metastasis | 200 | 191 |
| TRANSBIG-S3 | UK | 133A | 22283 | Distant metastasis | 147 | 0 |
| Test total | 347 | 191 | ||||
| Total | 1586 | 1394 |
Figure 1Kaplan–Meyer plot of the five molecular subtypes in seven datasets, including 1,394 tumor samples. Molecular subtypes were determined by the single-sample prediction method reported by Hu et al.8 Hazard ratios were calculated by fitting a Cox model to the data for comparing each group to basal-like tumors. The P-value was calculated using the log-rank test.
Differentially expressed chromosomal regions.
| Basal | ERBB2 | Lum A | Lum B | Normal |
|---|---|---|---|---|
| 1P34, 1P32, 1P22 | 2P11, 2Q37 | 1P33, 1P31, 1Q41 | 4P16 | |
| 3P22, 3P21, 3P14 | 2P25, 2P12 | 1Q23, 1Q41, | ||
Note: Gene sets in bold were also significant in test data set (Mainz).
Differentially expressed pathway gene sets.
| Basal | ERBB2 | Lum A | Lum B | Normal |
|---|---|---|---|---|
| BIOSYNTHESIS_OF_ | AMINO ACID | FATTY_ACID_METABOLISM | ||
| TEL | ||||
Notes: Gene sets in bold were also significant in test data set (Mainz). The number of redundant pathway gene sets is given in brackets.
Figure 2Kaplan–Meyer plots of the 5q14 and TNFR2 groups in basal-like tumors. The training set consisted of 280 basal-like tumors (A and E) and 959 nonbasal-like tumors (B and F). The test set contained 41 basal-like tumors (C and G) and 159 nonbasal-like tumors (D and H). The score is calculated for each sample as the mean standardized expression values for all genes in the gene set The low score group was defined as the third of samples with the lowest score for the indicated gene set, and the high score group was defined as the third of the samples with the highest score. The middle third was excluded from the analysis. The P-values were calculated using the log-rank test.
Abbreviation: TNFR, tumor necrosis factor receptor.
Figure 3Kaplan–Meyer plots of 5q33 and IL-12 groups in ERBB2 tumors. The training set consisted of 196 tumors with high ERBB2 expression (A and E) and 1,043 tumors with low ERBB2 expression (B and F). The test set contained 70 tumors with the highest ERBB2 expression (C and G) and 277 tumors with lowest ERBB2 expression (D and H). The score is calculated for each sample as the mean standardized expression values for all genes in the gene set. The low score group was defined as the third of samples with lowest score for the indicated gene set, and the high score group was defined as the third of the samples with the highest score. The middle third was excluded from the analysis. The P-values were calculated using the log-rank test.
Abbreviation: IL, interleukin.
GSEA meta-analysis within subtypes and survival analysis in training sets.
| Direction in metastasis | Region or pathway | Mean ranking value | Max ranking value | P-value | fdr | Survival analysis P-value |
|---|---|---|---|---|---|---|
| Basal | ||||||
| Up | 5q14 | 26.8 | 188 | 0.0004 | 0.05 | 0.007 |
| 8q24 | 28.2 | 188 | 0.0005 | 0.05 | 0.17 | |
| Down | TNFR2 | 290 | 347 | 0.0002 | 0.04 | 0.009 |
| ERBB2 | ||||||
| Up | 16Q22 | 27.4 | 188 | 0.0004 | 0.05 | 0.15 |
| Xq22 | 28.4 | 188 | 0.0005 | 0.05 | 0.98 | |
| Fructose_mannose | 32.5 | 347 | 0.00004 | 0.01 | 0.32 | |
| Down | 5q33 | 165 | 188 | 0.0002 | 0.04 | 0.08 |
| IL-12 | 303 | 347 | 0.0002 | 0.04 | 0.004 | |
| GPCRDB | 313 | 347 | 0.00006 | 0.02 | 0.15 | |