| Literature DB >> 18662380 |
Pratyaksha Wirapati1, Christos Sotiriou, Susanne Kunkel, Pierre Farmer, Sylvain Pradervand, Benjamin Haibe-Kains, Christine Desmedt, Michail Ignatiadis, Thierry Sengstag, Frédéric Schütz, Darlene R Goldstein, Martine Piccart, Mauro Delorenzi.
Abstract
INTRODUCTION: Breast cancer subtyping and prognosis have been studied extensively by gene expression profiling, resulting in disparate signatures with little overlap in their constituent genes. Although a previous study demonstrated a prognostic concordance among gene expression signatures, it was limited to only one dataset and did not fully elucidate how the different genes were related to one another nor did it examine the contribution of well-known biological processes of breast cancer tumorigenesis to their prognostic performance.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18662380 PMCID: PMC2575538 DOI: 10.1186/bcr2124
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Publicly available gene expression data from breast cancer studies
| Dataset symbol | Number of arrays | Institution | Reference(s) | Platform | Data source | Number of GeneIDs |
| Genomic platforms | ||||||
| NKI | 337 | Nederlands Kanker Instituut (Amsterdam, The Netherlands) | [ | Agilent | Author's website | 13,120 |
| EMC | 286 | Erasmus Medical Center (Rotterdam, The Netherlands) | [ | Affymetrix U133A | GEO: GSE2034 | 11,837 |
| UPP | 249 | Karolinksa Institute (Uppsala, Sweden) | [ | Affymetrix U133A,B | GEO: GSE4922 | 15,684 |
| STOCK | 159 | Karolinska Institute (Stockholm, Sweden) | [ | Affymetrix U133A,B | GEO: GSE1456 | 15,684 |
| DUKE | 171 | Duke University (Durham, NC, USA) | [ | Affymetrix U95Av2 | Author's website | 8,149 |
| UCSF | 161 + 8 | University of California at San Francisco (USA) | [ | cDNA | Author's website | 6,178 |
| UNC | 143 + 10 | University of North Carolina (Chapel Hill, NC, USA) | [ | Agilent HuA1 | Author's website | 13,784 |
| NCH | 135 | Nottingham City Hospital (Nottingham, UK) | [ | Agilent HuA1 | AE: E-UCON-1 | 13,784 |
| STNO | 115 + 7 | Stanford University (Palo Alto, CA, USA)/Norwegian Radium Hospital (Oslo, Norway) | [ | cDNA | Author's website | 5,614 |
| JRH1 | 99 | John Radcliffe Hospital (Oxford, UK) | [ | cDNA | Journal's website | 4,112 |
| JRH2 | 61 | John Radcliffe Hospital | [ | Affymetrix U133A | GEO: GSE2990 | 11,837 |
| MGH | 60 | Massachusetts General Hospital (Boston, MA, USA) | [ | Agilent | GEO: GSE1379 | 11,421 |
| expO | 239 | International Genomic Consortium | [ | Affymetrix U133v2 | GEO: GSE2109 | 16,634 |
| TGIF1 | 49 | EORTC trial 10994 | [ | Affymetrix U133A | GEO: GSE1561 | 11,837 |
| BWH | 40 + 7 | Brigham and Women's Hospital (Boston, MA, USA) | [ | Affymetrix U133v2 | GEO: GSE3744 | 16,634 |
| Small diagnostic platforms | ||||||
| TRANSBIG | 253 | TRANSBIG Consortium | [ | Agilent | AE: E-TABM-77 | 1,052 |
| EMC2 | 180 | Erasmus Medical Center | [ | Affymetrix (custom) | GSE3453 | 86 |
| HPAZ | 96 | Hospital La Paz (Madrid, Spain) | [ | RT-PCR | Appendix of [ | 61 |
| Total | 2,865 = 2,833 carcinomas + 32 nonmalignant breast tissues | Number of the union of all GeneIDs: | 17,198 | |||
| Number of GeneIDs common to genomic platforms: | 1,963 | |||||
Datasets UNC, STNO, UCSF, and BWH include a small number of normal breast or fibroadenoma tissue samples. AE, ArrayExpress (accession); Affymetrix, Affymetrix, Inc., Santa Clara, CA, USA; Agilent, Agilent Technologies, Inc., Santa Clara, CA, USA; EORTC, European Organization for Research and Treatment of Cancer; GEO, Gene Expression Omnibus (accession); RT-PCR, reverse transcription-polymerase chain reaction.
Figure 1Breast tumor characterization using module scores. (a) Joint distribution between the estrogen and ERBB2 amplification scores in example datasets. Clusters are identified by Gaussian mixture models with three components. The ellipses correspond to the 95% cumulative probability around the cluster centers. The clusters are designated as tumor types ER-/ERBB2-, HER2+, and ER+/HER2-. HER2+ tumors show intermediate estrogen scores. (b) Dot histograms showing dependence of proliferation score on the subtypes. The median and quartiles for each group are shown by the box plot. ER-/ERBB2- and HER2+ tumors show high proliferation scores, whereas ER+/HER2- tumors show a wide range of proliferation scores. The distributions of the intrinsic subtypes (colored dots), BRCA1 mutations, and p53 mutations are shown in datasets where they are available. ER, estrogen receptor.
Figure 2Survival analysis of groups based on module scores. Kaplan-Meier analysis for distant relapse-free survival (DRFS) of systemically untreated (a) and treated (b) patient groups. The ER+ subgroup is split into ER+/HER2-/L and ER+/HER2-/H (low and high proliferation, respectively). Vertical bars on the curves are 95% confidence intervals for the Kaplan-Meier survival estimates. Forest plot representation of the 5-year survival estimates and hazard ratios for DRFS of individual datasets in the systemically untreated (c) and treated (d) populations. The length of horizontal bars and the width of the diamonds of the 'Total' correspond to 95% confidence intervals. Missing bars are unavailable data. Multivariate analysis representation in which all the variables are available in systemically untreated (e) and treated (f) patients. ER, estrogen receptor; HR, hazard ratio.
Prognostic signatures
| Signature symbol | Reference | Associated variables in gene selection procedure | Number of genes | |
| Original probes | Mapped to geneID | |||
| ONC-16 | [ | Biological knowledge; refined by patient outcome | 16 | 16 |
| NKI-70 | [ | Patient outcome | 70 | 52 |
| EMC-76 | [ | Patient outcome, stratified by estrogen receptor status | 60 + 16 | 48 + 12 |
| NCH-70 | [ | Patient outcome | 70 | 69 |
| CON-52 | [ | Patient outcome, consensus | 52 | 50 |
| p53-32 | [ | p53 mutation | 32 | 19 |
| CSR | [ | Fibroblast core serum response | 512 | 457 |
| GGI-128 | [ | Histological grade | 128 | 98 |
| CCYC | [ | Periodic expression in cell cycle progression | NA | 126 |
NA, not applicable.
Figure 3Signature comparison. The prognostic performance of the signatures is compared by the forest plots of hazard ratio and plotted as vertical color bars for comparison. Most signatures show similar performance. Prognostic performance for distant relapse-free survival (DRFS) of the signatures using partial signatures containing only proliferation genes in the untreated (a) and treated (c) populations. The performance of most signatures is not degraded; in fact, it is improved for p53-32. Prognostic performance for DRFS of the signatures using partial signatures containing nonproliferation genes in the untreated (b) and treated (d) populations.
Figure 4Patient classifications made by example signatures applied to representative datasets, showing that the different signatures are essentially detecting as low-risk the low-proliferation subset of ER+/ERBB2- tumors. ER, estrogen receptor.