| Literature DB >> 21082037 |
M C Abba1, E Lacunza, M Butti, C M Aldaz.
Abstract
In this review we provide a systematic analysis of transcriptomic signatures derived from 42 breast cancer gene expression studies, in an effort to identify the most relevant breast cancer biomarkers using a meta-analysis method. Meta-data revealed a set of 117 genes that were the most commonly affected ranging from 12% to 36% of overlap among breast cancer gene expression studies. Data mining analysis of transcripts and protein-protein interactions of these commonly modulated genes indicate three functional modules significantly affected among signatures, one module related with the response to steroid hormone stimulus, and two modules related to the cell cycle. Analysis of a publicly available gene expression data showed that the obtained meta-signature is capable of predicting overall survival (P < 0.0001) and relapse-free survival (P < 0.0001) in patients with early-stage breast carcinomas. In addition, the identified meta-signature improves breast cancer patient stratification independently of traditional prognostic factors in a multivariate Cox proportional-hazards analysis.Entities:
Keywords: biomarkers; breast cancer; gene expression signatures
Year: 2010 PMID: 21082037 PMCID: PMC2978930 DOI: 10.4137/BMI.S5740
Source DB: PubMed Journal: Biomark Insights ISSN: 1177-2719
Figure 1.Overlap beween gene identifiers across 42 breast cancer gene expression signatures. A) Heatmap representation of 946 genes overlapping in more than one gene expression signature. B) Heatmap representation of 117 genes overlapping in at least 5 out of 42 gene expression signatures analyzed. Easch row is a gene and each column is a breast cancer gene expression signature. Presence of a gene is indicated by a blue box, and absence is white.
Figure 2.Hierarchical clustering analysis of the 42 breast cancer gene expression studies, classified them in four groups: the intrinsic subtypes, response to chemotherapy, stromal/extracellular matrix (ECM) and signatures enriched in cell cycle genes. It can clearly be seen that related signatures such us intrinsic subtype and ER-alpha status on the one hand, or stromal and extracellular matrix signatures on the other hand, have a large overlap relative to other gene expression signatures.
Figure 3.Data mining analysis of the gene expression meta-signature. A) Gene ontology (GO) classification of the 117 gene list meta-signature with specific gene ontology annotations based on biological processes or molecular function terms. B) Graph of protein-protein interactions among the 117 gene expression metasignature generated using the STRING database. In the network: links between proteins means the various interactions data supporting the network, colored by evidence type.
Figure 4.Cross-validation of the gene meta-signature with a single data set of 295 breast cancer samples and integration with 4 pronostic or predictive gene expression signatures. A) Meta-signature hierarchical clustering, cluster 1 (blue), cluster 2 (pink), cluster 3 (orange). Gene ontology clustering (left of the graph), the green bar indicates genes related to steroid hormone stimulus, the blue bar indicates cell cycle related genes. B) Intrinsic Subtype signature. C) Poor prognosis signature: good prognosis (blue), poor prognosis (orange); Recurrence score: high (orange); intermediate (blue light), low recurrence score (blue) and Wound response: activated (orange); quiescent (blue). D) Clinicopathological data. Estrogen Receptor (ER) status: positive (black) and negative (white). Histological grade: high (black); moderate (grey) and low grade (light grey). Lymph node (LN) status: negative (white); 1–3 positive (grey); >3 positive (black). Tumor size: ≤2 cm (grey) and >2 cm (black).
Figure 5.Kaplan–Meier curves of overall and relapse-free survival among the 295 early-stage breast cancer patients obtained from van de Vijver et al study (2002) according to the meta-signature (A and B), Intrinsic Subtypes (C and D), Poor Prognosis Signature (E and F), Recurrence Score (G and H) and Wound Response (I and J).
Multivariate Cox proportional hazard analysis of standard clinical prognosis factors with the gene expression meta-signature predictor.
| Age, per decade | 0.68 (0.46–1.01) | 0.056 | ||
| ER status | 0.64 (0.39–1.05) | 0.076 | 0.93 (0.60–1.45) | 0.763 |
| Tumor grage 2 vs. 1 | 1.66 (0.88–3.13) | 0.120 | ||
| Tumor grade 3 vs. 1 | 2.69 (0.90–8.01) | 0.076 | 1.37 (0.70–2.68) | 0.358 |
| Size | ||||
| Lymph node 1–3 (+) vs. 0 | 0.95 (0.44–2.05) | 0.904 | 1.13 (0.63–2.04) | 0.683 |
| Lymph node > 3 (+) vs. 0 | 1.77 (0.75–4.21) | 0.195 | ||
| Treatment | 0.83 (0.39–1.76) | 0.622 | ||
| Meta-signature | ||||
Notes:
Size was a binary variable (0 = diameter of 2 cm or less, 1 = greater than 2 cm.), age was a continuous variable formatted as decade-years. Hazard ratio for meta-signature was calculated comparing the clusters 2 and 3 relative to cluster 1. Variables found to be significant (P < 0.05) in the Cox proportional hazard model are shown in bold.
Most highly up-regulated transcripts from meta-siganture gene list in van de Vijver et al 2002 data set.
| 3169 | 518.77 | Cluster 1 | |
| 2625 | 167.42 | Cluster 1 | |
| 2099 | 152.04 | Cluster 1 | |
| 7494 | 141.47 | Cluster 1 | |
| 1058 | 243.97 | Cluster 3 | |
| 699 | 234.41 | Cluster 3 | |
| 55355 | 230.73 | Cluster 3 | |
| 9133 | 227.29 | Cluster 3 | |
| 11004 | 213.76 | Cluster 3 | |
| 10112 | 202.88 | Cluster 3 | |
| 991 | 187.72 | Cluster 3 | |
| 9055 | 183.64 | Cluster 3 | |
| 55165 | 182.11 | Cluster 3 | |
| 2305 | 169.99 | Cluster 3 | |
| 11065 | 167.93 | Cluster 3 | |
| 55143 | 167.46 | Cluster 3 | |
| 3833 | 161.72 | Cluster 3 | |
| 6790 | 155.97 | Cluster 3 | |
| 7272 | 149.15 | Cluster 3 | |
| 55839 | 147.54 | Cluster 3 | |
| 4085 | 141.72 | Cluster 3 | |
| 332 | 141.26 | Cluster 3 | |
| 9928 | 136.10 | Cluster 3 | |
| 890 | 130.65 | Cluster 3 | |
| 9232 | 126.93 | Cluster 3 | |
| 9833 | 194.99 | Cluster 3 | |
| 1503 | 145.28 | Cluster 3 | |
| 2146 | 132.67 | Cluster 3 | |
| 6648 | 127.96 | Cluster 3 | |
| 9319 | 126.41 | Cluster 3 |