| Literature DB >> 17428335 |
James E Korkola1, Ekaterina Blaveri, Sandy DeVries, Dan H Moore, E Shelley Hwang, Yunn-Yi Chen, Anne L H Estep, Karen L Chew, Ronald H Jensen, Frederic M Waldman.
Abstract
BACKGROUND: Breast cancer is a heterogeneous disease, presenting with a wide range of histologic, clinical, and genetic features. Microarray technology has shown promise in predicting outcome in these patients.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17428335 PMCID: PMC1855059 DOI: 10.1186/1471-2407-7-61
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Figure 1Unsupervised hierarchical clustering of 162 breast tumor samples based on ~4,000 genes with greatest variation. Tumor samples (left) are color coded according to immunohistochemical staining for estrogen receptor (ER negative, green; ER positive, red; reduction mammoplasties, cyan; ER status unknown, black). Gene clusters (top) are those related to estrogen receptor (ESR1), proliferation-associated genes, ERBB2 related genes, and genes found in a common chromosomal region of 17q21-24 (red represents high relative expression, green low relative expression). The black bar indicates samples that clustered tightly with normal specimens.
Figure 2Relationship between PAM, SAM, Correlation, and overlapping gene sets. Degree of overlap between the 142 (PAM), 49 (SAM), and 49 (correlation) gene sets which results in the 21 gene set.
Classification Rates and Survival Differences for identified gene sets
| PAM | 142 | 71% | < 0.00001 |
| Correlation | 49 | 75% | < 0.00001 |
| Overlap | 21 | 69% | < 0.005 |
1 Number of genes identified from each of the predictive sets
2 Leave-one-out cross-validation classification rate using gene re-sampling based on known outcome of 55 tumor samples
3 Kaplan-Meier log rank survival difference using disease-free survival time based on good and poor predicted groups from leave-one-out classification
Figure 3Kaplan-Meier survival curves for UCSF breast tumors based on our predictive gene sets. Surviving fraction of predicted good (solid lines) and poor (dashed lines) groups is shown. A) separation based on 21 gene set (P < 0.0001). B) separation based on ER positive (dashed line) vs. ER negative (solid line) (P = 0.18). C) separation based on grade 1+2 (solid line) vs. grade 3 (dashed line) (P = 0.19). D) separation based on node positive (dashed line) vs. node negative (solid line) (P = 0.005).
Classification Rates by Clinical Subtype1
| PAM | 53% | 78% | 88% | 55% | 100% | 71% | 60% |
| Correlation | 60% | 80% | 92% | 59% | 100% | 83% | 55% |
| Overlap | 60% | 72% | 84% | 59% | 86% | 71% | 65% |
1 Correct leave-one-out cross-validation classification rates with gene re-sampling into good and poor outcome groups in 55 tumors
Classification Rates within Treatment Subgroups1
| PAM | 67% | 76% | 56% | 86% | 75% | 67% |
| Correlation | 73% | 76% | 59% | 89% | 72% | 73% |
| Overlap | 67% | 72% | 59% | 79% | 75% | 60% |
1 Correct leave-one-out cross-validation classification rates with gene re-sampling into good and poor outcome groups in 55 tumors
Figure 4Kaplan-Meier survival curves based on the 21 gene set in independent data sets. A) van't Veer et al. data set; B) Sotiriou et al. data set. Surviving fraction for predicted good (solid lines) and poor (dashed lines) groups in the validation tumor sets are shown (P < 0.0005).