| Literature DB >> 18714348 |
Katherine J Martin1, Denis R Patrick, Mina J Bissell, Marcia V Fournier.
Abstract
BACKGROUND: One of the major tenets in breast cancer research is that early detection is vital for patient survival by increasing treatment options. To that end, we have previously used a novel unsupervised approach to identify a set of genes whose expression predicts prognosis of breast cancer patients. The predictive genes were selected in a well-defined three dimensional (3D) cell culture model of non-malignant human mammary epithelial cell morphogenesis as down-regulated during breast epithelial cell acinar formation and cell cycle arrest. Here we examine the ability of this gene signature (3D-signature) to predict prognosis in three independent breast cancer microarray datasets having 295, 286, and 118 samples, respectively. METHODS ANDEntities:
Mesh:
Substances:
Year: 2008 PMID: 18714348 PMCID: PMC2500166 DOI: 10.1371/journal.pone.0002994
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The 22 gene 3D signature predicts survival in the microarray datasets of Wang, et al., and Sorlie, et al.
The 22 gene signature and unsupervised hierarchical clustering grouped breast cancer patients to accurately reflect overall relapse or survival when analyzed by the method of Kaplan and Meier. A. Hierarchical cluster analysis of the dataset of Wang, et al. The pattern of expression of the 22 genes selected by the 3D assay are shown for the 286 breast cancer patients of Wang, et al. Genes and samples were organized by using hierarchical clustering. The two major clusters in the sample dimension (red cluster and yellow cluster), were found by using survival analysis to distinguish between good and poor prognosis patients (p<0.0001). B. Kaplan-Meier curves for the red and yellow clusters of the hierarchical diagram of panel A. The endpoint recorded for this dataset was relapse, measured in months. C. Hierarchical cluster analysis of Sorlie, et al. dataset. The pattern of expression of the 15 of 22 genes with probes on the Stanford microarrays and with data available for at least 40% of patients are shown for the 121 breast cancer patients reported by Sorlie, et al. Expression was organized by hierarchical clustering. The two major clusters in the sample dimension (red cluster and yellow cluster), were found by using survival analysis to distinguish between good and poor prognosis patients (p = 0.00447). D. Kaplan-Meier curves for the red and yellow clusters of the hierarchical diagram of panel C. The endpoint recorded for this dataset was death, measured in months.
Figure 2Kaplan-Meier curves of the individual genes that accurately predicted patient prognosis (p<0.05).
A. Results for individual genes in the dataset of Wang, et al., using patient relapse as the endpoint. B. Results for individual genes in the dataset of Sorlie, et al., using patient survival as the endpoint.
Kaplan-Meier p-values for the 22 individual 3D signature genes in the Wang, Sorlie, and van de Vijver patient datasets.
| Gene | All patients | ER + patients | ER− patients | ||||||
| Wang | Sorlie | Van de Vijver | Wang | Sorlie | Van de Vijver | Wang | Sorlie | Van de Vijver | |
|
| |||||||||
| ASPM |
| - | ns | ns | - | ns | ns | - | ns |
| AURKA |
|
|
|
| 0.067 | ns | ns | ns | ns |
| CDKN3 | ns |
|
| ns | 0.095 | ns | ns | ns | ns |
| CEP55 |
|
|
| ns |
| ns | ns | ns | ns |
| CKS2 |
| ns |
| ns | ns | ns | ns | ns | ns |
| DUSP4 |
| ns |
| 0.085 | ns | ns | ns | ns | ns |
| NCAPG | ns | - | - | 0.087 | - | - | ns | - | - |
| RRM2 | ns |
|
| ns |
| 0.068 | ns | ns | ns |
| TUBG1 | ns | ns |
| ns | ns | ns | 0.059 | ns | ns |
|
| |||||||||
| ACTB | ns | - |
| 0.079 | - | ns | ns | - |
|
| ACTN1 |
| ns |
| ns | ns | ns | ns | ns | ns |
| EPHA2 |
| ns |
|
| ns |
| ns | ns | ns |
| FGFBP1 | ns | ns | ns | ns | ns |
| ns | ns | 0.059 |
| FOXM1 | ns |
|
| ns | ns | ns | ns | ns |
|
| SERPINE2 | ns | ns | - | ns | ns | - | 0.092 |
| - |
| TNFRSF6B | ns | ns | ns | 0.076 | ns | ns | ns | ns | 0.071 |
| ZWILCH | ns | ns |
| ns | ns | ns | ns | ns | ns |
|
| |||||||||
| ODC1 | ns | ns | ns | ns | ns | ns | ns | ns | ns |
|
| |||||||||
| EIF4A1 | ns | ns | ns | 0.095 | ns | ns | ns | ns | ns |
| TRIP13 |
|
|
| ns | ns | ns | ns | ns | ns |
| VRK1 |
|
| ns |
|
| ns | ns | ns | ns |
|
| |||||||||
| C1QDC1 | ns | ns | - | ns | ns | - | ns | ns | - |
Data previously reported (Fournier et al., Cancer Research 2007).
ns = not significant; - = no data; bold = p<0.05.
ER association of the 22 individual 3D signature genes in three patient datasets (Welch t-test p values with false positive multigene correction).
| Gene | Wang | Sorlie | van de Vijver |
|
| |||
| ASPM | 3.1e-9 | - | 9.2e-5 |
| AURKA | 3.6e-8 | 0.018 | 1.2e-8 |
| CDKN3 | 3.8e-6 | ns | 0.024 |
| CEP55 | 4.5e-10 | ns | 6.9e-10 |
| CKS2 | 0.0011 | 3.8e-9 | 0.0017 |
| DUSP4 | 5.3e-7 | 0.044 | 6.1e-9 |
| NCAPG | 2.4e-5 | - | - |
| RRM2 | 7.6e-12 | 0.049 | 2.7e-9 |
| TUBG1 | ns | 0.018 | ns |
|
| |||
| ACTB | 0.018 | - | 5.8e-11 |
| ACTN1 | 2.4e-5 | ns | 0.027 |
| EPHA2 | 1.7e-9 | 0.028 | 1.2e-10 |
| FGFBP1 | 1.6e-6 | - | 0.00069 |
| FOXM1 | 1.7e-9 | 0.025 | 3.5e-9 |
| SERPINE2 | 5.1e-5 | ns | - |
| TNFRSF6B | 0.0030 | ns | 0.0017 |
| ZWILCH | 0.0081 | 0.00018 | 0.0015 |
|
| |||
| ODC1 | 6.7e-11 | ns | 5.8e-11 |
|
| |||
| EIF4A1 | 0.018 | 0.018 | ns |
| TRIP13 | 9.0e-9 | 0.0042 | 1.2e-10 |
| VRK1 | 0.0061 | 4.2e-5 | ns |
|
| |||
| C1QDC1 | ns | 0.00094 | - |
Univariable and multivariable proportional-hazards analysis of survival risk for three large independent testing sets*.
| Univariable analysis | Multivariable analysis | |||
| Hazard ratio (95% CI) | p | Hazard ratio (95% CI) | p | |
|
| ||||
| ER positive vs negative | 0.31 (0.20 to 0.49) | <0.0001 | 0.50 (0.31 to 0.80) | 0.0044 |
| 3D signature | 5.52 (2.98 to 10.22) | <0.0001 | 4.45 (2.35 to 8.43) | <0.0001 |
|
| ||||
| ER positive vs negative | 1.00 (0.65 to 1.54) | 0.99 | 1.25 (0.80 to 1.95) | 0.32 |
| 3D signature | 2.40 (1.60 to 3.60) | <0.0001 | 2.51 (1.66 to 3.80) | <0.0001 |
|
| ||||
| ER positive vs negative | 0.69 (0.40 to 1.20) | 0.19 | 0.79 (0.44 to 1.39) | 0.41 |
| 3D signature | 1.89 (1.13 to 3.17) | 0.016 | 1.51 (0.88 to 2.58) | 0.13 |
Results for the datasets of van de Vijver, et al., and Wang, et al., represent 10 year Hazard Ratios (95%CI). Results for the dataset of Sorlie, et al. were calculated using all available data, which included 5 years of follow up. The endpoint for the van de Vijver analysis was overall survival and for the Wang and Sorlie analyses were relapse.
Multivariable analysis accounted for ER status and the 3D signature.
CI, confidence interval.
Multivariable proportional-hazards analysis of 10 year survival risk*.
| Hazard ratio (95% CI) | p | |
| Age (per 10 year increment) | 0.62 (0.44 to 0.88) | 0.008 |
| Tumor diameter (per cm) | 1.33 (1.04 to 1.69) | 0.023 |
| ER (positive vs negative) | 0.55 (0.34 to 0.90) | 0.018 |
| Lymph node status (per positive node) | 1.07 (0.96 to 1.20) | 0.234 |
| Chemotherapy | 0.69 (0.38 to 1.26) | 0.234 |
| Mastectomy | 1.05 (0.63 to 1.73) | 0.864 |
| 3D signature | 4.43 (2.32 to 8.46) | <0.00001 |
Results were calculated using the dataset of van de Vijver, et al. using overall survival as the endpoint. Similar results were obtained for the same multivariable analysis using relapse as the endpoint, 3D signature Hazard ratio 3.3 (95% CI 2.0 to 5.3), p<0.0001.
CI, confidence interval.