| Literature DB >> 18755024 |
Andrew E Teschendorff1, Carlos Caldas.
Abstract
INTRODUCTION: Patients with primary operable oestrogen receptor (ER) negative (-) breast cancer account for about 30% of all cases and generally have a worse prognosis than ER-positive (+) patients. Nevertheless, a significant proportion of ER- cases have favourable outcomes and could potentially benefit from a less aggressive course of therapy. However, identification of such patients with a good prognosis remains difficult and at present is only possible through examining histopathological factors.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18755024 PMCID: PMC2575547 DOI: 10.1186/bcr2138
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Figure 1Heatmaps of seven-gene immune response-modules. Heatmaps of gene expression of the seven-gene immune response-module for the training and six test cohorts (red = high relative expression, green = low). Samples are clustered into two groups according to the partitioning around medoids algorithm [28] (purple = group overexpressing the immune response-module, yellow = group underexpressing the immune response-module). Clinical outcome as defined by a disease-specific death event (or distant metastasis if the former is not available) is also shown (black = poor, grey = good, white = missing data). Note that in some cases not all seven genes could be mapped to the external platform. C1QA = complement component 1, q subcomponent, A chain; HLA-F = major histocompatibility complex, class I, F; IGLC2 = immunoglobulin lambda constant 2; LY9 = lymphocyte antigen 9; TNFRSF17 = tumour necrosis factor receptor superfamily member 17; SPP1 = secreted phosphoprotein 1 (osteopontin); XCL2 = chemokine (C motif) ligand 2.
Figure 2The MDA and MDAhet classifier. Four two-dimensional projections of the seven-dimensional Mixture Discriminant Analysis (MDA) and Heterogeneous Mixture Discriminant Analysis (MDAhet) classifiers. Scatterplots show projections of the training expression data (183 oestrogen receptor negative samples) onto arbitrarily chosen two-dimensional subspaces spanned by the genes HLA-F and IGLC2, LY9 and TNFRSF17, SPP1 and XCL2, and IGLC2 and C1QA. Codings: black = poor outcome, grey = good outcome, triangle = training samples classified into the good prognosis subgroup defined by overexpression of seven-gene module 'good-up', circle = training samples not classified into 'good-up' group. In addition, the means and covariance-curves of the two Gaussians that approximate each of the poor (black ellipses) and good outcome (grey ellipses) classes are shown. C1QA = complement component 1, q subcomponent, A chain; HLA-F = major histocompatibility complex, class I, F; IGLC2 = immunoglobulin lambda constant 2; LY9 = lymphocyte antigen 9; TNFRSF17 = tumour necrosis factor receptor superfamily member 17; SPP1 = secreted phosphoprotein 1 (osteopontin); XCL2 = chemokine (C motif) ligand 2.
The Heterogeneous Mixture Discriminant Analysis (MDAhet) classifier.
| good-down | good-up | poor-down | poor-up | |
| HLA-F | -0.31 | 0.65 | -0.29 | 0.40 |
| IGLC2 | -0.56 | 0.98 | -0.46 | 0.68 |
| LY9 | -0.29 | 0.58 | -0.52 | 1.12 |
| TNFRSF17 | -0.41 | 0.97 | -0.58 | 0.59 |
| SPP1 | 0.01 | -0.38 | 0.47 | -0.57 |
| XCL2 | -0.36 | 0.67 | -0.41 | 0.58 |
| C1QA | -0.39 | 0.79 | -0.40 | 0.57 |
| 0.74 | 0.74 | 0.58 | 0.58 | |
| 0.31 | 0.28 | 0.32 | 0.09 |
Estimated mean expression profiles , covariance matrices and weights for the four subgroups, as estimated from the training set. Note that the optimal covariance matrices were all proportional to the identity matrix ∝ I and are thus summarised by a single value, the variance of expression of the corresponding cluster. C1QA, complement component 1, q subcomponent, A chain; HLA-F, major histocompatibility complex, class I, F; IGLC2, immunoglobulin lambda constant 2; LY9, lymphocyte antigen 9; TNFRSF17, tumour necrosis factor receptor superfamily member 17; SPP1, secreted phosphoprotein 1 (osteopontin); XCL2, chemokine (C motif) ligand 2.
Classification of test samples.
| Test cohort | Size | good-down | good-up | poor-down | poor-up |
| UPP | 34 | 4 | 14 | 16 | 0 |
| JRH-2 | 24 | 5 | 8 | 11 | 0 |
| CAL | 46 | 13 | 13 | 20 | 0 |
| Kreike | 97 | 18 | 35 | 41 | 3 |
| UNC248 | 85 | 28 | 28 | 28 | 1 |
| Loi | 40 | 8 | 13 | 19 | 0 |
Distribution of test samples into the four subclasses by the Heterogeneous Mixture Discriminant Analysis (MDAhet) classifier.
Performance measures of seven-gene Heterogeneous Mixture Discriminant Analysis (MDAhet) classifier
| Training set | Test | Sets | |||||
| Cohort | NKI2+EMC+NCH | UPP | JRH-2 | CAL | Kreike | UNC248 | Loi |
| Cohort size | 186 | 34 | 24 | 46 | 97 | 85 | 40 |
| Annotated | 183 | 31 | 24 | 46 | 71 | 80 | 34 |
| Good prognosis (%) | 59 | 81 | 75 | 67 | 76 | 74 | 76 |
| Poor prognosis (%) | 41 | 19 | 25 | 33 | 24 | 26 | 24 |
| Chemotherapy (%) | 0 | 0 | 0 | 67 | 0 | 66 | 0 |
| MDA | |||||||
| NPV (%) | 74 | 92 | 93 | 69 | 83 | 74 | 100 |
| PPV (%) | 55 | 28 | 56 | 35 | 29 | 27 | 40 |
| SE (%) | 69 | 83 | 83 | 53 | 71 | 38 | 100 |
| SP (%) | 61 | 48 | 78 | 52 | 44 | 63 | 54 |
| MDAhet | |||||||
| PPV (%) | 51 | 30 | 37 | 45 | 29 | 36 | 35 |
| SP (%) | 44 | 44 | 44 | 42 | 41 | 42 | 42 |
| PPV at 4 years (%) | 42 | 24 | 33 | 35 | 25 | 45 | 35 |
| SP at 4 years (%) | 44 | 42 | 43 | 37 | 40 | 45 | 43 |
| LN | |||||||
| NPV (%) | 61 | 84 | NA | 85 | NA | 85 | 76 |
| PPV (%) | 50 | 30 | NA | 46 | NA | 37 | 0a |
| SE (%) | 27 | 50 | NA | 80 | NA | 71 | 0a |
| SP (%) | 81 | 70 | NA | 55 | NA | 58 | 100a |
| NPV at 4 years (%) | 67 | 88 | NA | 90 | NA | 82 | 77 |
| PPV at 4 years (%) | 39 | 37 | NA | 38 | NA | 47 | 0a |
| SE at 4 years (%) | 25 | 84 | NA | 85 | NA | 69 | 0a |
| SP at 4 years (%) | 80 | 74 | NA | 53 | NA | 60 | 100a |
aLoi's cohort consists only of LN- samples. The table summarises performance indicators of the seven-gene MDAhet classifier and lymph node status (LN) across oestrogen receptor negative (ER-) training and test sets. For each cohort, we also give the number of tumours (cohort size), number of clinically annotated tumours (annotated), the percentage of good and poor prognosis patients (as defined by disease-specific death or distant metastasis event) and the percentage of patients treated with chemotherapy. NPV, PPV, SE and SP are evaluated at four years and at end of study. NPV, negative predictive value (precision for good prognosis); PPV, positive predictive value (precision for poor prognosis); SE, sensitivity; SP, specificity.
Stratified Cox-regression model of seven-gene Heterogeneous Mixture Discriminant Analysis (MDAhet) classifier
| Training set | Combined test set | |
| Annotated | 183 | 286 |
| MDAhet | 0.29 (0.16–0.56) | 0.15 (0.07–0.36) |
| LN | 1.31 (0.73–2.33) | 3.25 (1.61–6.58) |
| CT | NA | 0.68 (0.34–1.39) |
| LN+MDAhet | ||
| MDAhet | 0.29 (0.15–0.55) | 0.06 (0.01–0.27) |
| LN | 1.59 (0.81–3.11) | 3.68 (1.32–10.13) |
| CT+MDAhet | ||
| MDAhet | NA | 0.27 (0.15–0.48) |
| CT | NA | 0.76 (0.27–2.13) |
Stratified Cox-proportional hazards regression performance of the seven-gene MDAhet classifier, lymph node status (LN) and chemotherapy (CT) across oestrogen receptor-negative training and test sets, with strata defined by cohorts. For the univariate analysis, Hazard ratio (HR), 95% confidence intervals (CI) and LR-test p-value are given. In the multivariate models, p-values quoted are from the corresponding Wald test.
Figure 3Kaplan-Meier curves for MDAhet classifier. Kaplan-Meier survival curves for the three subclasses 'good-down' (light green), 'good-up' (dark green), 'poor-down' (blue), as predicted by the Heterogeneous Mixture Discriminant Analysis (MDAhet) classifier, in the training and combined test cohorts. The class 'poor-up' is not shown due to small sample size (Table 2). Hazard ratios (HR), 95% confience intervals (CI) and log-rank test p-values are given for the predicted 'good-up' class relative to the predicted poor prognostic classes, as given by a stratified Cox-regression model with strata defined by cohorts. The Kaplan-Meier curves for each subclass is shown separately for disease-specific survival (solid lines) and distant metastasis (broken lines).