| Literature DB >> 25983547 |
Christophe Zemmour1, François Bertucci2, Pascal Finetti3, Bernard Chetrit4, Daniel Birnbaum3, Thomas Filleron5, Jean-Marie Boher1.
Abstract
BACKGROUND: DNA microarray studies identified gene expression signatures predictive of metastatic relapse in early breast cancer. Standard feature selection procedures applied to reduce the set of predictive genes did not take into account the correlation between genes. In this paper, we studied the performances of three high-dimensional regression methods - CoxBoost, LASSO (Least Absolute Shrinkage and Selection Operator), and Elastic net - to identify prognostic signatures in patients with early breast cancer.Entities:
Keywords: boosting; breast cancer; cross-validation; genomics; metastasis
Year: 2015 PMID: 25983547 PMCID: PMC4426954 DOI: 10.4137/CIN.S17284
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Clinical characteristics in the three breast cancer datasets.
| CLINICAL CHARACTERISTICS | VAN’T VEER’S DATASET (n = 97) | VAN DE VIJVER’S DATASET (n = 151) | DESMEDT’S DATASET (n = 197) | |
|---|---|---|---|---|
| Patient’s age | ≥40 years | 74 (76%) | 115 (76%) | 161 (82%) |
| <40 years | 23 (24%) | 36 (24%) | 36 (18%) | |
| Pathological tumor size | <20 mm | 44 (45%) | 82 (54%) | 101 (51%) |
| ≥20 mm | 53 (55%) | 69 (46%) | 96 (49%) | |
| Pathological grade | 1–2 | 37 (38%) | 80 (53%) | 114 (58%) |
| 3 | 60 (62%) | 71 (47%) | 83 (42%) | |
| ER status | Positive | 72 (74%) | 110 (73%) | 136 (69%) |
| Negative | 25 (26%) | 41 (27%) | 61 (31%) | |
| HER2 status | Negative | 82 (84%) | 125 (83%) | 186 (94%) |
| Positive | 15 (16%) | 26 (17%) | 11 (6%) | |
| Metastatic relapse within 5 years | Yes | 46 (47%) | 40 (27%) | 36 (18%) |
| No | 51 (53%) | 111 (73%) | 161 (82%) |
Prognostic performances of the four classifiers in the van’t Veer’s training (n = 78) and internal validation (n = 19) sets.
| CLASSIFIER | GENES (N) | TRAINING SET (n = 78) | VALIDATION SET (n = 19) | ||||
|---|---|---|---|---|---|---|---|
| ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ||
| Amsterdam | 70 | 63/78 (81%) | 31/34 (91%) | 32/44 (73%) | 17/19 (90%) | 11/12 (92%) | 6/7 (86%) |
| CoxBoost | 19 | 73/78 (94%) | 31/34 (91%) | 42/44 (95%) | 15/19 (79%) | 9/12 (75%) | 6/7 (86%) |
| LASSO | 23 | 74/78 (95%) | 31/34 (91%) | 43/44 (98%) | 16/19 (84%) | 10/12 (83%) | 6/7 (86%) |
| Elastic net | 36 | 74/78 (95%) | 31/34 (91%) | 43/44 (98%) | 15/19 (79%) | 9/12 (75%) | 6/7 (86%) |
Prognostic performances of the four classifiers in the external validation sets.
| CLASSIFIER | GENES (N) | VAN DE VIJVER’S DATASET (n = 151) | DESMEDT’S DATASET (n = 197) | ||||
|---|---|---|---|---|---|---|---|
| ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ACCURACY (%) | SENSITIVITY (%) | SPECIFICITY (%) | ||
| Amsterdam | 70 | 92/151 (61%) | 36/40 (90%) | 56/111 (50%) | 89/197 (45%) | 35/36 (97%) | 54/161 (34%) |
| CoxBoost | 6 | 95/151 (63%) | 32/40 (80%) | 63/111 (57%) | 107/197 (54%) | 30/36 (83%) | 77/161 (48%) |
| LASSO | 18 | 97/151 (64%) | 31/40 (78%) | 66/111 (59%) | 111/197 (56%) | 27/36 (75%) | 84/161 (52%) |
| Elastic net | 29 | 100/151 (66%) | 31/40 (78%) | 69/111 (62%) | 116/197 (59%) | 30/36 (83%) | 86/161 (53%) |
Figure 1MFS curves for the four classifiers in the van de Vijver’s external validation set. Kaplan–Meier curves are shown for the Amsterdam 70-gene signature (top left), the LASSO classifier (top right), the CoxBoost classifier (bottom left), and the Elastic net classifier (bottom right). P-values are estimated using the log-rank test.
Multivariate analyses for 5-year MFS in the external validation sets.
| CLASSIFIER | PATIENT’S AGE (<40 YEARS) | PATHOLOGICAL TUMOR SIZE (≥20 mm) | PATHOLOGICAL TUMOR GRADE (3) | ER STATUS (NEGATIVE) | HER2 STATUS (POSITIVE) | SIGNATURE (PREDICTED POOR PROGNOSIS) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HAZARD RATIO [95% CI] | p | HAZARD RATIO [95% CI] | p | HAZARD RATIO [95% CI] | p | HAZARD RATIO [95% CI] | p | HAZARD RATIO [95% CI] | p | HAZARD RATIO [95% CI] | p | |
| Amsterdam 70-gene | 1.28 [0.66–2.48] | 0.47 | 3.27 [1.54–6.97] | 0.002 | 1.07 [0.47–2.4] | 0.88 | 0.96 [0.49–1.9] | 0.91 | 1.58 [0.77–3.24] | 0.22 | ||
| CoxBoost 6-gene | 1.39 [0.72–2.69] | 0.33 | 3.59 [1.71–7.5] | <0.001 | 1.66 [0.77–3.57] | 0.20 | 0.73 [0.35–1.54] | 0.41 | 2.24 [1.08–4.64] | 0.03 | ||
| LASSO 18-gene | 1.62 [0.83–3.16] | 0.16 | 3.31 [1.59–6.87] | 0.001 | 1.64 [0.76–3.57] | 0.21 | 0.74 [0.35–1.58] | 0.44 | 1.79 [0.87–3.68] | 0.12 | ||
| Elastic net 29-gene | 1.51 [0.78–2.93] | 0.23 | 3.19 [1.55–6.58] | 0.002 | 1.61 [0.75–3.46] | 0.23 | 0.76 [0.36–1.59] | 0.46 | 1.82 [0.88–3.75] | 0.10 | ||
| Amsterdam 70-gene | 1.43 [0.64–3.19] | 0.39 | 1 [0.51–1.96] | 0.99 | 1.92 [0.93–3.99] | 0.08 | 1.73 [0.86–3.49] | 0.12 | 2.47 [0.86–7.08] | 0.09 | ||
| CoxBoost 6-gene | 1.38 [0.62–3.07] | 0.43 | 0.89 [0.45–1.77] | 0.75 | 1.5 [0.7–3.22] | 0.30 | 1.27 [0.61–2.64] | 0.52 | 1.53 [0.52–4.5] | 0.44 | ||
| LASSO 18-gene | 1.31 [0.59–2.92] | 0.51 | 0.82 [0.41–1.65] | 0.58 | 1.91 [0.89–4.06] | 0.09 | 1.06 [0.47–2.39] | 0.88 | 1.91 [0.63–5.73] | 0.25 | ||
| Elastic net 29-gene | 1.26 [0.56–2.8] | 0.58 | 0.9 [0.45–1.8] | 0.76 | 1.59 [0.74–3.4] | 0.24 | 1.05 [0.5–2.22] | 0.89 | 1.67 [0.57–4.92] | 0.35 | ||
Figure 2MFS curves for the four classifiers in the Desmedt’s external validation set. Kaplan–Meier curves are shown for the Amsterdam 70-gene signature (top left), the LASSO classifier (top right), the CoxBoost classifier (bottom left), and the Elastic net classifier (bottom right). P-values are estimated using the log-rank test.
List of top 10 ontological terms of GO overrepresented (GSEA) for the four classifiers in the external validation sets.
| CoxBoost 6-GENE | LASSO 18-GENE | ELASTIC NET 29-GENE | AMSTERDAM 70-GENE |
|---|---|---|---|
| Chromatin | M_Phase_Of_Mitotic_Cell_Cycle | ||
| Magnesium_Ion_Binding | |||
| Interphase | |||
| Mitosis | |||
| Chromatin | |||
| Magnesium_Ion_Binding | M_Phase_Of_Mitotic_Cell_Cycle | Interphase_Of_Mitotic_Cell_Cycle | |
| DNA_Replication | Mitosis | M_Phase_Of_Mitotic_Cell_Cycle | Chromatin |
| Response_To_DNA_Damage_Stimulus | |||
| Microtubule_Cytoskeleton | |||
| Regulation_Of_Mitosis | DNA_Repair | ||
| Cell_Division | |||
| Regulation_Of_Cell_Cycle | |||
| Chromosome_Segregation | Cell_Cycle_Checkpoint_GO_0000075 | Cytokinesis | |
| Cell_Cycle_Checkpoint_GO_0000075 | Chromosome_Segregation | ||
Notes: The GO ontological terms are ordered from top to bottom by their increasing GSEA normalized enrichment score (NES). Those written in bold are common to the four classifiers.