| Literature DB >> 30083053 |
Charusheila Ramkumar1, Ljubomir Buturovic2, Sukriti Malpani1, Arun Kumar Attuluri1, Chetana Basavaraj1, Chandra Prakash1, Lekshmi Madhav1, Dinesh Chandra Doval3, Anurag Mehta4, Manjiri M Bakre1.
Abstract
Use of proteomic strategies to identify a risk classifier that estimates probability of distant recurrence in early-stage hormone receptor (HR)-positive breast cancer is relevant to physiological cellular function and therefore to intrinsic tumor biology. We used a 298-sample retrospective training set to develop an immunohistochemistry-based novel risk classifier called CanAssist-Breast (CAB) which combines 5 prognostically relevant biomarkers and 3 clinico-pathological parameters to arrive at probability of distant recurrence within 5 years from diagnosis. Five selected biomarkers, namely, CD44, ABCC4, ABCC11, N-cadherin, and pan-cadherin, were chosen based on their role in tumor metastasis. The chosen biomarkers represent the hallmarks of cancer and are distinct from other proliferation and gene expression-based prognostic signatures. The 3 clinico-pathological parameters integrated into the machine learning-based CAB algorithm are tumor size, tumor grade, and node status. These features are used to calculate a "CAB risk score" that classifies patients into low- or high-risk groups and predicts probability of distant recurrence in 5 years. Independent clinical validation of CAB in a retrospective study comprising 196 patients indicated that distant metastasis-free survival (DMFS) was significantly different in the 2 risk groups. The difference in DMFS between the low- and high-risk categories was 19% in the validation cohort (P = .0002). In multivariate analysis, CAB risk score was the most significant independent predictor of distant recurrence with a hazard ratio of 4.3 (P = .0003). CanAssist-Breast is a precise and unique machine learning-based proteomic risk-classifier that can assist in risk stratification of patients with early-stage HR+ breast cancer.Entities:
Keywords: early breast cancer; immunohistochemistry; machine learning; prognosis; recurrence risk classification
Year: 2018 PMID: 30083053 PMCID: PMC6066801 DOI: 10.1177/1177271918789100
Source DB: PubMed Journal: Biomark Insights ISSN: 1177-2719
The critical steps and associated biomarkers involved in cancer progression.
| Hallmark of cancer | Biomarker |
|---|---|
| Self-sufficiency in growth signals | Ki67, FOXA1, IFITM1, GATA3, c-Myc, IGFBP3, FOXP1, FOXP3 |
| Insensitivity to antigrowth signals | ABCG2, ABCC4, ABCC11, Nrf2, PI3K, Akt |
| Evading apoptosis | MAGE-A9, MAGE-A11, BAG1, Apaf1, BCL2 |
| Limitless replicative potential | CD44, CD24, SOX2, Oct3, NANOG, NESTIN, KLF4, ALDH1A1, CD133, CD90 (THY-1), CD15, CD61, hTERT |
| Sustained angiogenesis | HIF1α, HIF2α, XBP1, TIE2, FGF, ANG1, VEGFR1, VEGFR2, CXCR1, MMP8 |
| Tissue invasion and metastasis | P-cadherin, N-cadherin, E-cadherin, β-catenin, APC, EpCAM, FOXA1, KLK6, CxCR4, CD147, HSP70, Integrinb-6, EGFR |
The demographics and patient characteristics of the training and validation cohorts.
| Training cohort (n = 298) | Validation cohort (n = 196) | |||
|---|---|---|---|---|
| No. of samples | % of samples | No. of samples | % of samples | |
| Age | ||||
| <50 | 142 | 47.6 | 95 | 48.5 |
| >50 | 156 | 52.3 | 101 | 51.5 |
| Tumor size | ||||
| T1 | 41 | 13.7 | 23 | 11.7 |
| T2 | 230 | 77.1 | 160 | 81.6 |
| T3+T4 | 27 | 9.1 | 13 | 6.6 |
| Tumor grade | ||||
| Well differentiated | 22 | 7.3 | 18 | 9.2 |
| Moderately differentiated | 150 | 50.3 | 115 | 58.6 |
| Poorly differentiated | 126 | 42.2 | 63 | 32.1 |
| Node status | ||||
| N0 | 122 | 40.9 | 95 | 48.5 |
| N1 | 107 | 35.9 | 63 | 32.1 |
| N2+N3 | 69 | 23.1 | 38 | 19.4 |
The ranking of the top 5 biomarkers by Pearson correlation coefficient.
| Marker name | Pearson correlation coefficient |
|---|---|
| CD44 | 0.26 |
| Pan-cadherin | 0.24 |
| N-cadherin | 0.21 |
| ABCC11 | 0.20 |
| ABCC4 | 0.19 |
Figure 1.Correlation of the expression of the top 5 ranked biomarkers with DMFS in the training set. (A) Kaplan-Meier survival analysis of distant recurrence in the training cohort (n = 298) analyzed by low versus high staining of CD44 in the cell membrane. (B) Kaplan-Meier survival analysis of distant recurrence in the training cohort (n = 298) analyzed by low versus high staining of ABCC4 in the cell membrane. (C) Kaplan-Meier survival analysis of distant recurrence in the training cohort (n = 298) analyzed by low versus high staining of ABCC11 in the cell membrane. (D) Kaplan-Meier survival analysis of distant recurrence in the training cohort (n = 298) analyzed by low versus high staining of N-cadherin in the cell cytoplasm. (E) Kaplan-Meier survival analysis of distant recurrence in the training cohort (n = 298) analyzed by low versus high staining of pan-cadherin in the cell cytoplasm.
Figure 2.Classifier development. (A) Receiver operating graph analysis of various RBF-SVM classifiers that were tested. For each potential classifier, we plotted cross-validation sensitivity versus 1-specificity. The intersection of the red lines indicates the chosen classifier. (B) Receiver operating characteristic analysis of 10-fold nested cross-validation for distant metastasis-free survival by the chosen RBF-SVM classifier. RBF-SVM indicates Radial Basis Function-Support Vector Machine.
Figure 3.Risk classification by CAB. (A) Kaplan-Meier plot of distant recurrence in the training set (n = 298) stratified by CAB into low- or high-risk groups. (B) Kaplan-Meier survival analysis of distant recurrence in the validation set (n = 196) stratified by CAB into low- or high-risk groups. CAB indicates CanAssist-Breast.
Multivariate analysis of the CAB risk score and clinico-pathological parameters for distant metastasis-free survival in the training set.
| Covariate | Hazard ratio | 95% CI | |
|---|---|---|---|
| Age | 0.69 | .15 | 0.42–1.12 |
| ER+ | 1.49 | .19 | 0.81–2.72 |
| PR+ | 1.45 | .13 | 0.89–2.36 |
| Stage | 1.45 | .42 | 0.46–4.30 |
| CAB risk score | 6.82 | <.0001 | 3.92–11.84 |
Abbreviations: CAB, CanAssist-Breast; CI, confidence interval; ER, estrogen receptor; PR, progesterone receptor.
Multivariate analysis of the CAB risk score and clinico-pathological parameters for distant metastasis-free survival in the validation set.
| Covariate | Hazard ratio | 95% CI | |
|---|---|---|---|
| Age | 1.25 | .56 | 0.58–2.69 |
| ER+ | 0.59 | .36 | 0.19–1.81 |
| PR+ | 2.50 | .02 | 1.11–5.6 |
| Stage | 1.06 | .92 | 0.31–3.57 |
| CAB risk score | 4.37 | .0003 | 1.99–9.61 |
Abbreviations: CAB, CanAssist-Breast; CI, confidence interval; ER, estrogen receptor; PR, progesterone receptor.