| Literature DB >> 32641077 |
Amin Emad1,2,3, Tania Ray4, Tor W Jensen5,6, Meera Parat3, Rachael Natrajan7, Saurabh Sinha8,9,10, Partha S Ray11.
Abstract
BACKGROUND: Cancer cells are known to display varying degrees of metastatic propensity, but the molecular basis underlying such heterogeneity remains unclear. Our aims in this study were to (i) elucidate prognostic subtypes in primary tumors based on an epithelial-to-mesenchymal-to-amoeboid transition (EMAT) continuum that captures the heterogeneity of metastatic propensity and (ii) to more comprehensively define biologically informed subtypes predictive of breast cancer metastasis and survival in lymph node-negative (LNN) patients.Entities:
Keywords: Breast cancer subtypes; Epithelial-to-mesenchymal transition; Mesenchymal-to-amoeboid transition; Metastasis; Metastatic risk assessment
Mesh:
Substances:
Year: 2020 PMID: 32641077 PMCID: PMC7341640 DOI: 10.1186/s13058-020-01304-8
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Clinical and pathological characteristics of patient cohorts used to assess prognostic significance of EMAT subtypes
| Factor | METABRIC ( | GSE11121 ( | NKI295 ( |
|---|---|---|---|
| Age, years | |||
| < 50 | 20.8 | 23.2 | 84.1 |
| > 50 | 78.8 | 76.8 | 15.9 |
| Unknown | 0.4 | 100 | |
| Tumor size, cm | |||
| < 2 | 39.5 | 49.5 | 54.3 |
| 2–5 | 57.7 | 49 | 45.7 |
| > 5 | 2.3 | 1.5 | |
| Unknown | 0.5 | 0 | 0 |
| Tumor grade | |||
| Low | 8.5 | 14.5 | 22.5 |
| Intermediate | 42.3 | 68 | 30.5 |
| High | 42.3 | 17.5 | 47.0 |
| Unknown | 6.8 | 0 | 0 |
| ER | |||
| Negative | 17.3 | 23.2 | 27.8 |
| Positive | 78.8 | 76.8 | 72.2 |
| Unknown | 3.9 | 0 | 0 |
| PR | |||
| Negative | 43.8 | 41.8 | |
| Positive | 56.2 | 58.2 | |
| Unknown | 0 | 0 | 100 |
| HER2 | |||
| Negative | 80.2 | 89.0 | |
| Positive | 19.4 | 0.11 | |
| Unknown | 0.4 | 0 | 100 |
| Adjuvant therapy | |||
| Chemotherapy | 7.1 | N/A | 4.0 |
| Hormonal therapy | 47.9 | N/A | 4.0 |
| Radiation therapy | 54.4 | N/A | 40.4 |
| PAM50 status | |||
| Luminal A | 44.1 | 27 | 31.8 |
| Luminal B | 19.9 | 23.5 | 25.2 |
| HER2 | 7.5 | 12.5 | 16.6 |
| Basal-like | 15.3 | 17.5 | 18.5 |
| Normal-like | 13 | 18 | 7.9 |
| EMAT status | |||
| EMAT1 | 19.4 | 22.5 | 17.2 |
| EMAT2 | 45.7 | 31 | 35.1 |
| EMAT3 | 26 | 29.5 | 27.8 |
| EMAT4 | 8.9 | 17 | 19.9 |
ER estrogen receptor, HER2 human epidermal growth factor receptor 2, PR progesterone receptor
Fig. 1EMAT clusters and their characteristics. a EMAT clusters based on lymph node-negative METABRIC samples obtained using hierarchical clustering. The heatmap shows the normalized expression of EMAT genes (rows) in each sample (columns). Sample dendrogram colors are chosen to match those of Kaplan-Meier plots in c. b Characterization of samples based on similarity to hESC, PAM50 subtypes, ER, PR, and HER2 status, stage, grade, and type of treatment. Spearman’s rank correlation, scaled between 0 and 1 using min-max normalization, is used as the measure of similarity of samples to hESC, in which 0 and 1 represent least similar and most similar, respectively. c Kaplan-Meier plots corresponding to n = 4 clusters. The heatmap shows the relative ranking of the average expression of four biomarkers in each cluster compared to other clusters. Clusters EMAT2 and EMAT3 have very similar Kaplan-Meier survival curves (c), even though their gene expression profiles are distinct. EMAT4 has a worse survival outcome compared to other clusters (p = 0.05 against EMAT3, p = 0.01 against EMAT2 and p = 1.8E-6 against EMAT1). d The box plots show the distribution of hESC similarity of the samples in each cluster. The similarity is defined as the Spearman’s rank correlation (scaled between 0 and 1) between expression profiles of H1 hESC lines and each sample. The p-values (calculated using a one-sided t-test) show how significant the differences between two adjacent EMAT clusters are with respect to their similarity to hESC. The significance p-value for the cluster with the least similarity to hESC (EMAT1) and the cluster with the most similarity to hESC (EMAT4) is p = 1.7E−23
Univariable and multivariable analysis of 562 LNN breast cancer patients (METABRIC dataset [13]) used to examine prognostic value of EMAT subtype designation status for 10-year follow-up
| Univariable analysis | Multivariable (age, tumor | Multivariable (age, tumor | Multivariable (age, tumor | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | HR (95% CI) | HR (95% CI) | |||||||||
| Age | 562 | 0.70 | 1.00 (0.99–1.02) | 510 | 0.18 | 1.014 (0.99–1.03) | 521 | 0.21 | 1.01 (0.99–1.03) | 521 | 0.21 | 1.01 (0.99–1.03) |
| Tumor size (< 2, 2–5, > 5) | 559 | 0.02 | 1.67 (1.10–2.54) | 510 | 0.04 | 1.59 (1.04–2.45) | 521 | 0.03 | 1.60 (1.04–2.49) | 521 | 0.04 | 1.59 (1.03–2.47) |
| Tumor grade* | 524 | 0.20 | 1.26 (0.89–1.79) | 510 | 0.46 | 1.16 (0.78–1.71) | 521 | 0.45 | 1.16 (0.78–1.73) | 521 | 0.98 | 1.01 (0.69–1.47) |
| Any chemotherapy | 562 | 0.15 | 1.78 (0.82–3.87) | 510 | 0.17 | 1.86 (0.78–4.451) | 521 | 0.12 | 1.99 (0.84–4.77) | 521 | 0.10 | 2.07 (0.87–4.92) |
| Any hormonal therapy | 562 | 0.15 | 0.72 (0.46–1.12) | 510 | 0.06 | 0.62 (0.38–1.01) | 521 | 0.04 | 0.60 (0.37–0.97) | 521 | 0.05 | 0.62 (0.39–1.00) |
| Any radiation therapy | 562 | 0.20 | 0.76 (0.50–1.16) | 510 | 0.28 | 0.78 (0.50–1.22) | 521 | 0.24 | 0.77 (0.49–1.19) | 521 | 0.15 | 0.72 (0.46–1.12) |
| IHC status** | 538 | 0.03 | 1.23 (1.02–1.47) | 510 | 0.32 | 1.11 (0.90–1.37) | ||||||
| PAM50 status# | 562 | 0.12 | 1.14 (0.97–1.35) | 521 | 0.67 | 1.04 (0.86–1.27) | ||||||
| EMAT status## | 562 | 1E−4 | 1.65 (1.29–2.11) | 521 | 3E−4 | 1.64 (1.25–2.14) | ||||||
Two-sided p-values were based on χ2 or Fisher’s exact test, whenever appropriate
CI confidence interval, ER estrogen receptor, HER2 human epidermal growth factor receptor 2, HR hazard ratio, PR progesterone receptor, P p-value, n number of samples
*(Low - “1,” intermediate - “2,” high - “3”)
**(ER+HER2− “1,” ER+HER2+ “2,” ER−HER2+ “3,” ER−HER2− “4”)
#(Normal-like - “0,” LumA - “1,” LumB - “2,” HER2 - “3,” basal-like - “4”)
##(EMAT1 - “1,” EMAT2 - “2,” EMAT3 - “3,” EMAT4 - “4”)
Fig. 2Enrichment of EMAT clusters in other breast cancer subtypes and systematic comparison of their prognostic power using cross-validation. The heatmaps show the − log10 (p-value) of enrichment of EMAT clusters in different subtypes or clinical parameters (using a hypergeometric test). The scatter plots compare the performance (measured in C-index) of Cox regression predictions using EMAT cluster status and clinical parameters (y-axis) versus other predictors (x-axis) (see the “Methods” section for details). If there are more points above the diagonal line (and further away from it), it shows that the method represented on the y-axis outperforms the method represented on the x-axis. The p-values were calculated using a one-sided Wilcoxon signed rank test and represent the significance of the improvement obtained using EMAT cluster status and clinical parameters as compared to other predictors. The measure PIF shows the percent of times in which EMAT + clinical parameters provided a more accurate prediction compared to the baseline. a The heatmap shows enrichment of EMAT clusters by samples of different tumor sizes. The scatter plot shows performance of Cox regression predictions using EMAT + clinical parameters versus clinical parameters alone. b The heatmap shows enrichment of EMAT clusters by samples of different PAM50 molecular subtypes. The scatter plot shows performance of Cox regression predictions using EMAT + clinical parameters versus PAM50 subtypes + clinical parameters. c The heatmap shows enrichment of EMAT clusters by samples of different receptor status. The scatter plot shows performance of Cox regression predictions using EMAT + clinical parameters versus receptor status + clinical parameters. d The heatmaps show the distribution of patients in each PAM50 subtypes as well as different treatments in EMAT clusters
Fig. 3The Kaplan-Meier survival plots and biomarker status for EMAT subtypes of LNN breast cancer samples from the GSE11121 (a) and NKI295 (b) datasets using cross-dataset analysis. A centroid-based classifier trained on LNN METABRIC samples is used to assign EMAT subtype labels to each sample
Fig. 4Analysis of clusters obtained using eight TFs most under-expressed or over-expressed in each EMAT cluster. a Hierarchical clustering based on the expression of the eight identified TFs is used to cluster samples into four groups. The color bar at the bottom shows the true EMAT cluster label of each sample. b Concordance of clusters obtained using eight TFs with EMAT clusters based on the Jaccard index. c Kaplan-Meier survival plots for clusters obtained using the TFs