| Literature DB >> 32324791 |
Anna A Schönherz1,2,3, Julie Støve Bødker2, Alexander Schmitz2, Rasmus Froberg Brøndum2, Lasse Hjort Jakobsen1,2, Anne Stidsholt Roug1,2, Marianne T Severinsen1,2, Tarec C El-Galaly1,2, Paw Jensen2, Hans Erik Johnsen1,2, Martin Bøgsted1,2, Karen Dybkær1,2.
Abstract
Acute myeloid leukaemia (AML) is characterised by phenotypic heterogeneity, which we hypothesise is a consequence of deregulated differentiation with transcriptional reminiscence of the normal compartment or cell-of-origin. Here, we propose a classification system based on normal myeloid progenitor cell subset-associated gene signatures (MAGS) for individual assignments of AML subtypes. We generated a MAGS classifier including the progenitor compartments CD34+/CD38- for haematopoietic stem cells (HSCs), CD34+/CD38+/CD45RA- for megakaryocyte-erythroid progenitors (MEPs), and CD34+/CD38+/CD45RA+ for granulocytic-monocytic progenitors (GMPs) using regularised multinomial regression with three discrete outcomes and an elastic net penalty. The regularisation parameters were chosen by cross-validation, and MAGS assignment accuracy was validated in an independent data set (N = 38; accuracy = 0.79) of sorted normal myeloid subpopulations. The prognostic value of MAGS assignment was studied in two clinical cohorts (TCGA: N = 171; GSE6891: N = 520) and had a significant prognostic impact. Furthermore, multivariate Cox regression analysis using the MAGS subtype, FAB subtype, cytogenetics, molecular genetics, and age as explanatory variables showed independent prognostic value. Molecular characterisation of subtypes by differential gene expression analysis, gene set enrichment analysis, and mutation patterns indicated reduced proliferation and overrepresentation of RUNX1 and IDH2 mutations in the HSC subtype; increased proliferation and overrepresentation of CEBPA mutations in the MEP subtype; and innate immune activation and overrepresentation of WT1 mutations in the GMP subtype. We present a differentiation-dependent classification system for AML subtypes with distinct pathogenetic and prognostic importance that can help identify candidates poorly responding to combination chemotherapy and potentially guide alternative treatments.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32324791 PMCID: PMC7179860 DOI: 10.1371/journal.pone.0229593
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Principal component analysis (PCA) to illustrate variance between samples characterized by gene expression profiles of flow-sorted normal myeloid progenitor subsets.
(A) The PCA conducted for the meta-cohort of normal myeloid samples that were RMA normalized before combination and includes all myeloid subsets available. (B) The PCA conducted for the meta-cohort of normal myeloid samples that were combined prior to RMA normalization and includes all myeloid subsets available. (C) The PCA conducted for the meta-cohort described in (B) but limited to samples identified as early (HSC) and late (GMP, MEP) hematopoietic progenitors by FACS. (D) The PCA conducted for the meta-cohort described in (B) but limited to samples that were included in the training-cohort and median centered. Each data point represents the expression profile of one sample. In A-B, samples are color-coded based on their cohort identity; for C-D, samples are color-coded according to their myeloid progenitor subset identity (HSC, GMP, MEP). Samples with similar expression profiles will cluster together. Axis labels indicate the principal component (PC) plotted and the proportion of the variance explained by that PC. PCs are derived by orthogonal data transformation to reduce dimensionality and represent the directions of the data that explain a maximal amount of variation.
Fig 2Venn diagram of predictive genes included in the MAGS classification.
MAGS prediction accuracy assigning 100% (A) or 85% (B) of the samples to the defined MAGS subtypes HSC, GMP, MEP, and an additional UC subtype.
| 18 (69.2) | 4 (15.4) | 4 (15.4) | 8 (30.8) | |||
| 0 (0) | 5 (100) | 0 (0) | 0 (0) | |||
| 0 (0) | 0 (0) | 7 (100) | 0 (0) | |||
| 17 (65.4) | 3 (11.5) | 0 (0) | 6 (23.1) | 9 (34.6) | ||
| 0 (0) | 5 (100) | 0 (0) | 0 (0) | 0 (0) | ||
| 0 (0) | 0 (0) | 7 (100) | 0 (0) | 0 (0) | ||
The prediction accuracy was estimated in the validation-cohort (N = 38). Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid, UC, unclassified.
Distributions and frequencies of assigned MAGS subtypes across two clinical cohorts: TCGA (N = 182) and GSE6891 (N = 520).
Two-sided Fishers exact tests were used to determine significantly different distributions across data sets (p = 0.99).
| Cohort | HSC (%) | MEP (%) | GMP (%) | UC (%) | Total (%) |
|---|---|---|---|---|---|
| TCGA | 50 (27.5) | 48 (26.4) | 56 (30.8) | 28 (15.4) | 182 (100) |
| GSE6891 | 146 (28.1) | 134 (25.8) | 162 (31.2) | 78 (15.0) | 520 (100) |
In total, 85% of clinical samples were assigned to MAGS subtypes (HSC, GMP, MEP), and 15% of each cohort was unclassified (UC).
Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors, MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%)
Fig 3Prognostic validation of the assigned MAGS subtypes for (A) the TCGA-cohort (N = 171), (B) the GSE6892-cohort (N = 520), and (C) the associated meta-cohort (N = 691), using a frequency cut-off of 85% for MAGS assigned and 15% for samples assigned as unclassified.
Kaplan Meier survival curves were generated for overall survival and P-values were estimated using a log-rank test. Only samples with complete survival information were included and the number at risk per MAGS subtype are provided for each cohort.
Univariate Cox regression analysis for MAGS-assigned clinical cohorts (GSE6891 cohort: N = 520; TCGA-cohort: N = 171) and the corresponding AML meta-cohort (N = 691) that included both clinical cohorts.
Analyses were performed for overall survival. Samples with missing survival information (TCGA cohort: N = 11) were excluded. Table columns indicate the total sample size (N), the number of patients that died (Events) per MAGS subtype, the associated hazards ration (HR), the 95% lower and upper confidence intervals (CI), and the estimated p-value (P).
| N | Events | HR | 95% CI | P | |
|---|---|---|---|---|---|
| HSC | 48 | 40 | 1 | ||
| MEP | 44 | 22 | 0.44 | 0.26–0.74 | 1.90e-03 |
| GMP | 52 | 34 | 0.68 | 0.43–1.08 | 0.1 |
| UC | 27 | 14 | 0.37 | 0.20–0.69 | 1.63e-03 |
| HSC | 146 | 115 | 1 | ||
| MEP | 134 | 77 | 0.53 | 0.40–0.71 | 2.13e-05 |
| GMP | 162 | 101 | 0.63 | 0.48–0.83 | 7.86e-04 |
| UC | 78 | 52 | 0.70 | 0.51–0.98 | 0.03 |
| HSC | 194 | 155 | 1 | ||
| MEP | 178 | 99 | 0.51 | 0.40–0.65 | 1.46e-07 |
| GMP | 214 | 135 | 0.64 | 0.51–0.81 | 1.64e-04 |
| UC | 105 | 66 | 0.60 | 0.45–0.80 | 5.32e-04 |
a Analysis of the meta-cohort was corrected for potential batch effects by including the cohort (TCGA, GSE6891) as an independent explanatory variable.
Significance levels:
* ≤ 0.05;
** ≤0.01;
*** ≤ 0.001;
Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%);
Cox regression analysis of potential confounding variables conducted for the TCGA (N = 122), GSE6891 (N = 439), and meta-cohort (N = 561).
Cohorts were limited to samples with complete records for all explanatory variables investigated. Results are shown for the (A) univariate Cox regression analysis per explanatory variable and (B) associated multivariate Cox regression analyses limited to confounding variables tested significant in univariate Cox regression analyses. Table columns are as described in Table 3. Analyses were performed for overall survival.
| 37 | 32 | 1 | 120 | 93 | 1 | 157 | 125 | 1 | ||||||||
| 26 | 15 | 0.523 | 0.282–0.971 | 108 | 56 | 0.491 | 0.352–0.685 | 134 | 71 | 0.486 | 0.363–0.650 | |||||
| 40 | 26 | 0.595 | 0.353–1.005 | 0.052 | 147 | 89 | 0.632 | 0.472–0.846 | 187 | 115 | 0.622 | 0.483–0.802 | ||||
| 19 | 9 | 0.309 | 0.146–0.652 | 64 | 40 | 0.666 | 0.460–0.965 | 83 | 49 | 0.580 | 0.417–0.807 | |||||
| 24 | 10 | 1 | 97 | 40 | 1 | NA | NA | NA | NA | NA | ||||||
| 60 | 42 | 2.636 | 1.319–5.268 | 247 | 160 | 1.900 | 1.341–2.690 | NA | NA | NA | NA | NA | ||||
| 35 | 27 | 2.828 | 1.362–5.872 | 85 | 69 | 3.222 | 2.173–4.778 | NA | NA | NA | NA | NA | ||||
| 3 | 3 | 8.159 | 2.188–30.431 | 10 | 9 | 3.801 | 1.839–7.857 | NA | NA | NA | NA | NA | ||||
| 13 | 9 | 1 | 16 | 13 | 1 | 29 | 22 | 1 | ||||||||
| 28 | 21 | 1.773 | 0.805–3.904 | 0.155 | 94 | 60 | 0.646 | 0.355–1.177 | 0.153 | 122 | 81 | 0.772 | 0.481–1.236 | 0.281 | ||
| 27 | 17 | 1.156 | 0.520–2.659 | 0.697 | 104 | 64 | 0.629 | 0.346–1.142 | 0.128 | 131 | 81 | 0.698 | 0.435–1.119 | 0.135 | ||
| 6 | 3 | 0.542 | 0.146–2.013 | 0.360 | 24 | 11 | 0.469 | 0.210–1.048 | 0.065 | 30 | 14 | 0.492 | 0.251–0.962 | |||
| 27 | 16 | 0.861 | 0.379–1.955 | 0.720 | 79 | 48 | 0.635 | 0.344–1.173 | 0.147 | 106 | 64 | 0.683 | 0.420–1.108 | 0.123 | ||
| NA | NA | NA | NA | NA | 5 | 2 | 0.325 | 0.073–1.444 | 0.140 | 5 | 2 | 0.330 | 0.077–1.403 | 0.133 | ||
| 14 | 10 | 1.957 | 0.785–4.879 | 0.150 | 103 | 72 | 0.780 | 0.432–1.407 | 0.408 | 117 | 82 | 0.846 | 0.528–1.356 | 0.488 | ||
| 3 | 3 | 2.452 | 0.649–9.272 | 0.186 | 6 | 3 | 0.415 | 0.118–1.456 | 0.170 | 9 | 6 | 0.702 | 0.284–1.731 | 0.442 | ||
| 3 | 3 | 2.311 | 0.615–8.679 | 0.215 | 0 | 0 | NA | NA | NA | 3 | 3 | 1.920 | 0.573–6.429 | 0.290 | ||
| 1 | 0 | 2.751 | NA | 0.996 | 8 | 5 | 0.869 | 0.310–2.438 | 0.790 | 9 | 5 | 0.865 | 0.328–2.286 | 0.771 | ||
| 119 | 80 | 1 | 406 | 262 | 1 | 525 | 342 | 1 | ||||||||
| 3 | 2 | 1.184 | 0.289–4.851 | 0.815 | 29 | 13 | 0.555 | 0.318–0.968 | 32 | 15 | 0.556 | 0.332–0.933 | ||||
| 121 | 81 | 1 | 315 | 187 | 1 | 436 | 268 | 1 | ||||||||
| 1 | 1 | 5.724 | 0.766–42.766 | 0.089 | 124 | 91 | 1.665 | 1.294–2.141 | 125 | 92 | 1.428 | 1.127–1.811 | ||||
| 107 | 71 | 1 | 390 | 254 | 1 | 497 | 325 | 1 | ||||||||
| 15 | 11 | 1.478 | 0.778–2.810 | 0.233 | 48 | 24 | 0.699 | 0.460–1.062 | 0.093 | 63 | 35 | 0.820 | 0.579–1.163 | 0.265 | ||
| 116 | 78 | 1 | 405 | 256 | 1 | 521 | 334 | 1 | ||||||||
| 6 | 4 | 0.772 | 0.282–2.114 | 0.614 | 31 | 19 | 0.901 | 0.565–1.436 | 0.661 | 37 | 23 | 0.863 | 0.565–1.316 | 0.493 | ||
| 113 | 78 | 1 | 400 | 254 | 1 | 513 | 332 | 1 | ||||||||
| 9 | 4 | 0.509 | 0.186–1.394 | 0.189 | 36 | 21 | 0.849 | 0.544–1.324 | 0.470 | 45 | 25 | 0.765 | 0.509–1.148 | 0.196 | ||
| 116 | 78 | 1 | 436 | 276 | 1 | 552 | 354 | 1 | ||||||||
| 6 | 4 | 0.669 | 0.244–1.836 | 0.435 | 3 | 2 | 1.028 | 0.256–4.132 | 0.970 | 9 | 6 | 1.031 | 0.460–2.311 | 0.941 | ||
| 109 | 73 | 1 | 303 | 198 | 1 | 412 | 271 | 1 | ||||||||
| 13 | 9 | 1.739 | 0.864–3.501 | 0.121 | 136 | 80 | 0.874 | 0.674–1.133 | 0.310 | 149 | 89 | 0.845 | 0.665–1.074 | 0.169 | ||
| 115 | 77 | 1 | 396 | 253 | 1 | 511 | 330 | 1 | ||||||||
| 7 | 5 | 1.435 | 0.578–3.563 | 0.436 | 42 | 25 | 0.826 | 0.548–1.246 | 0.362 | 49 | 30 | 0.843 | 0.580–1.225 | 0.370 | ||
| 122 | 82 | 1.002 | 0.998–1.006 | 0.402 | NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | ||
| 122 | 82 | 1.041 | 1.022–1.060 | 439 | 278 | 1.015 | 1.005–1.026 | 561 | 360 | 1.025 | 1.017–1.033 | |||||
| NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | 122 | 82 | 1 | ||||
| NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | 439 | 278 | 0.579 | 0.451–0.744 | |||
| 37 | 32 | 1 | 120 | 93 | 1 | 157 | 125 | 1 | ||||||||
| 26 | 15 | 0.781 | 0.404–1.511 | 0.464 | 108 | 56 | 0.600 | 0.419–0.862 | 134 | 71 | 0.524 | 0.374–0.734 | ||||
| 40 | 26 | 0.780 | 0.445–1.365 | 0.383 | 147 | 89 | 0.723 | 0.537–0.973 | 187 | 115 | 0.593 | 0.443–0.793 | ||||
| 19 | 9 | 0.467 | 0.213–1.025 | 64 | 40 | 0.742 | 0.508–1.084 | 0.123 | 83 | 49 | 0.561 | 0.398–0.791 | ||||
| 24 | 10 | 1 | 97 | 40 | 1 | - | - | - | - | - | ||||||
| 60 | 42 | 1.996 | 0.964–4.129 | 0.063 | 247 | 160 | 1.529 | 1.045–2.239 | - | - | - | - | - | |||
| 35 | 27 | 1.911 | 0.870–4.195 | 0.107 | 85 | 69 | 2.751 | 1.825–4.147 | - | - | - | - | - | |||
| 3 | 3 | 7.440 | 1.928–28,712 | 10 | 9 | 3.036 | 1.445–6.379 | - | - | - | - | - | ||||
| - | - | - | - | - | - | - | - | - | - | 29 | 22 | 1 | ||||
| - | - | - | - | - | - | - | - | - | - | 122 | 81 | 1.267 | 0.769–2.087 | 0.353 | ||
| - | - | - | - | - | - | - | - | - | - | 131 | 81 | 1.266 | 0.766–2.094 | 0.358 | ||
| - | - | - | - | - | - | - | - | - | - | 30 | 14 | 1.089 | 0.531–2.233 | 0.816 | ||
| - | - | - | - | - | - | - | - | - | - | 106 | 64 | 1.143 | 0.684–1.911 | 0.610 | ||
| - | - | - | - | - | - | - | - | - | - | 5 | 2 | 0.861 | 0.196–3.790 | 0.844 | ||
| - | - | - | - | - | - | - | - | - | - | 117 | 82 | 1.595 | 0.934–2.723 | 0.087 | ||
| - | - | - | - | - | - | - | - | - | - | 9 | 6 | 1.140 | 0.418–3.112 | 0.798 | ||
| - | - | - | - | - | - | - | - | - | - | 3 | 3 | 2.006 | 0.573–7.021 | 0.276 | ||
| - | - | - | - | - | - | - | - | - | - | 9 | 5 | 1.186 | 0.442–3.181 | 0.735 | ||
| - | - | - | - | - | 406 | 262 | 1 | 525 | 342 | 1 | ||||||
| - | - | - | - | - | 29 | 13 | 0.703 | 0.386–1.283 | 0.251 | 32 | 15 | 0.759 | 0.433–1.331 | 0.336 | ||
| - | - | - | - | - | 315 | 187 | 1 | 436 | 268 | 1 | ||||||
| - | - | - | - | - | 124 | 91 | 1.621 | 1.230–2.135 | 125 | 92 | 1.542 | 1.193–1.994 | ||||
| 122 | 82 | 1.035 | 1.017–1.054 | 435 | 275 | 1.011 | 1.001–1.022 | 557 | 357 | 1.021 | 1.012–1.030 | |||||
| NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | 122 | 82 | 1 | ||||
| NA | NA | NA | NA | NA | NA | NA | NA | NA | NA | 439 | 278 | 0.640 | 0.472–0.869 | |||
a Cytogenetic risk was excluded from Cox regression analyses conducted for the meta-cohort due to differences in cytogenetic risk group stratification between the TCGA (stratification based on cytogenetic and molecular genetics) and GSE6891 (stratification based on cytogenetic abnormalities only) cohorts.
b Samples recorded as FAB-Mx (N = 1), FAB-RAEB (N = 4), and FAB-RAEBt (N = 13) were removed.
Significance levels:
* ≤ 0.05;
** ≤0.01;
*** ≤ 0.001;
Abbreviations: N, total sample size; HR, hazards ratio; CI, confidence intervals; HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic- monocytic progenitors; UC, unclassified; INTER, intermediate
Overview of enrichment patterns detected using gene set enrichment analysis (GSEA) in the reduced assignment probability meta-cohort (N = 573: NGSE6891 = 423, NTCGA = 150), investigating the Hallmark, the C2-CP, and the C3-TFT gene set collections retrieved from the MSigDB.
GSEA analysis was conducted for three comparisons: (A) HSC vs. Rest, (B) MEP vs. Rest, and (C) GMP vs. Rest. The total number of enriched gene sets (Total) with p-value ≤ 0.01 and FDR ≤ 0.25, and the numbers of enriched gene sets detected per group within each comparison are presented.
| 31 | 10 (32.3) | 21 (67.7) | |
| 256 | 30 (11.7) | 226 (88.3) | |
| 222 | 173 (77.9) | 49 (22.1) | |
| 25 | 11 (44.0) | 14 (56.0) | |
| 360 | 59 (16.4) | 301 (83.6) | |
| 101 | 32 (31.7) | 69 (68.3) | |
| 21 | 20 (95.2) | 1 (4.8) | |
| 254 | 252 (99.2) | 301 (0.8) | |
| 22 | 9 (40.9) | 69 (59.1) | |
Abbreviations: HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic-monocytic progenitors; UC, unclassified; MSigDB, Molecular Signature Database