Literature DB >> 32324791

Normal myeloid progenitor cell subset-associated gene signatures for acute myeloid leukaemia subtyping with prognostic impact.

Anna A Schönherz1,2,3, Julie Støve Bødker2, Alexander Schmitz2, Rasmus Froberg Brøndum2, Lasse Hjort Jakobsen1,2, Anne Stidsholt Roug1,2, Marianne T Severinsen1,2, Tarec C El-Galaly1,2, Paw Jensen2, Hans Erik Johnsen1,2, Martin Bøgsted1,2, Karen Dybkær1,2.   

Abstract

Acute myeloid leukaemia (AML) is characterised by phenotypic heterogeneity, which we hypothesise is a consequence of deregulated differentiation with transcriptional reminiscence of the normal compartment or cell-of-origin. Here, we propose a classification system based on normal myeloid progenitor cell subset-associated gene signatures (MAGS) for individual assignments of AML subtypes. We generated a MAGS classifier including the progenitor compartments CD34+/CD38- for haematopoietic stem cells (HSCs), CD34+/CD38+/CD45RA- for megakaryocyte-erythroid progenitors (MEPs), and CD34+/CD38+/CD45RA+ for granulocytic-monocytic progenitors (GMPs) using regularised multinomial regression with three discrete outcomes and an elastic net penalty. The regularisation parameters were chosen by cross-validation, and MAGS assignment accuracy was validated in an independent data set (N = 38; accuracy = 0.79) of sorted normal myeloid subpopulations. The prognostic value of MAGS assignment was studied in two clinical cohorts (TCGA: N = 171; GSE6891: N = 520) and had a significant prognostic impact. Furthermore, multivariate Cox regression analysis using the MAGS subtype, FAB subtype, cytogenetics, molecular genetics, and age as explanatory variables showed independent prognostic value. Molecular characterisation of subtypes by differential gene expression analysis, gene set enrichment analysis, and mutation patterns indicated reduced proliferation and overrepresentation of RUNX1 and IDH2 mutations in the HSC subtype; increased proliferation and overrepresentation of CEBPA mutations in the MEP subtype; and innate immune activation and overrepresentation of WT1 mutations in the GMP subtype. We present a differentiation-dependent classification system for AML subtypes with distinct pathogenetic and prognostic importance that can help identify candidates poorly responding to combination chemotherapy and potentially guide alternative treatments.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32324791      PMCID: PMC7179860          DOI: 10.1371/journal.pone.0229593

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.752


Introduction

Compelling evidence demonstrates that acute myeloid leukaemia (AML) is of clonal origin and represents the progeny of a single cell that enters leukaemic transformation due to multiple genetic events that impair cell differentiation and apoptosis and invoke uncontrolled cell proliferation. However, the evolution from the first somatic mutation to full-blown AML is not well mapped. The simplest models predict that each newly acquired somatic mutation during oncogenesis confers a selective advantage that drives successive waves of clonal expansion and deregulated differentiation, with the fittest clone becoming dominant at diagnosis and during relapse. [1,2] The understanding of linage-specific progenitor commitment during AML transformation and subsequent clonal evolution is fundamental to the pathogenesis and treatment of AML. We speculate that the genetic abnormality in clinically relevant clones can be traced back to the normal compartment or cell of origin (COO)–as we have previously shown for lymphoid B-cell malignancies [3-6]–and that knowledge of the COO and its deregulation could provide novel molecular and oncogenic insight into AML subtypes. Myeloid and lymphoid malignancies are particularly well suited for evaluating the cellular origin on malignant transformation due to our understanding of the normal haematopoietic hierarchy and the availability of analytical tools for the examination of phenotypically defined subpopulations at the single-cell level. Recently, we described a procedure to identify and study the gene expression of immunophenotype-based flow-sorted minor subsets. [7] These subsets can be profiled by global gene expression, and using statistical modelling, we were able to define normal subset-specific B-cell-associated gene signatures (BAGS) for assignment and prognostic evaluation in B-cell malignancies. [4,8,9] We found that the diagnostic heterogeneity in multiple myeloma, chronic lymphocytic leukaemia, and diffuse large B-cell lymphoma reflects a link between differentiation and oncogenesis–a contribution often overlooked. The phenotypic heterogeneity of AML was initially defined by cytology decades ago, when the French-American-British (FAB) Cooperative Group developed a classification system based on morphologic and cytochemical phenotypes associated with normal end stage myelopoiesis. However, as cytogenetics was introduced, the prognostic impact of the FAB subtypes (M0-M7) became unclear. [10-16] Immunophenotyping demonstrated that poorly differentiated AML blasts are deregulated and differentially disrupted early stem or progenitor cells; however, no uniform leukaemic phenotype has been defined by CD34 and CD38 membrane markers. [17] Here, we hypothesise that the phenotypic heterogeneity of AML is a consequence of deregulated differentiation with transcriptional reminiscence of the normal stem or progenitor COO phenotype. We generated myeloid progenitor cell subset-associated gene signatures (MAGS) and assigned subtypes in clinical cohorts to study their prognostic and potential pathogenetic impacts.

Material and methods

Data sets

This study is based on data sets already published and publicly available. Sorted normal myeloid data from healthy donors were retrieved from the Gene Expression Omnibus Database (GEO) and are referred to as the GSE63270, [18] GSE42519, [19] GSE19599, [20] GSE17054, [21] and GSE19429 [22] cohorts. For GSE63270, [18] and GSE17054, [21] human bone marrow mononuclear cells (BMMCs) from healthy donors were purchased from ALLCELLS (Emeryville, CA) where collection protocols and donor informed consent are approved by an institutional review board (IRB) in compliance to State and Federal regulations. For GSE42519, [19] GSE19599, [20] and GSE19429 [22] BMMCs from healthy donors were collected and analysed after IRB approval and written informed consent was obtained. The samples were previously phenotyped by multiparametric flow cytometry (MFC) and fluorescence-activated cell sorting (FACS) into myeloid subpopulations, and gene expression profiles (GEPs) were obtained using an Affymetrix Human Genome U13z½3 Plus 2.0 Array GeneChip (Affymetrix, Santa Clara, CA). Sorted normal myeloid data were restricted to those five cohorts due to the limited availability of publicly available data sets with comparable FACS and GEP protocols. The subpopulation compositions and cell sorting details are summarised in S1 and S2 Tables. The myeloid subpopulations of interest for the present study were CD34+/CD38- haematopoietic stem cells (HSCs), CD34+/CD38+/CD45RA+ granulocytic-monocytic progenitors (GMPs), and CD34+/CD38+/CD45RA- megakaryocyte-erythroid progenitors (MEPs). Clinical AML data were retrieved from the GEO database and The Cancer Genome Atlas (TCGA) database and are referred to as the GSE6891 [23,24] and TCGA [25] cohorts, respectively. For GSE6891, [23,24] and TCGA [25] bone marrow aspirates from AML patients were obtained and analysed after patients provided written informed consent in accordance with the Declaration of Helsinki, and the study was approved by all participating institutional review boards. The data sets were chosen because the GEPs were generated in a comparable manner as those of the sorted normal myeloid data sets, and they contained information on age, gender, FAB subtype, cytogenetic risk score, WBC, disease outcome, and genetic aberrations. The metadata of both cohorts are summarised in the S1 Data (S1 and S3 Tables).

Statistical analysis

All statistical analyses were performed with R version 3.3.3. [26] The statistical analysis is summarised here; for comprehensive documentation, see the (S1 Data). This study followed the guidelines of omics-directed medicine [27-29] whenever possible. Prior to the statistical analysis, all gene expression data were background corrected and normalised using the Robust Multichip Average (RMA) [30] algorithm and summarised at the gene level using a Brainarray custom CDF for the Affymetrix Human Genome U133 Plus 2.0 GeneChip (version 20.0). The clinical cohorts were RMA normalised per cohort, whereas the normal myeloid cohorts were combined into one meta-cohort prior to RMA normalisation. Six normal myeloid HSC, GMP, and MEP samples were randomly selected and used as a training cohort. The remaining 38 normal myeloid samples were used as a validation cohort. The MAGS classifier for the HSC, MEP, and GMP subsets was obtained from regularised multinomial regression with the cell type as the discrete outcome and the median-centred GEP of the training cohort as the explanatory variable. The model was fitted with an elastic net penalty. [31] The regularisation parameters were determined by cross-validation, and the parameters with the lowest multinomial deviance were chosen. Genes with non-zero coefficients were defined as predictive. Based on those genes, assignment probabilities were estimated to identify the most likely MAGS subtype of a sample. The prediction accuracy of the MAGS classification was validated in the validation cohort. To compensate for cohort-based technical batch effects, the validation cohort was median centred and scaled to have the same variance as the training cohort. The same procedure was applied to the clinical cohorts followed by cohort-based MAGS subtype assignment according to the subtype with the highest predicted probability score and assigning 15% of the samples with the lowest assignment probability as unclassified (UC). To improve statistical power, clinical cohorts were combined into a clinical meta-cohort following MAGS assignment. For biological characterisation, the GEPs of the clinical meta-cohort were batch corrected using the empirical Bayes approach, ComBat, [32] implemented in the Bioconductor package “sva” (version 3.18.0). [33] Survival analyses were performed using the Kaplan-Meier method, log-rank test statistics, and Cox proportional hazards regression analysis for the individual cohorts and the combined clinical meta-cohort. In addition, the prognostic importance of 13 other explanatory variables (FAB subtype, cytogenetic risk score, CEBPA aberrations, FLT3-itd aberrations, FLT3-tkd aberrations, IDH1 aberrations, IDH2 aberrations, KRAS aberrations, NPM1 aberrations, NRAS aberrations, white blood cell count (WBC), age, and cohort) was investigated by univariate Cox proportional hazards regression analysis in the TCGA (N = 122), GSE6891 (N = 439), and meta-cohort (N = 561), limited to samples with complete records for all investigated variables (for detailed information, see S1 Data). MAGS was evaluated as an independent explanatory variable by multivariate Cox proportional hazards regression analysis for overall survival in all three cohorts, including explanatory variables tested with prognostic effects in the univariate regression analysis in the respective cohorts. Cytogenetic risk score could not be investigated as a confounding variable in the meta-cohort due to differences in cytogenetic risk group stratification between the two cohorts. In the GSE6891 cohort, cytogenetic risk score stratification was based on cytogenetic abnormalities only, [23] whereas in the TCGA cohort, cytogenetic risk score stratification was based on cytogenetics and molecular genetics. [25,34,35] Instead, MAGS was tested for prognostic impact against the cytogenetic risk scores independently in the two clinical cohorts.

MAGS validation through biological characterisation

The biological characterisation and identification of biological processes associated with MAGS subtypes were investigated through differential gene expression (DGE) analysis, gene set enrichment analysis, and the identification of subtype-specific mutation patterns for well-documented driver mutations. To increase detection power, analyses were conducted for the meta-cohort, but they were restricted to clinical samples with MAGS assignment probabilities ≥ 0.75, if not stated otherwise.

DGE analysis

Was performed in R using the limma Bioconductor package (version: 3.26.9). [36,37] To identify MAGS subtype-specific differences, clinical samples assigned to one subtype were compared with the remaining samples (Rest), resulting in the following three comparisons: i) HSC vs. Rest, ii) GMP vs. Rest, and iii) MEP vs. Rest. For summary statistics, p-values were adjusted using the Benjamini and Hochberg algorithm, [38] and genes with p ≤ 0.001 were defined as differentially expressed.

Enrichment analysis

Was conducted using two different approaches: a classical Gene Ontology (GO) annotation using Fisher’s exact test to identify over-represented GO terms in differentially expressed genes and a computational gene set enrichment analysis (GSEA) approach developed by the Broad Institute that uses a pre-ranked gene list of all profiled genes (S1 Data). The GSEA was performed using the GSEA desktop application (version 3.0) [39] using 2000 permutations of gene set randomisation and the default settings. Gene sets included in the analysis were selected from the Molecular Signature Database (MSigDB, v6.0) [40] using the Hallmark, [39,41] C2-CP, [39] and C3-TFT MSigDB collections. [39,42] Gene sets including fewer than 15 or more than 500 genes were excluded from the analysis. Gene sets with an adjusted p ≤ 0.05 and a false discovery rate (FDR) q-value for normalised enrichment scores (ES) ≤ 0.25 were considered significantly enriched. [39]

Identification of subtype-specific mutation patterns

Was performed for well-documented driver mutations across seven AML oncogenes (CEBPA, IDH1, IDH2, FLT3 [including both the FLT3-itd and the FLT3-tkd aberrations], NPM1, NRAS, and KRAS) in the meta-cohort, irrespective of the MAGS assignment probabilities but limited to samples with recorded mutation information (N = 587: NGSE6891 = 457, NTCGA = 130). In addition, a second analysis was conducted for 112 genes associated with AML that had been previously characterised and classified. [43] Mutation records for those genes were extracted from exome-wide somatic mutation data available for a subset of the TCGA cohort (N = 130). [25] Potential associations with MAGS subtypes were investigated for each mutation using Fisher’s exact tests with a significance cut-off level of 0.05.

Results

Generation and validation of MAGS

The transcriptomic identity of normal myeloid subsets was validated by principal component analysis (Fig 1). Batch effects were partially removed using RMA normalisation (Fig 1A and 1B), and the subset identity could be confirmed for the three progenitor compartments through subset-specific segregation into discrete clusters (Fig 1C and 1D), allowing subsequent identification of MAGS. The MAGS classifier with the smallest deviance determined by cross-validation consisted of 92 genes (S4 Table, S1 Fig). The HSC subtype signature included 44 predictive genes, 30 of which were subtype-specific (68.2%); the GMP subtype signature included 37 predictive genes, 20 of which were subtype-specific (62.2%); and the MEP subtype signature included 33 predictive genes, 19 of which were subtype-specific (57.6%; Fig 2). The highest overlap of predictive genes was between the HSC and GMP subsets (N = 8), followed by GMP and MEP (N = 6) and HSC and MEP (N = 6). The prediction accuracy of the MAGS classifier was validated using sorted normal myeloid samples (N = 38: NHSC = 26, NGMP = 7, NMEP = 5), showing a prediction accuracy of 78.95% when all samples were assigned to one of the three MAGS subtypes and 90.63% when defining 15% of the samples with the lowest MAGS assignment probability as UC. For both assignment strategies, the prediction accuracy of the GMP and MEP subtypes was 100%. The MAGS assignment inconsistencies were restricted to the HSC subtype (Table 1A and 1B). Moreover, the majority of the samples wrongly assigned belonged to the GSE19429 cohort (six of the eight samples).
Fig 1

Principal component analysis (PCA) to illustrate variance between samples characterized by gene expression profiles of flow-sorted normal myeloid progenitor subsets.

(A) The PCA conducted for the meta-cohort of normal myeloid samples that were RMA normalized before combination and includes all myeloid subsets available. (B) The PCA conducted for the meta-cohort of normal myeloid samples that were combined prior to RMA normalization and includes all myeloid subsets available. (C) The PCA conducted for the meta-cohort described in (B) but limited to samples identified as early (HSC) and late (GMP, MEP) hematopoietic progenitors by FACS. (D) The PCA conducted for the meta-cohort described in (B) but limited to samples that were included in the training-cohort and median centered. Each data point represents the expression profile of one sample. In A-B, samples are color-coded based on their cohort identity; for C-D, samples are color-coded according to their myeloid progenitor subset identity (HSC, GMP, MEP). Samples with similar expression profiles will cluster together. Axis labels indicate the principal component (PC) plotted and the proportion of the variance explained by that PC. PCs are derived by orthogonal data transformation to reduce dimensionality and represent the directions of the data that explain a maximal amount of variation.

Fig 2

Venn diagram of predictive genes included in the MAGS classification.

Table 1

MAGS prediction accuracy assigning 100% (A) or 85% (B) of the samples to the defined MAGS subtypes HSC, GMP, MEP, and an additional UC subtype.

A)PredictedInvalid Prediction (%)
Observed  HSC (%)MEP (%)GMP (%)UC (%)
HSC (N = 26)18 (69.2)4 (15.4)4 (15.4)-8 (30.8)
MEP (N = 5)0 (0)5 (100)0 (0)-0 (0)
GMP (N = 7)0 (0)0 (0)7 (100)-0 (0)
B)PredictedInvalid Prediction (%)
ObservedHSC (%)MEP (%)GMP (%)UC (%)
HSC (N = 26)17 (65.4)3 (11.5)0 (0)6 (23.1)9 (34.6)
MEP (N = 5)0 (0)5 (100)0 (0)0 (0)0 (0)
GMP (N = 7)0 (0)0 (0)7 (100)0 (0)0 (0)

The prediction accuracy was estimated in the validation-cohort (N = 38). Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid, UC, unclassified.

Principal component analysis (PCA) to illustrate variance between samples characterized by gene expression profiles of flow-sorted normal myeloid progenitor subsets.

(A) The PCA conducted for the meta-cohort of normal myeloid samples that were RMA normalized before combination and includes all myeloid subsets available. (B) The PCA conducted for the meta-cohort of normal myeloid samples that were combined prior to RMA normalization and includes all myeloid subsets available. (C) The PCA conducted for the meta-cohort described in (B) but limited to samples identified as early (HSC) and late (GMP, MEP) hematopoietic progenitors by FACS. (D) The PCA conducted for the meta-cohort described in (B) but limited to samples that were included in the training-cohort and median centered. Each data point represents the expression profile of one sample. In A-B, samples are color-coded based on their cohort identity; for C-D, samples are color-coded according to their myeloid progenitor subset identity (HSC, GMP, MEP). Samples with similar expression profiles will cluster together. Axis labels indicate the principal component (PC) plotted and the proportion of the variance explained by that PC. PCs are derived by orthogonal data transformation to reduce dimensionality and represent the directions of the data that explain a maximal amount of variation. The prediction accuracy was estimated in the validation-cohort (N = 38). Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid, UC, unclassified.

MAGS assignment of clinical samples and prognostic impact

Clinical AML samples from two independent cohorts of adult patients diagnosed with de novo AML were classified into MAGS subtypes (S3 Table). We allowed 15% of the samples within each cohort to be assigned as UC, resulting in an assignment probability cut-off ≥ 0.71 (TCGA cohort = 0.71, GSE6891 cohort = 0.72). An unambiguous MAGS subtype assignment was achieved, and the subtype frequencies did not vary between the two clinical cohorts (Table 2). Subtype frequencies ranged from 28.1–31.2% in the GSE6981 cohort and 26.4–30.8% in the TCGA cohort when ignoring the UC-assigned samples. Furthermore, the GMP subtype was the most frequently assigned in both cohorts, followed by MEP and HSC.
Table 2

Distributions and frequencies of assigned MAGS subtypes across two clinical cohorts: TCGA (N = 182) and GSE6891 (N = 520).

Two-sided Fishers exact tests were used to determine significantly different distributions across data sets (p = 0.99).

CohortHSC (%)MEP (%)GMP (%)UC (%)Total (%)
TCGA50 (27.5)48 (26.4)56 (30.8)28 (15.4)182 (100)
GSE6891146 (28.1)134 (25.8)162 (31.2)78 (15.0)520 (100)

In total, 85% of clinical samples were assigned to MAGS subtypes (HSC, GMP, MEP), and 15% of each cohort was unclassified (UC).

Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors, MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%)

Distributions and frequencies of assigned MAGS subtypes across two clinical cohorts: TCGA (N = 182) and GSE6891 (N = 520).

Two-sided Fishers exact tests were used to determine significantly different distributions across data sets (p = 0.99). In total, 85% of clinical samples were assigned to MAGS subtypes (HSC, GMP, MEP), and 15% of each cohort was unclassified (UC). Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors, MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%) The prognostic impact of the MAGS subtypes was analysed both individually and collectively in a meta-analysis combining the MAGS-assigned samples of the GSE6891 and TCGA cohorts. The MAGS assignment showed a significant prognostic association with overall survival (Fig 3; log-rank test p ≤ 0.001). The lineage-committed MAGS subtypes GMP and MEP had superior prognoses compared with the undifferentiated AMLs captured by the HSC subtype. This was supported by univariate Cox regression analysis conducted for the GSE6891 and the cohort-adjusted clinical meta-cohort, revealing significant differences between the GMP and HSC (GSE6891: HR = 0.63, p < 0.001; meta-cohort: HR = 0.64, p < 0.001), the MEP and HSC (GSE6891: HR = 0.53, p ≪ 0.001; meta-cohort: HR = 0.51, p ≪ 0.001), and the UC and HSC (GSE6891: HR = 0.70, p = 0.03; meta-cohort: HR = 0.60, p < 0.001; Table 3) subtypes. In the TCGA cohort, significant differences were only observed between the MEP and HSC (HR = 0.44, p = 1.9e-03) and the UC and HSC (HR = 0.37, p = 1.6e-03; Table 3) subtypes. Moreover, multivariate Cox proportional hazards analysis conducted for the three cohorts (TCGA, GSE6891, meta-cohort) demonstrated that the MAGS subtypes added significant prognostic information that was not already explained by FAB subtype, cytogenetics, molecular genetics (well-documented driver mutations in CEBPA, FLT3, IDH1, IDH2, KRAS, NPM1, or NRAS), or age (Table 4).
Fig 3

Prognostic validation of the assigned MAGS subtypes for (A) the TCGA-cohort (N = 171), (B) the GSE6892-cohort (N = 520), and (C) the associated meta-cohort (N = 691), using a frequency cut-off of 85% for MAGS assigned and 15% for samples assigned as unclassified.

Kaplan Meier survival curves were generated for overall survival and P-values were estimated using a log-rank test. Only samples with complete survival information were included and the number at risk per MAGS subtype are provided for each cohort.

Table 3

Univariate Cox regression analysis for MAGS-assigned clinical cohorts (GSE6891 cohort: N = 520; TCGA-cohort: N = 171) and the corresponding AML meta-cohort (N = 691) that included both clinical cohorts.

Analyses were performed for overall survival. Samples with missing survival information (TCGA cohort: N = 11) were excluded. Table columns indicate the total sample size (N), the number of patients that died (Events) per MAGS subtype, the associated hazards ration (HR), the 95% lower and upper confidence intervals (CI), and the estimated p-value (P).

NEventsHR95% CIP
TCGA cohort
 HSC48401
 MEP44220.440.26–0.741.90e-03**
 GMP52340.680.43–1.080.1
 UC27140.370.20–0.691.63e-03**
GSE6891 cohort
 HSC1461151
 MEP134770.530.40–0.712.13e-05***
 GMP1621010.630.48–0.837.86e-04***
 UC78520.700.51–0.980.03
Meta-cohorta
 HSC1941551
 MEP178990.510.40–0.651.46e-07***
 GMP2141350.640.51–0.811.64e-04***
 UC105660.600.45–0.805.32e-04***

a Analysis of the meta-cohort was corrected for potential batch effects by including the cohort (TCGA, GSE6891) as an independent explanatory variable.

Significance levels:

* ≤ 0.05;

** ≤0.01;

*** ≤ 0.001;

Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%);

Table 4

Cox regression analysis of potential confounding variables conducted for the TCGA (N = 122), GSE6891 (N = 439), and meta-cohort (N = 561).

Cohorts were limited to samples with complete records for all explanatory variables investigated. Results are shown for the (A) univariate Cox regression analysis per explanatory variable and (B) associated multivariate Cox regression analyses limited to confounding variables tested significant in univariate Cox regression analyses. Table columns are as described in Table 3. Analyses were performed for overall survival.

A) Simple Cox Regression
TCGA cohort (N = 122)GSE6891 cohort (N = 439)AML meta-cohort (N = 561)
NEventsHR95% CIPNEventsHR95% CIPNEventsHR95% CIP
MAGSHSC373211209311571251
MEP26150.5230.282–0.9710.040*108560.4910.352–0.6852.75E-05***134710.4860.363–0.6501.26E-06***
GMP40260.5950.353–1.0050.052147890.6320.472–0.8460.002**1871150.6220.483–0.8022.50E-04***
UC1990.3090.146–0.6520.002**64400.6660.460–0.9650.032*83490.5800.417–0.8070.001**
Cytogenetic riskaGOOD2410197401NANANANANA
INTER60422.6361.319–5.2680.006**2471601.9001.341–2.6903.02E-04***NANANANANA
POOR35272.8281.362–5.8720.005**85693.2222.173–4.7785.84E-09***NANANANANA
UC338.1592.188–30.4310.002**1093.8011.839–7.8573.14E-04***NANANANANA
FABbM013911613129221
M128211.7730.805–3.9040.15594600.6460.355–1.1770.153122810.7720.481–1.2360.281
M227171.1560.520–2.6590.697104640.6290.346–1.1420.128131810.6980.435–1.1190.135
M3630.5420.146–2.0130.36024110.4690.210–1.0480.06530140.4920.251–0.9620.038*
M427160.8610.379–1.9550.72079480.6350.344–1.1730.147106640.6830.420–1.1080.123
M4ENANANANANA520.3250.073–1.4440.140520.3300.077–1.4030.133
M514101.9570.785–4.8790.150103720.7800.432–1.4070.408117820.8460.528–1.3560.488
M6332.4520.649–9.2720.186630.4150.118–1.4560.170960.7020.284–1.7310.442
M7332.3110.615–8.6790.21500NANANA331.9200.573–6.4290.290
UC102.751NA0.996850.8690.310–2.4380.790950.8650.328–2.2860.771
CEBPANEG11980140626215253421
POS321.1840.289–4.8510.81529130.5550.318–0.9680.038*32150.5560.332–0.9330.026*
FLT3-ITDNEG12181131518714362681
POS115.7240.766–42.7660.089124911.6651.294–2.1417.18E-05***125921.4281.127–1.8110.003**
FLT3-TKDNEG10771139025414973251
POS15111.4780.778–2.8100.23348240.6990.460–1.0620.09363350.8200.579–1.1630.265
IDH1NEG11678140525615213341
POS640.7720.282–2.1140.61431190.9010.565–1.4360.66137230.8630.565–1.3160.493
IDH2NEG11378140025415133321
POS940.5090.186–1.3940.18936210.8490.544–1.3240.47045250.7650.509–1.1480.196
KRASNEG11678143627615523541
POS640.6690.244–1.8360.435321.0280.256–4.1320.970961.0310.460–2.3110.941
NMP1NEG10973130319814122711
POS1391.7390.864–3.5010.121136800.8740.674–1.1330.310149890.8450.665–1.0740.169
NRASNEG11577139625315113301
POS751.4350.578–3.5630.43642250.8260.548–1.2460.36249300.8430.580–1.2250.370
WBCWBC122821.0020.998–1.0060.402NANANANANANANANANANA
AgeAge122821.0411.022–1.0601.44E-05***4392781.0151.005–1.0260.003**5613601.0251.017–1.0332.64E-09***
CohortTCGANANANANANANANANANANA122821
GSE6891NANANANANANANANANANA4392780.5790.451–0.7441.99E-05***
B) Multivariate Cox Regression
TCGA cohort (N = 122)GSE6891 cohort (N = 439)AML meta-cohort (N = 561)
NEventsHR95% CIPNEventsHR95% CIPNEventsHR95% CIP
MAGSHSC373211209311571251
MEP26150.7810.404–1.5110.464108560.6000.419–0.8620.006**134710.5240.374–0.7341.69E-04***
GMP40260.7800.445–1.3650.383147890.7230.537–0.9730.032*1871150.5930.443–0.7934.32E-04***
UC1990.4670.213–1.0250.058*64400.7420.508–1.0840.12383490.5610.398–0.7919.55E-04***
Cytogenetic riskaGOOD2410197401-----
INTER60421.9960.964–4.1290.0632471601.5291.045–2.2390.029-----
POOR35271.9110.870–4.1950.10785692.7511.825–4.1471.36E-06***-----
UC337.4401.928–28,7120.004**1093.0361.445–6.3790.003**-----
FABbM0----------29221
M1----------122811.2670.769–2.0870.353
M2----------131811.2660.766–2.0940.358
M3----------30141.0890.531–2.2330.816
M4----------106641.1430.684–1.9110.610
M4E----------520.8610.196–3.7900.844
M5----------117821.5950.934–2.7230.087
M6----------961.1400.418–3.1120.798
M7----------332.0060.573–7.0210.276
UC----------951.1860.442–3.1810.735
CEBPANEG-----40626215253421
POS-----29130.7030.386–1.2830.25132150.7590.433–1.3310.336
FLT3-ITDNEG-----31518714362681
POS-----124911.6211.230–2.1356.04E-04***125921.5421.193–1.9949.52E-04***
AgeAge122821.0351.017–1.0541.71E-04***4352751.0111.001–1.0220.036*5573571.0211.012–1.0304.56E-06***
CohortTCGANANANANANANANANANANA122821
GSE6891NANANANANANANANANANA4392780.6400.472–0.8690.005**

a Cytogenetic risk was excluded from Cox regression analyses conducted for the meta-cohort due to differences in cytogenetic risk group stratification between the TCGA (stratification based on cytogenetic and molecular genetics) and GSE6891 (stratification based on cytogenetic abnormalities only) cohorts.

b Samples recorded as FAB-Mx (N = 1), FAB-RAEB (N = 4), and FAB-RAEBt (N = 13) were removed.

Significance levels:

* ≤ 0.05;

** ≤0.01;

*** ≤ 0.001;

Abbreviations: N, total sample size; HR, hazards ratio; CI, confidence intervals; HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic- monocytic progenitors; UC, unclassified; INTER, intermediate

Prognostic validation of the assigned MAGS subtypes for (A) the TCGA-cohort (N = 171), (B) the GSE6892-cohort (N = 520), and (C) the associated meta-cohort (N = 691), using a frequency cut-off of 85% for MAGS assigned and 15% for samples assigned as unclassified.

Kaplan Meier survival curves were generated for overall survival and P-values were estimated using a log-rank test. Only samples with complete survival information were included and the number at risk per MAGS subtype are provided for each cohort.

Univariate Cox regression analysis for MAGS-assigned clinical cohorts (GSE6891 cohort: N = 520; TCGA-cohort: N = 171) and the corresponding AML meta-cohort (N = 691) that included both clinical cohorts.

Analyses were performed for overall survival. Samples with missing survival information (TCGA cohort: N = 11) were excluded. Table columns indicate the total sample size (N), the number of patients that died (Events) per MAGS subtype, the associated hazards ration (HR), the 95% lower and upper confidence intervals (CI), and the estimated p-value (P). a Analysis of the meta-cohort was corrected for potential batch effects by including the cohort (TCGA, GSE6891) as an independent explanatory variable. Significance levels: * ≤ 0.05; ** ≤0.01; *** ≤ 0.001; Abbreviations: HSC, hematopoietic stem cells; GMP, granulocytic-monocytic progenitors; MEP, megakaryocyte-erythroid progenitors; UC, unclassified samples (assignment frequency of UC = 15%);

Cox regression analysis of potential confounding variables conducted for the TCGA (N = 122), GSE6891 (N = 439), and meta-cohort (N = 561).

Cohorts were limited to samples with complete records for all explanatory variables investigated. Results are shown for the (A) univariate Cox regression analysis per explanatory variable and (B) associated multivariate Cox regression analyses limited to confounding variables tested significant in univariate Cox regression analyses. Table columns are as described in Table 3. Analyses were performed for overall survival. a Cytogenetic risk was excluded from Cox regression analyses conducted for the meta-cohort due to differences in cytogenetic risk group stratification between the TCGA (stratification based on cytogenetic and molecular genetics) and GSE6891 (stratification based on cytogenetic abnormalities only) cohorts. b Samples recorded as FAB-Mx (N = 1), FAB-RAEB (N = 4), and FAB-RAEBt (N = 13) were removed. Significance levels: * ≤ 0.05; ** ≤0.01; *** ≤ 0.001; Abbreviations: N, total sample size; HR, hazards ratio; CI, confidence intervals; HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic- monocytic progenitors; UC, unclassified; INTER, intermediate

DGE and functional annotation of enriched gene sets

To assess biological differences between MAGS subtypes, we performed DGE analysis on 573 samples and compared each subtype with the combined other subtypes: HSC vs. Rest, GMP vs. Rest, and MEP vs. Rest. The largest number of differentially expressed genes (DEGs) was identified for the GMP subtype (NDEG = 6414), followed by the MEP (NDEG = 4279) and HSC (NDEG = 4071) subtypes. The most distinct DGE profile (number of subtype-specific DEGs) was discovered for the GMP subtype with 1657 DEGs, followed by MEP with 935 and the HSC subtype with 776 (S2 Fig). The top DEGs for the GMP and MEP subtypes overlapped (HBD, ALAS2, SPTA1, KLF1, EPB42, AHSP, and SELENBP1; S5A–S5C Table). They were upregulated in the MEP subtype and downregulated in the GMP subtype. Moreover, most of those genes were involved in erythrocyte differentiation (KLF1, ALAS2, and AHSP) or erythrocyte membrane or haemoglobin functions (HBD, SPTA1, and EPB42), which indicates transcriptional discrimination between erythrocytes and other cells. Hence, the results provide biological proof of concept that the MAGS classification of clinical AML samples enables separation into megakaryocyte-erythroid linage and granulocytic-monocytic linage COO subtypes. In contrast, the top DEGs associated with the HSC subtype were subtype-specific and did not reflect any lineage commitment. To further investigate potential functional patterns associated with the MAGS subtypes, gene enrichment analysis was performed for only the DEGs and taking all genes into consideration. Enrichment analysis of DEGs annotated to GO terms associated with biological processes identified 1066 GO terms enriched for the MEP subtype, followed by 697 for the GMP subtype and 556 for the HSC subtype. Moreover, general patterns of enriched biological processes differed between MAGS subtypes, suggesting different pathogenic courses (S6 Table). The DEGs detected for the HSC subtype were mainly enriched for cell cycle and metabolic processes (S6A Table), whereas DEGs in the GMP and MEP subtypes were mainly enriched for immune system processes and cellular responses to external stimuli, suggesting late progenitor cell functions (S6B and S6C Table). The GSEA revealed a general downregulation of genes associated with gene sets enriched in the HSC (Table 5A) and MEP subtypes (Table 5B), whereas genes associated with gene sets enriched in the GMP subtype were predominantly upregulated (Table 5C). Biological processes associated with subtype-specific enriched gene sets differed between subtypes. For the HSC subtype, enrichment patterns were characterised by a low cell-cycle activity signature with reduced metabolic rates, suggesting low cell proliferation or a prolonged quiescence phase (S7A Table, S3A Fig). The gene set enrichment patterns observed for the GMP subtype revealed a high metabolic activity signature with innate immune activation (S7B Table, S3B Fig), whereas the MEP subtype had a high cell-cycle activity signature with impaired innate immune activity (S7C Table, S3C Fig). Moreover, the MEP subtype was enriched for genes involved in the metabolism of heme- and erythroblast differentiation, which were downregulated in the GMP subtype, further supporting our hypothesis that malignant cells possess transcriptional reminiscence of the COO.
Table 5

Overview of enrichment patterns detected using gene set enrichment analysis (GSEA) in the reduced assignment probability meta-cohort (N = 573: NGSE6891 = 423, NTCGA = 150), investigating the Hallmark, the C2-CP, and the C3-TFT gene set collections retrieved from the MSigDB.

GSEA analysis was conducted for three comparisons: (A) HSC vs. Rest, (B) MEP vs. Rest, and (C) GMP vs. Rest. The total number of enriched gene sets (Total) with p-value ≤ 0.01 and FDR ≤ 0.25, and the numbers of enriched gene sets detected per group within each comparison are presented.

A) HSC vs. Rest
MSigDB collectionTotalEnriched in HSC (%)Enriched in Rest (%)
Hallmark3110 (32.3)21 (67.7)
C2-CP25630 (11.7)226 (88.3)
C3-TFT222173 (77.9)49 (22.1)
B) MEP vs. Rest
MSigDB collectionTotalEnriched in GMP (%)Enriched in Rest (%)
Hallmark2511 (44.0)14 (56.0)
C2-CP36059 (16.4)301 (83.6)
C3-TFT10132 (31.7)69 (68.3)
C) GMP vs. Rest
MsigDB collectionTotalEnriched in MEP (%)Enriched in Rest (%)
Hallmark2120 (95.2)1 (4.8)
C2-CP254252 (99.2)301 (0.8)
C3-TFT229 (40.9)69 (59.1)

Abbreviations: HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic-monocytic progenitors; UC, unclassified; MSigDB, Molecular Signature Database

Overview of enrichment patterns detected using gene set enrichment analysis (GSEA) in the reduced assignment probability meta-cohort (N = 573: NGSE6891 = 423, NTCGA = 150), investigating the Hallmark, the C2-CP, and the C3-TFT gene set collections retrieved from the MSigDB.

GSEA analysis was conducted for three comparisons: (A) HSC vs. Rest, (B) MEP vs. Rest, and (C) GMP vs. Rest. The total number of enriched gene sets (Total) with p-value ≤ 0.01 and FDR ≤ 0.25, and the numbers of enriched gene sets detected per group within each comparison are presented. Abbreviations: HSC, hematopoietic stem cells; MEP, megakaryocyte-erythroid progenitors; GMP, granulocytic-monocytic progenitors; UC, unclassified; MSigDB, Molecular Signature Database

Annotation of genetic mutation patterns

Potential associations between the MAGS subtypes and well-documented mutations in seven AML-associated oncogenes recorded for both the TCGA and the GSE6891 cohort (CEBPA, FLT3 [including both the FLT3-itd and FLT3-tkd aberrations], IDH1, IDH2, KRAS, NPM1, and NRAS) were investigated in the meta-cohort. Two genes, CEBPA and IDH2, showed subtype-specific mutation patterns. Mutations occurring in the CEBPA gene were associated with the MEP subtype (p = 5.79e-08), which was observed especially for the CEBPA double mutation. Mutations detected in the IDH2 gene were more frequently observed in the HSC subtype (p = 0.015; S8A Table). Furthermore, a set of 112 genes previously shown to harbour AML driver mutations were investigated for MAGS subtype-specific mutation patterns in the reduced TCGA cohort (N = 130). Mutations were detected in 68 genes, revealing significant subtype-specific mutation patterns for RUNX1, RUNX1T1, TP53, and WT1. RUNX1 mutations were associated with the HSC subtype (p = 0.005), while mutations detected in the WT1 gene were associated with the GMP subtype (p = 0.031). Mutations detected in the TP53 gene were negatively correlated with the GMP subtype, and mutations in RUNX1T1 were associated with the UC samples (p = 0.02; S8B Table).

Discussion

In AML, GEPs have successfully identified molecular cancer subtypes for stratifying patients into responders vs. non-responders and predicting survival. [19,44-49] These molecular classification systems are generally based on the GEPs of leukaemic cells or on well-documented oncogenic driver mutations and cytogenetic aberrations associated with AML oncogenesis. [16,21,40-45] Here, we examined and validated a classification system using the GEPs of normal myeloid progenitor cell compartments to classify AML into subtypes based on transcriptional reminiscence of the COO. We showed that the MAGS subtypes of AML cases are associated with prognosis. This observation supports the idea that one or more MAGS subtypes have pathogenic impact. The conclusions may be important for future diagnostic phenotyping and the implementation of individual precision therapy, although there are conceptual, molecular, statistical, and clinical considerations that need to be discussed before clinical implementation and validation. Our concept is that AML heterogeneity is a consequence of deregulated differentiation and that there is transcriptional reminiscence of the COO. Combining MFC, FACS, and GEP methodologies to phenotype the myeloid progenitor cells in normal bone marrow samples enabled the development of MAGS that differentiate between early (HSC) and late (GMP, MEP) progenitors by tracing transcriptional reminiscence expression patterns of the COO in end-stage AML samples. The MAGS classification assigned comparable subtype frequencies to AML samples within and across independent clinical cohorts. In a meta-analysis of 691 adult patients with de novo AML, we demonstrated a significant prognostic association with post-therapy outcome. Moreover, multivariate Cox proportional hazards analyses in the two clinical cohorts as well as in the meta-cohort supported that MAGS subtyping is independent of FAB subtype, cytogenetic risk score (not investigated in the meta-cohort due to different risk score stratifications across cohorts), and molecular genetics (well-documented driver mutations in CEBPA, FLT3 [including both FLT3-itd and the FLT3-tkd aberrations], IDH1, IDH2, KRAS, NPM1, and NRAS). These results suggest that we have identified distinct pathogenic mechanisms that require further investigation. The prognostic impact of MAGS-assigned subtypes conferred a superior prognosis to the lineage-committed GMP and MEP subtypes compared with the undifferentiated HSC subtype. The adverse association between leukaemic stem cell phenotypes and survival is well documented in primary AML. [50,51] In agreement with our findings, those studies demonstrated that leukaemic stem cell signatures were independent prognostic predictors that were associated with adverse clinical outcome. The molecular characterisation of MAGS subtype-specific mutation patterns revealed that RUNX1 and IDH2 mutations had non-random associations with the HSC subtype and that there was a tendency for the DNMT3A mutation to be overrepresented in this subtype. Thus, the HSC subtype is associated with well-documented driver mutations in key haematopoietic and epigenetic regulators involved in AML initiation. [25,52,53] In normal haematopoiesis, RUNX1 plays a critical role in HSC compartment maintenance, proliferation, and haematopoietic differentiation, while in malignancy, genetic aberrations in RUNX1 have been associated with highly aggressive AML and poor prognosis. [43,54-57] In contrast, IDH2 and DNMT3A are key epigenetic regulators, and pre-leukaemic driver mutations in these genes occur early in AML development and are associated with the pre-leukaemic HSC compartment. [2,17,52,58] The significant enrichment of stem cell-specific mutations in AML samples with the HSC subtype indicates a consensus between the transcriptional COO detected by MAGS and the genetic COO, suggesting that AML transformation is not accompanied by subsequent differentiation in the HSC subtype or that HSC-like transcriptional gene signatures were reactivated after AML transformation in more differentiated cells. Similarly, CEBPA mutations were overrepresented in the MEP subtype. CEBPA is a myeloid transcription factor involved in the balance between cell proliferation and terminal differentiation, especially granulocyte differentiation. Loss of CEBPA function in AML contributes to leukaemogenesis by blocking granulocytic differentiation, which is accompanied by the increased accumulation of earlier stem and myeloid progenitors as well as erythroid and megakaryocytic progenitors. [59-62] According to the WHO classification and the current ELN guidelines, patients with mutated CEBPA, particularly those with biallelic mutated CEBPA, represent a cohort with favourable prognosis, [54] which is in general accordance with current findings of superior prognosis in the MEP subtype. However, MAGS-based survival curves revealed smaller differences between subtypes than previously reported findings. [63] Although a positive association between MEP and CEBPA aberrations was observed, this is of associative and not definitive nature, as only 24 out of 147 MEP classified cases showed CEBPA mutations and recorded both contain mono and biallelic CEBPA mutations, possibly contributing to a lower overall survival range than previously reported. [63] Functional annotation revealed that genes involved in cell-cycle activity and metabolic processes were downregulated in the HSC subtype. This suggests slower cell proliferation or even a prolonged quiescence phase compared with the GMP or MEP subtype. The poor prognosis of the HSC subtype, thus, might reflect the inefficacy of antiproliferative chemotherapeutics, such as cytarabine, in eradicating slow or non-proliferating leukaemic cells. [64,65] Hence, reactivation of cell-cycle activity might be a critical step to re-establish chemotherapy sensitivity, as previously demonstrated in xenograft AML models. [66,67] In addition, the GMP subtype was characterised by enhanced innate immune activity, especially through Toll-like receptor (TLR) signaling, which was impaired in the MEP subtype. Enhanced expression of TLRs has been associated with haematopoietic malignancies, [68-71] including AML, [68,72,73] but their role in pathogenesis remains unclear. Nevertheless, enhanced TLR signaling could activate inflammatory cytokine secretion and downstream effectors, which might explain the observed upregulation of IL-6 JAK-STAT3 signaling and coregulation of TNF-α signaling through NF-κB and IFN-γ signaling in the GMP subtype. Accordingly, inhibition of TLRs or downstream effectors may confer therapeutic benefit in the GMP subtype but not necessarily in the MEP or HSC subtype, as described previously. [68,74] Statistical models used restricted multinomial regression to estimate the MAGS assignment probability for each sample. MAGS subtypes were defined a priori based on FACS and were independent of the GEP used to build the classifier and subsequent MAGS assignment in clinical samples. Furthermore, samples with low assignment probabilities were labelled UC. The frequency of UC samples in other gene expression-based COO classifications is approximately 15%. [4,75] The probability cut-offs observed for MAGS assignment in the clinical cohorts, when allowing for the assignment of 15% of the samples as UC, exceeded 0.70, which is well above the random assignment probability of one out of four. Furthermore, the prognostic robustness of MAGS was successfully validated for a wide range of assignment frequency cut-offs for the UC subtype (S4A–S4C Fig). The prediction accuracy of the MAGS classification was rather low at 78.95%, but defining 15% of the samples with a low assignment probability as UC improved the prediction accuracy to 90.63%. Incorrect subtype prediction was restricted to the HSC subtype, especially to the GSE19429 cohort, for which FACS information was limited. The findings, thus, may be associated with differences in the FACS procedures and poorly defined progenitor populations. This is further supported by recent findings indicating that FACS surface markers are limited in their capacity to fully capture the differentiation stage of haematopoietic progenitor cells. [76] As the prediction accuracy of the MAGS classification is highly dependent on the number and quality of normal myeloid reference populations, it may be improved by increasing the sample size of the training cohort, avoiding interlaboratory batch effects, and optimising the isolation and characterisation of normal haematopoietic cell compartments for a priori subtype assignments. Clinical considerations: Overall, patient survival was associated with the MAGS-assigned AML progenitor subtypes, independent of age, FAB subtype, and cytogenetic risk scores. These findings support the idea that initial hits in oncogenesis occur in the stem and progenitor cell compartments. MAGS subtype-specific mutation patterns of well-documented driver mutations also support the potential clinical impact of MAGS subtyping. Combination chemotherapy still forms the backbone of AML treatments; however, patients with relapsed or refractory diseases have an unmet need for predictive tests and precise companion diagnostics. This need may be fulfilled using MAGS subtyping with predictive information to guide targeted therapy. In agreement with previous work of our group, [4,8] the current analyses indicate that such information is available at diagnosis and could be used for the identification of candidates needing more precise strategies. We believe our results support the future inclusion of gene expression profiling in randomised prospective clinical trials aimed at improving AML treatment. In summary, we have developed and documented a novel classification system that associates normal myeloid progenitor subsets with AML subtypes and prognosis. The MAGS subtypes have different clinical courses, drug resistance mechanisms, and molecular pathogenesis. However, further studies are needed to examine subtype-specific therapeutic strategies. Interestingly, the results imply a consensus between the genetic and transcriptional COOs, suggesting a minor impact of cell plasticity in leukaemic end stage cells. Future prospective studies will be needed to prove this concept using clinical endpoints. (DOCX) Click here for additional data file.

Summary of cohort properties for sorted normal myeloid data sets and clinical AML data sets.

(DOCX) Click here for additional data file.

FACS antibody panels used to sort early (HSC) and late (GMP, MEP) myeloid progenitor cell subsets.

(DOCX) Click here for additional data file.

Molecular metadata information available for the GSE6891 and TCGA cohorts.

(DOCX) Click here for additional data file.

List of predictive genes defining the myeloid progenitor cell subset-associated gene signatures.

(DOCX) Click here for additional data file.

Top differentially expressed genes.

(DOCX) Click here for additional data file.

Top twenty Gene Ontology (GO) terms enriched for differentially expressed genes.

(DOCX) Click here for additional data file.

Top-enriched gene sets identified through GSEA analysis.

(DOCX) Click here for additional data file.

MAGS subtype-specific mutation patterns.

(DOCX) Click here for additional data file.

Association between the MAGS and FAB subtype.

(DOCX) Click here for additional data file.

Identification of regularisation parameters through cross-validation.

(DOCX) Click here for additional data file.

Venn diagram of MAGS subtype-specific differentially expressed genes.

(DOCX) Click here for additional data file.

GSEA enrichment plots.

(PDF) Click here for additional data file.

Prognostic validation of the assigned MAGS subtypes for the clinical meta-cohort.

(DOCX) Click here for additional data file.
  72 in total

1.  Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia.

Authors:  Andrew J Gentles; Sylvia K Plevritis; Ravindra Majeti; Ash A Alizadeh
Journal:  JAMA       Date:  2010-12-22       Impact factor: 56.272

2.  Diffuse large B-cell lymphoma classification system that associates normal B-cell subset phenotypes with prognosis.

Authors:  Karen Dybkær; Martin Bøgsted; Steffen Falgreen; Julie S Bødker; Malene K Kjeldsen; Alexander Schmitz; Anders E Bilgrau; Zijun Y Xu-Monette; Ling Li; Kim S Bergkvist; Maria B Laursen; Maria Rodrigo-Domingo; Sara C Marques; Sophie B Rasmussen; Mette Nyegaard; Michael Gaihede; Michael B Møller; Richard J Samworth; Rajen D Shah; Preben Johansen; Tarec C El-Galaly; Ken H Young; Hans E Johnsen
Journal:  J Clin Oncol       Date:  2015-03-23       Impact factor: 44.544

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 4.  Molecular genetics in acute myeloid leukemia.

Authors:  Ulrike Bacher; Susanne Schnittger; Torsten Haferlach
Journal:  Curr Opin Oncol       Date:  2010-11       Impact factor: 3.645

Review 5.  The 2008 revision of the World Health Organization (WHO) classification of myeloid neoplasms and acute leukemia: rationale and important changes.

Authors:  James W Vardiman; Jüergen Thiele; Daniel A Arber; Richard D Brunning; Michael J Borowitz; Anna Porwit; Nancy Lee Harris; Michelle M Le Beau; Eva Hellström-Lindberg; Ayalew Tefferi; Clara D Bloomfield
Journal:  Blood       Date:  2009-04-08       Impact factor: 22.113

6.  Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia.

Authors:  Timothy J Ley; Christopher Miller; Li Ding; Benjamin J Raphael; Andrew J Mungall; A Gordon Robertson; Katherine Hoadley; Timothy J Triche; Peter W Laird; Jack D Baty; Lucinda L Fulton; Robert Fulton; Sharon E Heath; Joelle Kalicki-Veizer; Cyriac Kandoth; Jeffery M Klco; Daniel C Koboldt; Krishna-Latha Kanchi; Shashikant Kulkarni; Tamara L Lamprecht; David E Larson; Ling Lin; Charles Lu; Michael D McLellan; Joshua F McMichael; Jacqueline Payton; Heather Schmidt; David H Spencer; Michael H Tomasson; John W Wallis; Lukas D Wartman; Mark A Watson; John Welch; Michael C Wendl; Adrian Ally; Miruna Balasundaram; Inanc Birol; Yaron Butterfield; Readman Chiu; Andy Chu; Eric Chuah; Hye-Jung Chun; Richard Corbett; Noreen Dhalla; Ranabir Guin; An He; Carrie Hirst; Martin Hirst; Robert A Holt; Steven Jones; Aly Karsan; Darlene Lee; Haiyan I Li; Marco A Marra; Michael Mayo; Richard A Moore; Karen Mungall; Jeremy Parker; Erin Pleasance; Patrick Plettner; Jacquie Schein; Dominik Stoll; Lucas Swanson; Angela Tam; Nina Thiessen; Richard Varhol; Natasja Wye; Yongjun Zhao; Stacey Gabriel; Gad Getz; Carrie Sougnez; Lihua Zou; Mark D M Leiserson; Fabio Vandin; Hsin-Ta Wu; Frederick Applebaum; Stephen B Baylin; Rehan Akbani; Bradley M Broom; Ken Chen; Thomas C Motter; Khanh Nguyen; John N Weinstein; Nianziang Zhang; Martin L Ferguson; Christopher Adams; Aaron Black; Jay Bowen; Julie Gastier-Foster; Thomas Grossman; Tara Lichtenberg; Lisa Wise; Tanja Davidsen; John A Demchok; Kenna R Mills Shaw; Margi Sheth; Heidi J Sofia; Liming Yang; James R Downing; Greg Eley
Journal:  N Engl J Med       Date:  2013-05-01       Impact factor: 91.245

7.  Long-term follow-up of cytogenetically normal CEBPA-mutated AML.

Authors:  Friederike Pastore; Daniela Kling; Eva Hoster; Annika Dufour; Nikola P Konstandin; Stephanie Schneider; Maria C Sauerland; Wolfgang E Berdel; Thomas Buechner; Bernhard Woermann; Jan Braess; Wolfgang Hiddemann; Karsten Spiekermann
Journal:  J Hematol Oncol       Date:  2014-09-10       Impact factor: 17.388

Review 8.  The Role of Toll-Like Receptors in Hematopoietic Malignancies.

Authors:  Darlene A Monlish; Sima T Bhatt; Laura G Schuettpelz
Journal:  Front Immunol       Date:  2016-09-28       Impact factor: 7.561

9.  Minimum Information About a Microarray Experiment (MIAME)--successes, failures, challenges.

Authors:  Alvis Brazma
Journal:  ScientificWorldJournal       Date:  2009-05-29

10.  An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis.

Authors:  Namyoung Jung; Bo Dai; Andrew J Gentles; Ravindra Majeti; Andrew P Feinberg
Journal:  Nat Commun       Date:  2015-10-07       Impact factor: 14.919

View more
  1 in total

Review 1.  Molecular Systems Architecture of Interactome in the Acute Myeloid Leukemia Microenvironment.

Authors:  V A Shiva Ayyadurai; Prabhakar Deonikar; Kevin G McLure; Kathleen M Sakamoto
Journal:  Cancers (Basel)       Date:  2022-02-01       Impact factor: 6.639

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.