| Literature DB >> 31235852 |
Chi Huu Nguyen1,2, Tobias Glüxam1,2, Angela Schlerka1,2, Katharina Bauer1,2,3, Alexander M Grandits1,2, Hubert Hackl4, Oliver Dovey5, Sabine Zöchbauer-Müller1,2, Jonathan L Cooper5, George S Vassiliou5, Dagmar Stoiber6,7, Rotraud Wieser8,9, Gerwin Heller10,11,12.
Abstract
Acute myeloid leukemia (AML) is a heterogeneous disease with respect to its genetic and molecular basis and to patients´ outcome. Clinical, cytogenetic, and mutational data are used to classify patients into risk groups with different survival, however, within-group heterogeneity is still an issue. Here, we used a robust likelihood-based survival modeling approach and publicly available gene expression data to identify a minimal number of genes whose combined expression values were prognostic of overall survival. The resulting gene expression signature (4-GES) consisted of 4 genes (SOCS2, IL2RA, NPDC1, PHGDH), predicted patient survival as an independent prognostic parameter in several cohorts of AML patients (total, 1272 patients), and further refined prognostication based on the European Leukemia Net classification. An oncogenic role of the top scoring gene in this signature, SOCS2, was investigated using MLL-AF9 and Flt3-ITD/NPM1c driven mouse models of AML. SOCS2 promoted leukemogenesis as well as the abundance, quiescence, and activity of AML stem cells. Overall, the 4-GES represents a highly discriminating prognostic parameter in AML, whose clinical applicability is greatly enhanced by its small number of genes. The newly established role of SOCS2 in leukemia aggressiveness and stemness raises the possibility that the signature might even be exploitable therapeutically.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31235852 PMCID: PMC6591510 DOI: 10.1038/s41598-019-45579-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of AML gene expression data sets retrieved from GEO and TCGA/Cancer Browser databases.
| Accession number | Platform | Patients (N)a | Ageb | Cytogenetics |
|---|---|---|---|---|
| GSE12417, cohort 1 | Affymetrix, HG-U133A | 162 | 58 (17–83) | Normal |
| GSE12417, cohort 2 | Affymetrix, HG-U133_Plus_2 | 78 | 62 (18–85) | Normal |
| GSE6891, cohort 1 | Affymetrix, HG-U133_Plus_2 | 222 | 43 (15–60) | Heterogeneous |
| GSE6891, cohort 2 | Affymetrix, HG-U133_Plus_2 | 185 | 45 (17–60) | Heterogeneous |
| GSE37642 | Affymetrix, HG-U133A | 379 | 57 (18–83) | Heterogeneous |
| GSE71014 | Illumina HumanHT-12 V4.0 | 104 | na | Normal |
| TCGA_LAML | RNA-sequencing | 142 | 60 (18–88) | Heterogeneous |
aNumber of patients after exclusion of samples with FAB M3/unknown FAB type or with MDS. bAge in years, median (range). na, not available.
Survival associated gene expression model identified by forward gene selection using GSE12417 cohort 1.
| Probe ID | Gene Symbol | nloglik | AIC | Selected |
|---|---|---|---|---|
| 203373_at |
| 452.01 | 910.02 | * |
| 218086_at |
| 449.26 | 906.52 | * |
| 211269_s_at |
| 447.91 | 905.81 | * |
| 201397_at |
| 444.23 | 900.46 | * |
| 218966_at |
| 444.23 | 902.45 | |
| 209386_at |
| 443.2 | 902.39 | |
| 203372_s_at |
| 443.17 | 904.33 | |
| 201540_at |
| 441.85 | 903.71 | |
| 211597_s_at |
| 440.17 | 902.33 |
Nloglik, negative log likelihood; AIC, Akaike information criterion; AIC is provided for the prognostic model including the respective gene and all genes above that gene in the list.
Figure 1Prognostic value of the 4-GES in 162 cytogenetically normal AML patients (GSE12417, cohort 1). (A) Cut-off value for stratification of AML patients into 4-GESlow (blue) and 4-GEShigh (red) was calculated by maximally selected rank statistics. (B) Kaplan Meier curves for overall survival of 162 AML patients classified as 4-GESlow or 4-GEShigh. Log rank tests were calculated and p-values were adjusted for multiple testing according to Altman et al.[58]. (C) Heatmap summarizing expression values of SOCS2, IL2RA, NPDC1 and PHGDH in 4-GESlow (blue) and 4-GEShigh (red) AML patients. Blue, low expression; red, high expression.
Multivariable Cox regression analysis for overall survival of AML patients.
| Data set/cohort | Variable | 4-GES | L-24 | M-7 | W-3 | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HR | 95% CI | p-value | HR | 95% CI | p-value | HR | 95% CI | p-value | HR | 95% CI | p-value | ||
|
| GE score, high |
|
|
|
| na | Na | na |
|
|
| ||
| Age (years) |
|
|
|
|
|
| na | Na | na |
|
|
| |
|
| GE score, high |
|
|
| ns | ns | ns |
|
|
|
|
|
|
| Age (years) |
|
|
| ns | ns | ns |
|
|
|
|
|
| |
|
| GE score, high |
|
|
| 1.1 | 0.8–1.6 | 0.56 |
|
| ns | ns | ns | |
| Cytogenetic riska |
|
|
|
|
|
| ns | ns | ns | ||||
| CEBPAm (w/m/b) | 0.79 | 0.1–6.4 | 0.828 | 0.84 | 0.1–8.1 | 0.878 | 0.95 | 0.1–6.4 | 0.957 | ns | ns | ns | |
| CEBPAm (yes/no) | 0.94 | 0.0–57.1 | 0.976 | 0.71 | 0.0–61.2 | 0.878 | 0.66 | 0.01–27.5 | 0.83 | ns | ns | ns | |
| 0.82 | 0.5–1.5 | 0.511 | 0.95 | 0.5–1.7 | 0.854 | 0.89 | 0.5–1.6 | 0.681 | ns | ns | ns | ||
| FLT3-ITD | 1.38 | 0.9–2.0 | 0.106 |
|
|
|
|
|
| ns | ns | ns | |
|
| GE score, high |
|
|
| 1.33 | 0.9–2.0 | 0.173 |
|
|
| ns | ns | ns |
| Age (years) | 1.01 | 1.0–1.03 | 0.281 | 1.01 | 1.0–1.03 | 0.128 | 1.01 | 1.0–1.03 | 0.102 | ns | ns | ns | |
| Cytogenetic riska | 1.13 | 0.8–1.6 | 0.528 | 1.22 | 0.9–1.75 | 0.27 | 1.35 | 1.0–1.92 | 0.092 | ns | ns | ns | |
|
|
|
|
|
|
|
|
|
| ns | ns | ns | ||
|
| GE score, high |
|
|
|
|
| na | Na | na | ns | ns | ns | |
| Age (years) |
|
|
|
| na | Na | na | ns | ns | ns | |||
| ELN scoreb |
|
|
|
| na | Na | na | ns | ns | ns | |||
| FAB, M1 vs. others | 1.06 | 0.8–1.4 | 0.718 | 0.95 | 0.7–1.3 | 0.745 | na | Na | na | ns | ns | ns | |
| FAB, M4 vs. others | 1.32 | 1.0–1.8 | 0.074 |
|
|
| na | Na | na | ns | ns | ns | |
|
| GE score, high | 1.52 | 1.0–2.3 | 0.051 | 1.3 | 0.8–2.1 | 0.249 | ns | Ns | ns |
|
|
|
| Age (years) |
|
|
|
|
| ns | Ns | ns |
|
| |||
| Cytogenetic riska | 1.29 | 0.9–1.9 | 0.179 | 1.25 | 0.8–1.9 | 0.265 | ns | Ns | ns |
|
|
| |
Parameters provided with the respective data sets were first tested in univariable analyses (Supplementary Table S1); those that resulted as significant were included in the multivariable models, whose results are summarized in this table. 4-GES, 4-gene expression score; L-24, 24-gene expression signature by Li et al.[16]; M-7, 7-gene expression signature by Marcucci et al.[19]; W-3, 3-gene expression signature by Wilop et al.[20]; HR, hazard ratio; CI, confidence interval; GE score, gene expression score; CEBPAm, CEBPA mutation; w, wild type; m, monoallelic; b, biallelic. aAssignment to cytogenetic risk groups were included in the respective GEO entries. bAssignment to ELN risk groups was provided by T. Herold, University of Munich, Department of Internal Medicine III, Munich, Germany. No relevant patient data were provided in GSE71014; therefore, multivariable analyses could not be performed. Significant p-values and corresponding HRs and Cis are indicated in bold letters. na, score could not be calculated because 2 signature genes were not represented on HG-U133A microarrays; ns, no statistical significance found in univariable analyses, thus, no multivariable analyses were performed.
Figure 2Validation of the 4-GES in gene expression data set GSE6891, containing cytogenetically heterogeneous AML patients. (A) Cohort 1 (N = 222), (B) cohort 2 (N = 185). Kaplan Meier curves for overall survival of 4-GESlow (blue) and 4-GEShigh (red) patients are shown. Statistical significance was calculated using the log rank test and p-values were adjusted for multiple testing according to Altman et al.[58].
Figure 3Validation of the 4-GES in gene expression data set GSE37642, consisting of 379 cytogenetically heterogeneous AML patients. (A) Cut-off value for stratification of AML patients into 4-GESlow (blue) and 4-GEShigh (red) was calculated by maximally selected rank statistics. (B) Kaplan Meier curves for overall survival of 379 AML patients classified as 4-GESlow (blue) and 4-GEShigh (red). (C) Kaplan Meier curves for overall survival of 107 ELN favourable, 174 ELN intermediate I/II and 86 ELN adverse risk AML patients stratified into 4-GESlow (blue) and 4-GEShigh (red). Statistical significance was calculated using the log rank test and p-values were adjusted for multiple testing according to Altman et al.[58]. (D) Kaplan Meier curves for overall survival of AML patients stratified into favourable, intermediate, and adverse based on ELN 2010 classification (left) and ELN 2010 + 4-GES classification (right). ELN, European Leukemia Net.
Figure 4Validation of the 4-GES in the gene expression data sets (A) GSE71014 and (B) TCGA_LAML. Kaplan Meier curves for overall survival of 4-GESlow (blue) and 4-GEShigh (red) patients are shown. Statistical significance was calculated using the log rank test and p-values were adjusted for multiple testing according to Altman et al.[58].
Figure 5Effects of Socs2 knock-down in an MLL-AF9 driven mouse model of AML. (A) Socs2 mRNA levels in MLL-AF9+ BM LCs and in BM and spleen cells of healthy mice were determined by qRT-PCR and normalised to those of the housekeeping gene ß-2-microglobulin using the ∆∆CT method. Mean ± SEM of three independent experiments; ***p < 0.001 (Student’s two-tailed t-test). (B) Left panel, immunoblot analysis for SOCS2 expression in shCtrl or shSocs2 transduced MLL-AF9+ LCs. Right panel, quantification of immunoblot results; mean ± SEM of two independent experiments; *p < 0.05 (Student’s two-tailed t-test). (C) shCtrl or shSocs2 transduced MLL-AF9+ LCs were maintained in suspension culture and counted on the indicated days. Mean ± SEM of three independent experiments. **p < 0.01; ***p < 0.001 (2-way ANOVA followed by Bonferroni’s post-hoc test). (D) Kaplan-Meier plot of mice transplanted with shCtrl or shSocs2 transduced MLL-AF9+ LCs (300.000 Venus+ RFP+ cells per mouse, n = 4). **p < 0.01 (log-rank test). (E–H) Analyses of LCs from mice terminally ill after transplantation with shCtrl or shSocs2 transduced MLL-AF9+ LCs. Mean ± SEM of 3 independent experiments. *p < 0.05, **p < 0.01, ***p < 0.001 (2-way ANOVA followed by Bonferroni’s post-hoc test). (E) Myeloid differentiation of spleen LCs. Mac-1+ Gr-1−, immature LCs; Mac-1+ Gr-1+, mature LCs. (F) Abundance of LSC enriched cells (LSCe; Lin− cKit+ Sca-1− CD34+ CD16/CD32hi cells) among Venus+ RFP+ spleen LCs. (G) Cell cycle distribution of spleen LSCe was determined by Ki67 and DAPI staining. Quiescent cells were defined as Ki67− cells with a 2n DNA content. (H) Colony formation by BM LCs serially plated into methyl cellulose.