Literature DB >> 34556857

Mutational synergy during leukemia induction remodels chromatin accessibility, histone modifications and three-dimensional DNA topology to alter gene expression.

Haiyang Yun1,2,3, Nisha Narayan1,2, Shabana Vohra1,2, George Giotopoulos1,2, Annalisa Mupo1,2,4,5, Pedro Madrigal1,2, Daniel Sasca1,2,6, David Lara-Astiaso1,2, Sarah J Horton1,2, Shuchi Agrawal-Singh1,2, Eshwar Meduri1,2, Faisal Basheer1,2, Ludovica Marando1,2, Malgorzata Gozdecka1,2,4, Oliver M Dovey1,2,4, Aracely Castillo-Venzor1, Xiaonan Wang1,2, Paolo Gallipoli1,2,7, Carsten Müller-Tidow3, Cameron S Osborne8, George S Vassiliou1,2,4, Brian J P Huntly9,10.   

Abstract

Altered transcription is a cardinal feature of acute myeloid leukemia (AML); however, exactly how mutations synergize to remodel the epigenetic landscape and rewire three-dimensional DNA topology is unknown. Here, we apply an integrated genomic approach to a murine allelic series that models the two most common mutations in AML: Flt3-ITD and Npm1c. We then deconvolute the contribution of each mutation to alterations of the epigenetic landscape and genome organization, and infer how mutations synergize in the induction of AML. Our studies demonstrate that Flt3-ITD signals to chromatin to alter the epigenetic environment and synergizes with mutations in Npm1c to alter gene expression and drive leukemia induction. These analyses also allow the identification of long-range cis-regulatory circuits, including a previously unknown superenhancer of Hoxa locus, as well as larger and more detailed gene-regulatory networks, driven by transcription factors including PU.1 and IRF8, whose importance we demonstrate through perturbation of network members.
© 2021. The Author(s), under exclusive licence to Springer Nature America, Inc.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34556857      PMCID: PMC7611829          DOI: 10.1038/s41588-021-00925-9

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Introduction

The functional specification of tissues in metazoans occurs through the generation of cell-specific proteomes, in turn, controlled by diverse transcriptional programs[1,2]. These transcriptional programs are tightly spatiotemporally regulated through the utilization of tissue-specific cis-regulatory elements such as enhancers, that are licensed and brought into direct communication with their cognate promoters through the functions of transcription factors (TFs), chromatin regulators (CRs) and genome structural proteins[3,4]. A classical model of this complexity is the hematopoietic system, where multiple mature effector cells derive from a single cell type; the hematopoietic stem cell (HSC)[5]. Hematopoiesis, the dynamic and reactive process of blood production is exquisitely regulated by a complicated interplay between multiple pioneer, lineage-specific and signal-dependent TFs, CRs and genome structural proteins[6-8]. These players are known to remodel the epigenetic landscape and reshape genome topology to allow communication between regulatory elements (e.g. promoter-enhancer). In malignancies such as acute myeloid leukemia (AML), recurrent mutations in TFs, CRs and genome structural proteins, including components of the Cohesin complex and CTCF have been observed at high frequencies[9-11]. AML is the most common acute leukemia in adults, serves as a paradigm for aberrant hematopoietic differentiation and is an aggressive disease with a dismal overall survival rate of < 30%[12]. However, although aberrant transcription is a cardinal feature of AML[13], not all cases carry mutations in the above classes of proteins, suggesting that the effects of other common mutations, such as signaling alterations, indirectly converge on the same epigenetic, transcriptional and genome regulatory machinery, although the mechanistic detail of this remains obscure. This study addresses these fundamental questions of how AML mutations, even those that do not directly control gene expression, co-opt the transcriptional and epigenetic machinery to alter chromatin states, 3D DNA topology and communication between enhancers and promoters to generate leukemia-specific transcriptional programs. To do so we have experimentally “deconstructed” AML utilizing an allelic series of mice that model different “transition states” during AML induction; normal, pre-malignant and overt leukemia. In addition, analysis of the single mutant (SM) pre-malignant mice also allows deconvolution of the contribution of individual mutations to altered epigenetic regulation. These studies allow us to elucidate the interplay between epigenetic and 3D genomic states and the generation of leukemia-specific transcriptional programs.

Results

Murine models show transcriptional synergy of AML mutations

FLT3-ITD and NPM1c mutations are the most common in AML, occurring individually in ~25-30% and co-occuring in ~15% of all cases[9-11]. Mouse models carrying knock-in mutations of Npm1 (conditional Npm1 flox-cA/+;Mx1-Cre, hereafter referred to as “Npm1c”) or Flt3 (constitutive Flt3 ITD/+, hereafter “Flt3-ITD”) and referred to as single mutant (SM) mice, are associated with individual subtle, non-fatal but obvious pre-malignant phenotypes[14,15]. However, when combined (Npm1 flox-cA/+;Mx1-Cre;Flt3 ITD/+, “Npm1c/Flt3-ITD” or double mutant, DM) mice develop an aggressive AML with short latency[16]. To prospectively assess dynamic remodeling of the cis-regulatory landscape and 3D genome topology during leukemia development, we utilized an allelic series of wild-type (WT), Npm1c, Flt3-ITD and DM mice to model AML induction (Figure 1a, upper panel), analyzing the same population enriched for hematopoietic stem and progenitor cells (HSPC, negative for lineage markers, Lin-) from WT and mutant mice. Assessing gene expression (RNA-seq), multiple chromatin activation states by chromatin immunoprecipitation and sequencing (ChIP-seq), chromatin accessibility by assay for transposon accessible chromatin (ATAC-seq)[17] and promoter-anchored 3D chromatin interaction by promoter capture HiC (pCHiC)[18] Figure 1a, lower panel) across our allelic series, we derived a high-quality dataset with strong reproducibility between sample replicates (Supplementary Figure 1).
Figure 1

Murine models show transcriptional synergy of AML mutations.

a, Schematic of overall experimental design. b, Principal component analysis (PCA) of mRNA expression data of 16,771 protein-coding genes. c, Volcano plots showing differential expression between mutant and WT HSPC. The p values were attained by the Wald test from DEseq2 and were two-tailed and corrected for multiple testing using the Benjamini and Hochberg method (adjP). Up- or downregulation were defined by setting adjP < 0.05 and absolute fold change (FC) ≥ 1.5. d, Immune response related Hallmark gene sets from GSEA analysis of differential gene expression in Flt3-ITD or DM HSPC. e, GSEA enrichment plots showing gene set of TNFA signalling via NFKB in Flt3-ITD or DM HSPC. NES, normalised enrichment score. f, Overlap of upregulated genes in each mutant. Numbers of genes in each segment and representative genes are indicated. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. g, Heatmap of normalised mRNA expression (Z-score) of representative genes in WT and mutant HSPC with replicate samples. h, Overlapping analysis between Npm1c/Flt3-ITD up- or downregulated genes (with a 2-fold change) in human AML and mouse DM leukemia. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown.

We first analysed differential gene expression between WT, individual pre-malignant SM and leukemic DM stages. Principal Component Analysis (PCA) and pairwise comparisons demonstrated that the two single mutations induced only very modest changes of global gene expression in isolation (Figure 1b,c, Extended Data Figure 1a and Supplementary Table 1). In stark contrast, when combined, Npm1c and Flt3-ITD strongly synergised to induce marked differential gene expression. Utilizing gene set enrichment analysis (GSEA), Flt3-ITD HSPC displayed signatures related to immune activation that were shared with DM Leukemic HSPC (Figure 1d,e and Extended Data Figure 1b). Conversely, genes downregulated in either Npm1c or DM HSPC were significantly enriched for leukemia-associated repressive gene signatures (Extended Data Figure 1c-e). Of note, many of the genes altered in the single mutant HSPC were also dysregulated in the same direction upon leukemia development (Figure 1f and Supplementary Figure 2a). Moreover, their expression often demonstrated an intermediate stepwise change from the single to double mutant stage, suggesting mutational synergy (Figure 1g and Supplementary Figure 2b). Examples of genes upregulated in Npm1c and DM cells included Hoxa genes (e.g. Hoxa9, Hoxa10) (Figure 1f,g). A small number of genes upregulated by both single mutations included exemplars of the immune activation programme including the AP-1 complex members Jun, Fos and Fosb, whose expression increased still further in DM HSPC. However, some genes such as Spi1, encoding the ETS-family TF Pu.1, Setbp1, and immune activation-associated Interferon regulatory factors (e.g. Irf4, Irf8) were upregulated only in DM HSPC (Figure 1f,g). Comparably, Gata1, Gata2 and tumour suppressor genes Gfi1 and Gfi1b were significantly downregulated only in DM HSPC (Figure 1g and Supplementary Figure 2a). Importantly, comparing genes differentially expressed in NPM1c/FLT3-ITD mutated human AML[19] with our differentially expressed genes between DM and WT HSPC, we could demonstrate a highly significant overlap with 27% up- (119/441, p=1.28e-07) and 15% (74/503, p=1.27e-07) of downregulated human genes also evident in our dataset Figure 1h). Taken together, these data demonstrate marked mutational synergy not evident when only single mutations are present.
Extended Data Fig. 1

Transcriptional changes across WT and mutant HSPC.

a, Heatmap showing unsupervised clustering analysis of global gene expression in WT and mutant HSPC. b, Top 10 positively enriched gene ontology (GO) gene sets revealed by GSEA for each mutant (vs WT). c, Commonly enriched gene sets from GSEA analysis of differential gene expression by Npm1c and DM. d, GSEA enrichment plots showing gene set of Kong_E2F3_Targets in Npm1c or DM vs WT by GSEA analysis. e, Gene sets positively or negatively enriched in all three mutants vs WT.

Flt3-ITD and Npm1c mutations alter chromatin accessibility

Transcriptional activation is facilitated by the recruitment of specific TFs leading to nucleosome depletion at cis-regulatory elements such as enhancers and promoters[20]. Using ATAC-seq, we therefore first assessed the dynamics of chromatin accessibility across our allelic series. In contrast to the minimal effects on gene expression conferred by individual mutations, both single mutations demonstrated marked alterations in accessibility, though to a less extent than in combination (Figure 2a,b and Supplementary Table 2). Compared with gene expression, more alterations in accessibility were preserved between the Flt3-ITD and DM HSPC (Figure 2c). In comparison to the degree of increased or decreased chromatin accessibility elicited by DM at differentially accessible regions, changes were intermediate in Flt3-ITD HSPC (Figure 2d). In contrast, less overlap was seen for the changes of accessibility evident between Npm1c and DM HSPC (Figure 2c and Extended Data Figure 2a). However, within this overlap were four regions located in the Hoxa locus (Figure 2e), linking accessibility with an increase in gene expression in both Npm1c and DM HSPC. Regions that became sequentially less accessible in Npm1c and DM HSPC included the Gata2 promoter, as well as its distal and proximal enhancers[21,22] (Extended Data Figure 2b). Interestingly, accessibility changes in single mutant HSPC did not significantly alter gene expression, but did correlate with gene expression upon leukemia induction (Extended Data Figure 2c).
Figure 2

Both FLT3-ITD and Npm1c alter chromatin accessibility.

a, PCA of ATAC-seq signals across all four cellular states. Counts per million reads were used for the analysis. b, MA plots showing ATAC-seq peaks with significantly differential accessibility in mutant vs WT HSPC. Significant increase or decrease were determined by edgeR with setting FDR (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. c, Venn diagram of ATAC-seq peaks with accessibility gain or loss induced by each mutant. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. d, Profile plots of ATAC-seq signals at regions demonstrating gain or loss of accessibility induced by DM across four cellular states. e, ATAC-seq tracks showing chromatin accessibility at the Hoxa cluster in all four HSPC and wildtype neutrophils (NE). Regions showing increased accessibility by Npmc1 and DM compared with WT were highlighted. f, De novo motifs significantly enriched at genomic regions with gain or loss of accessibility in DM HSPC. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). g, BaGFoot analysis illustrates TFs with differential footprint depth and accessibility in DM vs WT HSPC. The data points within bag and fence area include 50% and > 97% of the population, respectively, are not significant. Motifs outside the fence and with a p value (two-tailed, not multiple testing corrected) < 0.05 are statistically significant outliers; n.s., not significant.

Extended Data Fig. 2

Global chromatin accessibility across WT and mutant HSPC.

a, Heatmaps and profile plots of ATAC-seq enrichment across WT and mutant HSPC over regions with gain or loss of accessibility in the presence of Npm1c. Peaks were ranked by average enrichment across all samples. b, Chromatin accessibility at the Gata2 genes and its upstream enhancers in all four HSPC and wildtype neutrophils. Regions showing loss of accessibility in Npmc1 and DM HSPC are highlighted. c, Linking differential accessibility at gene promoters to their mRNA expression changes by each mutant. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. d-f, De novo motifs significantly enriched at genomic regions with altered accessibility by Npm1c (d) and Flt3-ITD (e) or open chromatin sites with static accessibility (f) where FDR of differential analysis > 0.2 in any mutant vs WT HSPC. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). g and h, BaGFoot analysis illustrates TFs with differential footprint depth and accessibility in Npm1c (g) and Flt3-ITD (h) vs WT HSPC. Motifs outside the fence and with a p value (two-tailed, not multiple testing corrected) < 0.05 are statistically significant outliers; n.s., not significant.

Performing de novo motif analysis, we identified binding sites of TFs whose function may be modulated by these accessibility changes, and demonstrated enrichment for the binding sites of a number of TFs at regions gained, lost or static (Figure 2f and Extended Data Figure 2d-f). Further Bivariate Genomic Footprinting (BaGFoot) analysis[23] simultaneously assessed differential footprint depth of TFs and altered accessibility flanking their motifs between conditions (Figure 2g and Extended Data Figure 2g,h). This confirmed that GATA factors demonstrated decreased accessibility and reduced footprint depth suggesting a loss of binding in DM HSPC (Figure 2g). Conversely, increased accessibility and footprint depth was noted for Cebpa, Cebpd and for the AP-1 complex members Atf2 and Atf7. Of interest, increased accessibility but no increase in footprinting was observed for PU.1 motifs in DM cells, suggesting a potential role for prior-bound PU.1 in determining this accessibility upon leukemia development.

Only Flt3-ITD remodels the histone modification landscape

We next assessed chromatin modifications associated with regulatory activity (mono- or tri-methylation of Histone 3 lysine 4, H3K4me1 or H3K4me3 and acetylation of Histone 3 lysine 27, H3K27ac) at cis-regulatory elements in WT and mutant HSPC using ChIP-seq. Enhancers were considered cis-regulatory modules usually flanked by high levels of H3K4me1 and no or low levels of H3K4me3 (n=98,365, Extended Data Figure 3a and Supplementary Table 3) and were designated “primed” when lacking H3K27ac or “active” if marked with H3K27ac[24]. To further enrich for cis-regulatory elements we overlapped this compendium with our ATAC-seq analysis to identify “accessible” enhancers (n=48,052, Extended Data Figure 3b and Supplementary Table 3). This cis-regulatory element repertoire was highly dynamic across the cellular states; ~51% demonstrated differential H3K4me1 between mutant and WT HSPC (Extended Data Figure 3c). Of note, a large number of dynamic alterations of H3K4me1 were observed for both Flt3-ITD and DM HSPC, with an increased number in the DM leukemic state (Figure 3a,b and Supplementary Table 4) but a marked overlap between the two (Extended Data Figure 3d). Strikingly, and by marked contrast, Npm1c HSPC were virtually indistinguishable from WT in H3K4me1 patterns (Figure 3a,b). Moreover, among all DM dynamic elements, intermediate changes of H3K4me1 were observed in Flt3-ITD but not Npm1c HSPC Figure 3c and Extended Data Figure 3e).
Extended Data Fig. 3

Chromatin modifications at accessible enhancers across WT and mutant HSPC.

a, Distribution of all H3K4me3 peaks based on read counts across all four cell types. Active promoters were marked by high H3K4me3 with log2 CPM > 4. b, Genome distribution of ATAC-seq consensus peaks. c, Number of total accessible enhancer peaks defined in all four HSPC samples and percentage of dynamic enhancers which were defined by differential H3K4me1 in the presence of any mutations. d, Numbers of overlapping accessible enhancers with gain or loss of H3K4me1 in the presence of Flt3-ITD or DM. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. e, Heatmaps of H3K4me1 enrichment over gained or lost enhancers during DM leukemia induction. Peaks were ranked by average enrichment across all samples. f, Percentage of accessible enhancers marked by both H3K4me1 and H3K27ac in total accessible enhancers across all four HSPC states. g, Percentage of enhancers showing dynamic H3K27ac modification in the presence of single or double mutations. h. Heatmaps and profile plots of H3K27ac enrichment over enhancers showing gain or loss of H3K27ac during DM leukemia induction. Peaks were ranked by average enrichment across all samples.

Figure 3

Flt3-ITD but not Npm1c remodels the enhancer landscape.

a, Correlation matrix of H3K4me1 reads at accessible enhancers in mutant vs WT HSPC. Counts per million (CPM) reads were log2 transformed and Pearson correlation coefficients are shown. b and d, MA plots showing accessible enhancers with significantly differential H3K4me1 (b) or H3K27ac (d) levels in mutant vs WT HSPC. Significant increase or decrease were determined by edgeR with setting FDR (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. c, Profile plots of H3K4me1 enrichment scores at accessible enhancers (1kb ± ATAC-seq peak summit) with gain (upper panel) or loss (lower panel) of H3K4me1 by DM across four cellular states. e, Chromatin profiles of Socs2 gene and its proximal region with enhancer marks, accessibility and RNA-seq reads in all four HSPC and wildtype neutrophils (NE). Regions showing intermediate increase of enhancer marks by Flt3-ITD were highlighted. f, Heatmaps of the H3K4me1, H3K27ac and ATAC-seq profiles at DM gained and lost enhancers across WT and DM HSPC, wildtype neutrophils (NE) and granulocyte/macrophage progenitors (GMP). Group Gain-2 demonstrates “leukemia-specific” changes. Peaks were ranked by average enrichment across all samples. g, De novo motifs significantly enriched at individual enhancer groups. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected).

We next assessed activation of these elements by overlaying the H3K27ac changes on the H3K4me1 and accessibility landscape. H3K27ac alterations were relatively modest (Extended Data Figure 3f,g). Flt3-ITD alone and DM demonstrated H3K27ac changes at hundreds of enhancer regions, whereas Npm1c did not significantly alter global H3K27ac signal (Figure 3d and Supplementary Table 4). Once again, the effects of Flt3-ITD alone were often intermediate when compared with DM, as exemplified at the Socs2 locus (Figure 3e and Extended Data Figure 3h). Although no significant alterations in the cellular composition of the broader Lin- HSPC compartment are evident between WT and the SM mice, this compartment is larger in DM mice and demonstrates a relative increase in LSK (Lin-Sca1+cKit+, enriched for HSC) and particularly the granulocyte/macrophage progenitors (GMP) compartments, as well as a modest reduction in the common myeloid progenitors (CMP) and megakaryocyte/erythroid progenitors (MEP) compartments (Extended Data Figure 4a,b). Therefore, to address whether altered cis-regulatory elements were truly leukemia-specific or were associated with other hematopoietic states, such as normal myeloid differentiation, we replicated our integrated genomic analysis in normal neutrophils and included similar available GMP maps[25] for the comparison. For DM gained enhancers, 31% (3,637/11,892) overlapped with similar changes in neutrophils and GMP (Figure 3f group Gain-1). However, the remaining 69% appeared leukemia-specific (group Gain-2, Supplementary Table 5). In contrast, of DM lost enhancers, the majority showed similar changes in both neutrophils and GMP (83%, 10,362/12,509, group Loss-2), while ~17%, although distinct from neutrophils, were highly similar to GMP (group Loss-1). Exemplar leukemia specific regulatory elements are shown for the Socs2 gene (Figure 3e). Of note, subtle specific enrichment for motifs were observed for leukemia-specific elements, in particular, the presence of an ETS-IRF composite element for PU.1/IRF in the leukemia-specific enhancer regions Figure 3g).
Extended Data Fig. 4

Immunophenotypic characterization of HSPCs isolated from bone marrow of wildtype and mutant mice.

a, Representative flow cytometry plots showing proportions of hematopoietic stem and progenitors within bone marrow compartment. Lin-, lineage negative; HPC, hematopoietic progenitors (Lin-Sca1-cKit+) containing all myeloid progenitors; LSK, Lin-Sca1+cKit+; CMP, common myeloid progenitors; GMP, granulocyte/macrophage progenitors; MEP, megakaryocyte/erythroid progenitors. b, Comparison of proportions of hematopoietic stem and progenitors in their parental populations in mutant mice (Npm1 flox-cA/+;Mx1-Cre, n=9; Flt3 ITD/+, n=8; Npm1 flox-cA/+;Mx1-Cre;Flt3 ITD/+, n=9) compared with wildtype mice (WT, n=7). Student’s unpaired t-tests (two-sided) were performed for the comparisons; data are presented as mean values +/- standard deviation; Only the p values < 0.05 were shown.

Leukemia programs utilize novel and existing 3D contacts

To capture genome-scale alterations in 3D DNA topology, we employed pCHiC[18] to generate a compendium of promoter-associated DNA interactions and demonstrate how they differed between WT and mutant HSPC. A total of 88,624 high-confidence interactions were captured across our HSPC series (Figure 4a, Extended Data Figure 5a-c and Supplementary Table 6).
Figure 4

Leukemia programs utilize novel and existing 3D contacts.

a, PCA of CHiCAGO scores of high-confidence interactions across all four HSPC states. High-confidence interactions were defined as significant interactions (CHiCAGO score ≥ 5) present in both replicates. b, Pan-chromosome correlation of chromatin compartments between mutant and WT HSPC. Dots represent individual compartment and the correlation of their PC1 values. c, Numbers of flipped chromatin compartments upon mutations. d, Illustration of chromatin compartments A/B levels at a DM-induced “B to A” flipped region (highlighted) containing Setbp1. e, Differential total interaction reads at individual promoters between mutant and WT HSPC. The p values were two-tailed and corrected for multiple testing (adjP). Increase or decrease were determined by setting adjP < 0.05 and absolute FC > 1. f, Scatter plot showing correlation between promoters with altered interactions induced by either Flt3-ITD (y axis) or DM (x axis). Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. g, Proportion of rewired and hard-wired interactions among all high-confidence interactions across four HSPC states. h, Number of gained or lost interactions in the presence of mutations (compared to WT HSPC). i and j, Promoter-contact plots showing the read counts of promoter bait to target pairs for Gfi1b (i) and Irf8 (j). Dots represent chromatin interaction fragments at defined distances from the bait; Grey lines, expected read counts; dashed lines, the upper bound of the 95 % confidence intervals. Regions highlighted yellow show loss (i) or gain (j) of interactions in DM vs WT HSPC.

Extended Data Fig. 5

3D chromatin interaction profiles across WT and mutant HSPC.

a, Numbers of total and high-confidence pCHiC interactions in all four HSPC states. High-confidence interactions were defined as significant interactions (with CHiCAGO score ≥ 5) overlapping in both replicates of each cell type. b, Numbers of pCHiC interactions captured by individual promoters. c, Distance of interacting regions from their target promoters. Median distance was shown. d, Illustration of chromatin compartments A/B levels at a DM-induced “B to A” flipped region (highlighted) containing the Igf1 oncogene. e. Illustration of chromatin compartments A/B levels at a DM-induced “B to A” flipped region (highlighted) containing an Ifi (interferon inducible) gene cluster (in red). f and g, Significant interactions, represented by arcs, associated with the Gfi1b (f) or Irf8 (g) promoters in different cell types.

pCHiC data can subdivide the genome into A (active) and B (inactive) compartments using PCA[26,27]. Only the combination of mutations altered the compartment structure, with several hundred compartments changing their assignment (Figure 4b,c and Supplementary Table 7). A total of 290 regions altered their compartment assignment from “inactive” B to “active” A (Figure 4c). These regions contained 345 expressed genes, including known oncogenes such as Setbp1, Igf1 and a contiguous region involving an interferon-inducible Ifi gene cluster (Figure 4d and Extended Data Figure 5d,e). Alteration from an active to an inactive compartment, A to B, occurred at 274 regions (Figure 4c) and covered 453 genes including the tumour suppressor gene Dach1 and the Fancf gene that is silenced by methylation in AML[28]. Assessing total interactions at individual promoters, only a few promoters were altered for Npm1c HSPC in comparison to WT. In contrast, in the presence of the Flt3-ITD, several hundred promoters showed lost- or gained-interactions, with this number roughly doubling in DM HSPC (Figure 4e). Moreover, nearly half of those promoters altered by Flt3-ITD overlapped with the change in DM HSPC (Figure 4f). At the level of individual interactions, of note, only 11% of all high-confidence interactions were significantly differential (gained or lost) in SM or DM HSPC, based on stringent alterations of the ranking of CHiCAGO scores[29,30] (Figure 4g), with more interactions lost than newly formed in the mutant states (Figure 4h and Supplementary Table 8). We named these “rewired” interactions, and these interactions could be either lost or gained (e.g. Gfi1b or Irf8, respectively; Figure 4i,j and Extended Data Figure 5f,g). In contrast, the remainder of interactions were already pre-set in WT HSPC and were termed “hard-wired” DNA contacts Figure 4g).

Integrated analysis identifies critical regulatory networks

We first applied a multi-layered approach to integrate the multi-omics chromatin analysis[31] at all cis-regulatory regions across wildtype and mutant HSPC. Briefly, focusing on 2 kilobase (kb) bins of accessible regions (±1kb from ATAC-seq peak summit, n=75,457), normalised reads for chromatin modifications and accessibility of each HSPC sample were integrated. Treating these bins as a separate data point across all 16 conditions (4 analyses in 4 cellular states), we were then able to use dimensionality reduction tools developed for scRNA-Seq (Seurat[32]) to classify and visualise clusters of regions showing similar patterns (Figure 5a). This classified 10 individual clusters Figure 5a,b). Cis-regulatory regions with a loss of enhancer signatures during leukemia induction, characterised by concurrent loss of H3K4me1 and accessibility, with or without evident loss of H3K27ac (Cluster-9 or -10, respectively; Figure 5b,c, Extended Data Figure 6a and Supplementary Table 9). Conversely, cis-regulatory regions were also identified with variable gains of enhancer marks and accessibility, which were separated by marked gain of H3K27ac (Cluster-6) and accessibility (Cluster-7), as well as pre-established H3K4me1 in WT (Cluster-8). Regions with patterns reflecting a loss of marks associated with enhancer function (Clusters 9, 10) were enriched for GATA and KLF factors (Figure 5d and Extended Data Figure 6b). For regions that gained marks associated with enhancer function (Clusters 6-8), enrichment for PU.1, CEBP, RUNX and AP-1 motifs were seen.
Figure 5

Integrated analyses identifies critical regulatory networks.

a, tSNE plot showing Seurat-guided clustering of all accessible regions (±1kb from ATAC-seq peak summit) based on chromatin profiles (H3K4me1, H3K4me3 and H3K27ac, normalised read counts) and accessibility (normalised ATAC-seq reads) across WT and mutant HSPC. b, Heatmap presenting levels of chromatin modifications and accessibility at chromatin regions from the 10 clusters. c, Profile plots of chromatin marks and accessibility at accessible regions in Cluster-6 and -10 regions across all four cellular states. d, De novo motifs significantly enriched at Cluster-6 and -10 chromatin regions. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). e, Gene ontology (GO) analysis of genes that are linked to the chromatin regions in Cluster-6 by pCHiC profiles and upregulated during leukemia induction. f, Volcano plot showing differential expression of Cluster-6 linked genes in DM vs WT HSPC. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. g, Overlap of genes linked to chromatin regions with increased accessibility in NPM1c/FLT3-ITD leukemia in mouse (Cluster-6 genes annotated by pCHiC profiles) and human[19]. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. h and i, GO analysis (h) and differential expression (i) of Cluster-6 genes with human overlap. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5.

Extended Data Fig. 6

Integrative analysis with Seurat-guided clustering of chromatin profiles across WT and mutant HSPC.

a, Profile plots of chromatin marks and accessibility at accessible regions in each of the 10 clusters (except for Cluster-6 and -10). b, De novo motifs significantly enriched at Clusters 7-9 chromatin regions. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). c, GO analysis of genes that are linked to the chromatin regions in Cluster-10 using pCHiC profiles and downregulated during leukemia induction. d, Volcano plot showing differential expression of Cluster-10 linked genes in DM vs WT HSPC. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. e, Percentage of Cluster-6 genes with human overlap showing their expression in DM vs WT HSPC. Differential expression was defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5.

The chromatin regions with gain or loss of enhancer signatures (Cluster-6 or -10) were further linked to their linearly localised genes or with spatially-contacted genes using chromatin interaction profiles. Genes upregulated (n=978) in Cluster-6 demonstrated significant enrichment for immune process (Figure 5e). These included multiple genes of interest for leukemogenesis, including Jun, Fos, Hoxa10 and Irf8 (Figure 5f and Supplementary Table 10). By contrast, genes downregulated in Cluster-10 were enriched for DNA replication and contained hematopoietic TFs including Gata1, Gata2, Gfi1b and Myb (Extended Data Figure 6c,d). To further demonstrate relevance for human disease, we used a publicly available dataset of upregulated genes linked to cis-regulatory elements with increased accessibility in NPM1c/FLT3-ITD mutated human AML[19]. We could demonstrate that over a quarter of the human genes (275/1031, 26%, p=2.58e-06) were present within our Cluster-6 genes, demonstrated enriched signatures for immune regulatory processes and the majority showing increased gene expression in our system (Figure 5g-i and Extended Data Figure 6e). We next investigated the genome-wide relationship between chromatin states, chromatin interactions and gene expression programs. As expected, alterations of enhancer-associated marks correlated globally with gene expression upon leukemia induction, albeit modestly (Extended Data Figure 7a). Chromatin interaction frequencies reflected by the flipping of genome compartments generally correlated with gene expression (Figure 6a and Extended Data Figure 7b). Integrating chromatin modifications with differential interactions, we demonstrated a strong correlation of H3K4me1 alteration with “rewired” interactions (Figure 6b and Extended Data Figure 7c,d). Similarly, flipping of genome compartments positively associated with alteration of enhancer marks including H3K4me1 and H3K27ac, as well as chromatin accessibility (Figure 6c).
Extended Data Fig. 7

Correlating enhancer alterations with differential expression of target genes or DNA topology changes.

a, Percentage of dynamic enhancers interacting with gene promoters and the percent of up- or downregulation of the genes they linked to. Dynamic enhancers were marked by either differential H3K4me1 only, or together with differential H3K27ac and accessibility. b, Percentage of genes involving flipped chromatin compartments and expressed in DM vs WT HSPC. Differential expression was defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. c, Percentage of enhancers with dynamic interactions showing their H3K4me1 read counts in DM vs WT HSPC. Differential H3K4m1 levels were defined by setting FDR (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5. d, Example genomic region demonstrating correlation of H3K4me1 changes with “rewired” interactions.

Figure 6

Integrated analyses identifies critical transcriptional networks nodes.

a, Differential expression of genes involved in flipped chromatin compartments in DM vs WT HSPC. Up- or downregulation were determined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. b, Comparison of H3K4me1 alterations between WT and DM states at enhancers that gained (upper graph) or lost (lower graph) interactions. Increase or decrease were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. c, Enrichment of chromatin marks and accessibility involved in flipped chromatin compartments in DM vs WT HSPC. d-f, Combined profiles of chromatin states, accessibility at enhancers, target mRNA expression, as well as 3D interactions between enhancers and target promoters for Spi1 (d), Irf8 (e) and Hoxa cluster (f) in all four HSPC samples and wildtype neutrophils. Significant chromatin interactions were defined by CHiCAGO score ≥ 5 and are represented by arcs.

Our analysis also identified 801 super-enhancers (SE) across WT and mutant HSPC (Extended Data Figure 8a and Supplementary Table 11). Patterns of loss and gain were similar to standard enhancers when linked to mutations (Extended Data Figure 8b,c and Supplementary Table 11). However, for Npm1c one of the three gained SE demonstrated a long-range contact to the Hoxa locus and was associated with upregulation of most Hoxa genes (Extended Data Figure 8d). Linking altered SE to their regulated genes using DNA interaction profiles revealed a strong correlation to expression during DM leukemia induction (Extended Data Figure 8d). Genes upregulated in DM HSPC were again found to be enriched for inflammatory response, as well as myeloid differentiation, while downregulated genes were enriched for erythroid differentiation (Extended Data Figure 8e).
Extended Data Fig. 8

Differential Super-enhancer usage across WT and mutant HSPC.

a, Definition of super-enhancers (SEs) by ranking H3K27ac peaks that were overlapped in both replicates of each cell type based on normalised H3K27 counts. The top-ranked 801 regions were considered as SEs across four HSPC. b, Number of super-enhancers (SE) with increase (UP) or decrease (DOWN) in H3K27ac modification in mutant vs WT HSPC. c, Heatmaps and profile plots of H3K27ac enrichment in WT and mutant HSPC over SEs showing gain or loss of H3K27ac during DM leukemia induction. d, Linking differential activity of SEs to altered mRNA expression of their target genes (determined by pCHiC interactions) in mutant vs WT HSPC. Upregulated genes connecting to SEs with H3K27ac gain or downregulated genes connecting to SEs with H3K27ac loss were determined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5 for expression, or FDR (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5 for H3K27ac. Several Hoxa genes were indicated in Npm1c vs WT. e, Significantly enriched GO terms for genes linked to gained SEs and upregulated in DM vs WT (upper panel) or for genes linked to lost SEs and downregulated (lower panel). f and g, Promoter-contact plots showing the read counts of promoter bait to target pairs (bait-”other end”) for Spi1 (f) and Hoxa9/Hoxa10 (g). Regions highlighted yellow show rewired interactions.

Leukemic regulation of exemplar reprogrammed loci

Our integrated analysis suggested Spi1/Pu.1 as an important network regulator in Flt3-ITD/Npm1c leukemia (Figure 6d and Extended Data Figure 8f). All WT and single mutant HSPC demonstrated H3K4me1 priming and accessibility at the well described Spi1 upstream regulatory element (URE)[33-36]. However, H3K27ac increased only in DM HSPC, where it associated with a “rewiring” of the URE to communicate with the Spi1 promoter and upregulate its expression. Irf8 was similarly identified as a likely network player. A slightly different chromatin pattern was observed to be associated with “rewiring” and increased gene expression at the DM stage, with increased H3K4me1 and accessibility observed in both Flt3-ITD and DM HSPC, but no alteration of H3K27ac until the DM stage (Figure 6e and 4j). Definitively linking promoters to cis-regulatory regions using pCHiC also allowed us to identify novel long-range regulatory interactions. An exemplar of this was seen for the Hoxa locus, where “hard-wired” interactions with an H3K4me1 modified region ~1MB upstream (Figure 6f and Extended Data Figure 8g) could be demonstrated for all WT and mutant HSPC. However, as previously described, this same region, which we have named the Hoxa-long range super-enhancer (Hoxa-LRSE), demonstrated a marked increase in H3K27ac modification in both Npm1c and in DM HSPC, where it appears to collaborate with the increased accessibility across the promoters of Hoxa genes (Supplementary Figure 3a). In addition, and as a direct link to human AML, this region is syntenic with a homologous region on human chromosome 7 where the same interaction can be demonstrated (Supplementary Figure 3b). Using published data (including ChIP-seq on H3K4me1[6] and CTCF[37], ATAC-seq[38], together with pCHiC[18]) of human CD34+ HSPC, we could demonstrate that the interaction was also present across species, and ChIP-seq data from ourselves[39] and others[40,41] indicated that this region also contained a SE in human AML cells that overexpress HOXA genes (OCI-AML3 cell line) but not in cells that do not (Kasumi-1 cell line) (Supplementary Figure 3b). The human HOXA-LRSE was also demonstrated to have an open chromatin confirmation in NPM1 mutant AML patients by DNase hypersensitivity analysis[19], further demonstrating cross-species conservation of this long-range element.

Network perturbation abrogates leukemia maintenance

Utilizing our integrated analysis to identify exemplar putative network nodes, we focussed on the AP-1 complex members c-Fos and c-Jun, the TFs Spi1/Pu.1, Irf8, the oncogene Igf1, the Hoxa genes and the regulatory loci of Spi1 and the Hoxa cluster, validating their critical role in the maintenance of Flt3-ITD/Npm1c AML using RNAi and CRISPR-editing. As proof of this principle, knockdown of c-Jun and c-Fos significantly decreased clonogenic capacity in murine DM cells in vitro (Extended Data Figure 9a,b). Depletion of Spi1 could abrogate leukemia cell growth and clonogenicity (Extended Data Figure 9c-e). To validate Spi1 regulatory elements as critical for leukemia maintenance, CRISPR-Cas9 mediated genetic excision of the Spi1-URE was achieved using dual guide RNAs to target its 5kb central region in DM cells carrying Cas9[42] Figure 7a and Extended Data Figure 9f,g). Removal of the URE resulted in ~30% reduction of Spi1 expression in bulk cultures (Figure 7b), significantly decreased cell growth and clonogenicity (Figure 7c,d and Extended Data Figure 9h).
Extended Data Fig. 9

Perturbation of critical cis-regulatory hubs and their target genes abrogates leukemia maintenance in mouse DM leukemia.

a, mRNA expression of AP-1 components (Jun, Fos) detected by RT-qPCR in DM cells expressing shRNAs targeting Jun or Fos relative to control shRNA (n=3 independent experiments). b, CFU assays of DM cells expressing shRNAs targeting Jun or Fos or control gRNAs (n=3 independent experiments). c, mRNA expression of Hoxa9, Hoxa10, and Spi1 in DM cells expressing shRNAs targeting them specifically relative to control shRNA (n=3 independent experiments). d and e, CFU assays (d) and ex vivo cell proliferation (e) of DM cells expressing shRNAs targeting Hoxa9, Hoxa10, Spi1 or control shRNA (n=3 independent experiments). f and i, Gel electrophoresis of PCR products on genomic DNA of DM-Cas9 cells expressing control gRNAs, Spi1-URE (f) or Hoxa-LRSE (i) gRNAs. PCR experiment was performed once on three independent gRNA transductions (marked with #). g and j, Sanger sequencing to confirm the deletion of Spi1-URE (g) or Hoxa-LRSE (j) in DM-Cas9 cells expressing Spi1-URE gRNAs. h and k, Ex vivo cell growth of DM-Cas9 cells expressing Spi1-URE gRNAs (h) or Hoxa-LRSE gRNAs (k) in comparison to cells expressing control gRNAs (n=3 independent experiments). Statistical analyses in a-d, h and k were performed by running Student’s unpaired t-tests and p values (two-sided, not multiple testing corrected) were shown; error bars represent mean and standard error of mean.

Figure 7

Network perturbation abrogates leukemia maintenance.

a, Experimental strategy targeting Spi1-URE for deletion using CRISPR-Cas9 plus dual guide RNAs (gRNA) flanking its ~5kb central region. Primer 1 and 2 are oligos for PCR confirmation of deletion. b, Spi1 mRNA expression detected by RT-qPCR in DM-Cas9 cells expressing Spi1-URE gRNAs relative to control gRNAs (n=3 independent experiments). c and d, Ex vivo proliferation (c) and colony-forming unit (CFU) assays (d) of DM-Cas9 cells expressing Spi1-URE gRNAs relative to control gRNAs (n=3 independent experiments). e, Experimental strategy targeting Hoxa-LRSE for deletion using CRISPR-Cas9 plus dual gRNAs flanking its ~2.4kb central region. f, mRNA expression of multiple Hoxa genes detected in DM-Cas9 cells expressing Hoxa-LRSE gRNAs relative to control gRNAs (n=3 independent experiments). g, Ex vivo proliferation of DM-Cas9 cells expressing Hoxa-LRSE gRNAs relative to control gRNAs (n=3 independent experiments). h, CFU assays of DM-Cas9 cells expressing Hoxa-LRSE gRNAs or control gRNAs (n=3 independent experiments). # indicates three independent single cell derived clones. i-k, Cell proliferation of OCI-AML3 cells expressing human gRNAs targeting SPI1 (i), IRF8 (j) and IGF1 (k) relative to control gRNAs in liquid culture (n=3, 4, 4 independent experiments, respectively). # indicates two independent gRNAs. l, CFU assays of OCI-AML3 cells expressing human gRNAs targeting SPI1, IRF8 and IGF1 relative to control gRNAs in methylcellulose culture (n=3 independent experiments). Statistical analyses in b-d and f-l were performed by running Student’s unpaired t-tests and p values (two-sided, not multiple testing corrected) were shown; error bars represent mean and standard error of mean.

The requirement of sustained expression of Hoxa9 or Hoxa10 for the maintenance of leukemia cell growth and clonogenicity was also demonstrated (Extended Data Figure 9c-e). Furthermore, we used a similar experimental strategy to examine the role of the Hoxa-LRSE by removing its ~2.4kb central region in DM-Cas9 cells (Figure 7e and Extended Data Figure 9i,j). Genetic disruption of Hoxa-LRSE resulted in significantly reduced expression of all Hoxa genes tested (Hoxa3, 5, 6, 7, 9, 10, Figure 7f). In addition, excision of the Hoxa-LRSE also induced a significant decline in cell proliferation and reduced colony-forming capacity (Figure 7g,h and Extended Data Figure 9k). To further correlate the relevance of these networks to human AML we also validated SPI.1/PU.1 as well as extending the analysis to other targets, including IRF8 and IGF1. These targets were edited via CRISPR-Cas9 in the human HOXA-dependent AML cells lines OCI-AML3 (mutated for NPM1, but also DNMT3A) and MOLM-13 (that carries a FLT3-ITD mutation, but also an MLL-AF9 rearrangement). Importantly, these experiments corroborated our murine findings, with editing decreasing proliferation and clonogenic function (Figure 7i-l and Extended Data Figure a-f). Taken together, perturbation of these exemplar nodes validate our integrated analysis strategy to identify critical network members and their cis-regulatory elements and demonstrated these nodes to be critically required for the maintenance of the leukemia.

Discussion

AML is associated with a relatively uncomplicated genome by comparison with other tumors, with each case harboring only between 3 and 5 mutations on average[11]. Our study provides mechanistic evidence why multiple mutations are necessary for the full malignant phenotype and suggests that significant synergism between them may explain their relatively low number. By deconstructing an experimental model of a common AML subtype, we could show that single mutations lead to only modest changes in the epigenetic landscape that do not translate into significant alterations of gene expression. However, when mutations co-occur their combinatorial effects produce marked synergy at every epigenetic level examined. Moreover, our study prospectively and categorically confirms that common AML mutations that lack direct epigenetic or transcriptional effects can indirectly alter the epigenetic landscape and transcription to generate convergent leukemia-associated transcriptional programs. Overlap between the Flt3-ITD and DM states suggests Flt3-ITD to have a dominant role in malignant remodeling of the epigenome and 3D genome. Moreover, overt gene expression changes seen in DM cells were often preceded by alterations in chromatin modification, usually of H3K4me1, in regulatory elements associated with the same genes in Flt3-ITD HSPC, observations reminiscent of epigenetic changes preceding gene expression in normal hematopoietic differentiation[25]. These stepwise changes may mark the leukemogenic “potential” of these elements to be subsequently modified further by the actions of a collaborating mutation. Mechanistically, GSEA and motif analysis for differential chromatin modifications, particularly H3K4me1, linked Flt3-ITD signaling to upregulated inflammatory response gene programs. Moreover, de novo motif analysis, RNAi and CRISPR editing experiments demonstrated the role of signal-inducible TFs such as AP-1[43,44], interferon response factors (particularly IRF4 and -8)[43] and signal responsive TFs such as PU.1[45] and RUNX1[46] in this regulation. Of note, these findings accord with and extend similar genomic analysis in patient samples from FLT3-ITD mutated AML[19,47]. By contrast, the major alteration that occurred in HSPC expressing Npm1c was an alteration of chromatin accessibility, with an obvious exemplar the changes at the Hoxa locus demonstrated to drive leukemogenesis. The nature of this alteration in chromatin accessibility is not immediately apparent, however, may relate to the known histone chaperone function of NPM1[48], to its interaction with ATP-dependent chromatin remodeling complexes[20] or to its role as the major protein component of the nucleolus, a structure that regulates genome 3D topology[49,50] and further work is warranted to investigate these possibilities. Our study has prospectively modelled the alterations in 3D genome organization that accompany the induction of hematological malignancy. Of note, the combined mutant state was able to alter gross genome organization at the level of genome compartments, although only 10% of the interactions, those associated with alterations of H3K4me1[51], were dynamically rewired in leukemic HSPC. The majority of interactions remain stable, with the gene expression alterations presumably reflecting alterations of chromatin modification and accessibility at these loci. However, utilization of other cis-regulatory elements, such as those upregulating Socs2 and Irf8 (Figure 3e and 6e), was novel and specific for leukemia. These data demonstrate that malignant transformation is promiscuous in its enhancer usage, as has also been previously suggested[19,47]; it can generate novel DNA-topologies and enhancer states but may also utilize enhancers and DNA topologies associated with other differentiation states in the tissue from which it is derived. Our experimental framework has also allowed us to identify exemplar regulatory network nodes. Of note, when we compare these same factors with our own and other CRISPR screens performed in human AML cell line models[52,53], we see that they constitute similar vulnerabilities across multiple AML genotypes. PU.1 is classically regarded as a tumour suppressor in AML. Mice with a genetically engineered decrease in PU.1 expression develop AML[35,36,54,55]. Furthermore, heterozygous deletions of the locus or rare loss-of-function mutations of PU.1 have been described in patients with AML[9,11,56]. Moreover, oncogenic events abrogate the activity of PU.1[57,58] and therapeutically, the anti-leukemic activity of LSD1 inhibitors has been associated with re-activation of a PU.1 driven transcriptional programme[59,60]. However, in contrast, our study interestingly identifies Spi1/PU.1 as an oncogenic TF in Npm1c/Flt3-ITD AML, and is in accord with our previous work and that of others that have demonstrated PU.1 to be a vulnerability in certain AML subtypes[52,53,61]. These findings suggest the role of PU.1 to be cell context, genotype and possibly stage-specific in AML and warrant further study. Of note, PU.1’s oncogenic effect appears to occur in combination with its known co-activators, AP-1[62,63], IRFs and Hoxa9[64,65], with this observation a further demonstration of the synergy between Npm1c and Flt3-ITD where coordinated upregulation of these synergistic TF occurs. Finally, our findings describe an entirely novel Hoxa-long range super-enhancer (Hoxa-LRSE) that appears to regulate the entire Hoxa cluster following Npm1 mutation. Importantly, we demonstrate that the Hoxa-LRSE is not species-specific but is conserved within human cells. Further study of the signals integration at the Hoxa-LRSE will inform its critical roles for hematopoiesis and leukemogenesis[66,67]. Taken together our work demonstrates the complicated interplay that occurs between synergistic mutations to remodel the epigenetic landscape and rewire the epigenome to induce and maintain leukemogenic transcriptional programs and identifies critical network characteristics that might be targeted for therapeutic success.

Methods

Ethical compliance

The studies were approved by the UK Medical Research Council and University of Cambridge Animal Welfare and Ethical Review Body (AWERB). All animal procedures were regulated under UK Home Office Animals (Scientific Procedures) Act 1986 Amendment Regulations 2012 under project license 80/2564. The work does not contain any experiments processing human primary samples performed by any of the authors.

Mice

C57/BL6 strain mice carrying single mutations (conditional Npm1 flox-cA/+;Mx1-Cre+, or constitutive Flt3 ITD/+) or the combined mutations (Npm1 flox-cA/+;Flt3 ITD/+ ;Mx1-Cre+) have been described previously[14,15,16]. The activation of the Npm1 cA allele was achieved by induction of Cre-mediated recombination in 5- to 6-week-old Npm1 flox-cA/+;Mx1-Cre+ mice which were administered six doses of polyinosinic-polycytidylic acid (pIpC). The activation of Npm1 cA allele in mice carrying Npm1 flox-cA/+;Flt3 ITD/+ ;Mx1-Cre+ does not require administration of pIpC to induce Mx1-Cre for recombination[16]. For tissue harvest, mice of each genotype were utilized regardless of their sex. Wildtype (WT) C57/BL6 mice and Flt3 ITD/+ mice were sacrificed at 10-12 weeks of age; Npm1 flox-cA/+ mice were sacrificed immediately after 6x pIpC treatment (aged 11-12 weeks of age); mice carrying the combined mutations were sacrificed at 5-6 weeks of age. All mice were maintained in a standard SPF facility (12 light/12 dark cycle, 19-23°C with 40-60% humidity) and were checked on a daily base to identify signs of heavy leukemia burden, including ruffled fur, hunchback, heavy shivering, weight loss, extreme weakness, or significant loss of activity. No mice were allowed to exceed the limit of leukemia burden, under the ethical regulations of University of Cambridge Animal Welfare and Ethical Review Body (AWERB).

Flow cytometry analysis

Flow cytometry Measurement was conducted in 7-9 mice for each genotype. Briefly, bone marrow (BM) single-cell suspensions were isolated in PBS from mice femurs and tibias and stained with combinations of following anti-mouse antibodies for different populations: Mouse Lineage antibody cocktail (containing CD3e/CD11b/B220/Ly-76/Ly-6G/Ly-6C; APC conjugated; BD; 1:300 dilution), Ly-6A/E (Sca-1; Pacific Blue (PB) conjugated; BioLegend; 1:500 dilution), CD117 (c-Kit; PE-Cy7 conjugated; BioLegend; 1:500 dilution), CD34 (FITC conjugated; BD; 1:1000 dilution), CD16/32(FcR III/II) (PE conjugated; BioLegend;1:500 dilution), CD135 (Flt3; PE conjugated; BioLegend; 1:500 dilution). Hematopoietic stem and progenitor populations (HSPC) were determined using the combined immunophenotyping markers as below: Lin-, lineage negative; HPC, hematopoietic progenitors (containing all myeloid progenitors), Lin-Sca1-cKit+; LSK, enriched for HSC, Lin-Sca1+cKit+; CMP, common myeloid progenitors, Lin-Sca1-cKit+CD34+CD16/32int; GMP, granulocyte/macrophage progenitors, Lin-Sca1-cKit+CD34+CD16/32hi; MEP, megakaryocyte/erythroid progenitors, Lin-Sca1-cKit+CD34-CD16/32lo. Assays were performed on a BD LSRFortessa cell analyzer and all data were analyzed with FlowJo software (Tree Star, version 10.5.0).

Gene expression analysis

RNA-seq reads were quality filtered and mapped using STAR[68] (version 2.4.0) against the mouse genome (mm10). Uniquely mapped reads were quantified with HTSeq (version 0.6.0) vand protein-coding genes with non-zero read count in wildtype or mutant HSPC (n = 16,771) were included for downstream analysis. Reads Per Kilobase Million (RPKM) mapped reads for each protein-coding genes were calculated using Bioconductor package edgeR (version 3.28.1)[69]. Principal component analysis (PCA) was performed on RPKM values of 16,771 protein-coding genes to assess reproducibility[70]. Differential expression of protein-coding genes was analysed with these counts using Bioconductor package DESeq2 (version 1.12.4)[71]. Significantly differential expression was considered by setting adjusted P value (adjP) < 0.05 and fold change (FC) ≥ 1.5 between mutants and wildtype HSPC. Unsupervised clustering was used to generate the heatmap.

ChIP-seq

Chromatin immunoprecipitation was performed with iDeal ChIP-seq kit for Histones (Diagenode) following manufacturer’s recommendations. In brief, 1 x 106 Lin- HSPC or neutrophils isolated from wildtype or mutant mice (2-3 mice per genotype per sample replicate) were crosslinked with formaldehyde (Thermo Fisher Scientific) at a final concentration of 1% for 10 minutes and then quenched by addition of Glycine (Thermo Fisher Scientific) for 5 minutes incubation. Lysis buffers iL1 and iL2 were used to prepare the nuclei pellets. Chromatin was sheared in Shearing buffer iS1 using the Bioruptor Plus (Diagenode) for 20 cycles (each cycle for 30 seconds “ON” and 45 seconds “OFF”) at high power setting. Immunoprecipitation, wash, decrosslinking and elution were carried out as per the manufacturer’s protocol, using 1 μg antibody (anti-H3K4me1, #ab8895, Abcam; anti-H3K27ac, #ab4729, Abcam; anti-H3K4me3, #c15410003, Diagenode). 10% of the sheared chromatins were kept aside as input samples. ChIP-seq library preparation of ChIP DNA or input DNA was performed using TruSeq ChIP Sample Prep Kit (Illumina) following standard procedures from manufacturer. The library DNA was quantified using KAPA Library Quantification Kit (Kapa Biosystems) and library average size was determined by Agilent DNA 1000 Kit (Agilent Technologies). Libraries were pooled for single-read (1 x 50 bp) sequencing on an Illumina HiSeq 2500 or HiSeq 4000 platform at CRUK Cambridge Institute Genomics Core. Experiments were performed in duplicate on biologically independent samples as recommended[72].

Analysis of histone modifications

ChIP-seq reads were aligned to the mouse reference genome (mm10) using Bowtie (version 2.1.0)[73]. Duplicates were removed using PICARD tools (version 2.2.1). As for H3K4me1 or H3K4me3, enriched regions (peaks) were identified using findPeaks program from HOMER software (version 4.10.4)[26], with the setting of “-size 1000 -minDist 1000”. Peaks overlapped in both replicates of individual cell type were kept. If centers of two consecutive peaks were less than 500 bp, only the peak with higher enrichment signal was counted. H3K4me1 or H3K4me3 peak sets were generated by combining total peaks in all four HSPC samples and extending +/- 500 bp. Frequency distribution of H3K4me3 peaks were then plotted based on H3K4me3 enrichment, and peaks with high levels of H4me3 (CPM > 16, Extended Data Figure 3a) were considered to be active promoters. H3K4me1-based total enhancer catalog was created by excluding H3K4me1 peaks that intersect with active promoters, and then merging the overlapped ones. Accessible enhancers were defined as ATAC-seq consensus peaks overlapping with total enhancers. H3K27ac ChIP-seq peaks were called using MACS2 (version 2.0.1)[26] against input sample of each cell type with a p-value cutoff of 1e-9 and with setting “nomodel”. A list of H3K27ac consensus peak set was made using DiffBind (version 2.0.1)[74]. Accessible enhancers that overlap with any H3K27ac peaks were defined as active enhancers. Differential enrichment of chromatin marks (H3K4me1 and H3K27ac) was analysed using edgeR (version 3.28.1), with significant changes being defined by FDR value < 0.05 and FC ≥ 1.5 (gain or loss) in the presence of any mutations. Accessible enhancers with gain or loss of H3K4me1 levels in any mutant vs WT HSPC were defined as dynamic enhancers, likewise, with gain or loss of H3K27ac enrichment were considered with dynamic activity.

Identification of leukemia-specific enhancer changes

Accessible enhancers with DM-induced significantly differential H3K4me1 were selected for plotting enhancer marks and chromatin accessibility in WT HSPC, DM HSPC, WT neutrophils (NEU) and GMP. Histone profiles and accessibility for GMP cells were previously reported[25] and available in Gene Expression Omnibus (H3K4me1, GSM1441289; H3K27ac, GSM1441273; ATAC-seq, GSM1463173). Sequencing reads were mapped to mm10 and processed similarly as for WT HSPC. DM gained or lost enhancer groups were divided into two subgroups based on H3K4me1 enrichment patterns by k-means clustering (n = 2) using computeMatrix and plotHeatmap in deepTools (version 3.1.3)[75]. Enhancers showing unique changes in DM HSPC as compared to WT HSPC and neutrophils were considered “leukemia-specific” changes.

Chromatin accessibility analysis

Chromatin accessibility probed by ATAC-seq was analysed in like manner as H3K27ac ChIP-seq analysis, including the procedures of trimming, mapping, filtering and peak calling. Briefly, trimmed sequences were mapped against mm10 reference genome using Bowtie2 and only uniquely mapped reads were kept. Peaks were called using MACS2 with the setting “-nomodel -nolambda” and only those with p-values less than 1e-20 were considered significant. A list of ATAC-seq consensus peak set was made using DiffBind. Differential enrichment was analysed using edgeR, with significant changes being defined by FDR value < 0.05 and FC ≥ 2 (gain or loss) in the presence of any mutations.

Motif enrichment and footprinting analyses

Scanning de novo motifs was performed at ±100 bp from ATAC-seq peak summit within the investigated chromatin regions, using findMotifsGenome.pl with the setting “-mask -p 12 -S 10” from HOMER package. In most cases, motifs occurred with significant p-values and with a coverage of > 10% of target regions were displayed. Combined analysis of TFs’ occurrence and chromatin accessibility at regions of interest was assessed by Bivariate Genomic Footprinting (BaGFoot, version 0.9.7) algorithm following the procedures as described[23]. Analysis was performed on ATAC-seq profiles at all accessible regions represented by ATAC-seq consensus peaks. In brief, known TF motifs were first scanned at ± 100 bp from ATAC-seq peak summit and their occurrence was aggregated. Next, for each motif, ATAC-seq reads were counted at aggregated motif-flanking regions (±200 bp from motif center) and normalised to total sequencing counts in each sample. Differential accessibility was then determined as altered read counts flanking each motif in mutant vs WT. To probe TF footprinting depth, expected cuts were computed by mapping ATAC-seq reads to each motif occurred genome-wide and was set as baseline, and observed cuts only counted the reads within ± 100 bp from ATAC-seq peak summit. Footprinting depth was calculated as cut bias represented by a log ratio difference of observed cuts divided by expected cuts. Together, for each comparison between mutant and WT HSPC, the differences of footprinting depth and flanking accessibility for each TF motif were plotted in a Bagplot, where the inner polygon (“bag”) encompasses at most half of the TFs and the outer polygon “fence” is formed by inflating the bag geometrically by a default factor of 2.5. The outliers with a p-value less than 0.05 were considered significant.

Promoter-anchored interaction analysis

Paired-end sequences of pCHiC were processed using the HiCUP (version 0.5.8)[76] pipeline with default parameters for the following steps: quality control, identification of reads containing HiC junctions, mapping to reference genome mm10, and filtering duplicated HiC di-tags. Output bam files containing valid HiC di-tags were processed by Bioconductor package CHiCAGO (version 1.1.1)[77], to call significant promoter-based interactions. CHiCAGO considers distance effect on interaction frequencies by virtue of a convolution background model and a distance-weighted p-value. CHiCAGO scores represent -log weighted p-values, the higher the more likelihood of interaction formed. CHiCAGO scores were calculated for each pCHiC sample, and significant interactions were called when CHiCAGO scores ≥ 5. Significant interactions were computed per HSPC by merging their replicates, and were combined to form a matrix of total unique interactions. While, interactions present in both replicates per HSPC were considered with high confidence. Total pCHiC reads at individual promoters were summed to perform differential analysis in mutant vs WT HSPC using edgeR. Differential total interaction reads were defined by adjP < 0.05 and absolute FC > 1. Rewired interactions were identified by comparing and ranking their CHiCAGO scores in WT and mutant cells. Those high-confidence interactions, absent in WT (score < 5) but present in mutant (score ≥ 5), with scores ranked in the bottom quartile in WT but in the top quartile in mutant, were considered mutation-associated gained interactions. And vice versa, high-confidence interactions that were present in WT but absent in mutant, ranked in the top quartile in WT but in the bottom quartile in mutant, were considered as lost interactions by mutations. In addition, total unique interaction profiles facilitated the annotation of distal cis-regulatory elements (e.g. SEs) to their target genes in HSPC.

Chromatin compartment analysis

Sub-nuclear compartmentation represented by self-associating chromatin domains were analysed by means of principal component analysis (PCA) on capture HiC data in a similar way as described on HiC data[27], mainly using HOMER software. To do so, uniquely mapped reads from HiCUP analysis were used to create “Tag Directory” using makeTagDirectory from HOMER. Principal component (PC1) values were calculated by running runHiCpca.pl with default setting (resolution at 50 kb). This led to the separation of chromatin into two compartments, with positive PC1 regions reflecting “active” chromatin and negative PC1 regions indicative of “inactive” chromatin. Regions of continuous positive or negative PC1 values were stitched to be identified as A or B compartments, respectively. Genome-wide correlation of compartment PC1 values between mutant and WT cells was performed by running getHiCcorrDiff.pl from HOMER. Flipped compartments were identified using HOMER findHiCCompartments.pl. Genes involved in the flipped compartments were selected to analyse their mRNA expression changes by mutations.

Seurat-guided clustering analysis to identify differential patterns of accessible enhancers

A multi-layered multiomic approach to integrate chromatin analysis[31] at all accessible regions across wildtype and mutant HSPC was applied. Firstly, within 2-kb bins at ATAC-seq consensus peaks (±1 kb from peak summit, n=75,457), normalised sequencing read counts for each chromatin modifications (H3K4me1, H3K4me3 and H3K27ac) and chromatin accessibility of each HSPC population (WT, Npm1c, Flt3-ITD and DM) were extracted and computed as input data matrix. This matrix was composed of 16 rows (with 4x chromatin marks and 4x cell populations) and 75,457 columns (with bins of accessible regions). Then processing the data in a similar way as for single cell RNA-sequencing with Seurat (version 3.2.3) package[32], all 75,457 accessible regions (as columns) were treated as a separate data point across all 16 conditions (as rows). This allowed dimensionality reduction to classify and visualise clusters of regions whose pattern of alterations were similar. Data were plotted on a tSNE plot to visualize all separated clusters (n=10) and a heatmap was shown to illustrate the specific patterns of chromatin changes in each cell type (no row-wise ranking). These clusters were further loaded as regions of interest into deepTools to probe average profiles of chromatin modifications and accessibility across WT and mutant conditions, further supporting the separation of tSNE clusters and their unique changes of chromatin states.

Gene perturbation through shRNA-mediated knock-down

Npm1c/Flt3-ITD leukemia cells (termed DM cells) were derived by isolation of Lin- BM HSPC from double mutant mice and were cultured in X-Vivo medium (Lonza) plus 10% fetal bovine serum (Gibco) in the presence of 10 ng/mL mIL-3, 10 ng/mL hIL-6, and 50 ng/mL mSCF (Peprotech). Lentiviral shRNA plasmids were obtained from Sigma-Aldrich in the form of MISSION pLKO.1-puro vectors and shRNA sequences were provided in Supplementary Table 12. Lentiviral particles were produced by co-transfection of shRNA plasmids with psPAX and pMDG.2 in 293T cells using the Trans-IT LT-1 transfection reagent (Mirus). 293T cells were cultured in DMEM (Gibco) plus 10% fetal bovine serum (Gibco). DM cells were infected by shRNA lentivirus twice via spinoculation in the presence of 5 μg/mL polybrene (Sigma-Aldrich). 72 hours post transduction, cells were selected with 2 μg/mL puromycin (Sigma-Aldrich) for 72 hours.

Disruption of cis-regulatory elements through CRISPR-mediated deletion

Mouse leukemia cells carrying Npm1c/Flt3-ITD/Cas9 (termed DM-Cas9 cells)[42] were cultured in X-Vivo medium (Lonza) plus 10% fetal bovine serum (Gibco) supplemented with cytokines as described above. In the presence of Cas9, using dual guide RNAs (gRNAs) to specifically target two genomic loci has been demonstrated previously[52]. Paired dual gRNAs targeting specific cis-regulatory elements or scramble control dual gRNAs were designed with IDT web tools (https://eu.idtdna.com/site/order/designtool/index/CRISPR_CUSTOM) and their sequences were available in Supplementary Table 12. Oligos of dual gRNAs were cloned into lentivirual vector pKLV2.2-h7SKgRNA5(SapI)-hU6gRNA5(BbsI)-PGKpuroBFP-W (#72666, Addgene). Production of gRNA particles and transduction of DM-Cas9 cells were carried out same as for shRNA. To confirm the target deletion in the bulk transduced cells, PCR primers were designed to amplify the wildtype or modified allele (Supplementary Table 12). Size of PCR products was checked by agarose gel electrophoresis. Furthermore, PCR products were subject to TA cloning (Promega). Plasmid DNA was extracted from individual colonies and was confirmed for deletion by Sanger sequencing.

CRISPR-mediated gene loss of function in human leukemia cells

OCI-AML3 and MOLM-13 cells were obtained from the Sanger Institute Cancer Cell Line Panel, with constitutive expression of Cas9 were generated by lentiviral transduction using pKLV2-EF1aBsd2ACas9-W vector (Addgene #67978)[52]. OCI-AML3 and MOLM-13 cells were maintained mycoplasma free in MEM-alpha supplemented with 20% FCS and RPMI supplemented with 10% FCS, respectively. For each of the human targets (including IRF8, SPI1 and IGF1), two guide RNAs were designed and subcloned into the pKLV2-U6gRNA5(BbsI)-PKGpuro2ABFP-W expression vector (Addgene #67974)[52]. A non-target control gRNA was also included. Target sequences of all gRNAs are detailed in Supplemental Table 12. For cell proliferation assays, 50,000 cells transduced with specific gRNAs were plated and cells were counted every 2-3 days for a period of 7-14 days. Three replicates for each target or control were carried out. Data were presented as a fold change in total cell numbers relative to corresponding non-targeting control. For colony forming assays, 500-1000 cells transduced with specific gRNAs were seeded in duplicate in Methocult semi-solid medium (H4531, STEMCELL Technologies). Total colony forming units were enumerated 7 days post-culture. Three replicates for each target or control were carried out. Data were presented as a fold change in colony forming units or colony cellularity, relative to non-targeting counterparts.

Statistics & Reproducibility

Statistical analyses in the studies were specified in details in figure legends. Sample sizes were not predetermined using any statistical method. Randomization was applied to mouse samples preparation and cell perturbation experiments. The Investigators who collected the mouse samples were not informed about the sample allocation for NGS experiments. The Investigators who performed other experiments were not blinded to allocation during experiments and outcome assessment. No data were excluded for analysis, except for the exclusion of reads mapped to mitochondrial DNA, as the overall study aim was to investigate only the chromatin landscape of the non-mitochondrial genome. Student’s unpaired or paired t-tests were performed as two-tailed with GraphPad Prism (version 8.2.1); One-tailed hypergeometric distribution was analysed in R (version 3.6.1); p values in HOMER de novo motif analysis were calculated as one-tailed or in BaGFoot analysis as two-tailed. These analyses were not corrected for multiple testing. Statistical calculations in all differential analyses of sequencing data were performed with DESeq2 or edgeR, generating two-tailed and multiple testing corrected p (with the Benjamini and Hochberg method, adjP) or FDR values. Output data with significance p, adjP, or FDR values ≤ 0.05 were considered statistically significant. Number of independent experiments or independent samples were specified in figure legends. Representative data or images were replicated in at least three independent experiments or three independent samples.

Transcriptional changes across WT and mutant HSPC.

a, Heatmap showing unsupervised clustering analysis of global gene expression in WT and mutant HSPC. b, Top 10 positively enriched gene ontology (GO) gene sets revealed by GSEA for each mutant (vs WT). c, Commonly enriched gene sets from GSEA analysis of differential gene expression by Npm1c and DM. d, GSEA enrichment plots showing gene set of Kong_E2F3_Targets in Npm1c or DM vs WT by GSEA analysis. e, Gene sets positively or negatively enriched in all three mutants vs WT.

Global chromatin accessibility across WT and mutant HSPC.

a, Heatmaps and profile plots of ATAC-seq enrichment across WT and mutant HSPC over regions with gain or loss of accessibility in the presence of Npm1c. Peaks were ranked by average enrichment across all samples. b, Chromatin accessibility at the Gata2 genes and its upstream enhancers in all four HSPC and wildtype neutrophils. Regions showing loss of accessibility in Npmc1 and DM HSPC are highlighted. c, Linking differential accessibility at gene promoters to their mRNA expression changes by each mutant. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. d-f, De novo motifs significantly enriched at genomic regions with altered accessibility by Npm1c (d) and Flt3-ITD (e) or open chromatin sites with static accessibility (f) where FDR of differential analysis > 0.2 in any mutant vs WT HSPC. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). g and h, BaGFoot analysis illustrates TFs with differential footprint depth and accessibility in Npm1c (g) and Flt3-ITD (h) vs WT HSPC. Motifs outside the fence and with a p value (two-tailed, not multiple testing corrected) < 0.05 are statistically significant outliers; n.s., not significant.

Chromatin modifications at accessible enhancers across WT and mutant HSPC.

a, Distribution of all H3K4me3 peaks based on read counts across all four cell types. Active promoters were marked by high H3K4me3 with log2 CPM > 4. b, Genome distribution of ATAC-seq consensus peaks. c, Number of total accessible enhancer peaks defined in all four HSPC samples and percentage of dynamic enhancers which were defined by differential H3K4me1 in the presence of any mutations. d, Numbers of overlapping accessible enhancers with gain or loss of H3K4me1 in the presence of Flt3-ITD or DM. Hypergeometric test p values (one-tailed, not multiple testing corrected) are shown. e, Heatmaps of H3K4me1 enrichment over gained or lost enhancers during DM leukemia induction. Peaks were ranked by average enrichment across all samples. f, Percentage of accessible enhancers marked by both H3K4me1 and H3K27ac in total accessible enhancers across all four HSPC states. g, Percentage of enhancers showing dynamic H3K27ac modification in the presence of single or double mutations. h. Heatmaps and profile plots of H3K27ac enrichment over enhancers showing gain or loss of H3K27ac during DM leukemia induction. Peaks were ranked by average enrichment across all samples.

Immunophenotypic characterization of HSPCs isolated from bone marrow of wildtype and mutant mice.

a, Representative flow cytometry plots showing proportions of hematopoietic stem and progenitors within bone marrow compartment. Lin-, lineage negative; HPC, hematopoietic progenitors (Lin-Sca1-cKit+) containing all myeloid progenitors; LSK, Lin-Sca1+cKit+; CMP, common myeloid progenitors; GMP, granulocyte/macrophage progenitors; MEP, megakaryocyte/erythroid progenitors. b, Comparison of proportions of hematopoietic stem and progenitors in their parental populations in mutant mice (Npm1 flox-cA/+;Mx1-Cre, n=9; Flt3 ITD/+, n=8; Npm1 flox-cA/+;Mx1-Cre;Flt3 ITD/+, n=9) compared with wildtype mice (WT, n=7). Student’s unpaired t-tests (two-sided) were performed for the comparisons; data are presented as mean values +/- standard deviation; Only the p values < 0.05 were shown.

3D chromatin interaction profiles across WT and mutant HSPC.

a, Numbers of total and high-confidence pCHiC interactions in all four HSPC states. High-confidence interactions were defined as significant interactions (with CHiCAGO score ≥ 5) overlapping in both replicates of each cell type. b, Numbers of pCHiC interactions captured by individual promoters. c, Distance of interacting regions from their target promoters. Median distance was shown. d, Illustration of chromatin compartments A/B levels at a DM-induced “B to A” flipped region (highlighted) containing the Igf1 oncogene. e. Illustration of chromatin compartments A/B levels at a DM-induced “B to A” flipped region (highlighted) containing an Ifi (interferon inducible) gene cluster (in red). f and g, Significant interactions, represented by arcs, associated with the Gfi1b (f) or Irf8 (g) promoters in different cell types.

Integrative analysis with Seurat-guided clustering of chromatin profiles across WT and mutant HSPC.

a, Profile plots of chromatin marks and accessibility at accessible regions in each of the 10 clusters (except for Cluster-6 and -10). b, De novo motifs significantly enriched at Clusters 7-9 chromatin regions. HOMER outputs motifs with target coverage > 10% and ranked by p values (one-tailed, not multiple testing corrected). c, GO analysis of genes that are linked to the chromatin regions in Cluster-10 using pCHiC profiles and downregulated during leukemia induction. d, Volcano plot showing differential expression of Cluster-10 linked genes in DM vs WT HSPC. Up- or downregulation were defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. e, Percentage of Cluster-6 genes with human overlap showing their expression in DM vs WT HSPC. Differential expression was defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5.

Correlating enhancer alterations with differential expression of target genes or DNA topology changes.

a, Percentage of dynamic enhancers interacting with gene promoters and the percent of up- or downregulation of the genes they linked to. Dynamic enhancers were marked by either differential H3K4me1 only, or together with differential H3K27ac and accessibility. b, Percentage of genes involving flipped chromatin compartments and expressed in DM vs WT HSPC. Differential expression was defined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and absolute FC ≥ 1.5. c, Percentage of enhancers with dynamic interactions showing their H3K4me1 read counts in DM vs WT HSPC. Differential H3K4m1 levels were defined by setting FDR (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5. d, Example genomic region demonstrating correlation of H3K4me1 changes with “rewired” interactions.

Differential Super-enhancer usage across WT and mutant HSPC.

a, Definition of super-enhancers (SEs) by ranking H3K27ac peaks that were overlapped in both replicates of each cell type based on normalised H3K27 counts. The top-ranked 801 regions were considered as SEs across four HSPC. b, Number of super-enhancers (SE) with increase (UP) or decrease (DOWN) in H3K27ac modification in mutant vs WT HSPC. c, Heatmaps and profile plots of H3K27ac enrichment in WT and mutant HSPC over SEs showing gain or loss of H3K27ac during DM leukemia induction. d, Linking differential activity of SEs to altered mRNA expression of their target genes (determined by pCHiC interactions) in mutant vs WT HSPC. Upregulated genes connecting to SEs with H3K27ac gain or downregulated genes connecting to SEs with H3K27ac loss were determined by setting adjP (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5 for expression, or FDR (two-tailed and multiple testing corrected) < 0.05 and FC ≥ 1.5 for H3K27ac. Several Hoxa genes were indicated in Npm1c vs WT. e, Significantly enriched GO terms for genes linked to gained SEs and upregulated in DM vs WT (upper panel) or for genes linked to lost SEs and downregulated (lower panel). f and g, Promoter-contact plots showing the read counts of promoter bait to target pairs (bait-”other end”) for Spi1 (f) and Hoxa9/Hoxa10 (g). Regions highlighted yellow show rewired interactions.

Perturbation of critical cis-regulatory hubs and their target genes abrogates leukemia maintenance in mouse DM leukemia.

a, mRNA expression of AP-1 components (Jun, Fos) detected by RT-qPCR in DM cells expressing shRNAs targeting Jun or Fos relative to control shRNA (n=3 independent experiments). b, CFU assays of DM cells expressing shRNAs targeting Jun or Fos or control gRNAs (n=3 independent experiments). c, mRNA expression of Hoxa9, Hoxa10, and Spi1 in DM cells expressing shRNAs targeting them specifically relative to control shRNA (n=3 independent experiments). d and e, CFU assays (d) and ex vivo cell proliferation (e) of DM cells expressing shRNAs targeting Hoxa9, Hoxa10, Spi1 or control shRNA (n=3 independent experiments). f and i, Gel electrophoresis of PCR products on genomic DNA of DM-Cas9 cells expressing control gRNAs, Spi1-URE (f) or Hoxa-LRSE (i) gRNAs. PCR experiment was performed once on three independent gRNA transductions (marked with #). g and j, Sanger sequencing to confirm the deletion of Spi1-URE (g) or Hoxa-LRSE (j) in DM-Cas9 cells expressing Spi1-URE gRNAs. h and k, Ex vivo cell growth of DM-Cas9 cells expressing Spi1-URE gRNAs (h) or Hoxa-LRSE gRNAs (k) in comparison to cells expressing control gRNAs (n=3 independent experiments). Statistical analyses in a-d, h and k were performed by running Student’s unpaired t-tests and p values (two-sided, not multiple testing corrected) were shown; error bars represent mean and standard error of mean.

Perturbation of critical target genes impairs human leukemia cells.

a and b, Relative mRNA expression of target gene expression upon CRISPR-mediated knockdown in OCI-AML3 (a) and MOLM-13 (b) leukemia cell lines (n=3 independent experiments). c-e, Cell proliferation of MOLM-13 cells expressing human gRNAs targeting SPI1 (c), IRF8 (d) and IGF1 (e) relative to control gRNAs (n=3, 4, 3 independent experiments, respectively). # indicates two independent gRNAs. f, CFU assays of MOLM-13 cells expressing human gRNAs targeting SPI1, IRF8 and IGF1 relative to control gRNAs (n=3 independent experiments). Statistical analyses in a-f were performed by running Student’s unpaired t-tests and p values (two-sided, not multiple testing corrected) were shown; error bars represent mean and standard error of mean.
  73 in total

Review 1.  Transcriptional regulatory elements in the human genome.

Authors:  Glenn A Maston; Sara K Evans; Michael R Green
Journal:  Annu Rev Genomics Hum Genet       Date:  2006       Impact factor: 8.929

2.  Genomic Classification and Prognosis in Acute Myeloid Leukemia.

Authors:  Elli Papaemmanuil; Moritz Gerstung; Hartmut Döhner; Peter J Campbell; Lars Bullinger; Verena I Gaidzik; Peter Paschka; Nicola D Roberts; Nicola E Potter; Michael Heuser; Felicitas Thol; Niccolo Bolli; Gunes Gundem; Peter Van Loo; Inigo Martincorena; Peter Ganly; Laura Mudie; Stuart McLaren; Sarah O'Meara; Keiran Raine; David R Jones; Jon W Teague; Adam P Butler; Mel F Greaves; Arnold Ganser; Konstanze Döhner; Richard F Schlenk
Journal:  N Engl J Med       Date:  2016-06-09       Impact factor: 91.245

Review 3.  Epigenetics of haematopoietic cell development.

Authors:  Howard Cedar; Yehudit Bergman
Journal:  Nat Rev Immunol       Date:  2011-06-10       Impact factor: 53.106

Review 4.  Transcription factors: from enhancer binding to developmental control.

Authors:  François Spitz; Eileen E M Furlong
Journal:  Nat Rev Genet       Date:  2012-08-07       Impact factor: 53.242

Review 5.  Mechanisms of transcriptional precision in animal development.

Authors:  Mounia Lagha; Jacques P Bothma; Michael Levine
Journal:  Trends Genet       Date:  2012-04-16       Impact factor: 11.639

6.  Dynamic Control of Enhancer Repertoires Drives Lineage and Stage-Specific Transcription during Hematopoiesis.

Authors:  Jialiang Huang; Xin Liu; Dan Li; Zhen Shao; Hui Cao; Yuannyu Zhang; Eirini Trompouki; Teresa V Bowman; Leonard I Zon; Guo-Cheng Yuan; Stuart H Orkin; Jian Xu
Journal:  Dev Cell       Date:  2016-01-11       Impact factor: 12.270

Review 7.  Long-range enhancer-promoter contacts in gene expression control.

Authors:  Stefan Schoenfelder; Peter Fraser
Journal:  Nat Rev Genet       Date:  2019-08       Impact factor: 53.242

Review 8.  From haematopoietic stem cells to complex differentiation landscapes.

Authors:  Elisa Laurenti; Berthold Göttgens
Journal:  Nature       Date:  2018-01-24       Impact factor: 49.962

9.  Hematopoiesis: an evolving paradigm for stem cell biology.

Authors:  Stuart H Orkin; Leonard I Zon
Journal:  Cell       Date:  2008-02-22       Impact factor: 41.582

Review 10.  Molecular landscape of acute myeloid leukemia in younger adults and its clinical relevance.

Authors:  David Grimwade; Adam Ivey; Brian J P Huntly
Journal:  Blood       Date:  2015-12-10       Impact factor: 25.476

View more
  5 in total

1.  Multiomics data integration to reveal chromatin remodeling and reorganization induced by gene mutational synergy.

Authors:  Haiyang Yun; Shabana Vohra; David Lara-Astiaso; Brian J P Huntly
Journal:  STAR Protoc       Date:  2022-10-13

2.  A novel network pharmacology approach for leukaemia differentiation therapy using Mogrify®.

Authors:  Lin Ming Lee; Eleni G Christodoulou; Pavithra Shyamsunder; Bei Jun Chen; Kian Leong Lee; Tsz Kan Fung; Chi Wai Eric So; Gee Chuan Wong; Enrico Petretto; Owen J L Rackham; S Tiong Ong
Journal:  Oncogene       Date:  2022-10-21       Impact factor: 8.756

Review 3.  Dissecting the Genetic and Non-Genetic Heterogeneity of Acute Myeloid Leukemia Using Next-Generation Sequencing and In Vivo Models.

Authors:  Rhea H Desai; Niloofar Zandvakili; Stefan K Bohlander
Journal:  Cancers (Basel)       Date:  2022-04-27       Impact factor: 6.575

Review 4.  Murine Models of Acute Myeloid Leukemia.

Authors:  Kristen J Kurtz; Shannon E Conneely; Madeleine O'Keefe; Katharina Wohlan; Rachel E Rau
Journal:  Front Oncol       Date:  2022-06-08       Impact factor: 5.738

5.  Dissection of the MEF2D-IRF8 transcriptional circuit dependency in acute myeloid leukemia.

Authors:  Bianca Y Pingul; Hua Huang; Qingzhou Chen; Fatemeh Alikarami; Zhen Zhang; Jun Qi; Kathrin M Bernt; Shelley L Berger; Zhendong Cao; Junwei Shi
Journal:  iScience       Date:  2022-09-15
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.