Xiufei Chen1,2, Wenjie Zhou1,3, Ren-Hua Song4, Shuang Liu5, Shu Wang6, Yujia Chen1,3, Chao Gao1,3, Chenxi He1,3, Jianxiong Xiao1,3, Lei Zhang1, Tianxiang Wang1,3, Peng Liu1,3, Kunlong Duan1,3, Zhouli Cheng1,3, Chen Zhang1,3, Jinye Zhang1,3, Yiping Sun1,3, Felix Jackson7, Fei Lan1,3, Yun Liu8, Yanhui Xu1, Justin Jong-Leong Wong4, Pu Wang1,3, Hui Yang9, Yue Xiong10, Tong Chen6, Yan Li5, Dan Ye1,3,11. 1. Huashan Hospital, Fudan University, and Shanghai Key Laboratory of Medical Epigenetics, International Co-laboratory of Medical Epigenetics and Metabolism (Ministry of Science and Technology), and Molecular and Cell Biology Lab, Institutes of Biomedical Sciences, Shanghai Medical College of Fudan University, Shanghai, China. 2. Target Discovery Institute, NDM Research Building, Oxford Ludwig Institute of Cancer Research, Oxford University, Old Road Campus, Roosevelt Drive, Oxford OX3 7FZ, UK. 3. Key Laboratory of Metabolism and Molecular Medicine, Ministry of Education, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fudan University, Shanghai, China. 4. Epigenetics and RNA Biology Program, Centenary Institute, The University of Sydney, Camperdown 2050, Australia. 5. MOE Key Laboratory of Model Animals for Disease Study, Department of Rheumatology and Immunology, Nanjing Drum Tower Hospital, The Affiliated Hospital of Nanjing University Medical School, Chemistry and Biomedicine Innovation Center (ChemBIC), Model Animal Research Center, Nanjing University Medical School, Nanjing University, Nanjing, China. 6. Department of Hematology, Huashan Hospital, Fudan University, Shanghai, China. 7. Department of Computer Science, University of Oxford, 15 Parks Rd, Oxford OX1 3QD, UK. 8. MOE Key Laboratory of Metabolism and Molecular Medicine, Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences and Zhongshan Hospital, Fudan University, Shanghai, China. 9. Department of Neurosurgery, Huashan Hospital, Fudan University, Shanghai, China. 10. Lineberger Comprehensive Cancer Center, Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599, USA. 11. Department of General Surgery, Huashan Hospital, Fudan University, Shanghai, China.
Abstract
DNA methyltransferases (DNMTs) catalyze DNA methylation, and their functions in mammalian embryonic development and diseases including cancer have been extensively studied. However, regulation of DNMTs remains under study. Here, we show that CCAAT/enhancer binding protein α (CEBPA) interacts with the long splice isoform DNMT3A, but not the short isoform DNMT3A2. CEBPA, by interacting with DNMT3A N-terminus, blocks DNMT3A from accessing DNA substrate and thereby inhibits its activity. Recurrent tumor-associated CEBPA mutations, such as preleukemic CEBPAN321D mutation, which is particularly potent in causing AML with high mortality, disrupt DNMT3A association and cause aberrant DNA methylation, notably hypermethylation of PRC2 target genes. Consequently, leukemia cells with the CEBPAN321D mutation are hypersensitive to hypomethylation agents. Our results provide insights into the functional difference between DNMT3A isoforms and the regulation of de novo DNA methylation at specific loci in the genome. Our study also suggests a therapeutic strategy for the treatment of CEBPA-mutated leukemia with DNA-hypomethylating agents.
DNA methyltransferases (DNMTs) catalyze DNA methylation, and their functions in mammalian embryonic development and diseases including cancer have been extensively studied. However, regulation of DNMTs remains under study. Here, we show that CCAAT/enhancer binding protein α (CEBPA) interacts with the long splice isoform DNMT3A, but not the short isoform DNMT3A2. CEBPA, by interacting with DNMT3A N-terminus, blocks DNMT3A from accessing DNA substrate and thereby inhibits its activity. Recurrent tumor-associated CEBPA mutations, such as preleukemic CEBPAN321D mutation, which is particularly potent in causing AML with high mortality, disrupt DNMT3A association and cause aberrant DNA methylation, notably hypermethylation of PRC2 target genes. Consequently, leukemia cells with the CEBPAN321D mutation are hypersensitive to hypomethylation agents. Our results provide insights into the functional difference between DNMT3A isoforms and the regulation of de novo DNA methylation at specific loci in the genome. Our study also suggests a therapeutic strategy for the treatment of CEBPA-mutated leukemia with DNA-hypomethylating agents.
DNA methylation of cytosine-guanine (CpG) dinucleotides is a covalent modification involved in the regulation of gene expression and plays crucial functions in mammalian development and diseases (, ). In mammalian genomes, DNA methylation is usually excluded from CpG-rich promoters of actively transcribed genes, and the correlation between DNA methylation and gene silencing increases with the density of CpG islands (CGIs) at gene promoters (). There are two de novo DNA methyltransferases (DNMTs) in mammals, DNMT3A and DNMT3B, which both contain a highly conserved catalytic domain in the C terminus and recognize unmethylated CpG. Genetic studies in mouse models have linked the function of Dnmt3a and Dnmt3b genes to diverse biological pathways. For instance, mice with whole-body knockout (KO) of Dnmt3a gene are normal at birth but die at about 4 weeks of age (). Dnmt3b-null mouse embryos encounter rostral neural tube defects and growth impairment and die in utero (), suggesting that Dnmt3b may play a more substantial role during embryonic development than Dnmt3a. Catalytically inactive Dnmt3b can restore a majority of methylation and expression changes and rescue mouse embryonic defect by accessory functions to support other Dnmts (). In addition, Dnmt3a is also essential for paternal and maternal genome imprinting and germ cell development (, ). Tissue-specific KO of Dnmt3a has revealed its important regulatory roles in adipose (), bone (), muscle (, ), T cells (), and the innate immune system (). Although the function of DNMT enzymes in mammalian development and diseases has been extensively studied, it is still unclear how these epigenetic enzymes are regulated at specific genomic regions such as CpG-rich promoters and thereby precisely regulate gene expression by DNA methylation.Alternate promoter usage at the DNMT3A locus gives rise to two isoforms: DNMT3A1 (also known as DNMT3A) and DNMT3A2. The long splice isoform DNMT3A contains a unique N-terminal domain containing 219 and 223 amino acid residues in mouse and human, respectively (), and is expressed broadly. The short isoform DNMT3A2 is expressed restrictively in embryonic stem cells (ESCs) and tissues with active proliferation and differentiation. Both isoforms of Dnmt3a are equally active in in vitro assays (, ), but they have different chromatin binding preferences in mouse ESCs (). Dnmt3a is substantially more active than Dnmt3a2 in increasing DNA methylation at the major and minor satellite repeats in Dnmt3a/3b/3l triple-KO mouse ESCs (), and isoform-specific function of Dnmt3a is also required during germ cell development in mice (). The current knowledge on the function of the N-terminal region uniquely present in DNMT3A is very limited. This domain contains a nuclear localization signal sequence, which may account for the exclusive nuclear localization of Dnmt3a and its heterochromatin enrichment (), and can also facilitate the interaction with other chromatin-remodeling proteins, such as lysine methyltransferase G9a (, ).The transcription factor CCAAT/enhancer binding protein α (CEBPA) controls lineage-specific gene expression and acts as a master regulator of terminal differentiation of several cell types including granulocytes (). The CEBPA gene comprises a single exon with an internal translational start site, resulting in the full-length CEBPA (42 kDa; p42) and a shorter N-terminal–truncated isoform (30 kDa; p30). As compared to full-length p42, p30 lacks the first 117 amino acids including the transactivating domain 1 (TAD1) and exhibits lower transactivation potential (, ). The CEBPA gene is mutated in a subset of acute myeloid leukemia (AML), and most CEBPA-mutated AML patients harbor a unique biallelic mutation pattern: an N-terminal frameshift mutation in one allele that generates a dominant negative p30 isoform () and a C-terminal missense mutation in the other allele that targets the basic region–leucine zipper domain (bZIP) and affects DNA binding and/or homo- or heterodimerization with other CEBP family members (, ). Among tumor-associated missense mutations targeting the C-terminal region of CEBPA, N321D mutation is particularly potent to cause AML with high mortality in murine models (). In an effort to determine how mutations of CEBPA contribute to AML development, we found that CEBPA mutations are associated with CpG hypermethylation, especially Polycomb repressive complex 2 (PRC2) target genes. We found that CEBPA interacts with and acts as a novel inhibitor of DNMT3A and further demonstrated that AML-derived C-terminal mutations in CEBPA disrupt DNMT3A interaction and relieve DNMT3A inhibition, leading to DNA hypermethylation and sensitizing leukemia cells to hypomethylation agents.
RESULTS
CEBPA physically interacts with DNMT3A
Aberrant DNA methylation is a well-established cancer hallmark, including regional hypermethylation at CGIs of tumor suppressor genes (, ). A previous study by Sinha and colleagues () reported genetic drivers of DNA hypermethylation in AML patients, including isocitrate dehydrogenase 2 (IDH2), CEBPA, and Wilms tumor protein 1 (WT1). In accord, we found that mutations in IDH2, CEBPA, and WT1 led to increased methylation of 43,417, 42,537, and 33,077 CpG sites, respectively, in The Cancer Genome Atlas (TCGA) AML dataset consisting of 194 patients (Fig. 1A). As compared to DNMT3A mutation, which causes 29,056 hypomethylated CpG sites, much fewer hypomethylated CpG sites were found in AML patients with mutations in IDH2 (803), CEBPA (1901), or WT1 (1808), confirming that mutations in these three genes are associated with DNA hypermethylation in AML. We and others have previously demonstrated that ten-eleven translocation 2 (TET2) DNA dioxygenase is catalytically inactivated by d-2-hydroxyglutarate, an oncometabolite produced by the mutated IDH (, ), which occurs in a mutually exclusive manner with TET2 mutations (); the transcription factor WT1 recruits TET2 to enable target promoter demethylation and that AML-associated recurrent mutations in WT1 or TET2 disrupt their interaction and cause focal CGI hypermethylation (, ). In AML cases examined, mutations in CEBPA are linked to the second largest number of hypermethylated CpG sites, but the underlying mechanism is yet to be elucidated.
Fig. 1.
CEBPA physically interacts with DNMT3A.
(A) Heatmap and Venn diagram showing different DNA methylation in 194 sequenced AML patients from the TCGA cohort, including mutations that are mostly linked to DNA hypermethylation (WT1, IDH2, and CEBPA) and hypomethylation (DNMT3A). (B) Schematic representation of the mammalian two-hybrid system and dual-luciferase reporter system used to search for CEBPA-interacting DNA-modifying enzymes in HEK293K cells. Average values of triplicated results with SD are shown. (C) Myc-tagged DNMT proteins, including DNMT1, DNMT3A, DNMT3B, and DNMT3L, were coexpressed with Flag-CEBPA in HEK293T. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (D) Schematic representation of two isoforms of DNMT3A. (E) HA-tagged DNMT3A or DNMT3A2 was coexpressed with Flag-CEBPA in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (F) Endogenous DNMT3A protein was purified by IP with an anti-DNMT3A antibody from HL-60 cells with or without knockout (KO) of DNMT3A, followed by Western blot to detect CEBPA. WT, wild type.
CEBPA physically interacts with DNMT3A.
(A) Heatmap and Venn diagram showing different DNA methylation in 194 sequenced AML patients from the TCGA cohort, including mutations that are mostly linked to DNA hypermethylation (WT1, IDH2, and CEBPA) and hypomethylation (DNMT3A). (B) Schematic representation of the mammalian two-hybrid system and dual-luciferase reporter system used to search for CEBPA-interacting DNA-modifying enzymes in HEK293K cells. Average values of triplicated results with SD are shown. (C) Myc-tagged DNMT proteins, including DNMT1, DNMT3A, DNMT3B, and DNMT3L, were coexpressed with Flag-CEBPA in HEK293T. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (D) Schematic representation of two isoforms of DNMT3A. (E) HA-tagged DNMT3A or DNMT3A2 was coexpressed with Flag-CEBPA in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (F) Endogenous DNMT3A protein was purified by IP with an anti-DNMT3A antibody from HL-60 cells with or without knockout (KO) of DNMT3A, followed by Western blot to detect CEBPA. WT, wild type.To determine how CEBPA affects DNA methylation, we applied a dual-luciferase reporter assay system and identified DNMT3A as a potential CEBPA-interacting protein, which exhibited the strongest luciferase reporter activation followed by DNMT3B (Fig. 1B). The protein association of Flag-tagged CEBPA with Myc-tagged DNMT3A and DNMT3B, but not DNMT1 and DNMT3L, was confirmed by coimmunoprecipitation (co-IP) and Western blotting (Fig. 1C). We then focused on DNMT3A, given that its interaction with CEBPA was the strongest among all DNA methylation regulators examined. We observed that CEBPA selectively bound to DNMT3A, but not DNMT3A2, when co-overexpressed in human embryonic kidney (HEK) 293T cells (Fig. 1, D and E), implying that the N terminus of DNMT3A might be crucial for the interaction with CEBPA. By constructing a series of truncated proteins of DNMT3A, we found that CEBPA specifically interacted with the N-terminal region of DNMT3A (amino acid residues 1 to 291), but not the other regions, such as the Pro-Trp-Trp-Pro domain (PWWP; amino acid residues 292 to 481), the ATRX-DNMT3-DNMT3L domain (ADD; amino acid residues 482 to 633), and the catalytic domain (CD; amino acid residues 634 to 912) (fig. S1A). In situ proximity ligation assay revealed the physical closeness of endogenous CEBPA and DNMT3A in K562 leukemia cells (fig. S1B). The interaction between endogenous DNMT3A and CEBPA proteins was confirmed in K562 and HL-60 cells via IP coupled with Western blotting (fig. S1C). To further confirm the specificity of endogenous co-IP, we found that CEBPA was immunoprecipitated by the DNMT3A antibody in wild-type HL-60 cells, but not in DNMT3A-KO cells (Fig. 1F). In addition, in vitro pull-down assay demonstrated the direct protein interaction between CEBPA and DNMT3A (fig. S1, D to F). Collectively, these data suggest that CEBPA physically interacts with DNMT3A and that the N-terminal domain of DNMT3A is primarily responsible for this interaction.
AML-derived mutations in CEBPA C terminus disrupt DNMT3A binding
To map the binding regions between CEBPA and DNMT3A, we constructed a series of CEBPA truncations and observed the retention of hemagglutinin (HA)–tagged DNMT3A in the immunoprecipitates of Flag-tagged p30 and bZIP domain (amino acid residues 278 to 358), but not the region containing TAD1 (amino acid residues 1 to 120) or TAD2 (amino acid residues 121 to 277) (Fig. 2A), indicating that the C-terminal bZIP domain of CEBPA mediates the interaction with DNMT3A. Besides CEBPA (alpha), the CEBP family transcription factors contain CEBPB (beta), CEBPG (gamma), CEBPD (delta), and CEBPE (epsilon), all of which have the conserved bZIP domain in their C terminus (fig. S2, A and B). Moreover, we found that DNMT3A N terminus is conserved in mouse and human (fig. S2C) and that CEBP proteins interacted with the N-terminal domain (amino acid residues 1 to 291) of both human and mouse DNMT3A when co-overexpressed in HEK293T cells (fig. S2, D and E). Collectively, these findings suggest that the bZIP domain of CEBP proteins mediates the interaction with DNMT3A through its N-terminal domain.
Fig. 2.
AML-derived mutations in CEBPA C terminus disrupt DNMT3A binding.
(A) Wild-type CEBPA and its deletion mutants, as shown in the schematic illustration (top), were coexpressed with HA-DNMT3A in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (B and C) HA-tagged wild-type CEBPA or AML-derived mutants, as shown in the schematic illustration (B), were transiently coexpressed with Flag-DNMT3A in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (D) Summary of tested AML-derived CEBPA mutations that affect DNMT3A binding, including 19 mutations in the BR and LZ regions of CEBPA bZIP domain.
AML-derived mutations in CEBPA C terminus disrupt DNMT3A binding.
(A) Wild-type CEBPA and its deletion mutants, as shown in the schematic illustration (top), were coexpressed with HA-DNMT3A in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (B and C) HA-tagged wild-type CEBPA or AML-derived mutants, as shown in the schematic illustration (B), were transiently coexpressed with Flag-DNMT3A in HEK293T cells. Protein-protein interaction was examined by IP-Western using the indicated antibodies. (D) Summary of tested AML-derived CEBPA mutations that affect DNMT3A binding, including 19 mutations in the BR and LZ regions of CEBPA bZIP domain.According to the CEBPA-DNA cocrystal structure (), the bZIP domain of CEBPA can be divided into a DNA contacting basic region (BR; amino acid residues 278 to 309) and a leucine zipper coiled-coil interface for dimerization (LZ; amino acid residues 310 to 358) (fig. S3, A and B). Among the TCGA AML patients, the C-terminal bZIP domain contains hotspot mutations in CEBPA (Fig. 2B), but only a few mutations in this region have been functionally characterized (). Notably, of a total of 19 AML-derived CEBPA mutants targeting the bZIP domain tested, almost all reduced or abolished the CEBPA-DNMT3A interaction (Fig. 2, C and D, and fig. S3, C to F). These findings suggest that disrupting CEBPA-DNMT3A interaction may be a shared mechanism underlying tumor-derived C-terminal mutations in CEBPA.
CEBPA inhibits DNMT3A by decreasing its accessibility to DNA substrate
It was previously reported that the N-terminal domain of DNMT3a was the major determinant of its accessibility to DNA substrate (), which led us to speculate that CEBPA might interfere with the DNA binding activity of DNMT3A. In vitro electrophoretic mobility shift assay (EMSA) demonstrated that DNMT3A bound to DNA with no sequence selectivity, while CEBPA selectively recognized sequences containing the specific motif (GCAAT) (fig. S4, A and B). The dimerization-defective CEBPAN321D mutant failed to bind this DNA motif (fig. S5A), and this mutation in CEBPA also failed to influence the binding of DNMT3A N terminus (amino acid residues 1 to 291) to random DNA (fig. S5B). CEBPA remarkably reduced the binding of either DNMT3A N terminus or full-length protein to random DNA (fig. S5, C and D), and this inhibitory effect on the DNA binding activity of DNMT3A N terminus or full-length protein was in a CEBPA dose-dependent manner (Fig. 3, A and B). Considering that CEBPA mutations are heterozygous in most clinical cases of AML, we found that the DNA binding activity of DNMT3A N terminus was diminished by the coexistence of wild-type CEBPA and N321D mutant proteins in vitro (Fig. 3C).
Fig. 3.
CEBPA inhibits the activity of DNMT3A by decreasing its accessibility to substrate DNA.
(A and B) EMSA for DNA binding activity of N terminus (residues 1 to 291) (A) or full length (B) of DNMT3A in the absence or presence of CEBPA. See Materials and Methods for more details. Data are representative of three independent experiments. (C) EMSA showing the DNA binding ability of DNMT3A N terminus in the absence or presence of both wild-type CEBPA and N321D mutant proteins at the mix ratios of 1:1 or 1:2. (D and E) Superimposed fluorescence polarization plots for DNA binding affinities of the N terminus (D) or full length (E) of DNMT3A in the absence or presence of wild-type CEBPA or the N321D mutant. Random DNA substrate (forward, 5′-6-FAM/TGGATATCTAGGGGCGCTATGATATCT-3′; reverse, 5′-AGATATCATAGCGCCCCTAGATATCCA-3′) was used. See Materials and Methods for more details. (F and G) In vitro methyltransferase activity assays for DNMT3A activity measured in the presence of wild-type CEBPA or N321D mutant protein, using random DNA substrate (F) or CEBPA binding motif (GCAAT) containing DNA (G). See Materials and Methods for more details. n.s., not significant; CPM, counts per minute.
CEBPA inhibits the activity of DNMT3A by decreasing its accessibility to substrate DNA.
(A and B) EMSA for DNA binding activity of N terminus (residues 1 to 291) (A) or full length (B) of DNMT3A in the absence or presence of CEBPA. See Materials and Methods for more details. Data are representative of three independent experiments. (C) EMSA showing the DNA binding ability of DNMT3A N terminus in the absence or presence of both wild-type CEBPA and N321D mutant proteins at the mix ratios of 1:1 or 1:2. (D and E) Superimposed fluorescence polarization plots for DNA binding affinities of the N terminus (D) or full length (E) of DNMT3A in the absence or presence of wild-type CEBPA or the N321D mutant. Random DNA substrate (forward, 5′-6-FAM/TGGATATCTAGGGGCGCTATGATATCT-3′; reverse, 5′-AGATATCATAGCGCCCCTAGATATCCA-3′) was used. See Materials and Methods for more details. (F and G) In vitro methyltransferase activity assays for DNMT3A activity measured in the presence of wild-type CEBPA or N321D mutant protein, using random DNA substrate (F) or CEBPA binding motif (GCAAT) containing DNA (G). See Materials and Methods for more details. n.s., not significant; CPM, counts per minute.To quantitatively assess the effect of CEBPA on impairing the DNA binding activity of DNMT3A, we measured the superimposed fluorescence polarization of DNMT3A with the presence of CEBPA (i.e., p42) by using a FAM-labeled random DNA sequence containing one CG dinucleotide. As shown, the N terminus of DNMT3A exhibited lower dissociation constant (Kd) values than the full-length protein for binding with random DNA substrate in vitro (Fig. 3, D and E), which is in line with previous studies reporting that the N terminus allowed Dnmt3a to anchor in the nucleus and recognize nucleic acids or nucleoproteins (, ). The DNA binding affinity of DNMT3A N terminus was reduced by CEBPA recombinant protein, with the increment of Kd value from 1.0 ± 0.1 μM to 7.2 ± 0.5 μM (Fig. 3D). Likewise, the DNA binding affinity of full-length DNMT3A was reduced by wild-type CEBPA, as the Kd value was increased from 7.6 ± 0.4 μM to 117.5 ± 35.3 μM (Fig. 3E). The inhibitory effect on the DNA binding activity of DNMT3A N terminus or full-length protein was, however, not observed when using the recombinant protein of CEBPAN321D mutant (Fig. 3D and fig. S5E).Next, we evaluated the effect of CEBPA on regulating DNMT3A enzyme activity in vitro. DNMT3A is known to be structurally stimulated by histone H3K4me0, which disrupts ADD-CD interaction and induces a large movement of the ADD domain and releases the autoinhibition form of DNMT3A (). In the presence of H3K4me0 peptide and random DNA substrate, the enzymatic activity of DNMT3A was significantly suppressed (P < 0.001) by recombinant wild-type CEBPA, but not the N321D mutant (Fig. 3F). The DNMT3A D529A mutant within the ADD link, which can hamper ADD-CD interaction and thus release autoinhibition of DNMT3A (), was still significantly inhibited (P < 0.001) by recombinant wild-type CEBPA, but not N321D mutant (fig. S6A), implying that CEBPA inhibits DNMT3A activity without causing H3K4me0-mediated DNMT3A protein structural change. Furthermore, we found that the inhibitory effect of CEBPA on DNMT3A activity was more profound when the specific motif (GCAAT) of CEBPA was used as DNA substrate (Fig. 3G). In contrast, CEBPA failed to inhibit the activity of DNMT3A2 or DNMT3APWWP-ADD-CD (fig. S6, B and C), both of which lack the N-terminal CEBPA-interacting domain. Together, these in vitro results support a model that CEBPA inhibits DNMT3A activity by decreasing the accessibility of DNMT3A to DNA substrate.
CEBPA decreases CGI methylation at target gene promoters
Approximately 10% of AML patients show mutations in the CEBPA coding region, including N-terminal frameshift mutations and/or C-terminal in-frame insertions/deletion mutations in the bZIP domain (Fig. 4A). Notably, total CpG methylation was significantly increased (P < 0.05) in C-terminal CEBPA-mutated patients, but not in patients harboring N-terminal mutations, as compared to patients with wild-type CEBPA (Fig. 4B). In agreement with the findings in AML patient samples, overexpression of wild-type CEBPA, but not the N321D mutant, led to decreased global 5mC in cells (Fig. 4C and fig. S7, A and B). CEBPA did not affect either DNMT3A homodimer formation or DNMT3A-DNMT3L interaction (fig. S7, C and D), both of which are important for stimulating DNMT3A activity (, ). Moreover, we found that both wild-type CEBPA and the N321D mutant localized in the nucleus when expressed in cells (fig. S8A). However, the chromatin/DNA binding of the CEBPAN321D mutant was remarkably reduced as compared to wild-type CEBPA (fig. S8B). The capability of CEBPAN321D mutant to form dimers with the full-length CEBPA (p42) or its truncated isoform (p30) was also impaired in cells (fig. S8C), which is in line with a previous study showing that dimerization of bZIP proteins occurs in the absence of DNA and is a prerequisite for DNA binding ().
Fig. 4.
CEBPA decreases CGI methylation at target gene promoters.
(A) Summary of CEBPA mutations in total 5976 AML patients according to the TCGA database and other literatures. (B) Reanalysis of DNA methylation in 194 sequenced AML patients by dividing the TCGA cohort into four groups, including CEBPA nonmutation (i.e., WT), single N-terminal mutations (i.e., TADmut), single C-terminal mutations (i.e., bZIPmut), and double mutations (i.e., both TADmut/bZIPmut). The patient numbers for each group are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. (C) In HL-60 cells, Flag-tagged wild-type CEBPA or N321D mutant protein was ectopically expressed, followed by Western blot to detect overexpressed proteins (top). In these cells, genomic DNA was isolated and then subjected to LC-MS/MS analysis to quantify global 5mC levels. See Materials and Methods for more details. (D) Stable HL-60 cells with knockdown of endogenous CEBPA and reinduction of lentiviral vectors expressing empty vector (VEC) or Flag-tagged wild-type CEBPA or the N321D mutant to a comparable level as endogenous CEBPA protein. In these stable cells, the occupancy of Flag-CEBPA at the promoter regions of indicated target genes was determined by ChIP-qPCR. See Materials and Methods for more details. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was included as a negative control. (E) In stable HL-60 cells with CEBPA knockdown and put-back as described in (E), 5mC levels at CEBPA binding sites at the promoter regions of indicated CEBPA target genes were determined by BS-PCR. See Materials and Methods for more details. 5mC level was calculated as the percentage of methylated cytosines. (F) In stable HL-60 cells in (E), the mRNA expression of indicated CEBPA target genes was determined by quantitative reverse transcription (qRT-PCR). Average values of triplicated results with SD are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. *P < 0.05, **P < 0.01, and ***P < 0.001 for the indicated comparison.
CEBPA decreases CGI methylation at target gene promoters.
(A) Summary of CEBPA mutations in total 5976 AML patients according to the TCGA database and other literatures. (B) Reanalysis of DNA methylation in 194 sequenced AML patients by dividing the TCGA cohort into four groups, including CEBPA nonmutation (i.e., WT), single N-terminal mutations (i.e., TADmut), single C-terminal mutations (i.e., bZIPmut), and double mutations (i.e., both TADmut/bZIPmut). The patient numbers for each group are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. (C) In HL-60 cells, Flag-tagged wild-type CEBPA or N321D mutant protein was ectopically expressed, followed by Western blot to detect overexpressed proteins (top). In these cells, genomic DNA was isolated and then subjected to LC-MS/MS analysis to quantify global 5mC levels. See Materials and Methods for more details. (D) Stable HL-60 cells with knockdown of endogenous CEBPA and reinduction of lentiviral vectors expressing empty vector (VEC) or Flag-tagged wild-type CEBPA or the N321D mutant to a comparable level as endogenous CEBPA protein. In these stable cells, the occupancy of Flag-CEBPA at the promoter regions of indicated target genes was determined by ChIP-qPCR. See Materials and Methods for more details. Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was included as a negative control. (E) In stable HL-60 cells with CEBPA knockdown and put-back as described in (E), 5mC levels at CEBPA binding sites at the promoter regions of indicated CEBPA target genes were determined by BS-PCR. See Materials and Methods for more details. 5mC level was calculated as the percentage of methylated cytosines. (F) In stable HL-60 cells in (E), the mRNA expression of indicated CEBPA target genes was determined by quantitative reverse transcription (qRT-PCR). Average values of triplicated results with SD are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. *P < 0.05, **P < 0.01, and ***P < 0.001 for the indicated comparison.How is DNMT3A regulated by CEBPA at specific genomic regions in the cell? An appealing model is that CEBPA interacts with DNMT3A N terminus and subsequently hampers the accessibility of DNMT3A to DNA, including sites containing the CEBPA binding motif, and thus prevents CGI methylation at target gene promoters. To test this hypothesis, we generated stable HL-60 cells with depletion of endogenous CEBPA and reinduction of lentiviral vectors expressing empty vector (VEC) or Flag-tagged wild-type CEBPA or DNMT3A-binding defective N321D mutant to a comparable level as endogenous CEBPA protein (fig. S8D). In these stable cells, coupled chromatin IP and quantitative polymerase chain reaction (ChIP-qPCR) analysis confirmed the binding of wild-type CEBPA, but not the N321D mutant, to transcription start sites of target genes, including GATA3, CTNNA1, and HOXA5, but not GAPDH (a negative control) (Fig. 4D). Bisulfite (BS)–PCR and sequencing analysis showed that 5mC levels at the CpG-rich promoter regions of GATA3, CTNNA1, and HOXA5 were remarkably lower in stable cells expressing wild-type CEBPA than those expressing the CEBPAN321D mutant or empty vector (Fig. 4E). As a result, put-back of CEBPA, but not the N321D mutant, led to up-regulation of target gene expression in stable HL-60 cells with endogenous CEBPA depletion (Fig. 4F). In addition, we also reanalyzed the Illumina 450K methylation array and RNA sequencing (RNA-seq) results from the TCGA-AML cohort and confirmed that down-regulation of selected CEBPA target genes (e.g., GATA3, CTNNA1, and HOXA5/9) was associated with CpG hypermethylation in CEBPA-mutated AML patient samples (fig. S9, A to C). Collectively, these results indicate that CEBPA inhibits DNMT3A and prevents CGI methylation to induce target gene expression in both cultured leukemia cells and AML patients.
AML-derived mutations in CEBPA C terminus cause DNA hypermethylation of PRC2 targets
To systematically investigate the impact of CEBPA mutation on promoter CGI methylation and gene regulation, we next carried out gene set enrichment analysis (GSEA) in the TCGA AML dataset and found that hypermethylated genes associated with CEBPA mutations, hereafter referred to as CEBPAmut genes, were overrepresented in multiple gene sets of PRC2 component targets and genes associated with histone H3 lysine 27 trimethylated markers (H3K27me3) (P < 3.51 × 10−85 for multiple sets) (fig. S10A). There was a significant overlap between CEBPAmut genes and high CpG–containing promoters as defined by the hypergeometric test (P = 1.62 × 10−21) (fig. S10B). Supporting the negative correlation between DNA methylation and gene expression, three-fifths of CEBPAmut genes overlap with down-regulated genes in the TCGA AML cohort (P = 7.64 × 10−15, hypergeometric test) (fig. S10C).We analyzed the hypermethylated genes associated with CEBPA C-terminal mutations, hereafter referred to as CEBPAC-mut genes, and again found that PRC2 component target genes and H3K27me3-associated genes were significantly hypermethylated (Fig. 5, A and B). There was a significant overlap between CEBPAC-mut genes and high CpG–containing promoters (P = 3.65 × 10−30, hypergeometric test) (Fig. 5C). Again, supporting the negative correlation between DNA methylation and gene expression, two-thirds of CEBPAC-mut genes overlap with down-regulated genes in the TCGA AML cohort (P = 3.05 × 10−16, hypergeometric test) (Fig. 5D). Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis revealed that CEBPAC-mut genes were enriched in multiple signaling pathways critical for the pathogenesis of AML, including the Wnt signaling, the stem cell pluripotency–regulating pathway, and the Rap1 signaling pathway (Fig. 5E and table S1).
Fig. 5.
AML-derived C-terminal mutations in CEBPA are associated with DNA hypermethylation of PRC2 targets.
(A) Summary of gene sets enriched in the list of CEBPA C-terminal mutation–associated hypermethylated genes (CEBPAC-mut) from the TCGA dataset. Gene sets were obtained from the Molecular Signatures Database (MSigDB). Hypergeometric test P values are indicated, and gene numbers are also illustrated. (B) Enrichment plots from GSEA showing significant enrichment of gene sets associated with CEBPA C-terminal mutations in TCGA AML patients. P values were determined using a hypergeometric test. (C) Venn diagram showing overlap of high CpG promoter genes and CEBPAC-mut genes that are hypermethylated PRC2 target genes in TCGA AML patients. Overlap of these gene sets was measured by a hypergeometric test. (D) Venn diagram showing overlap of the CEBPAC-mut genes from the TCGA dataset and genes that are significantly down-regulated in AML. Overlap of these gene sets was measured by a hypergeometric test. (E) Pathway enrichment of CEBPAC-mut genes that are hypermethylated PRC2 target genes derived from the TCGA dataset. cAMP, adenosine 3′,5′-monophosphate; MAPK, mitogen-activated protein kinase; NES, normalized enrichment score.
AML-derived C-terminal mutations in CEBPA are associated with DNA hypermethylation of PRC2 targets.
(A) Summary of gene sets enriched in the list of CEBPA C-terminal mutation–associated hypermethylated genes (CEBPAC-mut) from the TCGA dataset. Gene sets were obtained from the Molecular Signatures Database (MSigDB). Hypergeometric test P values are indicated, and gene numbers are also illustrated. (B) Enrichment plots from GSEA showing significant enrichment of gene sets associated with CEBPA C-terminal mutations in TCGA AML patients. P values were determined using a hypergeometric test. (C) Venn diagram showing overlap of high CpG promoter genes and CEBPAC-mut genes that are hypermethylated PRC2 target genes in TCGA AML patients. Overlap of these gene sets was measured by a hypergeometric test. (D) Venn diagram showing overlap of the CEBPAC-mut genes from the TCGA dataset and genes that are significantly down-regulated in AML. Overlap of these gene sets was measured by a hypergeometric test. (E) Pathway enrichment of CEBPAC-mut genes that are hypermethylated PRC2 target genes derived from the TCGA dataset. cAMP, adenosine 3′,5′-monophosphate; MAPK, mitogen-activated protein kinase; NES, normalized enrichment score.Of all hypermethylated promoters associated with CEBPA mutations or C-terminal mutations in the TCGA AML cohort, ~97% contained at least one CEBPA binding motif (G/CCAAT) (fig. S10, D and E). Of the total PRC2 target genes, 44.8% (291 of 649) and 41.8% (271 of 649) were hypermethylated at their CpG-containing promoters in AML patients with CEBPA mutations and C-terminal mutations, respectively (table S2). These results indicate that CEBPA-DNMT3A interaction regulates a considerable proportion of PRC2 target genes by modulating their promoter methylation.
Leukemia cells with CEBPAN321D mutation are susceptible to DNA-hypomethylating agents
5-Azacytidine (5-Aza) and 5-aza-2′deoxycitidine [decitabine (DAC)] are U.S. Food and Drug Administration (FDA)–approved drugs and can reverse DNA hypermethylation patterns and restore the expression of tumor suppressor genes silenced by DNA methylation, thereby exerting antitumoral effects in the clinic for treating myelodysplastic syndrome (MDS) and AML (, ). The finding that AML-derived CEBPA mutations were associated with DNA hypermethylation encouraged us to examine the susceptibility of CEBPA-mutated leukemia cells to DNMT inhibitors (DNMTis). As expected, genomic 5mC was reduced by 5-Aza or DAC treatment in K562 cells expressing wild-type CEBPA or CEBPAN321D mutant (fig. S11A). In addition, we found that benzidine-positive cells corresponding to hemoglobin synthesis were increased by 5-Aza or DAC treatment in K562 cells and that erythroid differentiation seemed to be impaired in cells expressing CEBPAN321D mutant as compared to those expressing wild-type CEBPA when treated with either 5-Aza or DAC for 4 days (fig. S11B). K562 cells expressing CEBPAN321D mutant underwent significantly increased apoptosis as compared to those expressing wild-type CEBPA when treated with either 5-Aza or DAC (fig. S11C). The 50% inhibitory concentrations (IC50) of 5-Aza was threefold lower in CEBPAN321D-expressing K562 cells than those expressing wild-type CEBPA (IC50, 0.3 and 1.5 μM, respectively) (fig. S11D). Likewise, the IC50 value of DAC was also significantly lower in CEBPAN321D-expressing K562 cells than those expressing wild-type CEBPA (IC50, 15.4 and >100 μM, respectively) (fig. S11D). By monitoring cell proliferation over 4 days in both K562 and HL-60 stable cells treated with or without 5-Aza or DAC, we found a more profound reduction in cell proliferation at day 4 in CEBPAN321D-expressing cells compared to those expressing wild-type CEBPA (Fig. 6A). The effect of CEBPA N321D mutation on increasing cellular susceptibility to DNMTis was eliminated in DNMT3A-KO K562 and HL-60 cells (Fig. 6A), suggesting that AML-derived hotspot mutation in CEBPA sensitizes leukemia cells to DNA-hypomethylating agents in a DNMT3A-dependent manner.
Fig. 6.
CEBPAN321D mutation sensitizes leukemia cells to DNMTis.
(A) K562 and HL-60 stable cells with knockdown of endogenous CEBPA and reinduction of lentiviral vectors expressing empty vector (VEC) or Flag-tagged wild-type CEBPA or N321D mutant to a comparable level as endogenous CEBPA protein. Moreover, DNMT3A was deleted by CRISPR-Cas9–mediated genome editing technique in these stable cells. For each group, 2 × 106 cells were initially grown in 3 ml of culture medium without or with 5-Aza (1 μM) or DAC (1 μM), and the cell numbers were monitored during the period of 4 days by cell counting. DMSO, dimethyl sulfoxide. (B) Strategy to generate K562 CDX mouse model. See Materials and Methods for more details. (C) Numbers of mWBC and K562 cells in peripheral blood samples of xenografted mice after DAC therapy (n = 9 for each group). The data were collected on the date of sacrifice when an individual animal had lost ≥25% of body weight, and the results are representative of two independent experiments. (D) Representative data for Giemsa staining in peripheral blood from xenografted mice after DAC therapy (blue arrow, mWBC; red arrow, K562). (E) Kaplan-Meier survival curves and the log-rank test in xenografted mice with K562 overexpressing wild-type CEBPA or CEBPAN321D with DAC therapy (n = 9 to 10 and n = 16 per group for PBS and DAC treatment, respectively). Average values of triplicated results with SD are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. *P < 0.05, **P < 0.01, and ***P < 0.001 for the indicated comparison.
CEBPAN321D mutation sensitizes leukemia cells to DNMTis.
(A) K562 and HL-60 stable cells with knockdown of endogenous CEBPA and reinduction of lentiviral vectors expressing empty vector (VEC) or Flag-tagged wild-type CEBPA or N321D mutant to a comparable level as endogenous CEBPA protein. Moreover, DNMT3A was deleted by CRISPR-Cas9–mediated genome editing technique in these stable cells. For each group, 2 × 106 cells were initially grown in 3 ml of culture medium without or with 5-Aza (1 μM) or DAC (1 μM), and the cell numbers were monitored during the period of 4 days by cell counting. DMSO, dimethyl sulfoxide. (B) Strategy to generate K562 CDX mouse model. See Materials and Methods for more details. (C) Numbers of mWBC and K562 cells in peripheral blood samples of xenografted mice after DAC therapy (n = 9 for each group). The data were collected on the date of sacrifice when an individual animal had lost ≥25% of body weight, and the results are representative of two independent experiments. (D) Representative data for Giemsa staining in peripheral blood from xenografted mice after DAC therapy (blue arrow, mWBC; red arrow, K562). (E) Kaplan-Meier survival curves and the log-rank test in xenografted mice with K562 overexpressing wild-type CEBPA or CEBPAN321D with DAC therapy (n = 9 to 10 and n = 16 per group for PBS and DAC treatment, respectively). Average values of triplicated results with SD are shown. Asterisks denote statistical significance with two-tailed unpaired Student’s t test. *P < 0.05, **P < 0.01, and ***P < 0.001 for the indicated comparison.Last, we performed in vivo transplantation experiments by establishing K562 cell line–derived xenograft (CDX) model in NOD-Prkdcem26Cd52Il2rgem26Cd22Kitem1Cin(V831M)/Gpt (NCG-X) mice (Fig. 6B). Upon doxycycline (DOX) treatment, the ectopic expression of wild-type CEBPA or the N321D mutant was induced without affecting cell proliferation under cultured condition (fig. S12A). At day 14 after CDX, DOX was supplied in drinking water to induce the expression of wild-type CEBPA or the N321D mutant in NCG-X mice (fig. S12B). To verify a correct K562 chimeric ratio, we collected peripheral blood samples and confirmed that the count and percentages of mouse white blood cells (mWBCs) and K562 cells were not affected by DOX treatment (fig. S12, C to E). Moreover, DOX treatment did not change the body weight of mice xenografted with K562 expressing wild-type CEBPA or the N321D mutant (fig. S12F), and animal survival was also not affected (fig. S12, G and H). At day 19 after CDX, xenografted mice received repeated injections of DAC or phosphate-buffered saline (PBS) every 3 days, and blood samples were collected on the date of sacrifice when an individual animal had lost ≥25% of body weight. Phenotypically, we found that WBC counts in xenografted NCG-X mice were around 1 × 105/50 μl (fig. S12D) and that the counts were reduced by fivefold to less than 2 × 104/50 μl in DAC-treated moribund mice (Fig. 6C). NCG-X mice xenografted with K562 expressing CEBPAN321D exhibited fewer human leukemia cells in peripheral blood than those expressing wild-type CEBPA after DAC treatment (Fig. 6, C and D), and the survival of these mice was significantly (P < 0.01) prolonged compared to controls expressing wild-type CEBPA (Fig. 6E). These results provide in vivo evidence to support that the CEBPAN321D mutation sensitizes leukemia cells to DNMTi chemotherapy.
DISCUSSION
DNA methylation is established and maintained by DNMT enzymes, including DNMT3A and DNMT3B, which are responsible for de novo DNA methylation (), and DNMT1, which is critical for the maintenance of methylation patterns after DNA replication (, ). Like most epigenetic modifying enzymes, DNMT enzymes have little intrinsic DNA sequence specificity, and they bind to many sites across the genome. DNA methylation within the genome is not random, and regional DNA methylation forms a basis for regulation of tissue-specific gene transcription (). Previous studies have shown that DNMT enzymes require chromatin-remodeling proteins or other factors to gain access to preferred genomic loci. For instance, DNMTs can form complexes with several sequence-specific binding proteins such as PML-RAR (), Myc/Miz-1 (), and PU.1 () and are delivered to gene promoters for targeted DNA methylation. Here, we show a previously unrecognized mechanism of DNMT regulation—binding with a sequence-specific transcription factor—by which the de novo DNMT activity of DNMT3A is inhibited at specific sequences in the genome.Global epigenetic reprogramming during early development sets key developmental genes in a bivalent state marked by both gene-activating H3K4me3 and gene-repressive H3K27me3 modifications. An important observation of this epigenetic regulation is that bivalent histone methylation and DNA methylation are compatible throughout most of the genome, except for the PRC2 target genes enriched with CGIs, where these two marks are mutually exclusive (). This is believed to poise bivalently marked developmental genes in a temporarily silenced, but not permanently repressed, state, which is subsequently resolved during differentiation into either active or repressive state. It is largely unclear how these epigenetically primed states are established specifically at the promoters of PRC2 target genes. In the current study, we speculate that CEBPA may be a part of this epigenetic reprogramming (fig. S13): CEBPA interacts with DNMT3A on specific DNA and prevents DNMT3A from accessing to and catalyzing CpG methylation, thereby keeping DNMT3A activity in a “poised” state near CEBPA binding sites. Once tumor-associated mutations in the bZIP domain of CEBPA occur to disrupt CEBPA-DNA binding, it relieves DNMT3A inhibition and causes hypermethylation of PRC2 target genes. Given that about 40% of PRC2 target genes are hypermethylated in AML patients with CEBPA mutations, it is possible that additional factors may regulate PRC2 target genes by modulating their promoter CGI methylation. Besides CEBPA, other members of the CEBP family can also form protein complexes with DNMT3A, and these transcription factors are crucial regulators of progenitor differentiation in a range of cell types, including granulocytes, adipocytes, mammary epithelial cells, and granulosa cells, and play a central role in liver metabolism, adipogenesis, hematopoiesis, inflammatory processes, and tumorigenesis (). Whether CEBP-DNMT3A interaction represents a general mechanism for the regulation of PRC2 target genes remains unknown and needs further investigation. This line of research may provide new insights into the regulation of DNMT and de novo site-specific DNA methylation in a wide range of biological processes.Aberrant DNA methylation due to DNMT dysfunction is an important molecular event during leukemogenesis. FDA-approved DNMTis have been widely used as the standard of care for treating hematological malignancies. However, only 40 to 50% of patients with MDS or AML demonstrated a positive clinical response to DNMTi chemotherapy (, ). In addition, the responses are universally transient with loss of response within 2 years and being associated with very poor prognosis (, ). Therefore, there is an unmet medical need for the identification of more biomarkers to precisely predict patients’ response to DNA-hypomethylating agents and to provide better guidelines for AML treatment, and this relies on further mechanistic insight into the preferential regulation of DNMT particularly at specific genomic loci. CEBPA mutation in C-terminal bZIP domain is abundant in biallelic mutation and is a strong favorable prognostic factor in de novo AML (). In this study, we show that AML-associated mutations in the CEBPA C terminus, rather than the N terminus, cause CGI hypermethylation and dysregulation of PRC2 target genes, including those critical for self-renewal and cancer progression. Leukemia cells with the hotspot N321D mutation in CEBPA are more susceptible to DNMTi treatment than those with wild-type CEBPA both under culture conditions and in a mouse xenograft model. In terms of clinical manifestation, AML with biallelic CEBPA mutation is more frequent in children or young adults with favorable prognosis, where DNMTi is not included into the standard treatment at this age (, ). Our findings not only shed light on the pathological significance of CEBPA-DNMT3A interaction and reinforce the importance of the bZIP domain of CEBPA in DNMT3A regulation but also suggest that frequently occurring C-terminal mutations of CEBPA may be a useful clinical “signature” and predictor of sensitivity to DNA hypomethylation agents with potential therapeutic implications.
MATERIALS AND METHODS
Antibodies
Antibody specific to CEBPA (Santa Cruz Biotechnology, sc-166258), DNMT3A (Abcam, 2850 for IP), DNMT3A (Hua’an ET1609-31 for Western blot), DNMT3A (Proteintech, 20954-1-AP for immunofluorescent staining), β-actin (Cell Signaling Technology), Flag (Shanghai Genomic), HA (Santa Cruz Biotechnology), or Myc (Hua’an) was purchased commercially. Secondary antibodies for goat anti-mouse immunoglobulin G (IgG) light chain specific (Jackson ImmunoResearch) and mouse anti-rabbit IgG light chain specific (Jackson ImmunoResearch) were also purchased commercially.
Cell culture and transfection
HEK293T, HeLa, HL-60, and K562 cell lines were obtained from the American Type Culture Collection. HEK293T and HeLa cells were cultured in Dulbecco’s modified Eagle’s medium (Invitrogen), and HL-60 and K562 cells were cultured in 1640 (Gibco), both supplemented with 10% fetal bovine serum (FBS; Gibco) at 37°C with 5% CO2. Plasmid transfection was carried out by either PEI (polyethylenimine) (1 mg/ml; pH 7.0; Polysciences Inc.) or Lipofectamine 2000 (Invitrogen) according to the manufacturer’s instruction.
DNA methylation analysis in AML patients
DNA methylation was analyzed for 194 AML patient samples in the TCGA Research Network (TCGA) dataset. A sample was included in the CEBPA C-terminal mutant group if it has one or more C-terminal mutations on different alleles of CEBPA. There were 11 patients with CEBPA C-terminal mutation and 183 patients without CEBPA C-terminal mutation. In a separate analysis, a sample was included in the CEBPA mutant group if it has one or more mutations in CEBPA. There were 13 patients with CEBPA mutation and 181 patients without CEBPA mutation.Raw IDAT files from Infinium HumanMethylation450 BeadChip array were downloaded and used to obtain raw signal intensities by the RnBeads Bioconductor package (, ). The rule-based filtering criteria were applied to the preprocess step. For each probe, beta values were calculated using (Meth/[Meth+Unmeth]) and further normalized using the beta-mixture quantile normalization (). DNA methylation differences were analyzed using the limma method (), and P values were corrected for multiple testing using the false discovery rate (FDR) method. Differential methylated probes were defined as having a change in methylation (absolute delta beta) of >0.1 and an FDR-adjusted P value of <0.05. KEGG pathway analysis was conducted using the clusterProfiler Bioconductor package.
Gene set enrichment analysis
The hypermethylated CpG sites in the CEBPA C-terminal mutant or CEBPA mutant group were annotated to genes using the annotation file provided by Illumina (Gene Expression Omnibus: GPL13534). The genes were ranked according to the strength of the association between the mutation and a CpG site corresponding to the gene. A lower P value of differential methylation translated into a higher rank for any given gene.By using the GSEA software from Broad Institute, the ranked gene list was compared to the gene sets in the gene set collection C2 from Molecular Signatures Database (MSigDB) for enrichment analysis. Analysis was run with 2000 permutations and a hypergeometric (classic) test. Normalized enrichment score and FDR-adjusted P values were measured to obtain enrichments with statistical significance. All gene sets that have FDR < 0.05 were considered as significantly enriched.
Gene promoter CpG density analysis
We selected the genes from the top 10 enriched gene sets as CEBPA C-terminal mutant or CEBPA mutant genes and used the set of 10,497 genes that were studied and classified as having high CpG promoter genes (). The overlap between CEBPAC-mut or CEBPAmut genes and high CpG promoter genes was determined using a hypergeometric test.
RNA-seq analysis in AML patients
For TCGA AML patients, normalized gene quantification data were downloaded to perform differential expression analysis. Comparison between CEBPA C-terminal mutant or CEBPA mutant samples and the remaining samples was performed using t test. The overlap between CEBPAC-mut or CEBPAmut genes and down-regulated genes was determined using a hypergeometric test.
Mammalian two-hybrid assay
UAS-luciferase reporter plasmid, cytomegalovirus (CMV)–Renilla control plasmid, human DNA-modifying enzymes fusing with VP16 transactivation domain, and CEBPA fusing with Gal4-DBD were cotransfected into HEK293T cells. After transfection for 30 hours, the luciferase reporter activity was detected using a commercial kit (Promega, E1910) with a Turner BioSystems luminometer reader (Promega). The individual luciferase raw data are shown in table S3.
Generation of stable cells
To generate CEBPA knockdown and rescue stable cells, the lentivirus PLKO.1-shCEBPA and PCDH-Flag-CEBPA/CEBPAN321D plasmids were cotransfected into HEK293T cells with packing plasmids PSPAX2 and PMD2G at a 4:3:1 ratio. After 48 hours, virus was collected to infect HL-60 and K562 cells for another 24 hours. Polybrene (8 μg/ml) was mixed to increase infection efficiency. Infected cells were selected in puromycin (2 μg/ml) for 1 week. The short hairpin RNA (shRNA) sequences targeting CEBPA are 5′-GTAGAAGTCGGCCGACTCCA-3′ and 5′-CAGTTCCAGATCGCGCACTG-3′.To generate DNMT3A KO K562 or HL-60 cells, the lentiCRISPR (Addgene, plasmid #52961) DNMT3A KO plasmids were cotransfected into HEK293T cells with packing plasmids PSPAX2 and PMD2G at a 4:3:1 ratio. After 48 hours, virus was collected to infect HL-60 and K562 cells for another 24 hours. Polybrene (8 μg/ml) was mixed to increase infection efficiency. Infected cells were selected in puromycin (2 μg/ml) for 1 week. The single-guide RNA sequences used for targeting DNMT3A were 5′-GCATGATGCGCGGCCCAAGG-3′ and 5′-GCAGAGGACGAGCCGGGACG-3′. After that, single cells were sorted into 96-well plates by using fluorescence-activated cell sorting, and KO clones were validated by Western blot analysis.
Co-IP and Western blot
Cells were lysed in 1 ml of NP-40 buffer containing 50 mM tris-HCl (pH 7.5), 500 mM NaCl, 0.3% NP-40, and protease inhibitor cocktail (Roche) with rotation at 4°C for 60 min and then centrifuged at 12,000 rpm for 15 min at 4°C. IP was carried out either by incubating Flag beads (Sigma-Aldrich) at 4°C with lysate for 3 hours or by incubating cell lysate with the DNMT3A primary antibody for 1 hour, followed by incubating with protein A/G beads (GE) for another 3 hours at 4°C. Then, samples were harvested by washing beads three times and boiled in water bath for 10 min.Western blotting was performed after loading the samples into 10% SDS–polyacrylamide gel electrophoresis (PAGE) gel and running for about 1 hour at 120 V. The membrane was then blocked with 5% fat-free milk (BD Biosciences). After incubation with primary antibodies overnight at 4°C and secondary antibodies for 1 hour at room temperature, the membrane was washed three times and analyzed by Typhoon FLA 9500 image scanning (GE Healthcare).
Immunofluorescence assay
Cells were fixed with 4% phenylmethylsulfonyl fluoride (Sangon) overnight at 4°C and then permeabilized with 0.4% Triton X-100 for 15 to 30 min at room temperature. Next, the cells were incubated with blocking buffer [3% bovine serum albumin (BSA) in PBS; Sigma-Aldrich] for 1 hour at 4°C, followed by incubation with primary antibodies against HA (dilution at 1:500; Santa Cruz Biotechnology) and Flag [dilution at 1:500; MCB (The Molecular and Cell Biology Research Lab)] overnight at 4°C, and then the Alexa Fluor 594 (red) and 488 (green)–conjugated secondary antibodies (Invitrogen) for 1 hour at room temperature. Cell nuclei were stained with 4′,6-diamidino-2-phenylindole (DAPI; Invitrogen). Images were captured using a Leica fluorescence microscope.
Proximity ligation assay
Briefly, 2 × 105 cells were harvested and washed with prechilled PBS. Cells were then fixed and permeabilized for 10 min using 80% prechilled methanol and centrifuged to remove the methanol at 500g for 5 min at 4°C. Afterward, the cells were blocked in blocking solution (PBS and 1% BSA) and incubated with the anti-DNMT3A and anti-CEBPA primary antibodies (dilution at 1:100 in blocking solution). The next steps of the assay were performed according to the manufacturer’s instruction (DUO92101; Sigma-Aldrich). Cell nuclei were stained with DAPI (Invitrogen). Images were captured using a Leica fluorescence microscope.
Protein purification
DNMT3A protein was purified from HEK293T cells at 24 hours after transfection, and cells were harvested and lysed in buffer containing 50 mM tris-Cl (pH 8.0), 500 mM NaCl, and 0.01% 2-mercaptoethanol. The fusion protein was immunoprecipitated by Flag beads and then cleaved by PreScission protease. Moreover, DNMT3A N terminus (residues 1 to 291) and D529A mutant and wild-type and N321D mutant CEBPA proteins were purified from Escherichia coli strain BL21 (DE3). The transformed E. coli cells were grown at 37°C to reach an OD600 (optical density at 600 nm) of 0.6 to 0.8 and were then induced with 1 mM isopropyl-β-d-thiogalactopyranoside at 16°C for 20 hours. The cell lysate after mechanical disruption and centrifugation was applied for glutathione S-transferase purification. Then, the elution was digested with PreScission protease. The recombinant proteins were stored at −80°C until further analysis.
Electrophoretic mobility shift assay
Briefly, FAM-labeled DNA (20 nM) and indicated amounts of purified proteins were incubated in a 20-μl reaction system [20 mM Hepes, 100 mM KCl, 5% glycerol, and BSA (0.5 mg/ml) (pH 7.5)] for 45 min at room temperature. The samples were subjected to a 6 to 18% gradient PAGE and analyzed by Typhoon FLA 9500 image scanning (GE Healthcare).The FAM-labeled substrate DNA was commercially synthesized. Random DNA substrates are as follows: 5′-6-FAM/TGGATATCTAGGGGCGCTATGATATCT-3′ (forward) and 5′-AGATATCATAGCGCCCCTAGATATCCA-3′ (reverse). The CEBPA motif (GCAAT)–containing DNA substrates are as follows: 5′-6-FAM/CTAGGGCTTGCGCAATCTATATTCG-3′ (forward) and 5′-CGAATATAGATTGCGCAAGCCCTA-3′ (reverse).
DNA affinity and fluorescence polarization assay
FAM-labeled double-stranded DNA (10 nM) substrate was incubated with increased amounts of DNMT3A with wild-type or N321D mutant CEBPA proteins for 45 min at room temperature in the same reaction buffer as EMSA assay. Fluorescence polarization was measured with a Synergy 4 microplate reader (BioTek) at room temperature. Each reaction was performed in triplicate. The curves were fitted using GraphPad Prism 7.
In vitro DNMT3A activity assay
Briefly, biotin-labeled DNA was commercially synthesized as a substrate. Proteins of 0.5 mM DNMT3A or DNMT3A D529A/DNMT3A2/DNMT3A-CD and 0.5 mM DNMT3L were preincubated with the same amount of CEBPA or CEBPAN321D protein on ice for 30 min. Then, the mixed proteins were added to the reaction buffer, which contains substrate DNA (100 ng), 2.5 mM [methyl-3H] SAM (NET155V), 5% glycerol, 0.01% 2-mercaptoethanol, BSA (0.5 mg/ml), 1 μg of H3K4me0 peptide, and 25 mM tris-HCl (pH 7.5). The reactions were incubated in 37°C water bath for 30 min. Next, the reaction was stopped by adding 500 μl of cold wash buffer (500 mM NaCl and 1 mM EDTA in PBST). The DNA was collected by streptavidin beads, washed three times with cold washing buffer, and then subjected to liquid scintillation counting (PerkinElmer). Each reaction was performed in triplicate.
Dot-blot assay
DNA dot-blot assay was performed as described previously with some modifications (). Briefly, genomic DNA was spotted on a nitrocellulose membrane (Whatman). The membrane was placed under an ultraviolet lamp for 20 min to cross-link DNA, and then blocked with 5% milk in tris-buffered saline (TBS)–Tween 20 for 1 hour, followed by incubation with the anti-5mC antibody (Active Motif) at 4°C overnight. After incubation with a horseradish peroxidase–conjugated secondary antibody (GenScript) for 1 hour at room temperature, the membrane was washed with TBS–Tween 20 three times and then scanned with a Typhoon scanner (GE Healthcare), and the quantification of 5mC was done by ImageQuant software (GE Healthcare).
LC-MS/MS analysis
Genomic DNA from cells was digested with DNA Degradase Plus (Zymo Research) at 37°C for 5 hours. The digested samples were then subjected to liquid chromatography–tandem mass spectrometry (LC-MS/MS) analysis using a Shimazu LC (LC-20AB pump) system coupled with TSQ-vantage triple quadrupole mass spectrometer (Thermo Fisher Scientific). A C18 column (250 mm by 2.1 mm inside diameter, 3 μm particle size, ULTIMATE) was used. The mass spectrometer was optimized and set up in selected reaction monitoring scan mode for monitoring the [M + H]+ of 5mC [mass/charge ratio (m/z), 242.1→126.1] and deoxyguanosine (dG; m/z, 268.1→152.3). The standard nucleosides of dG and 5mC were commercially purchased (Thermo Fisher Scientific). The Analyst Software Xcalibur 2.2 sp1 was used for analysis, and global 5mC level was quantified as 5mC/dG.
ChIP-qPCR assays
Briefly, 5 × 106 cells were collected and cross-linked with 1% paraformaldehyde for 10 min and immediately stopped by adding 0.125 M glycine. After sonication at 4°C for 25 min (Bioruptor, 90% power, 15 s on, 45 s off), chromatin was immunoprecipitated at 4°C for 3 hours with the antibody against Flag (Sigma-Aldrich) and DNMT3A (Abcam) and for another 2 hours with the protein A + G beads (GE). The binding DNA was washed with 3× high-salt buffer, 2× low-salt buffers, and 1× TE (Tris-EDTA) buffer. The product DNA was de-cross-linked and collected with the PCR Purification Kit (QIAGEN) for further analysis by real-time qPCR. Primer sequences are listed in table S4.
BS-PCR assay
BS-PCR experiments were performed with a commercial EZ DNA Methylation-Gold kit following the manufacturer’s instruction. Briefly, 500 ng of genomic DNA extracted from cells was used for BS conversion and PCR amplification. The target DNA products were inserted to a T vector, and 10 clones from each sample were randomly picked for sequencing. Primer sequences are listed in table S4.
RNA isolation and qRT-PCR analysis
Total RNA was extracted from cultured cells by TRIzol reagent (Invitrogen) following the manufacturer’s instruction. RNA was reversely transcribed with oligo(dT) primers. Diluted complementary DNA was then used for real-time PCR with gene-specific primers in the presence of TB Green Advantage qPCR Premix (Takara) by using a QuantStudio 6 Flex Real-Time PCR system (Applied Biosystems). β-Actin was used as a housekeeping control. Primer sequences are listed in table S4.
Benzidine staining
K562 cells were treated with hypomethylating agents or hemin (25 μM) for 96 hours. Cells (1 × 106) were harvested and washed twice with prechilled PBS, and suspended cells (in 50 μl) were incubated with 10 μl of freshly prepared benzidine solution (2 mg/ml in glacial acetic acid containing 0.59% H2O2) for 10 min at room temperature. The stained cells were imaged with a microscope, and the percentages of benzidine-positive cells were calculated.
Cell proliferation, apoptosis, and viability assays
Cells were seeded in six-well plates in RPMI 1640 medium at the density of 5 × 104 cells per well and then treated with hypomethylating agents with indicated concentrations for indicated time periods. Cell numbers were quantified with an automated counter (Costar). In addition, cells were also collected and stained with the FITC Apoptosis Detection Kit (BD Biosciences) following the manufacturer’s instruction. The stained cells were detected by BD Accuri C6 to calculate the percentage of apoptotic cells (annexin V positive) and viable cells (both propidium iodide and annexin V negative). Each experiment was performed in triplicate.
CDX model
Severe immunodeficient NCG-X mice were purchased from GemPharmatech. Male animals were housed in individual ventilated cages under specific pathogen–free condition and used for experiments at 8 to 10 weeks of ages. To generate the CDX model, 6 × 106 K562 cells expressing Ptripz-CEBPAN321D or Ptripz-CEBPA were suspended in IMDM (Iscove Modified Dulbecco Media) (Sangon) containing 1% penicillin and streptomycin and were then intravenously injected into NCG-X mice via tail vein. At day 14 after injection, mice were continuously treated with DOX [2 mg/ml; MCE (MedChemExpress)] in drinking water to induce the expression of wild-type CEBPA or the N321D mutant until the end of experiment. Afterward, DOX-treated mice were intraperitoneally given PBS or DAC (0.4 mg/kg per 36 hours; Selleck). The body weight of mice was measured twice a week from days 0 to 19 and daily since day 20. Upon sacrifice, mice were euthanized, and peripheral blood was collected for further analysis. The Institutional Animal Care and Use Committee at Model Animal Research Center of Nanjing University approved all animal procedures used in this study (AP#: LY-01), which complied with all relevant ethical regulations.
Flow cytometry analysis
Peripheral blood of mice was collected in an anticoagulant tube and stored at 4°C. To verify proper K562 chimeric ratio, 5 μl of 123count eBeads counting beads (Invitrogen) was added into 50 μl of peripheral blood, which was then incubated with anti-mouse CD45-VioGreen (Miltenyi, clone REA737) and anti-human CD45-phycoerythrin-Cy7 (BioLegend, clone 2D1) for 30 min at 4°C in darkness. After that, FACS Lysing Solution (BD Biosciences) was used according to the manufacturer’s instruction. Cells were washed and suspended with PBS containing 2% FBS, 0.5 mM EDTA, and 1% penicillin and streptomycin. Fortessa (BD Biosciences) and Penteon (Agilent) were used for acquisition, and FlowJo v10.6.2 (TreeStar) was applied for data analysis.
Giemsa staining
Mouse peripheral blood samples were lysed by 1× red blood cell lysis buffer (eBioscience), cytospinned onto microscope adhesion slides, and then stained with a rapid Giemsa staining kit (Sangon) according to the manufacturer’s instruction. Images of cells were captured with a stereoscopic SOPTOP IC X 41 microscope.
Statistics
Two-tailed unpaired Student’s t test and hypergeometric test were applied for statistical analyses as indicated in the figure legends. With the exception of publicly available large population datasets, all other data were obtained from triplicated independent experiments, with SD of the mean (mean ± SD) displayed. The value of P < 0.05 was considered to be statistically significant.
Authors: Carmen Brenner; Rachel Deplus; Céline Didelot; Axelle Loriot; Emmanuelle Viré; Charles De Smet; Arantxa Gutierrez; Davide Danovi; David Bernard; Thierry Boon; Pier Giuseppe Pelicci; Bruno Amati; Tony Kouzarides; Yvan de Launoit; Luciano Di Croce; François Fuks Journal: EMBO J Date: 2004-12-16 Impact factor: 11.598
Authors: Danuta M Jeziorska; Robert J S Murray; Marco De Gobbi; Ricarda Gaentzsch; David Garrick; Helena Ayyub; Taiping Chen; En Li; Jelena Telenius; Magnus Lynch; Bryony Graham; Andrew J H Smith; Jonathan N Lund; Jim R Hughes; Douglas R Higgs; Cristina Tufarelli Journal: Proc Natl Acad Sci U S A Date: 2017-08-21 Impact factor: 11.205
Authors: Luciano Di Croce; Veronica A Raker; Massimo Corsaro; Francesco Fazi; Mirco Fanelli; Mario Faretta; Francois Fuks; Francesco Lo Coco; Tony Kouzarides; Clara Nervi; Saverio Minucci; Pier Giuseppe Pelicci Journal: Science Date: 2002-02-08 Impact factor: 47.728
Authors: Andrew E Teschendorff; Francesco Marabita; Matthias Lechner; Thomas Bartlett; Jesper Tegner; David Gomez-Cabrero; Stephan Beck Journal: Bioinformatics Date: 2012-11-21 Impact factor: 6.937
Authors: Ekaterina I Romanova; Anatoliy V Zubritskiy; Anna V Lioznova; Adewale J Ogunleye; Vasily A Golotin; Anna A Guts; Andreas Lennartsson; Oleg N Demidov; Yulia A Medvedeva Journal: Int J Mol Sci Date: 2022-09-27 Impact factor: 6.208