Literature DB >> 31409710

The catalytic core of DEMETER guides active DNA demethylation in Arabidopsis.

Changqing Zhang^1,2, Yu-Hung Hung^1,2, Hyun Jung Rim^3,4,5, Dapeng Zhang⁶, Jennifer M Frost⁷, Hosub Shin^3,4,5, Hosung Jang^3,4,5, Fang Liu^1,2,8, Wenyan Xiao⁶, Lakshminarayan M Iyer⁹, L Aravind⁹, Xiang-Qian Zhang^10,2,11, Robert L Fischer¹², Jin Hoe Huh^13,4,5, Tzung-Fu Hsieh^10,2.

Abstract

The Arabidopsis DEMETER (DME) DNA glycosylase demethylates the maternal genome in the central cell prior to fertilization and is essential for seed viability. DME preferentially targets small transposons that flank coding genes, influencing their expression and initiating plant gene imprinting. DME also targets intergenic and heterochromatic regions, but how it is recruited to these differing chromatin landscapes is unknown. The C-terminal half of DME consists of 3 conserved regions required for catalysis in vitro. We show that this catalytic core guides active demethylation at endogenous targets, rescuing dme developmental and genomic hypermethylation phenotypes. However, without the N terminus, heterochromatin demethylation is significantly impeded, and abundant CG-methylated genic sequences are ectopically demethylated. Comparative analysis revealed that the conserved DME N-terminal domains are present only in flowering plants, whereas the domain architecture of DME-like proteins in nonvascular plants mainly resembles the catalytic core, suggesting that it might represent the ancestral form of the 5mC DNA glycosylase found in plant lineages. We propose a bipartite model for DME protein action and suggest that the DME N terminus was acquired late during land plant evolution to improve specificity and facilitate demethylation at heterochromatin targets.

Entities: Chemical Disease Gene Species

Keywords: Arabidopsis thaliana; active DNA demethylation; endosperm development; epigenetic reprogramming; gene imprinting

Mesh：

Substances：

Year: 2019 PMID： 31409710 PMCID： PMC6717269 DOI： 10.1073/pnas.1907290116

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

DNA methylation is a covalent modification that influences the transcription of nearby genes and regulates important processes in eukaryotic genomes, including cell differentiation, transposable element (TE) silencing, and genomic imprinting (1, 2). Plant DNA methylation occurs in CG, CHG, and CHH sequence contexts (H = A, C, or T) and is targeted primarily to TEs. Flowering plants and mammals can also exhibit gene body methylation (gbM) in the CG context, generally in constitutively expressed genes, but the function of gbM is not fully understood (3, 4). DNA methylation homeostasis is essential for genome stability, notably in maintaining TE silencing, and for the stable inheritance of epigenetic information (5, 6). In plants, this is achieved by maintenance and de novo DNA methylation, as well as by active DNA demethylation (1, 7, 8). Active DNA demethylation is catalyzed by a family of DNA glycosylases, including REPRESSOR OF SILENCING 1 (ROS1), DEMETER (DME), and DEMETER-LIKE 2 (DML2) and DML3, through a base excision repair pathway (9–11). However, epigenetic profiles are dynamic in response to biotic and abiotic stress and during reproduction and development. Similar to mammals, flowering plants require epigenetic reprogramming during gamete formation, as characterized by extensive DNA demethylation in Arabidopsis by DME (12–14). DME encodes a bifunctional 5mC DNA glycosylase/lyase that is essential for reproduction (9, 15). Paralogs ROS1, DML2, and DML3 function primarily in the sporophyte to counteract the spread of DNA methylation mediated by RNA-dependent DNA methylation (16, 17). The A, glycosylase, and B regions of the C-terminal half of DME are conserved among the DME/ROS1 DNA glycosylase clade and are absolutely required for DME 5mC excision in vitro, composing the catalytic core for its enzymatic activity (9, 15). DME acts primarily in the central cell and the vegetative nucleus (15, 18, 19). The vegetative nucleus contributes to germination and growth of the pollen tube, which delivers the sperm cells to the female gametophyte. Following double fertilization, the egg and central cell develop into the embryo and the nutritive endosperm, respectively, the latter of which accumulates starch, lipids, and storage proteins to nourish the developing embryo. The endosperm is the site of plant genomic imprinting, resulting from allelic inheritance of differential epigenetic states (20–22). DNMT1 homolog MET1-mediated DNA methylation and DME-mediated demethylation are important regulators of plant gene imprinting. For example, DME demethylation is required to activate MEA, FIS2, and FWA expression in the central cell, which persists in the endosperm, while MET1 maintains the silencing of FIS2 and FWA paternal alleles (20–22). Imprinting is essential for reproduction in Arabidopsis, and seeds that inherit a maternal dme allele abort due to failure to activate MEA and FIS2, essential components of the Polycomb Repressive Complex 2 (PRC2) required for seed viability (20–22). Although DME preferentially targets small AT-rich and nucleosome-poor euchromatic transposons, it also demethylates intergenic and heterochromatin targets (13). How DME is recruited to target sites with various chromatin structures is unknown, although the Facilitates Chromatin Transactions (FACT) histone chaperone is required at heterochromatic targets and some imprinted loci (23, 24). Other than the glycosylase domain, the catalytic core region of DME contains multiple conserved globular domains of unknown function. Here we show that expressing a nuclear-localized DME catalytic region controlled by a native DME promoter complements dme seed abortion and pollen germination defects and partially rescues the DNA hypermethylation phenotype in endosperm. Our DNA methylation analysis revealed that the majority of canonical DME target sites are demethylated by the catalytic core, indicating that this region is sufficient to direct DME localization. However, without the N-terminal region, the degree of demethylation is reduced, and demethylation of heterochromatin targets is particularly impeded. In addition, we observed prevalent ectopic demethylation, specifically at genic sequences, by the catalytic core. Thus, the N-terminal region of DME is likely needed for the full breadth and depth of demethylation and to prevent gene body demethylation. We show that the N-terminal conserved domains are specific to the angiosperm lineage, acquired late during land plant evolution, potentially to ensure robust demethylation in nucleosome-rich heterochromatin targets.

Results

Nuclear-Localized Catalytic Core of DME Rescues dme Developmental Defects.

DME produces at least 2 alternatively spliced variants, encoding 2 hypothetical polypeptides of 1,729 (DME.1) and 1,987 (DME.2) amino acids in length (11, 15). The amino acid positions denoted in this study correspond to DME.2, the predominant isoform expressed in floral tissues (18). First, we detailed the domain structure and predicted characteristics of the catalytic and noncatalytic regions of the DME protein. The N-terminal half of DME consists of a large portion of unstructured, low-complexity sequences (amino acids 364–947; Fig. 1), a stretch of basic amino acid-rich repeats (amino acids 291–363; basic stretch) known to direct nuclear localization (15), and a 120-aa N-terminal domain (amino acids 1–120, DemeN) of unknown function present only among the angiosperm DME/ROS1-like proteins. Within the DemeN domain is a sWIPxTPxKs motif that is highly conserved () but is absent in the shorter DME.1 isoform. This motif has a hydrophobic core and may mediate protein–protein interactions and/or be subjected to posttranslational modification (i.e., phosphorylation or methylation at the lysine residue). The basic stretch region is highly conserved among angiosperm DME-like proteins and is reminiscent of the AT-hook motifs that can bind DNA in a non–sequence-specific manner (25), suggesting that the basic stretch might also bind DNA along with directing DME to the nucleus.

Fig. 1.

Complementation results. (A) Domain architecture and the positions of conserved domains along DME protein. nDMECTD is the construct used for complementation and methylome analyses. (B) In dme-2/dme-2 siliques, >99% of seeds are aborted. A single copy of the nDME transgene reduces seed abortion to 50%, and in the dme-2/dme-2; nDME/nDME siliques, all seeds develop normally. (Scale bar: 0.5 mm.) (C) Percentages of viable seeds in DME/dme-2 or in dme-2/dme-2 plants that were complemented by nDME transgene. Error bars represent SD. (D) The nDME transgene restores DME target gene FWA and FIS2 expression. WT: Col-0; nDMECTD: dme-2/dme-2; nDME/nDME; dme-2: dme-2 homozygotes. Total RNA was isolated from stage F1 to F12 floral buds. It was determined that the C-terminal half of DME (amino acids 936–1987; hereinafter the DMECTD) constitutes the catalytic core for 5mC excision in vitro (9, 26). This core contains the glycosylase domain of the HhH (helix-hairpin-helix) motif, followed by the [4Fe-4S] cluster loop (FCL) motif (27). The conserved B region contains an RNA recognition motif fold (RRMF) and a divergent, circularly permuted version of a methylated CpG-discriminating CXXC domain (27). The presence of the permuted CXXC and RRMF raises the possibility that the enzymatic core might contain intrinsic targeting information (28, 29). To test this possibility, we investigated whether DMECTD can rescue the dme seed abortion phenotype when expressed by an endogenous DME promoter. A classical SV40 nuclear localization signal (NLS) (PKKKPKV) was included to ensure robust nuclear localization (Fig. 1). The resulting transgene, nDME, was transformed into DME/dme-2 heterozygous Col-gl plants. Self-pollinated DME/dme-2 plants produce 50% viable (inherited DME maternal allele) and 50% aborted (inherited dme-2 maternal allele) F1 seeds. In contrast, transgenic lines that are DME/dme-2 and hemizygous for a single-locus nDME transgene produced a 3:1 ratio of viable to aborted F1 seeds when self-pollinated (1,292:439, 3:1, χ2 = 0.12, P > 0.7; ), indicating that nDME fully complemented the dme seed abortion phenotype. When nDME was transformed into dme-2/dme-2 homozygous plants (), the T1 transgenic lines (dme-2/dme-2;nDME/∼) displayed a 50% to 75% rate of viable seeds when self-pollinated (), compared with selfed nontransformed dme-2/dme-2 plants, which bear <0.05% viable seeds, again indicating full seed abortion complementation by nDME (Fig. 1 and ). These results show that nDME rescues the dme-2 seed abortion phenotype. Since seed abortion is partially due to defects in activating imprinted PRC2 genes (22), we used qRT-PCR to measure the expression of PRC2 subunit FIS2 and of FWA, whose maternal expression is enabled by DME-mediated DNA demethylation, and found that nDME restored the expression of these genes (Fig. 1). DME is also expressed in the vegetative cells of pollen, and mutations in DME reduce pollen germination, resulting in lower transmission of the paternal dme allele in certain ecotypes (19). Under our growth conditions, when DME/dme-2 heterozygous Col-gl plants were self-pollinated, approximately 20% to 30% (compared with the expected 50% when the dme-2 allele is normally transmitted) of the F1 progeny were dme-2 heterozygotes (). To test whether nDME can rescue the dme pollen phenotype, we pollinated wild-type Col-0 with pollens from transgenic lines that are dme-2 homozygous and hemizygous for the nDME transgene (dme-2/dme-2; nDME/∼ lines with ∼50% seed abortion rates; ). If nDME does not complement dme-2 pollen germination defects, we would expect roughly one-half of the F1 progeny to carry the nDME transgene (hygromycin-resistant), because mutant pollen with or without the transgene would germinate with equal frequency. Instead, we observed that 65% to 90% of the F1 progeny are hygromycin-resistant (resistant:sensitive = 190:52, 1:1, χ2 = 79.69, P = 7.3E−19; ), indicating that nDME rescues dme-2 pollen defects. These results show that nDME can rescue the dme developmental phenotype, indicating that DME targeting information is contained within the catalytic core.

Canonical DME Target Loci Are Demethylated by nDMECTD.

The molecular cause of DME mutant phenotypes is a loss of DME-mediated DNA demethylation. To test the extent of nDMECTD complementation, we compared the methylome of the nDME-complemented endosperm with the methylomes of wild-type and dme-2 endosperm (13). nDME-complemented endosperm methylomes from 3 independent lines (dme-2/dme-2, nDME/nDME) were generated and the reads combined for downstream analyses. Pearson correlation coefficients between independent lines showed that they were highly concordant (). We compared the differentially methylated regions (DMRs) between dme-2 and wild-type endosperm (canonical DME targets; n = 8,872), and the DMRs between dme-2 and nDME-complemented endosperm (nDMECTD targets; n = 8,939). Looking genome-wide, but excluding all DME and nDMECTD targets, the Pearson correlation coefficients between our combined independent lines and the previously published wild-type and dme-2 endosperm datasets ranged between 0.92 and 0.94 (), indicating a high level of concordance between the methylomes used in this study. We found several DME-regulated imprinting control regions of maternally and paternally expressed genes (MEGs and PEGs, respectively) to be hypomethylated in the nDME-complemented endosperm compared with dme-2 endosperm (), suggesting that nDMECTD is active at these loci. CG methylation does not return to wild-type levels, however, indicating that the genome is demethylated to a lesser degree by nDMECTD than by wild-type DME (Fig. 2 and ). The DME and nDMECTD DMRs largely overlap (Fig. 2), and for the DMRs that appear unique to DME, the same regions are also demethylated by nDMECTD (Fig. 2, black solid line trace), but to a reduced degree (the solid black peak is on the left of the dotted peak) that falls below our DMR cutoff (fractional CG methylation difference ≥0.3; P < 10−10, Fisher’s exact test). The shared DMRs are also slightly less demethylated in nDME-complemented endosperm compared with wild-type endosperm (Fig. 2, red trace the left of the black trace). Taken together, these data show that nDME rescues the dme hypermethylation phenotype, but only partially. As far as we could ascertain, this was not a result of lower transgene expression, since qRT-PCR analyses of endosperm tissue showed abundant expression of nDME ().

Fig. 2.

Methylome analysis. (A) Boxplot of CG methylation of canonical DME target sites in dme-2 mutant (gray), wild-type (white), or nDME-complemented (red) endosperm. (B) Venn diagram depicting unique and shared regions and loci numbers between DME and nDMECTD DMRs. (C) Kernel density plots of CG methylation differences between dme-2 and wild-type endosperm (black dotted trace), or between dme-2 and nDME-complemented endosperm (black trace), for DME unique DMRs and CG methylation difference between dme-2 and wild-type endosperm for nDMECTD-unique DMRs (orange trace). (D) Kernel density plots of CG methylation differences between dme-2 and nDME-complemented endosperm (black trace), or between dme-2 and wild-type endosperm (orange trace), within the DME and nDMECTD shared DMRs that completely overlap. (E) Chromatin state distribution, and total length covered, within nDMECTD-unique (I), nDMECTD-DME shared (II), DME-unique (III), DME-all (II+III), and nDMECTD-all (I+II) DMRs. States 1 to 7 correspond to euchromatin, and states 8 and 9 correspond to AT- and GC-rich heterochromatin, respectively. The numbers in the parentheses show fold changes (total length) in nDMECTD relative to DME DMRs. To investigate whether chromatin features influence nDMECTD demethylation, we assessed histone marks and genomic characteristics (30) in target sites that are nDMECTD-unique, DME-unique, or shared between the 2 types. Compared with nDMECTD-DME shared DMRs, DME-unique target sites are highly enriched for heterochromatin states 8 and 9 (χ2 test, P = 8.29E−7, kb as unit of length; Fig. 2). nDMECTD DMRs (unique and shared) are enriched for open chromatin states (χ2 test, P = 4.46E−117), but show significant reductions in heterochromatic chromatin states 8 and 9 compared with DME DMRs (χ2 test, P = 8.3.45E−153; Fig. 2). Thus, nDMECTD demethylates poorly at heterochromatic loci and preferentially targets euchromatin.

Reduced Demethylation Efficiency at Long Heterochromatic Target Sites by nDMECTD.

Longer DME DMRs almost exclusively reside in heterochromatin (86.3% of 1–1.5 kb and 95.5% of ≥1.5 kb; Fig. 3). We postulate that this is due to the dense methylation associated with heterochromatin, which may result in longer stretches of DNA demethylation during DME occupancy at these sites. Interestingly, the number of long DMRs is dramatically reduced in nDME-complemented endosperm (Fig. 3 ). This reduction in the number of longer DMRs was not due to a lack of nDMECTD targeting to these sites, since the partial demethylation characteristic of nDMECTD activity occurred in all targets regardless of their length (). However, when we analyzed the length of the nDMECTD demethylated regions, we found that it produced much shorter DMRs in the heterochromatin targets (). For example, there are 250 DME DMRs longer than 1.5 kb (median length, 1.9 kb). Among these, 165 are also DMRs of nDMECTD but are much shorter (median length, 400 bp) (Fig. 3 ). Thus, removal of the DME N-terminal region significantly reduced the extent of demethylation in these long targets.

Fig. 3.

DME and nDMECTD DMRs. (A) Distribution of euchromatin and heterochromatin within each DMR length group. (B) DME and nDMECTD DMRs grouped by size, with the total length that they cover shown. (C and D) The majority of the 250 longer DME DMRs overlap with the DMRs of nDMECTD(C), but the nDMECTD DMRs are shorter (D). (E) FACT dependency of the DME targets demethylated by DME (FACT-dependent, dark-green trace; FACT-independent, light-green trace) or by nDMECTD (FACT-dependent, magenta trace; FACT-independent, orange trace). The histone chaperone FACT complex is required for demethylation of approximately one-half of DME targets in Arabidopsis, particularly those in heterochromatin (23). DME colocalizes with SPT16 (the larger FACT subunit) in an in vivo bimolecular fluorescence complementation assay, suggesting that DME might recruit the FACT complex to these heterochromatic loci (23). Of the 250 long DME DMRs, 87% of them require FACT, raising the possibility that nDMECTD might be defective in recruiting FACT. To test this hypothesis, we examined how nDMECTD demethylates FACT-dependent vs. FACT-independent loci. In wild-type endosperm, both target groups are demethylated to a similar degree (Fig. 3 , blue and green traces with similar shape and peak location). In nDME-complemented endosperm, FACT-independent loci are only slightly less demethylated compared with wild-type endosperm (Fig. 3 , orange trace moderately shifted to the left of blue and green traces). In contrast, demethylation at FACT-dependent loci is more severely impeded (Fig. 3 , magenta trace). These results support a model in which DME recruits FACT via its N-terminal region to make heterochromatic targets accessible.

Prevalent Ectopic Gene Body Demethylation by nDMECTD.

We identified a set of new DMRs unique to nDMECTD, which we term “ectopic” targets (Fig. 2). The CG methylation difference between dme-2 and wild-type endosperm for these DMRs is minimal but not absent (Fig. 2 , the red trace peaks close to 0, with a positive shoulder). We plotted the methylation status of nDMECTD-unique loci in wild-type endosperm to assess how they are demethylated by DME. This resulted in positive peaks for shorter (red trace) or longer (blue trace) TEs and intergenic sequences (green trace), showing that these nDMECTD-unique loci are also demethylated by DME but not to a sufficient degree to reach the DMR cutoff. In contrast, most nDMECTD-unique loci within coding sequences are not demethylated by DME (Fig. 4 , orange trace). This indicates that genic sequences are the primary ectopic targets of nDMECTD. This is also reflected by an increase in nDMECTD DMR frequency (Fig. 4 , red and orange traces) and a decrease in average CG methylation () within coding genes.

Fig. 4.

nDMECTD induces ectopic genic demethylation. (A) Kernel density plot of CG methylation differences between dme-2 and wild-type endosperm for nDMECTD-unique loci that reside in short (<500 bp, red trace) or longer (≥500 bp, blue trace) TEs, intergenic regions (green trace), or genic sequences (orange trace). (B) Distribution frequency of DMRs with respect to coding genes. Genes were aligned at the 5′ end or the 3′ end, and the proportion of genes with DMRs in each 100-bp interval is plotted. DMR distribution is shown with respect to all DME DMRs (dark-green trace), DME-unique DMRs (light-green trace), all nDMECTD DMRs (red trace), and nDMECTD-unique DMRs (orange trace). (C) Venn diagram showing the numbers of coding genes associated with DME and nDMECTD DMRs. (D) Transcriptional scores (FPKM) in wild-type endosperm using expression data from ref. 33 for each group of genes indicated. Differences in expression level between nDMECTD-unique target genes and all genes, DME-unique, or DME-nDMECTD-shared target genes are significant (*P < 0.0001 for all comparisons, Wilcoxon rank-sum test). (E) Average CG methylation level of DME-unique target genes (blue trace), DME-nDMECTD-shared target genes (gray trace), and nDMECTD-unique target genes (red trace) in wild-type endosperm. Gene body CG methylation, or gbM, is an evolutionarily conserved feature in mammals and angiosperms, but the origin and function of gbM has not been elucidated (3, 4). Approximately 15% (∼5,000) of Arabidopsis genes contain gbM (31). In our endosperm methylomes, we found 2,202 and 3,213 genes associated with DME and nDMECTD DMRs, respectively, which were largely mutually exclusive and thus constituted almost all of the genes with gbM (Fig. 4). Among these, 2,260 genes were ectopically targeted by nDMECTD (Fig. 4 and Dataset S1). Compared with DME-targeted coding genes, these nDMECTD-unique genes have higher expression levels (Fig. 4) and greater CG methylation (Fig. 4). They include genes across most actively used cellular processes (), consistent with the current theory that moderate gbM positively correlates with constitutively transcribed genes (3). Thus, nDMECTD has a greater tendency than DME to target higher CG methylated coding sequences compared with lower methylated genes.

Evolutionary History of the DME/ROS1 Glycosylase Family.

We show that the catalytic region of DME is able to demethylate DNA in vivo and also has targeting ability. We also demonstrate that the N terminus likely plays a role in fine-tuning DME targeting to heterochromatin and restricting it from gene bodies. To provide clues as to the evolutionary origin of these bipartite domains, we carried out comparative analyses across plant lineages. Using various DME homologs as query sequences, we revealed a diversity of N-terminal domains associated with the DME catalytic core across various clades (Fig. 5). These indicate that a shorter protein, comprising only the C terminus of Arabidopsis DME, may represent the ancestral form of the 5mC glycosylase found in all plant lineages. Variations of the N-terminal domains are found in land plants and charophytes (Streptophyta), which have a divergent circularly permuted CXXC domain between the FCL and RRMF domains. In contrast, 1 or more copies of the classical CXXC can be found in chlorophyte and stramenopile algae at distinct positions. Chlorophyte and stramenopile homologs also have additional chromatin readers, such as Tudor and PHD; DNA-binding domains, such as the AT-hook; and the Hsp70-interacting DnaJ domains (27). These accessory domains suggest a mode of regulating DNA glycosylase activity according to methylation status (via CXXC) or chromatin states (via PHD and Tudor). The DemeN and basic stretch of the DME N-terminal region are restricted to the angiosperm lineage and appear to be a late acquisition during land plant evolution (Fig. 5). Thus, the acquisition of this region coincides with the origins of double fertilization in plants and the emergence of plant gene imprinting.

Fig. 5.

Evolution of plant DME-like proteins. A phylogenetic tree was reconstructed using the PhyML program. Only nodes supporting values >0.80 from maximum likelihood bootstrap analyses are shown. The representative domain architectures of DME homologs in major plant clades are shown along the tree, demonstrating domain fusions during evolution. DemeN, N-terminal domain of DME-like proteins in angiosperms; DnaJ, DnaJ molecular chaperone homology domain (Pfam: PF00226); FCL, [Fe4S4] cluster loop motif (also called Iron-sulfur binding domain of endonuclease III; Pfam: PF10576); HhH-GL, HhH-GPD superfamily base excision DNA repair protein (Pfam: PF00730); PHD, PHD finger (Pfam: PF00628); RRMF, RNA recognition motif fold (Pfam: PF00076); Tudor, Tudor domain (Pfam: PF00567). The scale bar below the tree represents the number of substitutions per site.

Discussion

DME regulates gene imprinting and influences transgenerational epigenetic inheritance in Arabidopsis (32). DME demethylates the central and vegetative gamete companion cell genomes at thousands of loci, but the mechanism of DME targeting remains elusive. This is due to the restriction of its expression to the ephemeral nuclei embedded within gametophytes, which largely prohibits biochemical interrogation by currently available tools. In contrast, genetic analysis coupled with endosperm transcriptome and methylome profiling has been instrumental in revealing DME’s molecular function (9, 12–14, 23, 33). Here we used genetic complementation and endosperm DNA methylation profiling to show that the catalytic core of DME is sufficient to rescue the dme seed abortion phenotype and pollen germination defects (Fig. 1). We present evidence that nDMECTD can demethylate most canonical DME target sites, implying that the catalytic region contains targeting information. We propose the DME protein has a bipartite structure (Fig. 6) and demonstrate a requirement for the N-terminal region in assisting heterochromatin demethylation, possibly via FACT.

Fig. 6.

A bipartite model for DME-mediated active demethylation. We propose a bipartite organization of DME in which the targeting and recruitment information is contained within the C-terminal catalytic core. In the absence of the N-terminal domain, processing of heterochromatin demethylation is significantly impeded, presumably by the chromatin structure. The N-terminal domain is required to recruit the FACT complex for the heterochromatic targets to overcome nucleosomal obstacles. The N-terminal conserved domains were acquired late during land plant evolution and are restricted to the angiosperm linage, suggesting a mode of demethylation regulation specific to angiosperm genomes. Although nDME complements dme-associated developmental defects (Fig. 1 and ), it does not fully rescue the dme endosperm DNA hypermethylation phenotype. Instead, we observed a reduced degree of demethylation by nDMECTD in all the endogenous DME target loci, regardless of length (Fig. 2 and ). To investigate the cause of this partial demethylation, we carried out data analyses to rule out technical differences in how experiments were performed. First, the nDMECTD methylomes were generated from selfing Col transgenic lines, whereas the control wild-type and dme-2 endosperm methylomes were derived from crosses of Col and Ler parents (13). To assess the extent to which the difference between ColxLer and ColxCol F1 endosperm affects methylomes and/or how sample collection and preparation methods might influence the DNA methylation profile, we compared the methylation differences between wild-type (ColxLer) and nDME-complemented (ColxCol) endosperm at DME target sites and non-DME target sites. The kernel density plot shows a general trend toward slight hypomethylation at non-DME target sites in nDME-complemented endosperm. In contrast, a more substantial hypermethylation at DME target sites in nDME-complemented endosperm compared with wild-type endosperm was detected (), indicating that the difference between ColxLer and ColxCol endosperm, or differing sample preparations, does not contribute to the hypermethylation of DME target loci in nDME-complemented endosperm. We performed the following experiment to rule out the possibility that when propagating dme-2 homozygous lines, loss of DME activity might cause epigenome alterations that render certain DME target loci no longer recognizable by nDMECTD. Among the 3 independent nDME-complemented lines, 1 line (nDME-3) was propagated from a transformed DME/dme-2 heterozygous plant in which the sporophyte was never dme-2/dme-2 in the absence of the nDME transgene. Two other lines were generated by directly transforming dme-2/dme-2 homozygous plants, with dme-2 ovules derived from dme-2/dme-2 megaspore mother cells before nDME complementation. Examination of the demethylation profiles of these 3 independent lines at DME canonical target loci revealed that the 3 lines had very similar demethylation profiles (), indicating that nDMECTD partial demethylation is likely not a result of unexpected epigenome alternations induced by dme-2/dme-2 plants. The nDME transgene, although controlled by a native DME promoter and fully complemented dme-2 seed abortion (Fig. 1 ), might lack critical regulatory elements needed to drive stable protein production to the levels comparable to the endogenous DME. Thus, an endosperm methylome complemented by a full-length DME cDNA construct within the otherwise similar T-DNA backbone as the nDME is needed to rule out any unforeseen artifacts introduced by the transgene. Throughout the course of this study, a full-length DME cDNA (DME; ref. 15) was included side by side as a control for complementation assays and for methylation profiling. However, for reasons not completely understood, we observed that although it complemented dme-2 seed abortion (3:1 ratio of viable to aborted F2 seeds, 1,518:556, χ2 = 3.62, P = 0.057; ) as reported previously (15, 34), the DME consistently suffered from negative interference in the dme-2 mutant background that resulted in very low demethylation activity in vivo, which most likely does not reflect the activity of the full-length DME protein (). To avoid confounding the interpretation, we excluded the DME results and focused only on methylome comparisons between wild-type and nDME-complemented endosperm. Although we cannot completely rule out a negative effect of the dme-2 genetic background on the nDME transgene, 2 factors suggest that this negative effect, if any, would be minimal. First, nDMECTD generates roughly the same number of DMRs as the endogenous DME (Fig. 2). Second, within the shared DMRs between nDMECTD and DME, the degree of demethylation is quite comparable (Fig. 2 and ). When directly comparing the difference between DME and nDMECTD (DME minus nDMECTD) in these completely overlapping DMRs that cover >1.2 million bases, the density plot peaks slightly to the negative side with a broader shoulder (), indicating that nDMECTD is capable of demethylation to almost the DME level in these regions. Thus, we suspect that nDME-complemented endosperm methylome data can reasonably reflect the catalytic activity of the nuclear-localized DMECTD. However, we also caution that in the absence of a proper full-length DME cDNA control, we cannot completely rule out that partial demethylation of DMECTD might be caused by other factors. Finally, it is also possible that the N-terminal region may assist the glycosylase by binding to DNA templates (via the AT hook-like motif) to promote demethylation, and that without it, DMECTD has a reduced affinity to target DNA and exhibits lower demethylation activity. Previous studies of ROS1 support such a model, showing that the basic stretch/AT-hook region of ROS1 binds strongly to DNA templates in vitro in a non–sequence-specific manner, and that removal of the ROS1 basic stretch region impairs the sliding capacity of ROS1 on the DNA template (35), significantly reducing ROS1 5mC excision activity (25). DME preferentially targets smaller euchromatic transposons that flank coding genes, and also targets gene-poor heterochromatin regions for demethylation (13). Since heterochromatin regions are compacted, demethylation in these regions requires substantial chromatin remodeling, including the temporary eviction of nucleosomes for DME to gain access to DNA. The FACT complex has been shown to be required for DME-mediated demethylation, primarily at heterochromatin targets (23), and we also noted that these sites have increased nucleosome occupancy (). It is tempting to speculate that the DME N-terminal region is required to recruit factors such as FACT, and indeed we found that the overwhelming majority of the 250 longer DME DMRs (87%) that were not properly demethylated in the absence of the N-terminal region also require FACT activity for demethylation (Dataset S2). SPT16 has been shown to colocalize with DME in vivo (23), suggesting a direct or indirect interaction between DME and FACT. nDMECTD also displayed a reduced capacity for demethylating FACT-independent loci. Thus, it is possible that the N-terminal region is needed to recruit other chromatin remodelers at FACT-independent targets, that is, if nucleosomes are natural barriers for DME demethylation in euchromatin as well as heterochromatin (). We envision a working model (Fig. 6) in which the catalytic region directs DME to target sites, while the N-terminal region is required to interact with the local chromatin environment, stabilizing binding to the chromosomal template and/or aiding demethylation of flanking sequences by remodeling nucleosomes. gbM is evolutionarily conserved, and approximately 15% of Arabidopsis genes contain gbM (4). CG-methylated genes are often constitutively expressed housekeeping genes (3), raising the possibility that these genic methylated genes reside in open chromatin that is more accessible by nDMECTD than by DME. It is also possible that DME is actively repelled by certain open chromatin histone marks, and such repulsion is missing in nDMECTD. This scenario would be analogous to the mammalian de novo DNA methyltransferase DNMT3, where its binding to an allosteric activator, unmethylated histone H3, is strongly inhibited by H3K4 methylation (36). Equally probable is that additional factor(s) (e.g., methyl-binding proteins) might be associated with these higher CG-methylated genes (Fig. 4) that restrict DME access. Tracing the evolutionary history of DME-like genes (Fig. 5), we found that a bacterial version of the HhH-FCL pair underwent a horizontal gene transfer to the ancestor of plants, followed by a gene duplication. One copy was fused to an RRMF domain and further acquired an insert in the glycosylase domain, giving the ancestral form of DME in plants. This was likely then transferred to the stramenopiles from a secondary chlorophyte endosymbiont of this lineage. Finally, at the base of the streptophyte radiation, DME acquired a permuted CXXC, and later the DemeN domain and associated charged repeats were acquired in angiosperms, possibly to facilitate and ensure robust and thorough DNA demethylation. Thus, the adoption of a DME-based demethylation system for DNA base modification appears to have occurred early in the plant lineage. The presence of several accessory domains in addition to the conserved core suggests variation in the chromatin environment in specific lineages. For example, the presence of the DemeN and basic stretch/AT-hook motifs in angiosperms and the permuted CXXC domain in the Streptophyta lineage likely reflects adjustment to the unique methylation and chromatin environment of the larger Streptophyta and land plant genomes. Our study shows that the catalytic core of DME was present in ancient plant ancestors and is alone capable of targeting and demethylation. The N-terminal domains, confined to flowering plant DME proteins, likely evolved to facilitate access to diverse chromatin states, in turn mediating gene imprinting and the transgenerational silencing of transposons (13).

Experimental Procedures

Detailed descriptions of the experimental methods are provided in . Sequencing data have been deposited in the Gene Expression Omnibus database (accession no. GSE130559).

Plant Materials and Seed Phenotype Analysis.

Heterozygous DME/dme-2 lines in the Col-gl background were subjected to Agrobacterium-mediated transformation. Seeds were sterilized with 30% bleach solution; plated on a 0.5× MS nutrient medium with 1.5% sucrose, 0.8% agar, and 40 µg/mL hygromycin; and stratified at 4 °C for 2 d. Germinated seedlings were transferred to soil and grown in a growth room under a 16-h light/8-h dark cycle at 23 °C. Siliques from T1 transgenic plants were dissected at 14–16 d after self-pollination using a stereoscopic microscope (SteREO Discovery.V12; Carl Zeiss). The numbers of viable and aborted seeds in transgenic lines were statistically analyzed with the χ2 test. The probabilities of deviation from a 1:1 or 3:1 segregation ratio for viable and aborted seeds were also calculated.

RNA Extraction, cDNA Synthesis, and qRT-PCR Analysis.

Total RNA was extracted using TRIzol reagent (Invitrogen) and treated with TURBO DNase (Ambion) according to the manufacturers’ instructions. For cDNA synthesis, 5 mg of total RNA was reverse-transcribed using SuperScript III reverse transcriptase and oligo(dT) primer (Invitrogen). cDNA was treated with RNase H (Invitrogen) at 37 °C for 20 min and then diluted 10-fold with H2O. For each 15-µL qPCR reaction, 1 µL of diluted cDNA was used. qRT-PCR analyses were run on an Applied Biosystems 7500 Fast Real-Time PCR System using Roche FastStart Universal SYBR Green Master Mix. The qRT-PCR primers are listed in . Ct values were normalized against ACT2 (At3g18780) mRNA or UBC (At5g25760) mRNA. The abundance of mRNAs was expressed as relative to controls, with the control value set to 1. The error bars represent the SD of 4 biological replicates.

Protein Domain Analysis and Phylogenetic Inference.

We used a domain-centric computational strategy to study DME and its related proteins. Specifically, we identified DME homologs using iterative profile searches with PSI-BLAST (37) from the protein nonredundant database at the National Center for Biotechnology Information. Multiple sequence alignments were built using PROMALS (38), followed by careful manual adjustments. Consensus secondary structures were predicted using the PSIPRED (39) JPred program (40). Conserved domains were further characterized based on comparisons with available domain models from Pfam (41) and sequence/structural features. PhyML (42) was used to determine the maximum likelihood tree using the Jones–Taylor–Thornton model for amino acids substitution with a discrete gamma model (4 categories with gamma shape parameter 1.096). The tree was rendered using MEGA Tree Explorer (43).

42 in total

1. DEMETER and REPRESSOR OF SILENCING 1 encode 5-methylcytosine DNA glycosylases.

Authors: Teresa Morales-Ruiz; Ana Pilar Ortega-Galisteo; María Isabel Ponferrada-Marín; María Isabel Martínez-Macías; Rafael R Ariza; Teresa Roldán-Arjona
Journal: Proc Natl Acad Sci U S A Date: 2006-04-19 Impact factor: 11.205

2. PROMALS: towards accurate multiple sequence alignments of distantly related proteins.

Authors: Jimin Pei; Nick V Grishin
Journal: Bioinformatics Date: 2007-01-31 Impact factor: 6.937

3. DEMETER DNA glycosylase establishes MEDEA polycomb gene self-imprinting by allele-specific demethylation.

Authors: Mary Gehring; Jin Hoe Huh; Tzung-Fu Hsieh; Jon Penterman; Yeonhee Choi; John J Harada; Robert B Goldberg; Robert L Fischer
Journal: Cell Date: 2006-02-10 Impact factor: 41.582

4. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0.

Authors: Koichiro Tamura; Joel Dudley; Masatoshi Nei; Sudhir Kumar
Journal: Mol Biol Evol Date: 2007-05-07 Impact factor: 16.240

5. DNA demethylation in the Arabidopsis genome.

Authors: Jon Penterman; Daniel Zilberman; Jin Hoe Huh; Tracy Ballinger; Steven Henikoff; Robert L Fischer
Journal: Proc Natl Acad Sci U S A Date: 2007-04-04 Impact factor: 11.205

6. Role of the Arabidopsis DNA glycosylase/lyase ROS1 in active DNA demethylation.

Authors: Fernanda Agius; Avnish Kapoor; Jian-Kang Zhu
Journal: Proc Natl Acad Sci U S A Date: 2006-07-24 Impact factor: 11.205

7. An invariant aspartic acid in the DNA glycosylase domain of DEMETER is necessary for transcriptional activation of the imprinted MEDEA gene.

Authors: Yeonhee Choi; John J Harada; Robert B Goldberg; Robert L Fischer
Journal: Proc Natl Acad Sci U S A Date: 2004-05-05 Impact factor: 11.205

8. DNMT3L connects unmethylated lysine 4 of histone H3 to de novo methylation of DNA.

Authors: Steen K T Ooi; Chen Qiu; Emily Bernstein; Keqin Li; Da Jia; Zhe Yang; Hediye Erdjument-Bromage; Paul Tempst; Shau-Ping Lin; C David Allis; Xiaodong Cheng; Timothy H Bestor
Journal: Nature Date: 2007-08-09 Impact factor: 49.962

9. DEMETER, a DNA glycosylase domain protein, is required for endosperm gene imprinting and seed viability in arabidopsis.

Authors: Yeonhee Choi; Mary Gehring; Lianna Johnson; Mike Hannon; John J Harada; Robert B Goldberg; Steven E Jacobsen; Robert L Fischer
Journal: Cell Date: 2002-07-12 Impact factor: 41.582

Review 10. Stability and flexibility of epigenetic gene regulation in mammalian development.

Authors: Wolf Reik
Journal: Nature Date: 2007-05-24 Impact factor: 49.962

9 in total

1. AtHDA6 functions as an H3K18ac eraser to maintain pericentromeric CHG methylation in Arabidopsis thaliana.

Authors: Qianwen Wang; Xiucong Bao; Shengjie Chen; Huan Zhong; Yaqin Liu; Li Zhang; Yiji Xia; Friedrich Kragler; Ming Luo; Xiang David Li; Hon-Ming Lam; Shoudong Zhang
Journal: Nucleic Acids Res Date: 2021-09-27 Impact factor: 16.971

Review 2. DNA Methylation in Plant Responses and Adaption to Abiotic Stresses.

Authors: Minghui Sun; Zhuo Yang; Li Liu; Liu Duan
Journal: Int J Mol Sci Date: 2022-06-21 Impact factor: 6.208

3. Roles of DEMETER in regulating DNA methylation in vegetative tissues and pathogen resistance.

Authors: Wenjie Zeng; Huan Huang; Xueqiang Lin; Chen Zhu; Ken-Ichi Kosami; Chaofeng Huang; Huiming Zhang; Cheng-Guo Duan; Jian-Kang Zhu; Daisuke Miki
Journal: J Integr Plant Biol Date: 2021-03-16 Impact factor: 7.061

4. Genome-wide identification and analysis of DNA methyltransferase and demethylase gene families in Dendrobium officinale reveal their potential functions in polysaccharide accumulation.

Authors: Zhenming Yu; Guihua Zhang; Jaime A Teixeira da Silva; Mingzhi Li; Conghui Zhao; Chunmei He; Can Si; Mingze Zhang; Jun Duan
Journal: BMC Plant Biol Date: 2021-01-06 Impact factor: 4.215

5. Targeting Induced Local Lesions in the Wheat DEMETER and DRE2 Genes, Responsible for Transcriptional Derepression of Wheat Gluten Proteins in the Developing Endosperm.

Authors: Nuan Wen; Claudia E Osorio; Rhoda A T Brew-Appiah; Jaime H Mejías; Tariq Alam; Samneet Kashyap; Steffen Reinbothe; Christiane Reinbothe; Charles P Moehs; Diter von Wettstein; Sachin Rustgi
Journal: Front Nutr Date: 2022-03-03

6. The maize gene maternal derepression of r1 encodes a DNA glycosylase that demethylates DNA and reduces siRNA expression in the endosperm.

Authors: Jonathan I Gent; Kaitlin M Higgins; Kyle W Swentowsky; Fang-Fang Fu; Yibing Zeng; Dong Won Kim; R Kelly Dawe; Nathan M Springer; Sarah N Anderson
Journal: Plant Cell Date: 2022-09-27 Impact factor: 12.085

7. Loss of Function of the RRMF Domain in OsROS1a Causes Sterility in Rice (Oryza sativa L.).

Authors: Jian-Hong Xu; Faiza Irshad; Yan Yan; Chao Li
Journal: Int J Mol Sci Date: 2022-09-26 Impact factor: 6.208

8. DNA demethylase ROS1 prevents inheritable DREB1A/CBF3 repression by transgene-induced promoter methylation in the Arabidopsis ice1-1 mutant.

Authors: June-Sik Kim; Satoshi Kidokoro; Kazuo Shinozaki; Kazuko Yamaguchi-Shinozaki
Journal: Plant Mol Biol Date: 2020-09-30 Impact factor: 4.076

9. The DME demethylase regulates sporophyte gene expression, cell proliferation, differentiation, and meristem resurrection.

Authors: Seohyun Kim; Jin-Sup Park; Jaehoon Lee; Kiseok Keith Lee; Ok-Sun Park; Hee-Seung Choi; Pil Joon Seo; Hyung-Taeg Cho; Jennifer M Frost; Robert L Fischer; Yeonhee Choi
Journal: Proc Natl Acad Sci U S A Date: 2021-07-20 Impact factor: 11.205

9 in total