Daniel M Messerschmidt1, Barbara B Knowles, Davor Solter. 1. Developmental Epigenetics and Disease, Institute of Molecular and Cell Biology (IMCB), Agency for Science, Technology, and Research (A*STAR), 138673 Singapore,;
Abstract
Methylation of DNA is an essential epigenetic control mechanism in mammals. During embryonic development, cells are directed toward their future lineages, and DNA methylation poses a fundamental epigenetic barrier that guides and restricts differentiation and prevents regression into an undifferentiated state. DNA methylation also plays an important role in sex chromosome dosage compensation, the repression of retrotransposons that threaten genome integrity, the maintenance of genome stability, and the coordinated expression of imprinted genes. However, DNA methylation marks must be globally removed to allow for sexual reproduction and the adoption of the specialized, hypomethylated epigenome of the primordial germ cell and the preimplantation embryo. Recent technological advances in genome-wide DNA methylation analysis and the functional description of novel enzymatic DNA demethylation pathways have provided significant insights into the molecular processes that prepare the mammalian embryo for normal development.
Methylation of DNA is an essential epigenetic control mechanism in mammals. During embryonic development, cells are directed toward their future lineages, and DNA methylation poses a fundamental epigenetic barrier that guides and restricts differentiation and prevents regression into an undifferentiated state. DNA methylation also plays an important role in sex chromosome dosage compensation, the repression of retrotransposons that threaten genome integrity, the maintenance of genome stability, and the coordinated expression of imprinted genes. However, DNA methylation marks must be globally removed to allow for sexual reproduction and the adoption of the specialized, hypomethylated epigenome of the primordial germ cell and the preimplantation embryo. Recent technological advances in genome-wide DNA methylation analysis and the functional description of novel enzymatic DNA demethylation pathways have provided significant insights into the molecular processes that prepare the mammalian embryo for normal development.
Entities:
Keywords:
DNA methylation; development; epigenetics; reprogramming
“When you arrive at a fork in the road, take it.”—Yogi BerraCells in adult higher organisms are either committed to highly specialized tasks or reside in adult stem cell pools with limited potential; while they share the same genetic information, each cell type is defined by a specialized gene expression pattern. During development, the precursors of these cells commit to their fate in a step-by-step differentiation process, which is driven by a multitude of inputs (position, signaling, etc.) and is accompanied by epigenetic measures reinforcing the commitment decisions. These epigenetic changes form barriers to ensure that cell type specification is a one-way street. Such epigenetic modifications are stable and heritable through mitosis, allowing a faithful and directed differentiation process and the propagation of lineage-specific transcription profiles over many cell divisions.However, these epigenetic barriers also pose a major challenge to sexual reproduction, where preparation for the next generation requires a reset of the (epi)genome to a basic, totipotent state. Particularly in mammals (where germ cells are not defined at fertilization but rather arise from later embryonic tissues), resetting the epigenome is of great importance (Hayashi et al. 2007; Strome and Lehmann 2007). The preparation for sexual reproduction is a three-step process consisting of (in chronological order) erasure of somatic signatures in the germ cell precursors (primordial germ cells [PGCs]) via a comprehensive reprogramming process, establishment of sex-specific and germ cell-specific epigenetic signatures and transcription profiles that enable the highly complex and specialized processes of meiotic maturation and fertilization, and finally, the post-fertilization removal of these signatures to trigger the embryonic developmental program and beginning of a new life cycle.Although the basic principles of epigenetic reprogramming in embryos and germ cells have been known and studied for many years, major aspects, including the dynamics of these processes, remain enigmatic. In recent years, however, two main advances have propelled the field rapidly forward. Next-generation sequencing has allowed genome-wide epigenetic analysis of ever decreasing amounts of biological material, giving unprecedented insights into the epigenetic states of germ cells and early embryos at multiple stages (Borgel et al. 2010; Popp et al. 2010; Smallwood et al. 2011; Smith et al. 2012; Seisenberger et al. 2012; Hackett et al. 2013; Kobayashi et al. 2013). Furthermore, a long-standing void in our understanding of epigenetic reprogramming was filled by the discovery of the previously elusive enzymatic activities that initiate active erasure of DNA methylation in germ cells and embryos (Ito et al. 2010; He et al. 2011; Inoue and Zhang 2011; Inoue et al. 2011).At present, our knowledge of epigenetic reprogramming is accumulating at a dizzying pace. Here we give a snapshot of the current understanding of epigenetic dynamics during the mammalian life cycle, with a focus on DNA methylation and genomic imprinting of PGCs and the preimplantation embryo.
The fifth base—5-methylcytosine (5mC) and its functions
The methylation of the fifth carbon of cytosine produces 5mC, often referred to as the “fifth base” of the DNA code. DNA methylation is common in eukaryotes ranging from fungi to vertebrates, although its significance and function in these organisms varies greatly. While undetectable in certain yeast, nematode, and fly species, DNA methylation is vital for mammalian development and adult homeostasis. Here, DNA methylation is most frequently found in a symmetrical CpG context, and although rare non-CpG (CpH) methylation occurs, its function remains unclear (Ramsahoye et al. 2000; Smallwood et al. 2011; Ziller et al. 2011). The first global methylomes showed that CpG methylation follows a bimodal distribution (Fig. 1A). While “single” CpGs are generally hypermethylated (60%–90%, depending on cell type), CpG islands (CGIs; 500- to 2000-base-pair-long regions with increased CpG density) remain predominantly hypomethylated (Deaton and Bird 2011). Given the instability of 5mC due to its propensity to deaminate to thymidine, this bimodal distribution may account for the evolutionary underrepresentation of CpGs in the mammalian genome outside of CGIs and, conversely, its enrichment within unmethylated CGIs (Weber et al. 2007; Cohen et al. 2011).
Figure 1.
DNA methylation in the mammalian genome. (A) Genomic CpG distribution: CGIs are generally hypomethylated and found at promoters or intergenic regions (orphan CpG). Non-CGI CpGs are generally hypermethylated. (B) Three categories of gene promoters according to CpG density respond differently to methylation. (LCP) Low CpG density promoter; (ICP) intermediate CpG density promoter; (HCP) high CpG density promoter. (C) Retrotransposons are repressed by DNA methylation. Derepression can result in coactivation of neighboring genes. (D) Basal transcription of centromeric repeats interferes with chromosome alignment and is repressed by DNA methylation. (E) DNA methylation reinforces gene silencing on the inactivated X chromosome. Transcribed genes on the active X chromosome (and globally) display gene body methylation, possibly repressing spurious expression from cryptic transcription start sites or aiding RNA processing. (F) Genomic imprinting drives allele-specific gene expression. DNA methylation at imprinting control regions (ICRs) regulates binding of insulator proteins or expression of cis-acting, noncoding RNAs.
DNA methylation in the mammalian genome. (A) Genomic CpG distribution: CGIs are generally hypomethylated and found at promoters or intergenic regions (orphan CpG). Non-CGI CpGs are generally hypermethylated. (B) Three categories of gene promoters according to CpG density respond differently to methylation. (LCP) Low CpG density promoter; (ICP) intermediate CpG density promoter; (HCP) high CpG density promoter. (C) Retrotransposons are repressed by DNA methylation. Derepression can result in coactivation of neighboring genes. (D) Basal transcription of centromeric repeats interferes with chromosome alignment and is repressed by DNA methylation. (E) DNA methylation reinforces gene silencing on the inactivated X chromosome. Transcribed genes on the active X chromosome (and globally) display gene body methylation, possibly repressing spurious expression from cryptic transcription start sites or aiding RNA processing. (F) Genomic imprinting drives allele-specific gene expression. DNA methylation at imprinting control regions (ICRs) regulates binding of insulator proteins or expression of cis-acting, noncoding RNAs.
Role of DNA methylation in transcriptional control
Promoters
CGIs are often associated with gene promoters, particularly those of developmental or housekeeping genes, where they remain hypomethylated even if the locus is transcriptionally silent (Weber et al. 2007; Meissner et al. 2008; Deaton and Bird 2011). Intragenic or intergenic CGIs behave like promoter CGIs, yet their function in the genome is not fully understood (Fig. 1A). The hypomethylated state of promoter CGIs is associated with transcription factor binding (which prevents the DNA methylation machinery from targeting these regions) (Brandeis et al. 1994; Lienert et al. 2011; Meissner 2011; Stadler et al. 2011) as well as their specialized, histone 3 Lys4 trimethylation (H3K4me3)-heavy chromatin state (Erfurth et al. 2008; Thomson et al. 2010; Smith and Meissner 2013).Historically, DNA methylation has been linked to transcriptional repression. However, detailed analysis of CGI-associated promoters and their effect on transcriptional activity has allowed for their classification into three categories based on CpG content and sequence length (Fig. 1B; Weber et al. 2007; Meissner et al. 2008). High CpG density promoters (HCPs) are rarely DNA-methylated, which supports the traditional view that DNA methylation represses transcription. In fact, the rare methylation of HCPs does result in efficient gene silencing. Similarly, intermediate CpG density promoters (ICPs) are inactive when methylated (Meissner et al. 2008; Borgel et al. 2010). However, ICPs more frequently acquire differentiation-dependent hypermethylation; i.e., at pluripotency gene promoters, where DNA methylation is thought to act as a safety mechanism to reinforce silencing during differentiation, as well as at promoters of germ cell-specific genes (Maatouk et al. 2006; Weber et al. 2007; Farthing et al. 2008; Meissner et al. 2008; Borgel et al. 2010). In contrast, low CpG density promoters (LCPs), which are generally hypermethylated, remain transcriptionally active regardless of their methylation state (Weber et al. 2007; Meissner et al. 2008).
Enhancers
Transcriptional enhancers physically interact with gene promoters and support tissue-specific differentiation. Like promoters, enhancers have characteristic DNA methylation patterns (Stadler et al. 2011), and hypomethylation is correlated with active gene expression (Carone et al. 2010; Sandovici et al. 2011). Remarkably, such enhancer methylation was shown to be more closely associated with gene expression changes in cancer cells than promoter methylation itself (Aran et al. 2013).
Role of DNA methylation in controlling specialized regions
Transposons
DNA methylation is conserved throughout evolution to repress endogenous transposable elements, which comprise ∼40% of the mammalian genome and thus significantly contribute to the overall hypermethylated state of the genome (Lander et al. 2001; Mouse Genome Sequencing Consortium 2002). In particular, LINEs (long interspersed nuclear elements) and LTR (long terminal repeat)-containing elements carry strong, generally hypermethylated promoters (Fig. 1C). Loss of methylation can cause massive transcriptional activation and, potentially, retrotranspositions (Walsh et al. 1998). Derepression of retrotransposons is also linked to the coactivation of neighboring genes or chimeric transcripts composed of retrotransposable elements and endogenous genes (Peaston et al. 2004). Particularly in oocytes and two-cell embryos, this indirect retrotransposon-mediated gene regulation may be critical for successful development (Peaston et al. 2004; Macfarlan et al. 2012).
Pericentromeric repeats
These elements show latent transcriptional activity, which is repressed by DNA methylation to achieve proper chromosome alignment and segregation (Fig. 1D). The importance of this repression is evident in the rare immunodeficiency, centromeric instability, and facial anomaly (ICF) syndrome, which is caused by missense mutations in the DNA methyltransferase gene DNMT3B. Continued transcription of the pericentromeric repeats causes rearrangements in the vicinity of the centromeres, likely originating from chromosome misalignments during mitosis (Okano et al. 1999; Xu et al. 1999; Bestor 2000; Chen and Li 2004; Jin et al. 2008; Gopalakrishnan et al. 2009).
X-chromosome inactivation
DNA methylation mediates gene dosage control via inactivation of the second X chromosome in females (Fig. 1E). In the case of X-chromosome inactivation, DNA methylation is the outcome of a cascade of events beginning with activation of a cis-acting noncoding RNA, Xist, which coats the X chromosome and therefore triggers displacement of transcription factors, chromatin changes, and, ultimately, CpG methylation at promoter CGIs. Loss of methylation, however, has only moderate effects on X inactivation, suggesting that it functions as long-term insurance rather than an initiator of X inactivation. Studies of DNA methylation in X inactivation led to the interesting observation that active genes display higher methylation levels in gene bodies, which could indicate transcriptional repression from cryptic promoters or merely the greater accessibility of actively transcribed DNA to the methylation machinery (Fig. 1E; Hellman and Chess 2007). Alternatively, DNA methylation at intragenic regions may correlate with RNA processing and alternative splicing, since methylation patterns can mark intron–exon boundaries (Hodges et al. 2009; Anastasiadou et al. 2011).
Imprinted genes
DNA methylation is vital to the controlled expression of imprinted genes (Fig. 1F; Bartolomei 2009; Ferguson-Smith 2011). The hallmark of imprinted genes is their parent-of-origin-specific expression, which is coordinated by differential DNA methylation (on paternal or maternal alleles) at imprinting control regions (ICRs). ICR methylation controls region-specific downstream mechanisms, such as insulator protein binding or antisense noncoding RNA expression. This results in allele-specific expression of gene clusters in which genes are either repressed or activated. Parent-of-origin-specific DNA methylation is introduced during gamete differentiation and maintained throughout life. Genomic imprinting therefore poses the greatest complication to epigenetic reprogramming in the mammalian life cycle, as their methylation patterns must first be erased in PGCs, re-established in an allele-specific manner in gametes, and then preserved during embryonic reprogramming. Genomic imprinting patterns play essential roles in development and growth, and their deregulation impacts grievously on the embryonic phenotype and may well be the chief cause of defects in methylation-deficient mutants.
The machinery
DNA methylation
DNA methyltransferases catalyze the transfer of a methyl group from S-adenosyl-l-methionine to the fifth position of cytosine residues in DNA (Chen and Li 2004). The predominant positioning of 5mC in the symmetrical CpG context led to the early proposal of DNA methylation inheritance through semiconservative DNA replication, which implied two distinct modes of DNA methylation activity: maintenance methylation and de novo methylation (Holliday and Pugh 1975; Riggs 1975).
Methylation maintenance
A requisite for a maintenance mechanism is a high affinity for hemimethylated CpGs (i.e., methylation of the palindromic CpG on only one strand). The first eukaryotic enzyme with DNA methyltransferase activity (DNMT1) (Bestor 1988) displayed this predicted function (Ruchirawat et al. 1987; Hitt et al. 1988; Yoder et al. 1997; Pradhan et al. 1999). Dnmt1 expression is activated by cell cycle-dependent transcription factors in S phase and thus is highly expressed in most mitotic cells (Kishikawa et al. 2003). Attracted by its proliferating cell nuclear antigen (PCNA)-interacting binding partner, NP95, DNMT1 is localized to replication foci, where it restores hemimethylated CpGs to full methylation (Leonhardt et al. 1992; Arand et al. 2012). NP95 specifically attracts DNMT1 to the parental, methylated strand, orienting the enzyme and its activity to the newly synthesized, unmethylated strand (Bostick et al. 2007; Sharif et al. 2007).In mice, the Dnmt1 locus encodes two functionally identical isoforms: a somatic (Dnmt1s) and an oocyte-specific (thus only maternally expressed) isoform (Dnmt1o) (Rouleau et al. 1992; Gaudet et al. 1998; Mertineit et al. 1998; Ding and Chaillet 2002). Deletion of mouse Dnmt1 is lethal at/after gastrulation, the result of a substantial, global loss of DNA methylation (Ruchirawat et al. 1987; Hitt et al. 1988; Lei et al. 1996; Yoder et al. 1997; Kurihara et al. 2008; Arand et al. 2012). Loss of NP95 causes similar defects (Sharif et al. 2007).
De novo methylation
Dnmt1 deletion in mouse embryonic stem cells (mESCs) causes dramatic DNA hypomethylation but not a complete loss, predicting the presence of further enzymes with methyltransferase activity. Two additional enzymes, DNMT3A and DNMT3B, were identified and shown to act as de novo DNA methyltransferases (Okano et al. 1998a). Dnmt3a is maternally provided and predominates in oocytes and early preimplantation embryos. DNMT3A establishes the differential DNA methylation patterns at ICRs in male and female gametes (Kaneda et al. 2004; Kato et al. 2007). Dnmt3b is transcribed upon zygotic gene activation (ZGA) and is robustly expressed by the blastocyst stage, at which time its presence is predominantly restricted to the epiblast lineage (Watanabe et al. 2002). Deletion of Dnmt3b causes embryonic lethality, while Dnmt3a knockouts are partially viable (Okano et al. 1999). Combined genetic deletion results in earlier embryonic lethality, indicating at least partial functional redundancy of the two enzymes (Okano et al. 1999).A third de novo DNA methyltransferase, DNMT3L, lacks the characteristic N-terminal catalytic domain but is nonetheless required for DNA methylation, predominantly establishing ICR methylation in gametes (Bourc’his et al. 2001; Bourc’his and Bestor 2004). DNMT3L is a crucial activating cofactor of DNMT3A/B, explaining its methylation effects despite the lack of inherent catalytic activity (Chedin et al. 2002; Gowher et al. 2005; Jia et al. 2007). Dnmt3l knockout mice are viable, but the lack of de novo methylation in the germline causes sterility in males and embryonic lethality of maternal null-derived embryos (Bourc’his et al. 2001; Bourc’his and Bestor 2004).Last, DNMT2 differs structurally from other DNA methyltransferases, and knockout mice display no phenotype (Okano et al. 1998b). Indeed, DNMT2 is a misnomer, as it shows methylation activity toward RNA (Goll et al. 2006).
DNA demethylation
In contrast to the enzymatically controlled, straightforward methylating mechanism, a direct DNA demethylase capable of breaking carbon–carbon bonds has not yet been identified. Instead, several alternative, active demethylation mechanisms that are passive and indirect have been proposed and demonstrated (Wu and Zhang 2014, 2010).
Passive, replication-dependent dilution
Loss of 5mC in mitotic cells can be achieved by down-regulation or exclusion of the DNA methylation maintenance machinery (DNMT1 or its recruitment factors, such as NP95) from the nucleus (Fig. 2A). Although allowing efficient global DNA demethylation, the passive mechanism depends on repeated DNA replication and therefore cannot account for the rapid loss of DNA methylation in slowly or nondividing cells. Furthermore, this mechanism does not allow locus-specific, but only global, removal of DNA methylation marks.
Figure 2.
DNA methylation and demethylation mechanisms. (A) The palindromic nature of the CpG DNA replication creates hemimethylated DNA. DNMT1 restores hemimethylated DNA to full methylation. Absence of the DNMT1 machinery produces unmethylated DNA after a subsequent round of cell division. (Black filled circles) Methylated CpG; (white filled circles) unmethylated CpG. (B) Possible active demethylation pathways: A direct demethylase converting 5mC to cytosine is, to date, speculative. TET enzymes oxidize 5mC to 5hmC, 5fC, and 5caC. Deamination of 5mC and 5hmC, potentially by AID, produces thymidine or hydroxymethyluracil, respectively. The base excision repair (BER) mechanism may target AID deamination products and possibly 5fC and 5caC directly. Direct deformylation or decarboxylation of 5fC and 5caC has been proposed but remains speculative.
DNA methylation and demethylation mechanisms. (A) The palindromic nature of the CpG DNA replication creates hemimethylated DNA. DNMT1 restores hemimethylated DNA to full methylation. Absence of the DNMT1 machinery produces unmethylated DNA after a subsequent round of cell division. (Black filled circles) Methylated CpG; (white filled circles) unmethylated CpG. (B) Possible active demethylation pathways: A direct demethylase converting 5mC to cytosine is, to date, speculative. TET enzymes oxidize 5mC to 5hmC, 5fC, and 5caC. Deamination of 5mC and 5hmC, potentially by AID, produces thymidine or hydroxymethyluracil, respectively. The base excision repair (BER) mechanism may target AID deamination products and possibly 5fC and 5caC directly. Direct deformylation or decarboxylation of 5fC and 5caC has been proposed but remains speculative.
Active demethylation
An indirect, enzyme-catalyzed DNA demethylation mechanism could involve the deamination of 5mC to thymidine by the activation-induced deaminase (AID) or apolipoprotein B mRNA-editing enzyme catalytic polypeptide 1 (APOBEC1) (Fig. 2B). Deamination of 5mC creates T:G mismatches, recognized by thymine–DNA glycosylase (TDG) or methyl-CpG-binding domain protein 4 (MBD4), which catalyze removal of the thymine base. Abasic sites then trigger the base pair excision repair (BER) machinery to reinstate an unmodified cytosine, effectively resulting in removal of the methyl mark (Fig. 2B; Morgan et al. 2004). The extent of AID-triggered DNA demethylation mechanisms required for the reprogramming in PGCs and early embryos is a matter of debate and is described in detail below.
The existence of a ‘sixth base’
5-hydroxymethylcytosine (5hmC) and the recent discovery of the ten-eleven translocation (TET) family of dioxygenases (Tahiliani et al. 2009) imply new possibilities of active DNA demethylation. TET enzymes catalyze the iterative oxidation of 5mC to 5hmC and further to 5-formylcytosine (5fC) and 5-carboxycytosine (5caC) (Fig. 2B; He et al. 2011; Inoue et al. 2011; Ito et al. 2011). All three derivatives are found in cells displaying TET activity, yet their biological significance is unresolved. Given the existence of a decarboxylating enzyme in the pyrimidine salvage pathway, a similar enzyme, elusive to date, could directly restore cytosine from 5caC (Ito et al. 2010). Alternatively, TDG, MBD4, or a yet unidentified glycosylase may remove oxidized methyl-cytosine derivatives, with the resulting abasic site triggering BER. As recent observations suggest, TDG has direct activity on 5fC and 5caC without requiring a previous deamination step (Fig. 2B; He et al. 2011; Shen et al. 2013).5hmC may also aid passive DNA demethylation even in the presence of DNMT1, which itself has low affinity for 5hmC hemimethylated DNA (Valinluck and Sowers 2007). Conversely, NP95 interacts with both 5mC and 5hmC. Further studies are required to define this particular mechanism. The observed, divergent affinities of many interaction partners to 5mC and 5hmC (Valinluck and Sowers 2007; Liu et al. 2012) may indicate other functions of 5hmC; i.e., altering the local chromatin environment by recruitment and/or displacement of methylation- or hydroxymethylation-dependent binding proteins.TET-catalyzed conversion of 5mC to 5hmC is a plausible mechanism to allow cell cycle-independent removal of DNA methylation. In mice, the TET family comprises three members (Tet1–3), which are differentially expressed during development and in adult tissues (Tahiliani et al. 2009). Tet1 and Tet2 are detected in ESCs and PGCs. Tet2 expression is essential in hematopoietic cells; knockouts are viable but develop severe, lethal hematopoietic malignancies at 4–6 mo of age (Li et al. 2011; Moran-Crusio et al. 2011; Quivoron et al. 2011; Koh and Rao 2013). Tet1 knockout mice are viable, displaying somewhat reduced body size, suggesting potential developmental delay, and both knockout males and females are subfertile (Dawlaty et al. 2011; Yamaguchi et al. 2012, 2014). Given their partially overlapping expression patterns, a certain degree of functional redundancy of Tet1 and Tet2 has been suggested. Indeed, double-knockout embryos show more pronounced developmental defects, yet loss of both Tet1 and Tet2 can still be compatible with normal development (Dawlaty et al. 2013). Tet3 expression patterns share little overlap with Tet1/2 expression and are found to be highly expressed in oocytes, spermatozoa, and at early, preimplantation embryonic stages. Maternal loss of Tet3 can cause developmental arrest in a subset of embryos (Gu et al. 2011).
Epigenetic reprogramming in PGCs: the true blank slate
PGCs are either defined by the inherent germplasm in the oocyte (flies, nematodes, frogs, and fish) or are derived from somatic precursors in the embryo by instructive signals (mammals) (Fig. 3; Strome and Lehmann 2007; Saitou and Yamaji 2012). In mice, a small group of PGCs originates from the proximal embryonic ectoderm at embryonic days 6.5–7.5 (E6.5–E7.5) upon receiving instructive signals (the earliest being BMP4 signaling) from the visceral endoderm and the surrounding embryonic ectoderm itself (McLaren and Lawson 2005; Hayashi et al. 2007; Seki et al. 2007; Saitou and Yamaji 2012). As the epiblast rapidly adopts somatic epigenetic features after implantation, the DNA methylation levels and patterns of the embryonic ectoderm in the E6.5 mouse embryo are more closely related to somatic tissues than to the inner cell mass of the blastocyst (Borgel et al. 2010; Popp et al. 2010). In particular, pluripotency genes (i.e., Oct4 and Nanog) and CGI-associated germline-specific genes are tightly repressed by DNA methylation, preventing any deleterious effects that could occur if they were ectopically activated (Maatouk et al. 2006; Shen et al. 2007; Weber et al. 2007; Straussman et al. 2009; Borgel et al. 2010; Arand et al. 2012; Seisenberger et al. 2012; ). Thus, PGCs at this point display the transcription profiles, chromatin modifications, and DNA methylation levels and patterns characteristic of their somatic origin (Ohinata et al. 2005; Maatouk et al. 2006; Hajkova et al. 2010; Popp et al. 2010; Seisenberger et al. 2012).
Figure 3.
Biphasic demethylation dynamics in mouse PGCs. PGCs are derived from the embryonic ectoderm of the E6.5 embryo and display high (somatic) 5mC levels (green lines) and low 5hmC levels (red lines). Upon migration, PGCs proliferate, and 5mC levels are passively diluted. Coincidently, hemimethylated DNA strands accumulate transiently and are subsequently lost (purple dashed line). Post-migratory PGCs enter a phase of active DNA demethylation, resulting in an almost complete loss of 5mC and a transient enrichment of 5hmC. At E13.5, both 5mC and 5hmC levels are low.
Biphasic demethylation dynamics in mouse PGCs. PGCs are derived from the embryonic ectoderm of the E6.5 embryo and display high (somatic) 5mC levels (green lines) and low 5hmC levels (red lines). Upon migration, PGCs proliferate, and 5mC levels are passively diluted. Coincidently, hemimethylated DNA strands accumulate transiently and are subsequently lost (purple dashed line). Post-migratory PGCs enter a phase of active DNA demethylation, resulting in an almost complete loss of 5mC and a transient enrichment of 5hmC. At E13.5, both 5mC and 5hmC levels are low.The major task of epigenetic reprogramming to totipotency that takes place in early PGCs is accomplished by an elaborate transcriptional program driving PGCs away from differentiation and forcing the intimately linked re-expression of key pluripotency markers (for review, see Hayashi et al. 2007; Saitou and Yamaji 2012). During this process, PGCs proliferate and migrate from the proximal epiblast along the hindgut to the genital ridges, where their genomes display very low levels of global DNA methylation (E12.5–E13.5) (Fig. 3; Popp et al. 2010; Guibert and Weber 2012; Seisenberger et al. 2012). This reprogramming process also reverses other somatic epigenetic features: Parental imprints are erased (Hajkova et al. 2002; Sasaki and Matsui 2008), and the silenced X chromosome is reactivated in PGCs of female mouse embryos (Chuva de Sousa Lopes et al. 2008).
DNA demethylation mechanisms in PGCs
It was initially speculated that the proliferation rate of PGCs is insufficient to allow passive demethylation (Fig. 2A), and all active DNA demethylation mechanisms (Fig. 2B) were proposed and explored. However, a reliable, robust, genome-wide removal of nearly all 5mCs by active means seems energetically unlikely and, given the required double-strand breaks involved (if BER-mediated), highly risky. Indeed, recent studies indicate that passive dilution of methylation might well be sufficient for global demethylation, providing an energetically more plausible, robust mechanism. With the help of genome-wide analysis methods applicable to small cell numbers, a combinatorial, almost sequential, and perhaps context-dependent process of active and passive DNA demethylation is emerging (Fig. 3).
Passive DNA demethylation in PGCs
The silencing of key genes of the DNA methylation machinery in PGCs could facilitate passive DNA demethylation. By E9.5, de novo methyltransferases (Dnmt3a/b) are repressed and remain virtually absent throughout the reprogramming period (Seki et al. 2005; Yabuta et al. 2006; Kurimoto et al. 2008; Seisenberger et al. 2012; Kagiwada et al. 2013). In addition, although the maintenance methyltransferase Dnmt1 remains expressed and the protein remains abundant, its essential cofactor, Np95, is repressed at E9.5, and NP95 protein is excluded from the nucleus (Kurimoto et al. 2008; Seisenberger et al. 2012; Kagiwada et al. 2013). Consequently, DNMT1 staining is strongly reduced at S-phase replication foci, suggesting poor methylation maintenance in PGCs. It is only after E12.5 that NP95 protein levels increase slightly, and DNMT1 localization to replication foci is slowly re-established (Kagiwada et al. 2013).The majority of PGCs are arrested in the G2 phase of the cell cycle, while they migrate toward the hindgut (E7.5–E8.5), but proliferation rapidly ensues at E9.5 (Fig. 3; Seki et al. 2007). In contrast to early data suggesting a 16-h cell cycle in PGCs (Tam and Snow 1981) recent BrdU pulse-chase experiments describe faster (∼12 h per cycle), exponential proliferation until at least E12.5, significantly increasing chances for passive demethylation (Kagiwada et al. 2013). This replication-dependent DNA demethylation creates hemimethylated DNA strands (Fig. 2A), which can indeed be detected by hairpin bisulfite sequencing (Arand et al. 2012), with increased incidence in E9.5–10.5 PGCs (Fig. 3, pink dashed line; Seisenberger et al. 2012).Bisulfite conversion (BSC) and sequencing of PGC-derived DNA revealed that demethylation indeed affects the global genome, including promoters, CGIs, introns, exons, and intergenic regions. However, a subset of sequences, including imprinted genes, promoters of genes necessary for germ cell formation and meiosis, and CGIs of the inactive X chromosome in females, appears to follow slower kinetics and become fully hypomethylated only after a second wave of demethylation (Fig. 3; Seisenberger et al. 2012). The observed late reprogramming of differentially methylated regions (DMRs) in PGCs is in agreement with some studies showing erasure of imprints after E11.5, when PGCs enter the gonads (Hajkova et al. 2002; Guibert and Weber 2012), yet is in conflict with other studies showing early DMR demethylation, with kinetics paralleling global demethylation (Lee et al. 2002; Kagiwada et al. 2013).
Active DNA demethylation in PGCs
The two-phase demethylation process in PGCs (Guibert and Weber 2012; Seisenberger et al. 2012; Hackett et al. 2013; Yamaguchi et al. 2013) raises the question as to the molecular mechanisms involved in this distinction. It is likely that specific sequences are protected from passive demethylation in early PGCs but are then demethylated by either passive, active, or a combination of both means at later stages. The prevailing expression and nuclear presence of DNMT1 but reduction and nuclear exclusion of NP95 could indicate a noncanonical DNA methylation maintenance mechanism, possibly paralleling methylation protection of imprinted gene loci in the early embryo (see below).The early assumptions of faster PGC proliferation kinetics have led to an intense investigation of active DNA demethylation mechanisms in these cells (Hajkova et al. 2008, 2010; Feng et al. 2010; Popp et al. 2010; Wu and Zhang 2010; Cortellino et al. 2011; Dawlaty et al. 2011, 2013; Yamaguchi et al. 2012, 2013, 2014; Hackett et al. 2013; Vincent et al. 2013). AID is expressed in E12.5 PGCs (Morgan et al. 2004) and linked to active DNA demethylation. Aid-deficient E13.5 PGCs display a very mild increase in DNA methylation levels over their wild-type counterparts; however, they also undergo evident global DNA demethylation between E8.5 and E13.5 (Popp et al. 2010). General demethylation processes in PGCs are thus either AID-independent or compensated for by other deaminases, possibly APOBEC proteins. However, loss of either APOBEC1, APOBEC2/3, or AID does not severely impact fertility (Muramatsu et al. 2000; Mikl et al. 2005; Popp et al. 2010). AID activity on 5mC is also much lower than on unmethylated cytosine in vitro (Neuberger and Rada 2007), and Aid and Apobec1 expression appears to be very low or absent throughout the PGC reprogramming period (Kagiwada et al. 2013), calling into question whether or to what extent AID and/or other deaminases are involved in PGC reprogramming. Locus-specific re-examination of Aid-deficient PGCs, particularly of imprinted gene loci, using refined analysis methods should resolve these uncertainties.Judging by mRNA expression levels, Tdg is expressed in PGCs (Kagiwada et al. 2013). However, the protein was not detected by immunofluorescence (IF) (E10.5–13.5) (Hajkova 2010). The biallelic methylation of the Igf2r imprinted region can be detected in Tdg mutant PGCs, suggesting that TDG activity is at least required for maintenance of the unmethylated allele at this locus. However, it is likely that the hypermethylation occurs prior to PGC specification, as de novo DNA methylation is repressed in PGCs (Cortellino et al. 2011). Components of the BER pathway were also detected in PGCs (Hajkova et al. 2002), perhaps a testimony to DNA repair after AID/TDG or even TET activity.Given the questionable expression of Aid but continued expression of Tet enzymes and Tdg, sequential oxidation of 5mC to 5fC or 5caC, which can be direct targets of TDG, appears to be the most likely scenario for an active demethylation mechanism in PGCs. Preliminary efforts failed to detect drastic changes of 5fC/5caC levels throughout the reprogramming period by IF (Hackett et al. 2013; Yamaguchi et al. 2013). However, until the importance of each of the 5mC oxidation derivatives is thoroughly addressed in vivo, these questions will remain unanswered.Two recent studies addressed the distribution and dynamics of 5hmC in PGCs, adding another layer of complexity to our understanding of the demethylation process (Hackett et al. 2013; Yamaguchi et al. 2013). These findings show a temporal increase of 5hmC in E9.5/E10.5 PGCs, accompanied by decreasing 5mC levels followed by a reduction of 5hmC levels at E11.5/E12.5 (Fig. 3, red and green lines). Combined knockdown of Tet1 and Tet2 revealed slightly elevated 5mC levels in in vitro generated PGC-like cells, yet in vitro PGC specification itself was unaffected (Hackett et al. 2013).Detailed analysis of a Tet1 knockout mouse strain revealed a reduction in female germ cells and ovary size. This phenotype is most likely the consequence of the reduced expression of genes affecting meiosis, which retain uncommonly high promoter methylation levels in mutant gametes (Yamaguchi et al. 2012). More recently, TET1 was also shown to be required for the efficient erasure of imprints in the paternal germline (Yamaguchi et al. 2014). However, the low penetrance of the phenotype suggests a redundant function and partial rescue of the phenotype with/by TET2. Tet1/2 double-knockout PGCs completely lack 5hmC (as detected by IF). Interestingly, global 5mC levels are not increased in these cells (also detected by IF), and TET1/2 loss can be compatible with development, with both mutant males and females maintaining fertility (Dawlaty et al. 2013). However, the majority of Tet1/2-null mutants die during embryogenesis or shortly after birth due to various and variable developmental abnormalities (Dawlaty et al. 2013). Crucially, progeny of Tet1/2-null males and females display hypermethylation at several imprinted gene regions, supporting a role for 5hmC removal of imprints in PGCs, probably during the second demethylation wave (Fig. 3). However, these defects are variable, and even in the absence of TET1/2, normal imprint erasure can occur in a fraction of the progeny (Dawlaty et al. 2013), likely by passive means (Lee et al. 2002; Kagiwada et al. 2013). A caveat to these studies is they were performed in mixed genetic backgrounds, which may itself explain the observed phenotypic variability (Dawlaty et al. 2013).
Germ cell specialization and sex-specific remethylation
Sex-specific differentiation of the mouse embryo ensues from E12.5 onward (McLaren 1984; Saitou and Yamaji 2012). The establishment of germ cell-specific methylation marks after PGC reprogramming occurs at different times and in different cellular environments in males and females, ultimately leading to sperm- and oocyte-specific patterns (Sasaki and Matsui 2008; Saitou and Yamaji 2012). In female germ cells, DNA methylation levels remain low at E16.5. In contrast, by the same time, male PGCs have already attained 50% global DNA methylation, as remethylation initiates as early as E14.5 in prospermatogonia, which are arrested in the G1 phase of mitosis (Kota and Feil 2010). The methylation patterns in male germ cells are fully established at birth (days 19–21) and are then maintained throughout many cycles of mitotic divisions before the cells enter meiosis (Davis et al. 2000; Henckel et al. 2009). The chances of accumulating DNA methylation errors and the propagation of mutations occurring due to spontaneous deamination of 5hmC during this extended period of replication and cell division are much greater in males than in females. In females, remethylation of gametes initiates only at birth during the oocyte growth phase, while the eggs are arrested in prophase of meiosis I (Kota and Feil 2010).Despite divergent development and remethylation dynamics, DNMT3L and DNMT3A are required to establish the sex-specific imprints (Bourc’his et al. 2001; Hata et al. 2002; Bourc’his and Bestor 2004; Kaneda et al. 2004; Kato et al. 2007). Loss of DNMT3A or DNMT3L in males causes catastrophic meiotic arrest in spermatogonia, up-regulation of IAPs and LINE elements, and apoptotic loss of spermatocytes (Walsh et al. 1998; Bourc’his and Bestor 2004). On the other hand, de novo methylation-impaired oocytes can give rise to embryos that die at E10.5 due to the lack of imprinting control (Bourc’his et al. 2001).Although the central players of ICR methylation have been identified in the differentiating germ cells, it is still unclear how they are targeted to imprinted regions. Several suggestions, such as sequence specificity, underlying chromatin structure and histone modifications, or traversing transcription, have been made and reviewed in excellent studies elsewhere (Smallwood and Kelsey 2012; Strogantsev and Ferguson-Smith 2012; Kelsey and Feil 2013). Recent efforts to decipher oocyte-specific, sperm-specific, and early embryo-specific methylomes, however, have cast a different light on imprinting-specific, DNA methylation targeting mechanisms (Smallwood et al. 2011). In addition to maternal germline ICRs, which are structurally CGIs, more than a thousand nonimprinted CGIs are also differentially methylated in oocytes compared with sperm. However, during post-fertilization reprogramming (see below), <15% of these actually retain their (differential) DNA methylation patterns, inclusive of all maternally imprinted regions (Smallwood et al. 2011). Therefore, a new concept suggesting that maternally methylated CGIs are not targeted by the underlying sequence but rather by histone modifications defined by active transcription enabling access to de novo methylation has emerged (Kelsey and Feil 2013).
Epigenetic reprogramming in preimplantation embryos: selective DNA methylation maintenance
The second wave of global epigenetic reprogramming occurs during early embryogenesis and is crucial to establishing pluripotency. The newly formed embryo undergoes massive, global DNA demethylation such that by the time the early blastocyst stage (32–64 cells) is reached, methylation levels are at their lowest (Fig. 4A). However, the process in embryos differs from that in PGCs. First, demethylation is close to absolute in PGCs, with the exception of a few resistant retroelements, while DNA methylation of imprinted gene regions is preserved in embryos, enabling parent-of-origin-specific gene expression in later tissues. Also, the imprinted paternal X inactivation found in early mouse embryos is not reversed until the late epiblast stage. Second, the genome of the zygote (which contains haploid contributions from the oocyte and sperm genome, each with their own specific chromatin properties) follows different DNA demethylation kinetics after fertilization (Fig. 4A; Mayer et al. 2000; Oswald et al. 2000; Santos et al. 2002; Santos and Dean 2004).
Figure 4.
DNA demethylation dynamics and imprinting maintenance in preimplantation embryos. (A) Distinct characteristics of maternal and paternal genomes impose an epigenetic asymmetry in the zygote. The maternal genome (red pronucleus; red line) undergoes passive DNA demethylation throughout several rounds of DNA replication. The paternal genome (blue pronucleus; blue lines) undergoes active demethylation before DNA replication in the zygote ensues. Concomitant with global loss of paternal 5mC, 5hmC (blue dotted line) and the further oxidation derivatives (5fC and 5caC; blue dashed line) are enriched. Although selected loci are restored to unmodified cytosines, the bulk of paternal 5hmC is passively diluted, paralleling demethylation of the maternal genome. (B) In the zygote, STELLA prevents TET3-dependent oxidation of 5mC through binding to H3K9me2-marked chromatin (maternal genome and paternally imprinted regions) and subsequent active restoration of cytosine by BER (or other pathways). (C) Throughout early cleavage stages, DNMT1 is largely excluded from the nucleus and requires noncanonical targeting to imprinted regions by the ZFP57/TRIM28 complex binding to its methylated consensus sequence found at most ICRs. Nonimprinted regions are efficiently demethylated through replication, while ICRs are maintained by DNMT1 and DNMT3A/B. (D) At later stages of embryogenesis and in adult tissues, high DNMT1/NP95 levels during replication maintain DNA methylation by targeting hemimethylated DNA in a canonical fashion.
DNA demethylation dynamics and imprinting maintenance in preimplantation embryos. (A) Distinct characteristics of maternal and paternal genomes impose an epigenetic asymmetry in the zygote. The maternal genome (red pronucleus; red line) undergoes passive DNA demethylation throughout several rounds of DNA replication. The paternal genome (blue pronucleus; blue lines) undergoes active demethylation before DNA replication in the zygote ensues. Concomitant with global loss of paternal 5mC, 5hmC (blue dotted line) and the further oxidation derivatives (5fC and 5caC; blue dashed line) are enriched. Although selected loci are restored to unmodified cytosines, the bulk of paternal 5hmC is passively diluted, paralleling demethylation of the maternal genome. (B) In the zygote, STELLA prevents TET3-dependent oxidation of 5mC through binding to H3K9me2-marked chromatin (maternal genome and paternally imprinted regions) and subsequent active restoration of cytosine by BER (or other pathways). (C) Throughout early cleavage stages, DNMT1 is largely excluded from the nucleus and requires noncanonical targeting to imprinted regions by the ZFP57/TRIM28 complex binding to its methylated consensus sequence found at most ICRs. Nonimprinted regions are efficiently demethylated through replication, while ICRs are maintained by DNMT1 and DNMT3A/B. (D) At later stages of embryogenesis and in adult tissues, high DNMT1/NP95 levels during replication maintain DNA methylation by targeting hemimethylated DNA in a canonical fashion.
Active DNA demethylation of the paternal genome
The mature sperm genome shows 80%–90% overall CpG methylation, the highest global DNA methylation level of any cell in the mouse (Popp et al. 2010), yet the paternal genome is apparently completely demethylated shortly after zygote formation (Fig. 4A, blue line; Mayer et al. 2000; Oswald et al. 2000). This loss must be due to an active demethylation mechanism, as it is completed before the onset of DNA replication at the pronuclear stage 3 (PN3). Conversely, the maternal genome shows lower global methylation levels (∼40%) and undergoes replication-dependent demethylation (Fig. 4A, red line), thereby establishing a significant epigenetic asymmetry in the early embryo (Fig. 4; Mayer et al. 2000; Oswald et al. 2000; Santos et al. 2002).Conflicting observations made by IF analysis in zygotes and DNA sequencing after BSC created a conundrum regarding DNA demethylation and reprogramming. While the loss of 5mC in the paternal pronucleus was evident by IF, BSC analysis did not wholly support this observation (Hajkova et al. 2008; Wossidlo et al. 2010). It was only after the description of 5hmC, the oxidation product of 5mC by TET enzymes, that a unifying explanation came to hand (Gu et al. 2011; Iqbal et al. 2011; Wossidlo et al. 2011). BSC cannot distinguish between 5mC and 5hmC, whereas oxidation of 5mC removes the epitope recognized in IF.Indeed, TET3 specifically localizes to the paternal pronucleus (Gu et al. 2011), where it was shown to be responsible for 5mC to 5hmC conversion (Fig. 4A, dotted blue line). In its absence, 5hmC is not detectable (Gu et al. 2011; Wossidlo et al. 2011), thus excluding functional redundancy with other TET proteins in the zygote. Lack of TET3 may result in the delayed activation of paternal alleles of genes required for embryonic development and establishment of the pluripotent epiblast (e.g., Nanog and Oct4), potentially causing the reduced fecundity and partial developmental failure of maternal TET3-null offspring (Gu et al. 2011).
The fate of paternal 5hmC
The bulk of 5mCs in the paternal genome is hydroxylated in the late zygote, but only a few regions were shown to completely revert to unmodified cytosine before the first cleavage division. BER activity could account for this loss of 5mC and the hypomethylation of the Nanog and Oct4 promoters (Gu et al. 2011). Several components of the BER pathway are specifically localized to the paternal pronucleus; i.e., XRCC1 (X-ray repair cross-complementing protein 1) tightly binds to paternal, but not maternal, DNA (Hajkova et al. 2010; Wossidlo et al. 2010). Concomitant with 5hmC, higher oxidation products of 5mC (5fC and 5caC), which can be direct targets for the TDG/BER pathway (He et al. 2011; Maiti and Drohat 2011), were also observed in the zygote (Fig. 4A, dotted/dashed blue lines; Inoue et al. 2011). However, this route of demethylation requires further investigation, since TDG was not detected in zygotes (Hajkova 2010). Certainly, other enzymes with similar activity could perform glycosylation. Alternatively, direct decarboxylation of 5caC has been described in other systems (Schiesser et al. 2012).However, there is also compelling evidence that paternal 5mC is actively converted to 5hmC (and possibly further to 5fC and 5caC), which then undergoes replication-mediated dilution throughout subsequent cleavage divisions (Fig. 4A, dotted/dashed blue lines; Inoue and Zhang 2011; Inoue et al. 2011). Fittingly, the maintenance methyltransferase DNMT1 shows very limited affinity to oxidized 5mC derivatives (Hashimoto et al. 2012) and is generally excluded from the nucleus of preimplantation embryos (Howell et al. 2001; Hirasawa et al. 2008).
Distinction of parental pronuclei
How is TET3 targeted to the paternal genome without affecting the maternal genome? Prior to description of 5hmC dynamics in the zygote, the protein STELLA (Payer et al. 2003) was found to convey specificity to the, then elusive, active demethylation machinery in the early zygote (Fig. 4B; Nakamura et al. 2006). In its absence, loss of 5mC is observed in both pronuclei, accompanied by 5hmC accumulation in the maternal pronucleus (Wossidlo et al. 2011). Thus, the divergent demethylation dynamics in maternal and paternal genomes in the zygote are the consequence of specific protection of the maternal genome from TET3-mediated 5mC oxidation by STELLA. Although STELLA appeared to localize to both pronuclei in the zygote, its binding to the paternal genome is weak (Nakamura et al. 2012). The protective function is specifically mediated by STELLA’s interaction with dimethylated histone H3 Lys9-marked chromatin (H3K9me2), which is enriched in the maternal but not paternal pronucleus (Santos et al. 2005). This interaction of STELLA and H3K9me2 nucleosomes alters chromatin configuration, preventing TET3 binding and activity (Nakamura et al. 2012). Aside from its global function in the maternal genome, STELLA also protects at least two paternally methylated, imprinted gene loci (Rasgrf1 and H19, but not the IG-DMR) from aberrant demethylation (Nakamura et al. 2006). These loci retain H3K9me2-marked chromatin during spermatogenesis and protamine exchange, which mediates their protection after fertilization (Nakamura et al. 2012). However, not all paternally or maternally imprinted regions are affected equally, and possibly other mechanisms (described below) act in partial redundancy with STELLA (Messerschmidt 2012). Remarkably, in the absence of STELLA, the loss of DNA methylation at imprinted gene loci in the zygote is complete (Nakamura et al. 2006). Thus, at least the imprinted gene regions undergo active removal of 5hmC or rapidly iterated oxidations to 5caC by TET3 (Wu and Zhang 2010). Interestingly, the BER component XRCC1, which ordinarily exclusively localizes to the paternal genome, is found in both pronuclei in STELLA mutants (Hajkova 2010).Embryos lacking STELLA display severe phenotypes, rarely developing past the four-cell stage, with few embryos surviving to birth. Remarkably, though, STELLA-deficient oocytes show neither methylation nor developmental defects prior to fertilization (Nakamura et al. 2006). This is particularly intriguing, as TET3 is present in the oocyte but only affects (in the absence of STELLA) the maternal genome after fertilization, in the zygotic context (Nakamura et al. 2006; Gu et al. 2011). The interplay of these two antagonistic factors must be addressed in more detail to define whether TET3 is merely prevented from acting on the maternal genome or must be molecularly targeted to its paternal substrates.
Is active DNA demethylation of the paternal genome required?
It is undisputed that global demethylation in the early mouse embryo is required to impose an open, totipotent or pluripotent state in the forming epiblast. However, it is not clear why only the paternal genome is targeted for active demethylation or, indeed, whether active demethylation is at all required. Although TET3-mediated demethylation of Nanog and Oct4 promoters was linked to embryo viability, loss of TET3 is nonetheless compatible with normal development (Gu et al. 2011). In fact, embryos derived from round spermatid-injected oocytes (containing histone-bound paternal DNA, which is not actively demethylated in the zygote) can develop into viable pups (Polanski et al. 2008). In other mammalian species, active demethylation of the paternal genome is followed by immediate de novo remethylation before parallel, passive demethylation of the maternal and paternal genomes occurs (Fulka et al. 2004; Park et al. 2007; Abdalla et al. 2009). Thus, while active demethylation of the paternal genome is beneficial, perhaps it simply provides an additional measure to ensure efficient reprogramming. Another hypothesis is that the oocyte is programmed to remove distinguishing paternal epigenetic features, which may provide a developmental advantage to any individual embryo. It is in the “interest of the mother” to distribute resources equally among her progeny, necessitating the removal of any unique paternal epigenetic marks that would favor a particular embryo (Moore and Haig 1991). More detailed investigations addressing 5hmC and locus-specific demethylation in round spermatid injection-derived embryos, coupled with a base-resolution view of the dynamics of 5mC and its oxidation products, are required to deepen our understanding of these processes.
Passive DNA demethylation and maintenance of parental imprints
The maternal genome is, at least globally, resistant to hydroxylation by TET3 yet nonetheless loses the bulk of its oocyte-specific DNA methylation pattern by replication-mediated 5mC dilution during preimplantation development (Fig. 4A, red line). Here, passive loss of 5mC is achieved by nuclear exclusion of DNMT1 (Howell et al. 2001; Ratnam et al. 2002; Branco et al. 2008; Hirasawa et al. 2008) rather than by down-regulation of NP95, as proposed in PGCs (Kagiwada et al. 2013).Nuclear exclusion of DNMT1, however, poses a problem for the required maintenance of genomic imprints and other sequences that must retain their DNA methylation patterns throughout development. Although zygotic deletion of DNMT1 was known to cause massive loss of DNA methylation both globally and at imprinted gene loci and to be embryonic-lethal (Li et al. 1992, 1993), the role of DNMT1 in early preimplantation embryos was only recently resolved. The oocyte-specific form of DNMT1 (DNMT1o) was shown to be required for imprint maintenance only during one cell cycle at the eight-cell stage, where DNMT1o was found to transiently translocate into the nucleus (Carlson et al. 1992; Howell et al. 2001; Ratnam et al. 2002). Because the somatic form of DNMT1 (DNMT1s) was not detected until the blastocyst stage, the maintenance of imprints during early cleavage divisions was attributed to unidentified DNA methyltransferases. Re-examination of DNMT3A, DNMT3B, and DNMT1o/s expression and function finally resolved the issue by first excluding the de novo DNA methyltransferases from general imprinting maintenance in the embryo (Hirasawa et al. 2008). Complete (maternal and zygotic) knockout of DNMT1, however, completely abolished DNA methylation at all imprinted gene loci (Cirio et al. 2008; Hirasawa et al. 2008; Kurihara et al. 2008). Thus, both DNMT1o and DNMT1s contribute to DNA methylation maintenance at imprinted regions even though the vast amount of protein is excluded from the nucleus (Branco et al. 2008).
Noncanonical targeting of DNMT1 to imprinted regions in preimplantation embryos
DNMT1 is required to maintain imprints, yet, at the same time, its nuclear protein levels are drastically reduced to allow global demethylation. How is DNMT1 targeted to imprinted gene loci? STELLA is required for imprinting maintenance. However, STELLA’s global binding and protection of the whole maternal genome from active demethylation makes it an unlikely candidate for DNA methylation maintenance at specific loci. ZFP57, a Krueppel-associated box (KRAB) domain zinc finger protein, has also been associated with imprinting maintenance (Fig. 4C; Li et al. 2008; Mackay et al. 2008). Loss of ZFP57 in mouse embryos and ESCs causes hypomethylation of both paternal and maternal ICRs and misregulation of imprinted genes (Li et al. 2008; Quenneville et al. 2011; Zuo et al. 2012). This function of ZFP57 is evolutionarily conserved; in humans, loss-of-function mutations also result in ICR hypomethylation, ultimately causing transient, neonatal diabetes (Mackay et al. 2006, 2008).KRAB zinc finger proteins often act as epigenetic repressors through their interaction with TRIM28. TRIM28 is a component of a multifunctional repressor complex comprised of, at least in this instance, the nucleosome remodeling and histone deacetylation (NuRD) complex, the H3K9me3-catalyzing histone methyltransferase SETDB1, the heterochromatin protein 1 (HP1), and DNA methyltransferases DNMT1, DNMT3A, and DNMT3B (Fig. 4C; Schultz et al. 2001, 2002; Iyengar and Farnham 2011; Quenneville et al. 2011; Zuo et al. 2012). While the DNA methylation maintenance defects are less pronounced in zygotic ZFP57 mutants and the lack of maternal ZFP57 is rescued by expression of paternal Zfp57 (Li et al. 2008), loss of maternal Trim28 alone is embryonic-lethal (Messerschmidt et al. 2012). The times of death and the embryonic phenotypes of maternal Trim28 mutants are highly variable, as is the occurrence of hypomethylation at several maternal and paternal ICRs (Messerschmidt et al. 2012). DNA methylation analysis of individual blastomeres confirmed that stochastic, random methylation defects (and phenotypes) are based on the mosaic composition of early Trim28 maternal null embryos carrying normally and aberrantly imprinted gene loci (Messerschmidt 2012; Messerschmidt et al. 2012; Lorthongpanich et al. 2013).Binding of both proteins and the presence of H3K9me3, the product of the TRIM28 complex, were detected at imprinted loci in embryos (Messerschmidt et al. 2012) and mESCs (Quenneville et al. 2011). Being a DNA-binding transcription factor, ZFP57 localization to ICRs opens the exciting prospect of sequence-specific recognition and maintenance of imprinted loci. Indeed, sequence analysis of loci enriched for H3K9me3, TRIM28, and ZFP57, as identified by chromatin immunoprecipitation (ChIP) combined with deep sequencing (ChIP-seq) in mESCs, revealed a hexanucleotide consensus ZFP57 recognition site (TGCCGC), which is highly conserved in 81 of 91 identified (H3K9me3/TRIM28/ZFP57) sites (Quenneville et al. 2011). Remarkably, binding of TRIM28 to the hypomethylated sites was abrogated in Trim28 maternal null embryos and was not restored by TRIM28 re-expression from the paternal allele in the two- to four-cell stage embryo, indicating the DNA methylation-dependent binding of the ZFP57/TRIM28 complex. In fact, ZFP57 displays much higher binding affinity for its methylated consensus binding site in vitro (Quenneville et al. 2011; Liu et al. 2012). Once lost, DNA methylation cannot be restored via the ZFP57/TRIM28 complex in vivo (Messerschmidt et al. 2012) or ectopic re-expression of ZFP57 in Zfp57-deficient ESCs (Zuo et al. 2012). To a point, the interaction of TRIM28 with DNMT1/NP95 suggests the noncanonical, ZFP57-mediated targeting of the DNA methylation maintenance machinery to ICRs in the preimplantation embryo (Fig. 4C). This targeting mode would compensate for the drastic reduction of nuclear DNMT1 during early cleavage divisions and thus enable DNA methylation maintenance at imprinted regions (Messerschmidt 2012). Only at later embryonic stages, after DNMT1 levels are significantly increased in the nuclei, does canonical DNA methylation maintenance ensue (Fig. 4D).The maintenance of methylation at imprinted regions in preimplantation embryos was thought to be very robust until genome-wide methylation analysis showed that even ICRs are partially demethylated, particularly at peripheral regions (Tomizawa et al. 2011; Kobayashi et al. 2012). Such loss, however, would be of little to no consequence as long as the ZFP57-binding site itself remains methylated and targeted by the ZFP57/TRIM28 complex. De novo methylation enzymes DNMT3A and DNMT3B, which are also found in the ZFP57/TRIM28 complex (Quenneville et al. 2011; Zuo et al. 2012), may account for the recovery of DNA methylation at these peripheral regions at later stages of development (Tomizawa et al. 2011; Kobayashi et al. 2012).Finally, given that DNA demethylation mechanisms act during early embryonic development, a drastic reduction of methylation levels is expected and in fact found at later preimplantation stages. Surprisingly, in addition to imprinted genes and retrotransposons, a substantial quantity of differentially methylated CGIs in oocyte-specific and a subset of sperm-specific methylated regions retain DNA methylation at much higher levels than expected if unhindered passive demethylation were to take place (Smallwood et al. 2011; Kobayashi et al. 2012). It remains to be seen whether these CGIs, as has been shown for ICRs, are targeted and protected by ZFP57/TRIM28, other KRAB zinc finger proteins, or entirely different mechanisms.
Concluding remarks
The veritable flood of new, genome-wide methylation data and insights into epigenetic reprogramming of embryos and PGCs allows us to view this process from a new angle. It is now clear that passive DNA demethylation is the most parsimonious mechanism of both PGCs and preimplantation embryos and is probably sufficient for that purpose. Moreover, the exact time and concentration of gene products at specific sites in the genome are now clarified. Nevertheless, these new insights also accentuate the limitations of our current understanding of the reprogramming process. Although global, genome-wide methylation and demethylation pathways have been identified, their disruption produces embryos with less than fully penetrant phenotypes, leading to confounding and confusing interpretations (Li et al. 2008; Popp et al. 2010; Dawlaty et al. 2011, 2013; Gu et al. 2011; Messerschmidt et al. 2012; Yamaguchi et al. 2012). Loss of TET3 in the early embryo, for instance, causes developmental arrest in only a subset of embryos, which is likely the result of delayed activation of the paternal pluripotency genes (Gu et al. 2011). Nonetheless, in wild-type embryos, the bulk of paternal 5mC undergoes conversion to 5hmC, which is then passively removed over subsequent cleavages. However, TET3-independent passive demethylation also occurs in the maternal genome, thus calling into question the significance of the conversion of the global paternal genome. Similarly, TET1/2-mediated oxidation in PGCs might have a broader genomic range than merely to ensure robust demethylation of meiotic and imprinted genes. It is therefore prudent not to assume active reversion to unmodified cytosine if TET-mediated oxidation is involved; divergent biological function and relevance of 5mC and 5hmC must also be considered.We are far from understanding active DNA demethylation. The direct removal of 5mC or its deamination via AID/APOBECs has still not been conclusively addressed in vivo. TET-dependent demethylation can occur via multiple, interconnected pathways (Wu and Zhang 2010, 2014). However, even in the absence of TETs, AID, APOBECs, etc., PGCs and early embryos can develop into functional gametes or give rise to live pups, although often inefficiently. The long-term transgenerational effects of epigenetic defects in these surviving pups have yet to be addressed. Therefore, it appears that several redundant, fail-safe proofing mechanisms have evolved to act in parallel to ensure the profound erasure of epigenetic marks. Future experiments addressing these redundancies will determine whether the enzymes mediating active demethylation do indeed act in separate, parallel pathways or upstream of or downstream from one another.With new experimental data, more parallels between PGC and embryo reprogramming have begun to manifest themselves. The protection of imprinted regions in the embryo is demonstrated in PGCs, where imprinted and meiotic genes appear to be specifically targeted for demethylation after being excluded from the wave of passive demethylation. Whether this delay is of functional significance or merely a byproduct of the imprint-specific maintenance mode evolved in embryos yet impacting on PGCs remains to be seen. Importantly, demethylation of imprinted regions following the same kinetics as that of the bulk of the genome has also been observed (Kagiwada et al. 2013), and even in the absence of TET enzymes, meiotic and imprinted genes are eventually reprogrammed (Dawlaty et al. 2013). This is in clear contrast to the case of the early embryo, where it is vital that DNA methylation maintenance of imprints is never overcome.The novel concept of noncanonical DNA methylation maintenance in the face of reprogramming is now well established during preimplantation development. ZFP57/TRIM28-mediated targeting of scarce nuclear DNMT1 protein to imprinted gene regions ensures their faithful maintenance (Messerschmidt 2012). Are genes such as Trim28 and Zfp57 also expressed in PGCs, and do they influence the demethylation dynamics of these regions? Preliminary findings suggest that ZFP57-binding sites are enriched in “late demethylating” genomic regions (Seisenberger et al. 2012). Also, in preimplantation embryos, DNA methylation maintenance may go beyond imprinting. Many other transiently, differentially methylated regions detected in oocytes are protected through the blastocyst stage (Smallwood et al. 2011; Kobayashi et al. 2012; Smith et al. 2012). Furthermore, ChIP-seq in mESCs has identified numerous additional ZFP57 targets in addition to known imprinted regions. These findings offer an exciting prospect that merits closer scrutiny (Quenneville et al. 2011). Although ZFP57 is the only TRIM28-interacting KRAB zinc finger protein known to display methylation-dependent DNA binding, it is possible that other KRAB family members with similar properties mediate analogous functions at different target sites.The overall question is: Why do mammals exhibit these complex epigenetic reprogramming processes in the first place? Does the goal of totipotency, which entails such developmental flexibility, offset the lack of guiding, developmental determinants found in frogs, flies, fish, or worms? Did these processes evolve to enable and preserve genomic imprinting as an essential mechanism for mammalian development? Epigenetic reprogramming differs in details among mammalian species, suggesting that demethylation–methylation in PGCs and subsequent demethylation–methylation in the embryo are novel mechanisms and that we are witnessing the evolutionary selection of the optimal one.
Authors: Benjamin R Carone; Lucas Fauquier; Naomi Habib; Jeremy M Shea; Caroline E Hart; Ruowang Li; Christoph Bock; Chengjian Li; Hongcang Gu; Phillip D Zamore; Alexander Meissner; Zhiping Weng; Hans A Hofmann; Nir Friedman; Oliver J Rando Journal: Cell Date: 2010-12-23 Impact factor: 41.582
Authors: B H Ramsahoye; D Biniszkiewicz; F Lyko; V Clark; A P Bird; R Jaenisch Journal: Proc Natl Acad Sci U S A Date: 2000-05-09 Impact factor: 11.205
Authors: M Brandeis; D Frank; I Keshet; Z Siegfried; M Mendelsohn; A Nemes; V Temper; A Razin; H Cedar Journal: Nature Date: 1994-09-29 Impact factor: 49.962
Authors: Susana M Chuva de Sousa Lopes; Katsuhiko Hayashi; Tanya C Shovlin; Will Mifsud; M Azim Surani; Anne McLaren Journal: PLoS Genet Date: 2008-02 Impact factor: 5.917
Authors: Michael C Golding; Matthew Snyder; Gayle L Williamson; Kylee J Veazey; Michael Peoples; Jane H Pryor; Mark E Westhusin; Charles R Long Journal: Theriogenology Date: 2015-07-29 Impact factor: 2.740
Authors: Joseph Kochmanski; Elizabeth H Marchlewicz; Raymond G Cavalcante; Maureen A Sartor; Dana C Dolinoy Journal: Epigenetics Date: 2018-08-23 Impact factor: 4.528