Tianyi Zhang1, Sarah Cooper2, Neil Brockdorff1. 1. Developmental Epigenetics, Department of Biochemistry, University of Oxford, Oxford, UK. 2. Developmental Epigenetics, Department of Biochemistry, University of Oxford, Oxford, UK sarah.cooper@bioch.ox.ac.uk.
Abstract
Histones are subject to a vast array of posttranslational modifications including acetylation, methylation, phosphorylation, and ubiquitylation. The writers of these modifications play important roles in normal development and their mutation or misregulation is linked with both genetic disorders and various cancers. Readers of these marks contain protein domains that allow their recruitment to chromatin. Interestingly, writers often contain domains which can read chromatin marks, allowing the reinforcement of modifications through a positive feedback loop or inhibition of their activity by other modifications. We discuss how such positive reinforcement can result in chromatin states that are robust and can be epigenetically maintained through cell division. We describe the implications of these regulatory systems in relation to modifications including H3K4me3, H3K79me3, and H3K36me3 that are associated with active genes and H3K27me3 and H3K9me3 that have been linked to transcriptional repression. We also review the crosstalk between active and repressive modifications, illustrated by the interplay between the Polycomb and Trithorax histone-modifying proteins, and discuss how this may be important in defining gene expression states during development.
Histones are subject to a vast array of posttranslational modifications including acetylation, methylation, phosphorylation, and ubiquitylation. The writers of these modifications play important roles in normal development and their mutation or misregulation is linked with both genetic disorders and various cancers. Readers of these marks contain protein domains that allow their recruitment to chromatin. Interestingly, writers often contain domains which can read chromatin marks, allowing the reinforcement of modifications through a positive feedback loop or inhibition of their activity by other modifications. We discuss how such positive reinforcement can result in chromatin states that are robust and can be epigenetically maintained through cell division. We describe the implications of these regulatory systems in relation to modifications including H3K4me3, H3K79me3, and H3K36me3 that are associated with active genes and H3K27me3 and H3K9me3 that have been linked to transcriptional repression. We also review the crosstalk between active and repressive modifications, illustrated by the interplay between the Polycomb and Trithorax histone-modifying proteins, and discuss how this may be important in defining gene expression states during development.
In eukaryotes, DNA is packaged in the form of chromatin. The basic unit of chromatin, the nucleosome, is comprised of 147 bp of DNA wrapped around a histone octamer made of two dimers of H2A and H2B and a tetramer of H3 and H4 proteins. The N- and C-terminal histone tails protrude from the nucleosome core and have the potential to interact with adjacent nucleosomes and the linker DNA. All histones can be posttranslationally modified, and the sites of modification are often on the histone tails. These modifications can regulate chromatin structure directly and frequently act as binding sites for the recruitment of other non-histone proteins to chromatin. The most abundant histone modifications are acetylation, phosphorylation, methylation, and ubiquitylation, although many other modifications have been reported (reviewed recently in 1).Transcriptionally active and silent chromatin is characterized by distinct posttranslational modifications on the histones or combinations thereof. Active genes typically carry high levels of lysine acetylation on the H3 and H4 tails, trimethylation of H3 lysine 4, trimethylation of H3 lysine 79, ubiquitylation of H2B, and trimethylation of H3 lysine 36 (Fig†). Marks associated with repressed genes include trimethylation of lysine 27, ubiquitylation of H2A on lysine 119, and trimethylation of H3 lysine 9 (Fig†). The chromatin-modifying enzymes that catalyze these marks can be recruited to target sites by sequence-specific DNA-binding transcription factors that regulate transcriptional states of particular genes. However, other more general features of the DNA such as its global CG content and DNA methylation status can be read by the DNA-binding Zn-finger CxxC domain present in many chromatin-modifying enzymes 2. Equally, the act of transcription can direct the recruitment of writers that associate with the transcriptional machinery, leading to the accumulation of specific marks such as H3K4me3 and H3K36me3.
Figure 1
The distribution of histone modifications over active and repressed genes
The distribution of histone modifications over active and repressed genes
Given the large number of different histone modifications, the potential combinatorial complexity is vast. Advances in technology over the past decade such as ChIP-sequencing have allowed us to map the distribution and co-localization of histone marks at high-resolution genome wide, while mass spectrometry, often in combination with stable isotope labeling, enables the analysis of histone marks and dynamics at the level of a single histone tail. Interestingly, mass spectrometric data suggest that there are many combinations of modifications that are either more likely to occur together, or are mutually exclusive, suggesting crosstalk between these marks. Such crosstalk can occur in cis between distinct modifications on the same histone tail, or in trans either on neighboring histones within the same nucleosome or on neighboring nucleosomes in a chromatin domain.The patterns of histone marks associated with distinct transcriptional states are established through a dynamic interplay between histone readers, writers, and erasers. Importantly, the writers that place these marks contain chromatin-reading domains that can bind preexisting histone marks. Studies have shown that such crosstalk between histone marks can both positively and negatively regulate binding and catalytic activity of writers, resulting in positive and negative feedback loops. Therefore, writers that can also read the histone modifications are required for the establishment and maintenance of chromatin states at active and repressed genes and may play important roles in the memory and switching of gene expression states.In this review, we will focus on several examples of the positive and negative feedback mechanisms that regulate the formation, reinforcement, and maintenance of the distinct patterns of histone marks associated with active and repressed transcriptional states (Fig2). However, such features are likely to be more general features of chromatin states, and the principles seen in these examples are likely to be applicable to the plethora of other chromatin modifications whose function is still unclear.
Figure 2
Crosstalk between chromatin writers and histone marks at active and repressed genes
Chromatin writers and chromatin marks associated with active genes positively reinforce each other through various positive feedback mechanisms. The same holds true for writers and marks associated with repressed genes. Additionally, negative feedback mechanisms and mutual inhibition between writers and marks associated with the opposite gene expression state also reinforce distinct transcriptional states.
Crosstalk between chromatin writers and histone marks at active and repressed genes
Chromatin writers and chromatin marks associated with active genes positively reinforce each other through various positive feedback mechanisms. The same holds true for writers and marks associated with repressed genes. Additionally, negative feedback mechanisms and mutual inhibition between writers and marks associated with the opposite gene expression state also reinforce distinct transcriptional states.
Active histone modifications
In eukaryotic organisms, gene expression is regulated through the synergistic actions of multiple factors, including but not limited to, transcription factors, the transcriptional machinery, chromatin remodelers, and the presence of specific histone variants and histone modifications. Active chromatin domains are characterized by a distinct array of histone marks. H3K27ac and H3K4me1 are associated with active enhancers 3, and high levels of H3K4me3 and H3 and H4 acetylation are found at the promoters of active genes 4–6. The bodies of active genes are enriched in H3 and H4 acetylation 7, H3K79me3 8, and H2BK120u1 9,10, and increasing H3K36me3 toward the 3′ end 11. These histone marks may regulate transcription by creating an open chromatin structure and recruit effectors that mediate a transcriptionally competent state. While the function of many active histone modifications is not fully understood, there is abundant evidence that their deposition is required for the proper regulation of gene expression. Positive crosstalk mechanisms between different histone modifications play an important role in the recruitment and maintenance of active histone modifications at active genes.
Establishment and maintenance of H3K4me3
H3K4me3 is a highly conserved histone modification and its association with transcription is evolutionarily conserved in eukaryotes. In mammals, H3K4 methylation is catalyzed by six related homologs of the yeast SET1—SETD1A, SETD1B, MLL1, MLL2, MLL3, and MLL4 12. These complexes are comprised of the catalytic SETD1/MLL subunits and four core subunits WDR5, RBBP5, ASH2L, and DPY30, and as well many other complex-specific subunits 13–15. H3K4me3 is a hallmark of active genes and is distributed along the promoter and TSS regions 6,16,17. Work in yeast shows that SET1 associates with the PAF complex and the Ser5-phosphorylated initiating form of Pol II and is co-transcriptionally deposited 18 (Fig3). Additionally, the recruitment of SETD1 and MLL to specific target genes is mediated by many cell type-specific transcription factors or transcriptional coactivators 19–23. However in higher organisms especially, more general mechanisms of H3K4me3 recruitment and establishment are also at play.
Figure 3
Establishment of H3K4me3 and interplay with H2BK120u1
The SETD1 complex associates with Pol II, and H3K4me3 is deposited co-transcriptionally. CFP1 (associated with SETD1) and MLL1/2 can be recruited to promoters de novo via CxxC domain binding to CpG islands. H2BK120u1 can recruit H3K4 writers, possibly through recognition of H2BK120u1 by the ASH2L subunit.
Establishment of H3K4me3 and interplay with H2BK120u1
The SETD1 complex associates with Pol II, and H3K4me3 is deposited co-transcriptionally. CFP1 (associated with SETD1) and MLL1/2 can be recruited to promoters de novo via CxxC domain binding to CpG islands. H2BK120u1 can recruit H3K4 writers, possibly through recognition of H2BK120u1 by the ASH2L subunit.Notably, the distribution of H3K4me3 is highly coupled to the presence of CpG islands, regions of CpG- and GC-dense DNA that are predominately unmethylated and found at 50–70% of vertebrate promoters 24. A biochemical link between CpGI promoters and H3K4me3 was eluciated with the discovery of the Zn-finger CxxC domain which specifically binds nonmethylated CpGs and is present in MLL1/2 and the CFP1 subunit of SETD1A/B 2 (Fig3). All CpGI promoters are marked with H3K4me3, and the level of H3K4me3 is correlated to gene activity 25,26. Emerging evidence suggests that in vivo, MLL2 is responsible for maintaining H3K4me3 at CpGI promoters with low expression 27,28, while the SETD1-specific subunit CFP1 is preferentially enriched at active gene promoters with higher levels of H3K4me3 29. In ESCs, CpGI promoters linked to developmentally regulated genes are bivalent and harbor the repressive H3K27me3 mark as well as H3K4me3 30. Importantly, it has been suggested that the ability of H3K4 writers to sample CpGIs genome wide and the presence of H3K4me3 at CpGI promoters may poise silent genes for rapid activation upon differentiation.SETD1/MLL complexes may reinforce their binding through recognition of their own mark, H3K4me3. The PHD finger domain of CFP1 is known to read H3K4me3 and mediates SETD1 interaction with H3K4me3 31–33. The third PHD domain in MLL1 is important for H3K4me3 binding and MLL1 recruitment to target sites in the Hox locus 34. Other PHD domains within SETD1/MLL may also interact with H3K4me3 but remain to be further characterized 35. The ability of SETD1/MLL to sample promoters and bind H3K4me3 may be involved in the maintenance of this mark at active genes. These mechanisms of H3K4me3 binding by H3K4 writers suggest that once established, this mark may positively reinforce its own deposition.
Crosstalk between H2BK120u1, H3K4me3, and H3K79me3
One of the best-studied pathways of positive histone crosstalk is the stimulation of H3K4me3 and H3K79me3 by H2BK120u1 (or H2BK123u1 in yeast). In yeast, H2BK123u1 is established by the ubiquitin ligase RAD6/BRE1 during transcriptional initiation and localizes to the TSS and along the bodies of active genes 36. Depletion of RAD6/BRE1 or mutation of H2BK123 causes severe loss of H3K4me3 and H3K79me3 37,38. This positive crosstalk between H2BK123u1 and H3K4me3 and H3K79me3 is specific and does not extend to the regulation of H3K36me3, another mark associated with transcription 37,38.H2BK123 lies in close proximity to H3K79 on the exposed nucleosome surface, and the H3K79 methyltransferase DOT1 in yeast has been shown to be influenced by deletion and mutation of residues on the H2B tail 39. In humans, the situation is more complex, as H3K79me3 and DOT1L distribution is not solely dependent on H2BK120u1. Human DOT1L localizes at active genes and peaks around the TSS and moreover has been shown to bind both Ser5- and Ser2-phosphorylated forms of the Pol II CTD 40. As such, H3K79me3 is a marker of active genes, yet its exact role in transcriptional regulation remains to be discovered.The crosstalk between H2BK120u1 and H3K4me3 is conserved in mammals, as knockdown of the BRE1 homologs RNF20/40 leads to global reduction in H3K4me3 41 (Fig3). More recently, studies on the MSL1/MSL2 E3 ligase that catalyzes H2BK34u1 have also revealed a crosstalk between H2BK34u1 and H3K4me3 42. Both H2BK120u1 and H2BK34u1 are now known to allosterically stimulate the activity of the MLL complex through binding to the ASH2L subunit 43. Sites of ubiquitylation at H2BK120 and H2BK34 reside on the nucleosome surface and may provide a more favorable substrate for SET1 or MLL complex binding and activity 43. As ASH2L is a core subunit of all writers of H3K4 methylation, H2B ubiquitylation may be one mechanism of H3K4me3 maintenance at active promoters through a positive feedback loop whereby transcription results in deposition of H2Bub, which subsequently activates the H3K4 methyltransferases.
H3K4me3 and histone acetylation
Histone lysine acetylation is a highly abundant mark and is known to regulate many cellular processes including transcription. Acetylation of histones H3 and H4 is highly correlated with gene expression. A unique structural motif, the bromodomain, specifically recognizes acetylated lysines and is present in many proteins involved in transcriptional regulation 44. Besides the direct recruitment of effectors, histone acetylation has also been proposed to physically alter chromatin structure by neutralizing the positive charge of lysines and disrupting intra- and internucleosomal interactions, which lead to an open chromatin environment permissible to transcription. Lysine acetylation of three residues on the H3 globular domain H3K56, H3K64, and H3K122, all of which lie at the H3–DNA interface, may disrupt electrostatic interactions within the nucleosome and have been linked to gene activation 45–47. H3K122ac has been shown to directly promote in vitro transcription through stimulating histone eviction 47. H3 and H4 histone tail acetylations enhance DNA unwrapping, while H3 acetylation sensitizes nucleosomes to salt-induced dissociation 48.H3K4me3 and H3/H4 acetylation coexist at the promoter and TSS of active genes, and there are many studies that suggest H3K4me3 promotes downstream H3/H4 acetylation through recruitment of HATs (Fig4). H3K4me3 readers have been identified in many HAT complexes. SGF29, a component of the SAGA HAT complex, contains a tandem tudor domain that binds H3K4me3 and overlaps with H3K4me3 at gene promoters. SGF29 deletion causes loss of H3K9ac and loss of SAGA complex at target sites 49. Similarly, yeast NuA3 50 and NuA4 51, and mammalian HBO1 52 provide other examples of HAT complexes that contain PHD fingers that preferentially bind H3K4me3. Dynamic turnover of H3 lysine acetylation through the combinatorial action of the HAT p300/CBP and HDAC has been shown to occur on histone H3 tails with preexisting H3K4me3, but not other modifications associated with active gene expression such as H3K79me3 or H3K36me3 53. This H3K4me3-linked acetylation is conserved in higher eukaryotes including fly, mouse, and human. Loss of H3K4me3 upon CFP1 deletion leads to loss of CpGI-associated H3K9ac in ESCs 29. Further work using the Dictyostelium discoideum model shows that upon knockout of SET1 and loss of H3K4me3, dynamic H3 acetylation was lost 54. The dynamic turnover of acetylation rather than the modification itself may be key in transcriptional activation (reviewed in 55). In support of this, many members of the H3K4me3-binding PHD fingers are associated with HDACs as well as HATs 56. As H3K4me3 has been found to be promoter-associated before transcription initiation, H3K4me3-dependent co-targeting of both HATS and HDACs may facilitate the dynamic turnover of histone acetylation. The above examples illustrate that positive crosstalk between H3K4me3 and histone acetylation is an evolutionarily conserved pathway and that the cooperativity between H3K4me3 and hyperacetylation as well as the dynamic turnover of acetylation is important in ensuring proper transcriptional regulation.
Figure 4
Interplay between H3K4me3, H3K36me3, and H3/H4 acetylation
H3K4me3 reinforces H3 and H4 acetylation at the promoters of active genes. Various H3K36 writers catalyze H3K36me1/2 and SETD2 associates with elongating Pol II and catalyzes H3K36me3 co-transcriptionally. H3K36me2/3 recruits HDACs that deacetylate histones over gene bodies.
Interplay between H3K4me3, H3K36me3, and H3/H4 acetylation
H3K4me3 reinforces H3 and H4 acetylation at the promoters of active genes. Various H3K36 writers catalyze H3K36me1/2 and SETD2 associates with elongating Pol II and catalyzes H3K36me3 co-transcriptionally. H3K36me2/3 recruits HDACs that deacetylate histones over gene bodies.
H3K36me3 and histone deacetylation
Methylation at histone H3K36 is an abundant histone mark highly conserved in eukaryotes. H3K36 mono-, di-, and trimethylation exist in yeast and all of these states are catalyzed by SET2. Mammals on the other hand have multiple writers of H3K36 methylation, including the NSD1/2/3 family, ASH1L, SMYD2, SETMAR, and SETD2, but SETD2 is the sole enzyme responsible for H3K36 trimethylation in vivo (reviewed in 57) (Fig4). Interestingly, the uncoupling of H3K36me3 activity from H3K36me1/2 over evolution alludes to specific biologically distinct roles of each methylation state.H3K36me3 is highly correlated with the transcribed regions of active genes and levels of H3K36me3 increase toward the 3′ end of genes 11. This particular distribution results from the association of Set2 with the elongating Ser2-phosphorylated CTD of Pol II, which is predominant over the bodies and 3′ ends of active genes 58–60. Like H3K4me3, H3K36me3 has also been linked to regulation of histone acetylation. H3K36me3 recruits HDACs to sites of active transcription (Fig4). In yeast, recognition of H3K36me2/3 by the bromodomain-containing EAF3 complex recruits the HDAC RPD3S complex, which deacetylates histones and prevents spurious transcription initiation from within gene bodies 61–63. H3K4me3 and histone hyperacetylation at gene promoters may regulate transcriptional initiation from the TSS, while H3K36me2/3-mediated deacetylation is required in the wake of the transcriptional machinery to prevent initiation from aberrant sites within the gene body. This mutual exclusivity of H3K4me3 and H3K36me3 may be important for maintaining transcriptional integrity. This idea is supported by work showing that promoters lack the H3K36me2/3 mark, and the H3K36me2 demethylases KDM2A/B co-localize with H3K4me3 at CpGI promoters, ensuring active removal of H3K36me2 from transcriptional start sites 64,65.H3K36me2/3 is recognized by a protein motif, the PWWP domain, found in many nuclear chromatin-binding proteins 66–69. Notably, all three members of the NSD family of H3K36 methyltransferases that catalyze H3K36me1/2 each contain two PWWP domains 70 and have been shown to preferentially bind H3 peptides containing H3K36me3 69. This implies that H3K36me2/3 recognition by its writers may be important for the propagation of H3K36me1 and H3K36me2 at certain sites. Mono-/dimethylation of H3K36 is more pervasive than H3K36me3 and not restricted to sites of active transcription or euchromatin domains 71,72. The biological function of mono-/dimethylation is unknown, though an increase in H3K36me2 levels as a result of mutations in NSD2 has been linked to upregulation of gene expression profiles in cancers 73–75. H3K36me2 may have an important biological function in its own right or may be required to serve as a substrate for subsequent SETD2-mediated H3K36 trimethylation. The broad distribution of H3K36me2 and H3K36me3 over active chromatin may also prevent the spreading and accumulation of silencing marks such as H3K27me3 through direct inhibition of the Polycomb complex PRC2 76,77, which will be discussed below.
Repressive histone modifications
The methylation of residues lysine 27 and lysine 9 of H3 and the ubiquitinylation of H2A on lysine 119 are hallmarks of repressive chromatin and are often found at silent gene loci. H3K27me3 and H2AK119u1 are associated with the formation of facultative heterochromatin, whereas H3K9me2/3, as well as having important roles in the formation of constitutive heterochromatin, also plays a part in regulating gene expression during development.
H3K27me3 and H2AK119u1 crosstalk
The Polycomb Repressive Complex 2 (PRC2) is responsible for the methylation of lysine 27 and contains four core subunits, EZH2/1, SUZ12, EED, and RBAP46/8 78. The catalytic subunit is the SET domain-containing protein EZH2 (or the related EZH1), although these enzymes are only functional in the context of the full core complex 79–81. There are also accessory proteins that can associate with the core PRC2 complex to form two types of PRC2: PRC2.1 which includes a Polycomb-like subunit (PCL1/2/3) and PRC2.2 which includes the JARID2 and AEBP2 subunits 82. The function of these accessory proteins remains unclear, although they have been shown to modulate activity of PRC2 and may also play a role in targeting PRC2 to chromatin. PRC2 is able to mono-, di-, and trimethylate H3K27, although there is some dispute if PRC2 is the only H3K27me1 methyltransferase. These different methylation states have very different roles, and although H3K27me3 is linked to gene repression, recent studies have suggested that H3K27me1 may be important for gene activation and is enriched over the bodies of genes 83. The H3K27me2 modification is very prevalent in the genome, with MS/MS analysis demonstrating that it accounts for 60–80% of all nucleosomes in mESCs 84, although little is known about its function or binding proteins. H3K27me3 is the most well-characterized mark in terms of facultative heterochromatin formation and is critical for the repression of key transcriptional regulators during development. Therefore, in terms of gene silencing, we will focus on the trimethylation state of H3K27.In ES cells, H3K27me3 is present at the promoters of several thousand genes, including the Hox gene clusters, where it is associated with heritable gene silencing 85. H3K27me3 modification is also highly enriched on the inactive X chromosome suggesting a role in facultative heterochromatin formation 86. In more differentiated cell types, larger domains of H3K27me3, termed BLOCS, are often visualized over silent loci in the genome 87. As described above, for many of the enzymes associated with active gene expression, there are also positive feedback loops important for the establishment and spreading of repressive domains. The PRC2 component EED contains an aromatic cage that is able to specifically bind to H3K27me3 88. It has been shown that the binding of PRC2 to the modification it deposits is required for the full establishment of H3K27me3 domains, and such a positive feedback mechanism could also be important for the inheritance of the H3K27me3 mark through cell division 89. PRC2 has also been shown to be stimulated by dense chromatin via an interaction of the SUZ12 subunit with the H3 tail (A31-R42) 90. In this way, positive feedback from the local chromatin structure will also allow robust domains of H3K27me3 to be maintained over repressed genes.The Polycomb repressive complex 1 (PRC1) is an E3 ubiquitin ligase complex that can modify chromatin by monoubiquitylation of H2A on lysine 119. All PRC1 complexes contain the catalytic RING1A/B subunit, and one of six PCGF proteins 91. The presence of different PCGF subunits is thought to define the class of PRC1 complex, for example, PCGF2 (MEL18) and PCGF4 (BMI) make up the canonical PRC1 complexes which also contain CBX (2,4,6,7,8) and polyhomeotic subunits 92. Variant complexes include either the RYBP or YAF2 protein, the presence of which is mutually exclusive with the CBX component 91,93. These variant complexes, such as the complex containing RING1B/PCGF1/RYBP/BCOR/KDM2B, have been implicated in recruitment of PRC1 and have been shown to have higher H2AK119u1 activity compared with canonical PRC1 complexes 91,94. Interestingly, RYBP also contains a ubiquitin-binding domain and has been shown to bind H2AK119u1 95. This suggests that a positive reinforcement mechanism could be important to establish or maintain high levels of H2AK119u1 at PRC1 target domains, in a similar manner to PRC2 where EED binds to H3K27me3.Both PRC1 and PRC2, along with their associated chromatin modifications, H2AK119u1 and H3K27me3, have been shown to co-localize at many regions of the genome, such as the promoters of developmentally regulated genes and the inactive X chromosome 96–99. A hierarchical recruitment model, whereby the H3K27me3 modification placed by PRC2 is read by PRC1, has been proposed to explain this co-recruitment of both PRC1 and PRC2 to chromatin 100. This occurs by a specific interaction of the H3K27me3 modification with the chromodomain of the CBX protein found in canonical PRC1 complexes 101. Hence, all PRC2 targets would also become PRC1 targets and a repressive domain would be established. However, this hierarchical model is not able to account for all PRC1 recruitment to chromatin since even in the absence of PRC2, the variant RYBP-containing complexes still localize to the correct regions of the genome 93,102. More recently, data from three laboratories have shown that the reverse mechanism is also possible, whereby PRC1 is recruited first, followed by PRC2. In this model, the H2AK119u1 placed by a variant PRC1 complex is recognized by PRC2 103–105. At present, we do not know how H2AK119u1 is recognized by PRC2, but it has been shown that the PRC2.2 complex (containing the accessory factors AEBP2 and JARID2) is enriched in chromatin containing the H2AK119u1 modification 105. Additionally, in vitro, this PRC2.2 complex is more active on an H2AK119u1 nucleosome substrate compared with unmodified nucleosomes 105. A remaining question is whether this PRC2 recruitment to H2AK119u1 is via a direct recruitment mechanism, similar to CBX binding to H3K27me3, or by a change in chromatin state or structure associated with the large H2AK119u1 modification.In summary, the establishment of Polycomb repressive domains may require these enzymes to read not only their own mark, for example, EED-binding H3K27me3 or RYBP-binding H2AK119u1, but also the marks placed by their partner complex. In this way, H3K27me3 can establish or reinforce H2AK119u1 modifications, and H2AK119u1 can establish or reinforce H3K27me3 deposition (Fig5). Which modification or Polycomb complex is initiating this recruitment is still a matter of debate although recent work has implicated the variant PRC1-KDM2B-containing complex in initiation 106–108. Polycomb target sites overlap with regions of dense unmethylated DNA, CpG islands, and the CxxC domain of KDM2B is able to recognize unmethylated CpGs, providing a plausible mechanism for PRC1 recruitment. Further work is needed to fully understand how both PRC1 and PRC2 complexes are initially recruited to CpG islands. However, once this is established, the positive feedback mechanisms described above involving the histone modifications that these enzymes place will be important to maintain and reinforce their activity at these target sites.
Figure 5
Crosstalk between the Polycomb complexes PRC1 and PRC2
PRC2 reinforces its own mark through binding of EED to H3K27me3. PRC1 may also reinforce its own mark through binding of RYBP to H2AK119u1. Establishment of PRC1 can be reinforced by the presence of PRC2, through recognition of H3K27me3 by the CBX subunit of PRC1. PRC2 establishment can also be reinforced by PRC1 through the recognition of H2AK119u1 by the JARID2/AEBP2 PRC2.2 complex.
Crosstalk between the Polycomb complexes PRC1 and PRC2
PRC2 reinforces its own mark through binding of EED to H3K27me3. PRC1 may also reinforce its own mark through binding of RYBP to H2AK119u1. Establishment of PRC1 can be reinforced by the presence of PRC2, through recognition of H3K27me3 by the CBX subunit of PRC1. PRC2 establishment can also be reinforced by PRC1 through the recognition of H2AK119u1 by the JARID2/AEBP2 PRC2.2 complex.
H3K27me3 and H3K9me2/3 crosstalk
Generally, methylation of H3K9 is associated with constitutive heterochromatin formation and transcriptional silencing. Recently, there has been some evidence that H3K9 methylation can crosstalk with the Polycomb H3K27me3 modification to cooperate in gene repression or as mutually exclusive pathways present at constitutive heterochromatin.The heterodimeric complex of G9a and GLP catalyzes H3K9me1 and H3K9me2 modifications 109, which are mainly associated with transcriptional silencing but also occur in euchromatic regions 110. Both proteins contain ankyrin repeat domains that can bind to H3K9me1/2 modifications, allowing the enzymes to read their own marks and therefore allow spreading of the H3K9me2 modification 111. The enzyme SETDB1 can place both H3K9me2/me3 and has roles in repression of transposons, in gene silencing and at pericentric heterochromain 112–115. The SUV3-9H1/H2 enzymes deposit H3K9me2 and H3K9me3 modifications and contain a chromodomain which can recognize these marks 116. A major site of the H3K9me3 modification is at pericentric heterochromatin, where there are self-reinforcing feedback loops involving the chromodomain-containing protein HP1 which can bind to H3K9me3, and recruit de novo DNA methyltransferases (DNMT3A/B) 117,118. The resulting DNA methylation can be recognized by MECP2, a protein containing a MBD (methyl binding domain) which can also bind to and recruit SUV3-9 enzymes to pericentric heterochromain 119 (Fig6A). Interestingly at mitosis, H3 is phosophorylated by the kinase Aurora B at H3S10, and this modification next to the H3K9me3 mark causes HP1 to be displaced from the pericentric heterochromatin during this phase of the cell cycle 120.
Figure 6
Interplay between H3K9me3, DNA methylation, and H3K27me3
(A) At the pericentric heterochromatin, DNA methylation and H3K9me3 positively reinforce each other. HP1 binds H3K9me3 and recruits the de novo DNA methyltransferases DNMT3A/B. MECP2 can bind methylated DNA and recruit the H3K9me3 methyltransferase SUV3-9. (B) H3K27me3 and H3K9me2 coexist on the inactive X chromosome. CDYL may recruit G9a to the inactive X chromosome through its ability to recognize H3K9me2 and H3K27me3. CDYL may reinforce the propagation of H3K9me2 at the Xi.
Interplay between H3K9me3, DNA methylation, and H3K27me3
(A) At the pericentric heterochromatin, DNA methylation and H3K9me3 positively reinforce each other. HP1 binds H3K9me3 and recruits the de novo DNA methyltransferases DNMT3A/B. MECP2 can bind methylated DNA and recruit the H3K9me3 methyltransferase SUV3-9. (B) H3K27me3 and H3K9me2 coexist on the inactive X chromosome. CDYL may recruit G9a to the inactive X chromosome through its ability to recognize H3K9me2 and H3K27me3. CDYL may reinforce the propagation of H3K9me2 at the Xi.Several reports have demonstrated that H3K9me3 and H3K27me3 modifications are mutually exclusive 87,103,121,122. In differentiated cells, H3K27me3 BLOCS, which form over silent gene loci, are mutually exclusive with H3K9me3 domains over features such as transposons 87. In SUV3-9H1/H2 knockout cells, there is a loss of H3K9me3 at the pericentric heterochromatin, and a subsequent gain of H3K27me3 103,121, suggesting that not only can these two marks compensate for each other, but that normally H3K9me3 prevents H3K27me3 establishment. A recent study isolating proteins associated with pericentric heterochromatin has shown that a chromatin-associated protein, BEND3, is recruited to pericentric chromatin in the absence of H3K9me3 (or DNA methylation) and is important for recruiting H3K27me3 122. Lack of DNA methylation can also cause H3K27me3 to be recruited to pericentric heterochromatin, but in this case H3K9me3 is still present and forms mutually exclusive domains with H3K27me3, despite both modifications now being enriched at DAPI-dense pericentric regions 103. Recruitment of H3K27me3 to pericentric heterochromatin has also been shown to occur during early mouse development. In the one-cell stage embryo, H3K27me3 can be visualized specifically on the male pericentric heterochromatin, but not the female heterochromatin, which contains H3K9me3 123. In this system, it has recently been shown that it is not the presence of H3K9me3 on the maternal pericentric heterochromatin itself that prevents H3K27me3 recruitment, but rather the presence of HP1b which binds to H3K9me3 124.Despite reports that H3K27me3 and H3K9me3 are mutually exclusive, a number of ChIP-sequencing studies in ES cells 115, extra-embryonic lineages 125, and differentiated cells 126 have shown that both H3K9me2 and H3K9me3 modifications can coexist with the PRC2 modification H3K27me3 at developmentally repressed genes. Given that both marks are associated with gene repression, it has been suggested that they may cooperate with each other. A mass spectrometry study, in which H3K27me3-containing nucleosomes were purified from HeLa cells, showed that H3K9me2 modifications, and to a lesser extent H3K9me3, were also present with H3K27me3 127. Large-scale proteomic screens have identified several Polycomb proteins as readers of H3K9me3 modifications 31,69. However, the authors suggest that this may be due to the affinity of CBX proteins to H3K27me3 and the high degree of sequence identity surrounding H3K9 and H3K27 (TARKST and AARKSA, respectively). In vitro there is no difference in PRC2 activity on nucleosomes that contain H3K9 methylation compared to WT nucleosomes 76. However, a recent paper has found a direct interaction of PRC2 with the G9a/GLP complex and that G9a enzymatic activity (H3K9me2) modulates PRC2 genomic recruitment 128. In addition, studies have reported that PRC2 is necessary for the binding of HP1 to chromatin 129–131. Both H3K27me3 and H3K9me2 modifications have been shown to be present on the inactive X chromosome where the two modifications play complementary roles 132. Here, the molecular mechanisms of crosstalk have been elucidated by the discovery of CDYL, a protein that can bind both H3K9me2 and H3K27me3. CDYL can interact with G9a to propagate the H3K9me2 modification 133. Interestingly, the loss of PRC2, and subsequent loss of H3K27me3, reduces the amount of H3K9me2 present on the Xi, suggesting that CDYL is a link between these two enzymatic activities allowing the combinatorial reading and writing of both modifications (Fig6B).The mutually exclusive distribution of H3K27me3 and H3K9me3 described above has predominantly been shown in studies of constitutive heterochromatin loci, and in all cases, this clear separation of domains has been seen for trimethylation marks. There are also many examples in which the H3K9me2 modification, and in some cases H3K9me3, can act in concert with H3K27me3, suggesting positive crosstalk between these two mechanisms of heterochromatin formation. In the best-documented examples, this positive crosstalk appears to be between H3K27me3 and H3K9me2. Crosstalk may also vary in different cell types or different differentiation states, in which chromatin structures or the balance of different enzymes may be altered. What is clear is that at least in some circumstances through, for example, CDYL, HP1, or DNA methylation, both the Polycomb system and the H3K9 methylation systems are able to read the chromatin state placed by each other and to write their own modifications accordingly.As discussed above, both H3K27me3 and H3K9me3 repressive marks can exist as extensive domains within cells. These domains can spread through positive feedback of the writers and their ability to recognize and propagate their marks. Given this, the establishment of boundaries is critical to isolate heterochromatin from euchromatin domains. These boundary elements have been found in multiple eukaryotic organisms, ranging from yeast to human (recently reviewed in 134). Boundaries can be formed by equilibrium between heterochromatin-promoting factors (remodelers, Polycomb proteins, H3K9me3 machinery) and euchromatin-promoting factors (remodelers, transcriptional machinery, Trithorax proteins). Such boundaries could vary in position and this concept forms the basis of position effect variegation (PEV) in which the spread of heterochromatin, for example, domains of H3K9me3, results in the stochastic silencing of a neighboring gene (reviewed in 135). Additionally, cis-regulatory elements and the binding of insulator proteins such as CTCF can also determine boundaries. For instance, H3K27me3 domain boundaries within the Hox gene cluster are bound by CTCF; deletion of these CTCF-binding sites results in the infringement of Pol II and H3K4me3 into adjacent heterochromatin territories and disruption of Hox gene silencing 136.
Interplay between repressive and active chromatin modifications
The main theme that emerges from the data we have described is that chromatin writers, associated with either active or repressed states, are positively regulated by their own marks or marks associated with the same transcriptional state. However, there are clear data that show these chromatin writers can also be negatively influenced by marks associated with the opposite transcriptional state. These negative feedback loops reinforce the maintenance of distinct chromatin states, and may play an important role for switching of gene expression during differentiation and development by creating and reinforcing a bistable state.Historically, the best-characterized example of an interplay between chromatin complexes in regulating gene expression is the antagonism between Polycomb and Trithorax complexes. Drosophila genetics first established Polycomb and Trithorax proteins as two groups having opposing function on Hox gene expression, and subsequently on the regulation of many important developmental genes. Histone crosstalk is important in this interplay, and some of the molecular mechanisms that govern this mutual antagonism between Polycomb and Trithorax proteins and marks have been elucidated.Schmitges et al
76 were the first to show a mechanism of direct inhibition of PRC2 by the TrxG modifications H3K4me2/3 and H3K36me2/3. The catalytic activity of the PRC2 core complex was greatly reduced on recombinant nucleosomes carrying trimethyllysine analogs at H3K4 and H3K36 on the H3 tail. This study and further work demonstrated inhibition of PRC2 activity when the marks H3K4me2/3 and H3K36me2/3 exist in cis (on the same histone tail) as the target H3K27. It has been suggested that the allosteric inhibition of the catalytic subunit EZH2 occurs through the recognition of the H3K4me2/3 and H3K36me2/3 marks by the SUZ12 VEFS domain 76. Inhibition of PRC2 by H3K36me2/3 is consistent with mass spectrometry data of the histone H3 peptide fragment K27-R40 isolated from total chromatin from mESCs and transformed cell lines 73,77,137. This shows that trimethylation at H3K27 and H3K36 do not coexist on the same H3 tail or are present at very low levels. Furthermore, removal of SETD2, the only HMT capable of placing H3K36me3, leads to an increase of H3K27me2 over bodies of active genes and reduces levels of expression 83.It is important to note that although PRC2 is inhibited in cis by H3K4me2/3 and H3K36me3, PRC2 is active on nucleosomes harboring these modifications on only one of the two H3 tails, thereby allowing the formation of asymmetrically modified nucleosomes 127. Such nucleosomes have been identified in vivo and may represent the nucleosomes present at bivalent promoters in mESCs (promoters that harbor both active and repressive marks, see later).This negative feedback mechanism has also been shown to operate at a chromosomal level in C. elegans. Normally H3K36me3 and H3K27me3 occupy mutually exclusive domains on the autosomes 138. However, removal of the H3K36me1/2 writer MES-4 in the germ line results in a global loss of H3K36me3 leading to the redistribution of H3K27me3 to exogenous sites at germ line-expressed genes, which are normally modified by H3K36me3. This redistribution causes a titration of the H3K27me3 mark from its endogenous sites, including the X chromosomes, and an inability to maintain normal gene expression states 138–140.As well as the inhibition of Polycomb activity by Trithorax marks, Polycomb marks have also been shown to inhibit the activity of some Trithorax proteins. The best-studied examples are the inhibition of H3K36 methyltransferases by the PRC1 modification H2AK119u1. Yuan et al
141 show that the catalytic domain of H3K36 methyltransferases is inhibited by recombinant nucleosomes containing H2AK119u1. Additionally, there is evidence that the PcG PRC1 mark H2AK119u1 inhibits H3K4 methyltransferases MLL1 and possibly MLL3. A study by Endoh et al shows that upon RING1A/B knockout and subsequent depletion of H2AK119u1, H3K4me3 levels at several PcG target genes increase 142. Although not extensively investigated, it is possible that H3K27me3 may also inhibit the deposition of H3K4 methylation by the SETD1 and MLL3/4 complexes 143.These studies provide good evidence that PcG and TrxG marks mutually inhibit the writers associated with the opposing group. It was therefore unexpected when several groups showed that the tudor domains of the PRC2-associated PCL1/2/3 proteins can specifically bind H3K36me3 144–147. Structural and biochemical analyses show that the tudor domain of PCL recognizes H3K36me2/3 with high specificity; however, PCL co-localizes only moderately with H3K36me3 in vivo by ChIP-sequencing 144,147. One interpretation of this observation is that the role of this inter-action may be important in the spreading of PRC2 and H3K27me3 to bodies of active genes, and perhaps during the switching of transcriptional state by allowing the initial recruitment of PRC2 to active genes. Consistent with the latter observation, it has been shown that the H3K36 demethylase NO66 can be recruited by PCL3, which would allow for the removal of H3K36me3 before the subsequent acquisition of H3K27me3 146.H3K27me3 is associated with gene repression, while H3K27ac is associated with gene activation and active enhancers. Since they act on the same lysine residue, these marks are mutually exclusive, and the switch between methylation and acetylation has been well established. The removal of H3K27ac by the NURD complex facilitates the recruitment of PRC2 and accumulation of H3K27me3 at promoters leading to gene repression 148. This process occurs during differentiation of ESCs, when CTBP2 in combination with NURD initiates the silencing of genes that were originally active, through H3K27 deacetylation, allowing deposition of H3K27me3 by PRC2149.Upon loss of H3K9 or DNA methylation, PRC2 accumulates at the pericentric heterochromatin as discussed earlier. In this condition, BEND3 recruits NURD to the pericentric heterochromatin 122, and thus, a similar mechanism of deacetylation of H3K27ac could explain the subsequent PRC2 recruitment and accumulation of H3K27me3 at these sites.Activation of Polycomb-repressed genes requires a methylation to acetylation switch at H3K27. The phosphorylation of H3S28 on the residue neighboring H3K27 has been shown to mediate this switch. H3S28p inhibits H3K27me3, allowing an accumulation of H3K27 acetylation 150,151. Similarly, the loss of PRC2 subunit SUZ12 and H3K27me3 leads to the accumulation of H3K27ac at PcG target genes 84. It has been suggested that one role of H3K27me3 is to exclude the HATs p300 and CBP, preventing accumulation of H3K27ac at enhancers that is important for gene activation 152. H3K27me2 has been suggested to play a similar role to prevent the firing of non-cell-type-specific enhancers. This idea is supported by the increase of H3K27ac at these enhancers when H3K27 methylation is lost 83.
Discussion
It is clear that specific histone modifications are associated with the transcriptional state. For many modifications, it is not well established whether they directly influence transcription or their placement is simply a consequence of the transcriptional state present at a particular gene. There are reports that acetylation can directly alter chromatin structure to a more accessible state allowing the recruitment of transcription factors and the transcription machinery. In addition, ubiquitylation of H2A has been shown to inhibit the elongating form of Pol II, suggesting direct effects on the transcriptional state. Conversely, there is substantial evidence to support the idea that transcription factors determine and initiate gene expression, and writers recognize this state and aid in the maintenance of this state through multiple feedback mechanisms. One such mechanism is the recruitment of the H3K4 HMT SETD1 and the H3K36 HMT SETD2 by Pol II itself to deposit histone modifications across the promoter and gene body. The emerging idea that PcG and TrxG can sample CpG islands genome wide and establish domains of repression or activation depending on the transcriptional state at the target promoter also supports the idea that the transcriptional state defines the chromatin modification landscape. This mechanism requires extensive positive and negative crosstalk between these modifications that we have discussed.Many writers of chromatin modifications are positively regulated by the marks that they place, as well as other marks associated with the same transcriptional state, contributing to the reinforcement of gene expression or silencing. This mechanism could also account for the spreading of marks such as the repressive modifications H3K9me3 and H3K27me3 over large domains in differentiated cells. These reinforcing mechanisms may also play a role in cellular memory by faithful propagation of the histone modifications that allow gene expression profiles to be maintained epigenetically through cell division. There is evidence to suggest that some writers remain associated with chromatin during DNA replication, but the exact molecular mechanisms of this epigenetic memory have yet to be fully elucidated.Negative histone crosstalk plays an equally important role in dictating distinct chromatin environments. Writers, especially in the case of PcG and TrxG proteins, are often negatively regulated by marks associated with the opposing transcriptional state. As we have seen in the case of H3K27 methylation and acetylation, negative crosstalk is also involved in the switching of gene expression states. It is known that in certain circumstances, marks associated with positive and negative transcription can coexist. In ESCs, bivalent promoters contain both H3K27me3 and H3K4me3, albeit on different histone tails of the same nucleosome, and may represent a chromatin profile amenable to switching between transcriptional states. Genes coding for master transcription factors often have bivalent promoters in ESCs, and their expression is dynamically regulated through development.The ability of writers to read and place histone modifications is important for the maintenance and regulation of specific transcriptional states throughout development. This is evident by the fact that mice deficient in chromatin-modifying enzymes display severe developmental defects. Several of these enzymes have been known to be involved in various genetic disorders such as Sotos syndrome and Wolf–Hirschhorn syndrome related to translocation of several members of the NSD family of H3K36 methyltransferases 153–155, and cancers related to translocation or mutation of MLL, NSD2, PRC2, SETD2, and countless others 156–158. Most recently, an H3K27M mutation in H3.3 found in pediatric glioma cancer has been shown to deplete levels of H3K27me3 globally potentially through a dominant negative mechanism 159.In conclusion, the ability of chromatin writers to read preexisting histone modifications contributes in two major ways. First, it allows the maintenance of distinct and robust transcriptional states, which could potentially be propagated through cell division, and therefore act as epigenetically inherited features. Second, crosstalk between modifications and enzymes of opposing transcriptional states can allow the establishment of bistable switches that allow the dynamic regulation of gene expression states. In the future, we hope to understand the extent to which histone crosstalk plays a role in defining the epigenetic landscape of a cell (Sidebar A), and address the role of histone modifications and the crosstalk between them during the processes of development and disease.How do chromatin modifications regulate gene transcription? Although there is a clear correlation between chromatin modifications and gene expression states, it will be important to establish the role that modifications play in regulating transcription and indeed whether they are a cause or an effect of transcription.What is the role of highly prevalent chromatin modifications such as H3K36me2 or H3K27me2 that mark up to 50% of H3 in mESCs and differentiated tissues 84,121,160? Is it possible that these marks are necessary to reduce noise, for example, by blocking inappropriate histone modifications?What is the role of histone variants? H3.3 and H2A.Z are enriched over active genes and may have more specialized regulatory roles compared to their more abundant canonical counterparts 161,162. Histone readers and writers may be sensitive to the histone variant status. For instance, the putative tumor suppressor ZMYND11 is an H3.3 variant-specific reader of H3K36me3 163, while the H3K27 methyltransferase in plants, ATRX5/6, is active on the canonical H3.1 but inhibited by H3.3 164.How are epigenetic profiles established within a cell? What are the relative contributions of direct targeting of histone-modifying activities and crosstalk between histone modifications or transcriptional state? For example, are chromatin writers able to sample and read the preexisting chromatin state to determine their activity or binding profiles, or are they directly recruited to their sites of action by sequence-specific DNA-binding factors?To what extent are epigenetic modifications maintained through cell division, and do self-reinforcing feedback loops provide a model for the mechanism of such inheritance? Unlike writers and readers, the genomic location of histone modifications can be easily transmitted through both mitosis and meiosis because they are an integral part of the packaging of DNA. However, self-reinforcing loops might become essential after replication to overcome the dilution of old, modified nucleosomes with new nucleosomes and maintain an epigenetic code.Do the reciprocal feedback loops between positively and negatively acting histone marks provide the basis for a bistable switch, in which each state is positively reinforced and stable once the initial decision has been made? In order to properly generate and validate such mathematical models, it will be critical to obtain quantitative data on the kinetics and dynamics of the catalytic and binding processes that are involved. This will involve experiments such as FRAP and in vitro binding and enzymatic assays, and importantly determining the changes that occur when the system has been perturbed.
Authors: Francois Fuks; Paul J Hurd; Daniel Wolf; Xinsheng Nan; Adrian P Bird; Tony Kouzarides Journal: J Biol Chem Date: 2002-11-09 Impact factor: 5.157
Authors: Antoine H F M Peters; Stefan Kubicek; Karl Mechtler; Roderick J O'Sullivan; Alwin A H A Derijck; Laura Perez-Burgos; Alexander Kohlmaier; Susanne Opravil; Makoto Tachibana; Yoichi Shinkai; Joost H A Martens; Thomas Jenuwein Journal: Mol Cell Date: 2003-12 Impact factor: 17.970
Authors: Pierre-Marie Dehé; Bernhard Dichtl; Daniel Schaft; Assen Roguev; Mercè Pamblanco; Régine Lebrun; Alfonso Rodríguez-Gil; Msau Mkandawire; Katarina Landsberg; Anna Shevchenko; Andrej Shevchenko; Lorena E Rosaleny; Vicente Tordera; Sebastián Chávez; A Francis Stewart; Vincent Géli Journal: J Biol Chem Date: 2006-08-18 Impact factor: 5.157
Authors: Huck Hui Ng; David N Ciccone; Katrina B Morshead; Marjorie A Oettinger; Kevin Struhl Journal: Proc Natl Acad Sci U S A Date: 2003-02-06 Impact factor: 11.205
Authors: N Kanu; E Grönroos; P Martinez; R A Burrell; X Yi Goh; J Bartkova; A Maya-Mendoza; M Mistrík; A J Rowan; H Patel; A Rabinowitz; P East; G Wilson; C R Santos; N McGranahan; S Gulati; M Gerlinger; N J Birkbak; T Joshi; L B Alexandrov; M R Stratton; T Powles; N Matthews; P A Bates; A Stewart; Z Szallasi; J Larkin; J Bartek; C Swanton Journal: Oncogene Date: 2015-03-02 Impact factor: 9.867
Authors: Grace L Cooper; Carol J Huseby; Claire N Chandler; Jean-Christophe Cocuron; Ana P Alonso; Jeff Kuret Journal: Anal Biochem Date: 2018-03-15 Impact factor: 3.365
Authors: Anna L Boudoures; Jacob J Pfeil; Elizabeth M Steenkiste; Rachel A Hoffman; Elizabeth A Bailey; Sara E Wilkes; Sarah K Higdon; Jeffrey S Thompson Journal: Genetics Date: 2017-05-18 Impact factor: 4.562