| Literature DB >> 26453572 |
Moritz Bauer1, Johanna Trupke1,2, Leonie Ringrose3,4.
Abstract
A long-standing mystery in the field of Polycomb and Trithorax regulation is how these proteins, which are highly conserved between flies and mammals, can regulate several hundred equally highly conserved target genes, but recognise these targets via cis-regulatory elements that appear to show no conservation in their DNA sequence. These elements, termed Polycomb/Trithorax response elements (PRE/TREs or PREs), are relatively well characterised in flies, but their mammalian counterparts have proved to be extremely difficult to identify. Recent progress in this endeavour has generated a wealth of data and raised several intriguing questions. Here, we ask why and to what extent mammalian PREs are so different to those of the fly. We review recent advances, evaluate current models and identify open questions in the quest for mammalian PREs.Entities:
Keywords: Drosophila; Epigenetics; Mammal; Polycomb; Polycomb response element; Trithorax
Mesh:
Substances:
Year: 2015 PMID: 26453572 PMCID: PMC4901126 DOI: 10.1007/s00412-015-0539-4
Source DB: PubMed Journal: Chromosoma ISSN: 0009-5915 Impact factor: 4.316
Fig. 1Composition of PcG Complexes in flies and vertebrates. The Polycomb repressive complex 2 (PRC2) and Polycomb repressive complex 1 (PRC1) family of complexes are shown. Core subunits are shown in green for PRC2 and blue for PRC1. Alternate subunits, derived from multiple genes and if more than two, are shown in grey. Accessory proteins are shown in orange. Top: mouse complexes; bottom: Drosophila complexes. Selected histone modifications are shown: Red hexagons: histone H3 tail trimethylated at lysine 27 (H3K27me3); yellow ovals: histone H2A monoubiquitinated at lysine 119 (vertebrates) or 118 (fly). (H2AK119/118 Ub). a PRC2 consists of four core subunits, EZH1/2 (fly E(Z)), EED (fly ESC), SUZ12 (fly SU(Z)12), and RbAp46/48 (or RBBP7/4; fly NURF55) (Cao et al. 2002; Czermin et al. 2002; Kuzmichev et al. 2002; Muller et al. 2002), and three accessory proteins, PCL (Walker et al. 2010), JARID2 (Herz et al. 2012; Kalb et al. 2014; Landeira et al. 2010; Li et al. 2010; Pasini et al. 2010a; Peng et al. 2009; Shen et al. 2009), and AEBP2 (Cao and Zhang 2004; Kalb et al. 2014). Alternate translation start site usage results in four different EED isoforms (not shown in the figure), which have different preferred histone substrates (Kuzmichev et al. 2004). PRC2 dimethylates and trimethylates histone H3 at Lys27 (H3K27me3) through the SET domain of EZH1/2 (fly E(Z)) (Cao et al. 2002; Czermin et al. 2002; Kuzmichev et al. 2002; Muller et al. 2002). In addition, PRC2 can bind H3K27me3 via EED (Hansen et al. 2008; Margueron et al. 2009). b Canonical PRC1 consists of four core subunits, RING1A/B (fly dRING), CBX (fly PC), PCGF (fly PSC or SU(Z)2), and PHC (fly PH) (Gil and O’Loghlen 2014; Simon and Kingston 2009). PRC1 catalyses H2AK119Ub1 (in flies H2AK118Ub1) through its RING1A/B (fly dRING) subunit (Cao et al. 2005; de Napoles et al. 2004; Scheuermann et al. 2012; Wang et al. 2004a). Canonical PRC1 can bind H3K27me3 via the chromodomain of CBX2 or 7 (fly PC) (Bernstein et al. 2006b; Fischle et al. 2003; Min et al. 2003); however, different CBX proteins have different preferences for modified histone tails (Bernstein et al. 2006b), see main text and Fig. 2 for details. c Top: one class of vertebrate non-canonical PRC1s consists of three core subunits, RING1A/B, PCGF, and RYBP or YAF2 and various accessory proteins. The complexes are distinguished by different PCGF subunits. The complex containing PCGF1 (PRC1.1) also contains the histone H3K36 demethylase KDM2B. Other PCGF subunits copurify with other accessory proteins (orange) (Gao et al. 2012). Bottom: Drosophila dRAF is the most similar to vertebrate PRC1.1 and consists of dRING, PSC, and the histone H3K36 demethylase dKDM2 (Lagarou et al. 2008). Further non-canonical PRC1s exist and are reviewed in Gil and O’Loghlen (2014) and Simon and Kingston (2013). See main text and Table 1 for detail on molecular properties
Fig. 2Evidence for different molecular mechanisms mediated by Drosophila and vertebrate PcG proteins. a Different proteins of the PRC1 complex mediate chromatin compaction in Drosophila and mouse (Grau et al. 2011). Purified PRC1 (see Fig. 1) from both fly and mouse can compact nucleosomal arrays in vitro; however, a different protein mediates this activity in the two species. Fly and mouse homologs of the proteins involved are shown. Red regions show domains required for compaction in each case, which are overrepresented in basic amino acids. Other domains and degree of conservation between mouse and Drosophila are indicated. b Alignment of the chromodomains of Drosophila Polycomb (PC, amino acids 15–77) and five mouse homologs (CBX, amino acids 1–62) redrawn from Bernstein et al. (2006b) and coloured according to the ClustalX colour scheme http://www.jalview.org/help/html/colourSchemes/clustal.html. On the right of the alignment, in vitro binding preferences of the different chromodomains from Bernstein et al. (2006b) are shown. Histone binding was addressed using modified peptides, Kds ranged between 12 and 49 μM. RNA binding was non-sequence specific. RNA-binding activity of the Drosophila PC chromodomain has not been reported to our knowledge
Evidence for common molecular mechanisms mediated by Drosophila and vertebrate PcG and TrxG proteins
| Molecular mechanism | Vertebrate | Fly |
|---|---|---|
| (A) PcG: self-reinforcing mechanisms | ||
| PRC2 trimethylates H3K27 through SET domain of EZH2 /E(Z) | (Cao et al. | (Czermin et al. |
| PRC2 binds H3K27me3 via EED/ESC | • (Hansen et al. | Indirect: mutations in conserved residues of ESC cause developmental defects in |
| PRC1 binds H3K27me3 via CBX/PC chromodomain | Different CBX chromodomains have different preferences (Bernstein et al. | in vitro: (Fischle et al. |
| PRC1 can bind chromatin independently of H3K27me3 in vivo | • PRC1 proteins are recruited to the inactive X chromosome in the absence of EED (Schoeftner et al. | In cultured cells, PC binds the Ubx promoter independently of PHO and E(Z) (Wang et al. |
| PRC1 catalyses monoubiquitylation of H2A on K119 (K118 in fly) through RING1A/1B/dRING | (Cao et al. | (Scheuermann et al. |
| H2A Ub colocalises with poised polymerase (ser5P) | In mouse ESCs (Brookes et al. | Polycomb colocalises with stalled promoters in |
| PRC2 binds H2A Ub via Aebp2 and Jarid2 in vitro | • Extracts from mouse ESCs (Kalb et al. | Extracts from |
| PRC1 blocks the assembly of transcriptional components. | Reconstituted PRC1 prevents assembly of preinitiation complex in vitro (Lehmann et al. | Indirect: PcG reduces accessibility to PolII in |
| PRC1 compacts chromatin | • In vitro: Different proteins in mouse and fly PRC1 mediate compaction (Grau et al. | • In vitro (Francis et al. |
| Compacted chromatin stimulates PRC2 activity in vitro | • (Yuan et al. | (Yuan et al. |
| PH and PSC SAM domains form homopolymers and heteropolymers | • (Kyba and Brock | • (Kim et al. |
| (B) TrxG: self-reinforcing mechanisms | ||
| ASH1L/ASH1 SET domain methylates H3K36 (mono and dimethylation). | • (Tanaka et al. | (Tanaka et al. |
| ASH1 required for TRX recruitment | Indirect: Vertebrate ASH1L shows similar distribution to MLL (Gregory et al. |
|
| MLL/TRX monomethylates H3K4 | (Tie et al. | (Tie et al. |
| Trx interacts with CBP | (Tie et al. | |
| CPB acetylates H3K27 | p300 and CBP (Pasini et al. | (Tie et al. |
| BRD4/FSH(1) binds acetylated lysine | (Devaiah et al. | (Kellner et al. |
| BRD4 phosphorylates Pol II Cter (ser2P)- may promote elongation. | (Devaiah et al. | No data found. FSH shares homology with BRD4 and may also be a PolII kinase. |
| (C) PcG and TrxG: antagonistic mechanisms | ||
| KDM2B/dKDM2 demethylates Ash1 mediated H3K36 and promotes H2A ubiquitination via RING1/dRING | • KDM2B is a H3K36 demethylase (He et al. | (Lagarou et al. |
| H3K4 and H3K36 methylation inhibit PRC2 | (Yuan et al. | (Yuan et al. |
| H3K27Ac antagonises PRC2 | (Jung et al. | (Tie et al. |
Column 1: molecular mechanism. Complex names (PRC1 or 2: Polycomb repressive complex 1 or 2). Individual protein names (vertebrate/ fly) are given. Columns 2 and 3: vertebrate, fly. If evidence for the mechanism exists, the reference is given, with notes where appropriate. A blank indicates that no references were found. Self-reinforcing PcG-mediated silencing mechanisms (A), self-reinforcing TrxG-mediated mechanisms (B), and those that are antagonistic to PcG-mediated silencing (C)
Fig. 3Similarities and differences in Drosophila and vertebrate Hox gene regulation. a The Drosophila Antennapedia (ANT-C) and Bithorax (BX-C) complexes and the mouse HoxD complex are drawn approximately to scale, based on Duboule (2007) and Maeda and Karch (2009). Dark bars indicate exons (introns not shown for HoxD due to scaling); light bars and vertical arrowheads in ANT-C and BX-C indicate experimentally verified PREs (Ringrose and Paro 2004 and references therein). Genes and regulatory regions with a common colour are most closely related in sequence between fly and mouse, and thus belong to the same paralogy group (Duboule 2007). Note that the colour coding is not intended to reflect the different regulatory regions of ANT-C and BX-C as in Maeda and Karch (2009). b Pattern of histone H3 lysine 27 methylation at mouse HoxD (left) and fly BX-C (right) in specific tissues over developmental time. Left: summary of data from Soshnikova and Duboule (2009b). In embryonic stem cells, H3K27me3 covers the entire HoxD locus (top). In tail buds of E8.5 embryos (middle) and E9.5 embryos (bottom), Hox genes are sequentially activated leading to clearing of H3K27me3 from the locus. Right: summary of data from Bowman et al. (2014) and Maeda and Karch (2009). In early (0–2 h) embryos (top), the BX-C very probably lacks H3K27me3 and PcG proteins, based on indirect evidence ((Orlando et al. 1998; Petruk et al. 2012); see main text for details). In parasegment 7 of stage 5 (2–3 h) embryos (middle), appropriate Hox genes are activated and repressed by the gap and pair rule gene products (Maeda and Karch 2009). In the same parasegment of later (post 5 h) embryos, repressed domains gain H3K27me3 (Bowman et al. 2014)
Fly and vertebrate DNA binding proteins
| Species | Protein | Role in PcG or TrxG function | Binding site |
|---|---|---|---|
| a) Proteins with function first defined in fly: vertebrate homologs | |||
| Drosophila | PHO, PHOL | Yes | • PHO, PHOL bind GCCAT ((Brown et al. |
| Vertebrate | YY1 | Disputed | • YY1 binds GCCAT with high nM to low M affinity in vitro (Golebiowski et al. |
| Drosophila | PSQ | Yes | PSQ binds to GA repeats; same motif as GAF (Hodgson et al. |
| Vertebrate | No known homolog | ||
| Drosophila | GAF | Yes | • GAF binds to GA repeats; same motif as PSQ (Pedone et al. |
| Vertebrate | mGAF | Unknown | • c-Krox-Th-POK binds to GA repeats in vitro (Matharu et al. |
| Drosophila | ZESTE | Yes | Zeste binds consensus YGAGYG (Biggin et al. |
| Vertebrate | No known homolog | ||
| Drosophila | SP1/KLF family | Yes | • Sp1/KLF consensus: RRGGYG. |
| Vertebrate | SP1/KLF family | Unknown | • SP1 consensus KRGGCGKRRY; binds with high affinity to GGGGCGGGGC (Briggs et al. |
| Drosophila | GRH | Yes | • Variable. Consensus site defined as ACYGGTT(T) (Mace et al. |
| Vertebrate | CP2 | Yes | • CP2 consensus GCNCNANCCAG (Kim et al. |
| Drosophila | DSP1 | Yes | • Binds GAAAA in Fab-7 PRE. |
| Vertebrate | HMGB2 | Unknown | • Recognises structural features, see above (Stros |
| b) Proteins with function first defined in vertebrates: fly homologs | |||
| Vertebrate | JARID 2 | Yes | • Jarid2 binds DNA with no sequence specificity (Patsialou et al. |
| Drosophila | JARID 2 | Unknown | • DNA binding activity/ specificity of fly Jarid2 has not been evaluated to our knowledge. |
| Vertebrate | AEBP2 | Yes |
|
| Drosophila | JING/ AEBP2 | Unknown | • Direct evidence that Drosophila AEBP2 binds DNA and interacts with PRC2 in vivo is lacking. |
| Vertebrate | REST | Yes | • Binds NRSE/RE1 element TTCAGCACCACGGACAGCGCC (Schoenherr and Anderson |
| Drosophila | Charlatan | Unknown | • N- terminal Zn fingers of Charlatan bind NRSE/RE1 element in vitro TTCAGCACCACGGACAGCGCC (Schoenherr and Anderson |
| Vertebrate | KDM2B (FBXL10) | Yes | • Binds to non methylated CpG dinucleotides via Zf- CxxC domain (Long et al. |
| Drosophila | dKDM2 | Yes | • dKDM2 has a CxxC domain but DNA binding has not been tested. |
| Vertebrate | RUNX1 | Yes | • Runx1 binds TGYGGT (Bowers et al. |
| Drosophila | Lozenge | Unknown | • Lozenge binds TGYGGT (Wildonger et al. |
DNA binding proteins that have been shown to play a role in PcG or TrxG regulation in flies and vertebrates are listed. a) Proteins whose function in PcG or TrxG regulation was first defined in flies are listed. Each fly protein is followed by the vertebrate homolog, if known. b) Proteins whose function in PcG or TrxG regulation was first defined in vertebrate are listed. Each vertebrate protein is followed by the fly homologue, if known. Column 3: Role in PcG or TrxG function. A comment on whether there is evidence for a role in PcG or TrxG regulation is given. Detailed information on the evidence supporting these statements and references are given in the extended version of this table provided as Table S1. Column 4: Binding site. Consensus binding sites are listed, using the IUPAC code for non-conserved nucleotides http://www.bioinformatics.org/sms/iupac.html. R =A/G; Y=C/T; S=G/C; W=A/T; K= G/T; M= A/C; B = C/G/T; D= A/G/T; H = A/C/T; V= A/C/G; N= A/C/G/T
Fig. 4Recruitment of mammalian PcG complexes. a Relationship between occurrence of gene promoters, CpG islands, KDM2B and RING1B, according to Deaton and Bird (2011), Farcas et al. (2012), He et al. (2013), and Wu et al. (2013). b The RING1B subunit is a component of multiple different complexes, including both canonical and non-canonical PRC1 (Gao et al. 2012) see main text for details. c Factors influencing PcG recruitment. A stretch of GC- and CpG-rich DNA is shown (yellow). Various motifs for sequence-specific DNA-binding proteins can exist within this DNA (dark yellow), and several of these are themselves GC-rich (see Table 2). All of these motifs may also exist in otherwise GC-poor DNA. Proteins that can bind directly to DNA and have been shown or suggested to have role in PcG recruitment are shown in orange. PRC1: indicates all versions of PRC1 except the special case of PRC1.1 which is recruited by KDM2B. Arrows indicate that the DNA-binding protein in question does not copurify with the complexes but has been shown to interact by Co-IP. TA: activating transcription factor. See main text for details
Fig. 5Motif occurrences in mammalian PREs. a A selection of mammalian PREs that have been verified to recruit PcG proteins in transgenic assays are shown (see Table 3 for details): HoxC11-12, HoxB4-5 (Woo et al. 2013), HoxD11-12 (Woo et al. 2010), PREkr (Sing et al. 2009), HoxD10 (Schorderet et al. 2013), DBE (Cabianca et al. 2012). Above each element, the % GC is shown, with CpG islands marked in dark grey, according to the following criteria: window size 100; minimum length of an island 200; minimum observed/expected CpG 0.6; minimum % GC 50.0. NB with these settings the HoxD10 PRE scores a short 200 bp GpG island; however, this was not detected by the more stringent settings used by Schorderet et al. (2013) and is designated as having no CpG island in Table 3 according to the authors of that study. Motifs for the DNA-binding proteins shown were scored as regular expressions with no mismatch allowance, as follows: REST: NTCAGCACCNNGGACAGCNCC; CP2: GCNCNANCCAG; RUNX:TGYGGT; YY1: GCCAT; GAF: GAGAGA, using the IUPAC code for non-conserved nucleotides as described in the legend to Table 2. b Occurrence per kb of motifs in the PREs shown and in random sequence (black). To generate random sequence, the total sequence of all elements shown (10.67 kb) was shuffled and searched for motifs. The mean of four iterations is shown
List of transgenic assays identifying vertebrate PREs
| Reference | Element name | Identification of the element | Size | CGI | Transgene type, cell type | Reporter (promoter) | ChIP on transgene (enrichment) | Knockdown |
|---|---|---|---|---|---|---|---|---|
| (Sing et al. | PRE- | Site disrupted by | 3 kb | No | Transient. Mouse F9 embryonic carcinoma cells | LacZ (Hsp68) | BMI1 (+), SUZ12 (+/−) | BMI1 (+), SUZ12 (−) |
| (Meng et al. | BREr p16 promoter (human) | BMI1 represses INK4a locus, which encodes p16 | 210 bp | Yes | Transient. HeLa cells | Luciferase (CMV) | BMI1 (+) | BMI1 (+) |
| (Woo et al. | Hox D11.12 PRE (human) | Site in HOXD cluster with low nucleosome occupancy and flanked by PcG binding | 1.8 kb | Yes | Transient and random integration. Human mesenchymal stem cells and adipocytes | Luciferase (TK) | BMI1 (+), H3K27me3 (+), SUZ12 (+) | BMI1 (+), EED (+), RYBP (+) |
| (Mendenhall et al. | ZFPM2 gene (human) | Locus recruits PRC1/2 and is bivalent. | 1.7 kb | Yes | Random integration. Mouse ESCs | No | Ezh2 (+), H3K27me3 (+), H3K4me3 (+), Ring1B (−) | No |
| (Lynch et al. | HBA2 gene (human) | α Globin locus recruits PcG proteins and is bivalent in humans but not in mice. | 4 kb | Yes | Site-specific integration. Mouse ESCs | No | Cbx7 (+), Ezh2 (+), H3K27me3 (+), H3K4me3 (+) | No |
| (Cuddapah et al. | SLC17A7 and Hox A3 PRE (human) | H3K27me3-enriched regions near silenced genes | 3.1 kb and 2.8 kb | Yes | Transgenic flies, random integration |
| H3K27me3 (+), E(Z) (+), PC (+) | Derepressed in |
| (Cabianca et al. | DBE (human) | Deletion of D4Z4 repeats leads to de-repression of 4q35 genes and loss of PcG | 3.3 kb | Yes | Random integration. CHO cells | No | Ring1B (+), Bmi1 (+), H2Aub1 (+), EZH2 (+), SUZ12 (+), H3K27me3 (+) | No |
| (Vasanthi et al. | HoxD PRE (mouse) | Known element necessary for setting up the early pattern of Hox gene colinear activation | 5 kb | No | Transient. NIH3T3 and HEK293 cells | Luciferase (SV40) | No | No |
| (Arnold et al. | Stmn2, Xkr7, Bdnf, Pgbd5 genes (mouse) | REST predicted to recruit PcG proteins. Testing of promoters with REST sites and high CpG content | 1.2–2 kb | Yes | Site-specific integration. Mouse ESCs and neural progenitors | No | H3K27me3 (+) | No |
| (Bengani et al. | PRE-PIK3C2B (human) | In silico analysis for regions with high density of YY1 consensus motif in the human genome | 1 kb | No | Transient. HEK293 cells | GFP (CMV) | No | No |
| (Woo et al. | Hox B4.5 and C11.12 PREs (human) | Sites in HOXB/C cluster with low nucleosome occupancy and flanked by PcG protein binding | 1.4 kb and 600 bp | No | Transient and random transgenic. Human mesenchymal stem cells | Luciferase (HOXA2) | BMI1 (+), H3K27me3 (+) SUZ12 (+), YY1 (+). | B4.5: BMI1 (+), EED (+), JARID2 (+); C11.12: JARID2 (−) |
| (Schorderet et al. | PREd10 (HoxD10) (mouse) | Region necessary for deposition of H3K27me3 at HoxD locus | 1.4 kb | No | Random integration. Mouse ESCs | No | H3K27me3 (+), Ring1B (+), Suz12 (+). | No |
| (Basu et al. | 3L-L and 3L-S | CpG island shown to have loss of DNA methylation in cancer samples | 486 and 275 bp | Yes | Transient. HEK293 cells | GFP (CMV) | ASHL1 (+), CBX2 (+), EED (+), EZH2 (+), H3K27me3 (+), H3K4me3 (−), MLL (−), PHF1 (+), WDR5 (−) | No |
| (Riising et al. | c-Jun promoter (mouse) | Target of PRC2 in ESCs and contains a CpG island. | 4 kb and 500 bp | Yes | Site-specific integration. Mouse ESCs and neural progenitors | Luciferase (c-Jun or mPGK) | Suz12 (+) | No |
| (Jermann et al. | 8 CpG islands (mouse) | Enrichment for H3K27me3 and SUZ12 in genome-wide ChIP datasets from murine ES cells | 500 to 900 bp | Yes | Site-specific integration. Mouse ESCs and neural progenitors | Luciferase (Utf1) | H3K27me3 (+), Ring1B (+*), Suz12 (+*) | No |
| (van Heeringen et al. | Several sites (Xenopus) | Enrichment for H3K27me3 in Xenopus blastula, and high score in computational prediction | 1 kb | No | Transient. Mouse ESCs and | Luciferase (SV40) | H3K27me3 (+), H3K4me3 (−) | No |
Studies identifying vertebrate PREs are listed chronologically. Column 1: authors. Column 2: element name. The name of the identified element and/or genomic location is given. The species in which the element was identified is given in brackets. Column 3: identification of the element. A brief description is given on how the element was identified. Column 4: size of element. Column 5: CpG island. The occurrence of a CpG island in elements is indicated. Column 6: transgene type, cell type. Indicates whether transgenes were evaluated by transient transfection, random or site-specific integration, and in which cell type(s) the transgenes were evaluated. Column 7: reporter. If the effect of the element on reporter gene expression was analysed, the reporter and promoter (in brackets) are given. Column 8: ChIP (enrichment). ChIP enrichment at the transgenic element is given where this was evaluated. (+) indicates strong binding, (+/−) weak binding, (−) no binding (*) differential binding to multiple constructs. Column 9: knockdown. Genes targeted for knockdown are given. (+) indicates de-repression of the reporter upon knockdown, (−) no de-repression
Fig. 6Different modes of PcG binding and their resulting ChIP-binding profiles. On the left are shown different modes of dynamic binding of PcG proteins to PREs. On the right are shown the ChIP profiles that would result from each mode of binding. a Spreading. PcG proteins are recruited by a PRE and subsequently spread up and downstream (left), resulting in a broad ChIP peak (right) from which the PRE is not identifiable. b Looping. PcG proteins are recruited by a PRE and subsequently loop to the promoter via higher order interactions (left), resulting in two ChIP peaks (right) only one of which is a bona fide PRE. c Dynamic changes. In the example shown, PcG proteins are recruited by a PRE and are subsequently delivered to a different location (left), resulting in a ChIP peak at the site of delivery (in this example, the gene) but not at the site of entry (in this example, the PRE) (right). Variations on this theme include different profiles in different cell types, in which only a subset of multiple PREs may be occupied in different tissues or at different times
Fig. 7Summary of similarities and differences relevant for fly and vertebrate PREs. The figure summarises the main points of this review. For PRC1, PRC2, TRXG, and target genes, key similarities and differences are listed, discussed in detail in the main text. For DNA-binding factors and PREs, open questions are identified, discussed in the conclusion section of the review