| Literature DB >> 35489059 |
Brady M Owen1, Chen Davidovich1,2.
Abstract
Polycomb group proteins predominantly exist in polycomb repressive complexes (PRCs) that cooperate to maintain the repressed state of thousands of cell-type-specific genes. Targeting PRCs to the correct sites in chromatin is essential for their function. However, the mechanisms by which PRCs are recruited to their target genes in mammals are multifactorial and complex. Here we review DNA binding by polycomb group proteins. There is strong evidence that the DNA-binding subunits of PRCs and their DNA-binding activities are required for chromatin binding and CpG targeting in cells. In vitro, CpG-specific binding was observed for truncated proteins externally to the context of their PRCs. Yet, the mere DNA sequence cannot fully explain the subset of CpG islands that are targeted by PRCs in any given cell type. At this time we find very little structural and biophysical evidence to support a model where sequence-specific DNA-binding activity is required or sufficient for the targeting of CpG-dinucleotide sequences by polycomb group proteins while they are within the context of their respective PRCs, either PRC1 or PRC2. We discuss the current knowledge and open questions on how the DNA-binding activities of polycomb group proteins facilitate the targeting of PRCs to chromatin.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35489059 PMCID: PMC9122586 DOI: 10.1093/nar/gkac290
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 19.160
Figure 1.Components of PRC1 and PRC2 complexes in mammals. PcG proteins form two predominant families of complexes; PRC1 and PRC2. Complexes within each family share common core proteins (shown in blue) and interact with accessory subunits (in magenta or yellow) which modulate their enzymatic activity and are required for chromatin binding and correct targeting. Known DNA-binding accessory subunits which may contribute to sequence selective recruitment of PcG proteins are highlighted in yellow. Arrows indicate binding to the core complex, but not necessarily the sites of protein-protein interactions since these are not always known. Mutually exclusive interactions are indicated by ‘or’.
Figure 2.Proposed DNA-binding regions of PRC2 proteins. PRC2 subunits are presented only if DNA-interacting regions were previously identified in them. DNA-binding domains are shown in green. DNA-binding regions outside of defined domains in EZH2 and EED are outlined in green.
DNA-binding by polycomb-group proteins: a quick guide to the biophysical evidence. Included in the table are only proteins that were shown to bind DNA. The table provides information on the protein construct that was assayed and the binding assays used for confirming the DNA-binding activity and specificity, where applicable. Information about the identified DNA motif is indicated only where probes were designed to test for binding specificity for the sequence, nucleotide composition or the base modifications indicated under ‘Motif’. In cases where ‘n.a.’ is indicated under ‘Motifs’, evidence could not support or was not intended to support binding specificity. In cases where ‘n.d.’ is indicated under ‘Motif’, specificity was not determined although experiments were designed to identify it (commonly, these would include size-matched probers that were subjected to specific perturbations required to test for a hypothesized motif; see Table 2 for a summary of probes used in various studies). Importantly, although ‘n.d.’ under ‘Motif’ indicates that specificity was not determined under the assayed conditions, it should not necessarily be interpreted as nonspecific binding. Unless otherwise indicated, nucleotide sequences and base modifications are referred to as DNA bases within the context of double-stranded DNA. All proteins are of mammalian species unless otherwise indicated. Method abbreviations: electrophoretic mobility shift assay (EMSA), microscale thermophoresis assays (MST), isothermal titration calorimetry (ITC), fluorescence anisotropy (FA), filter binding assay (FBA), systematic evolution of ligands by exponential enrichment (SELEX) and cyclic amplification and selection of targets (CASTing)
| Protein | DNA-binding activity | DNA-binding specificity | Comments | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Relevant complex | Assayed subunit | Was full-length protein assayed? [Y/N] | Was protein assayed within its PRC? [Y/N] | Method(s) used for detecting DNA-binding activity | Were quantitative binding assays done? [Y/N] | Were probes designed to test for binding specificity? [Y/N] | Motif | Method(s) used for assaying DNA-binding specificity | |
| PRC2 | PRC2 (EZH2, EED, SUZ12 and RBBP4) | Y | Y | EMSA ( | Y | Y | n.d. | EMSA ( | |
| PRC2.2 | AEBP2 | Y | Y | EMSA and FA ( | Y | Y | Methylated CpG DNA is preferred over unmethylated CpG DNA. | EMSA ( | |
| PRC2.2 | AEBP2 | N | N | EMSA ( | N | Y | CTT[N18-26]GCC motif concluded. | EMSA ( | |
| PRC2.2 | JARID2 | Y | Y | EMSA ( | Y | N | n.a. | ||
| PRC2.2 | JARID2 | Y | N | SELEX ( | N | N | Slight preference for CG-rich sequences. | SELEX ( | |
| PRC2.2 | JARID2 | N* | N | EMSA ( | N | N | n.a. | The JmjN, ARID, JmjC and the C5HC2 Zn finger domains are collectively required for DNA binding. ( | |
| PRC2.2 | JARID2 | N | N | EMSA ( | N | N | 5′-(T)4–6, TATT, or TAAT motifs | CASTing ( | |
| PRC2.1 | PHF1 | Y | Y | EMSA and MST ( | Y | Y | n.d. | MST ( | |
| PRC2.1 | PHF1 | N | N | ITCand EMSA ( | Y | Y | Two CpG dinucleotides are required but not sufficient for binding. | ITC ( | |
| PRC2.1 | MTF2 | Y | Y | EMSA ( | Y | Y | G and C bases are required for binding. | EMSA ( | |
| PRC2.1 | MTF2 | Y | ?* | DNA pulldown from mouse ESC extract ( | N | Y | Preferred binding to unmethylated DNA and helical-shape-selectivity concluded. | DNA pulldown from mouse ESC extract ( | *Immunoblotting for selected subunits carried out. |
| PRC2.1 | MTF2 | Y* | N | EMSA ( | Y | N | n.a. | *Amino acids 1–539 were assayed, which is nearly the full-length protein. | |
| PRC2.1 | MTF2 | N | N | ITC and EMSA ( | Y | Y | Two CpG dinucleotides are required but not sufficient for binding. | ITC ( | |
| PRC2.1 | PHF19 | Y | Y | EMSA ( | Y | Y | Enhanced affinity* to GC-rich sequences. | EMSA ( | *2- to 4-fold |
| PRC2.1 | Pcl (fly) | N | N | FA ( | Y | Y | n.d. | FA ( | |
| PRC2.1 | PALI1 | N | Y | EMSA ( | Y | Y | n.d. | EMSA ( | |
| PRC2.1 | PALI1 | N | N | EMSA ( | Y | N | n.a. | ||
| PRC1 | Pc, Ph, Psc and dRing1 (fly) | Y | Y (cPRC1) | FBA ( | Y | N | n.a. | ||
| PRC1 | Psc (fly) | Y | N | FBA ( | Y | N | n.a. | ||
| PRC1.1 | KDM2B | N | N | EMSA ( | N | N | Non-methylated CpG is preferably bound compared to methylated CpG DNA. | EMSA ( | |
| PRC1.1 | KDM2B | N | N | EMSA ( | N | N | n.a. | ||
| PRC1.1 | KDM2B | N | N | ITC ( | Y | Y | Non-methylated CpG is preferentially bound. | ITC ( | |
| PRC1.2/4 | CBX2 | N* | N | EMSA ( | Y | N | n.a. | * The AT-hook domain identified as sufficient for binding to DNA. | |
| PRC1.2/4 | CBX8 | N* | N | EMSA ( | N | N | n.a. | * The chromodomain identified as sufficient for binding to DNA. | |
| PRC1.4 | RING1B- PCGF4 | N | N | FA ( | Y | Y | n.d. | FA ( | |
| PRC1.6 | E2F6-DP2 heterodimer | Y | N | Gel shift ( | N | Y | DNA from a selected target gene. | Gel shift ( | |
| PRC1.6 | MAX-MGA | N* | N | Gel shift ( | N | N | n.a. | *heterodimers including MGA truncations and the full-length MAX were assayed. | |
| PhoRC and possibly other PRCs | Pho (fly) | Y | Y (cPRC1) | EMSA ( | N | N | n.a. | ||
| PhoRC and possibly other PRCs | Pho (fly) | Y | N | EMSA ( | N | Y | 13-mer motif* derived from PREs is required and sufficient for binding. | EMSA ( | *The motif identified earlier from PRE sequences ( |
Summary of quantitative studies of polycomb-group protein binding to naked DNA. For each protein, records are in descending order of K values. All DNA is double-stranded unless otherwise indicated. ‘-’ indicates that no binding was detected or that the affinity was too weak to quantify. ‘mC’ denotes 5′-methylcytosine. Abbreviations: Electrophoretic mobility shift assay (EMSA), fluorescence anisotropy (FA), filter binding assay (FBA) isothermal titration calorimetry (ITC), microscale thermophoresis (MST). a DNA hairpins: the sequence of the double-stranded regions is given, and the underlined sequence is the hairpin loop. The length, CpG number and GC percentage are given for the double-stranded region only. b The sequence was not given. c These were titration experiments where the probe concentration was close to or above the K concentration. d May have been PHF128-87. e Values in brackets represent the 95% confidence interval. f Hemi-methylated. g phosphorylated in the serine-rich region
| Protein construct | Substrate |
| Length | Number of CpGs | GC (%) | Method | Reference | |
|---|---|---|---|---|---|---|---|---|
| PRC2 | ||||||||
| 1 | (mCpG)24 | 31 ± 5 | 48 | 24 | 100 | EMSA | ( | |
| 2 | (CpG)24 | 32 ± 3 | 48 | 24 | 100 | EMSA | ( | |
| 3 | TCCCTCTCTC | 64 ± 4 | 30 | 5 | 73 | EMSA | ( | |
| 4 | CATAAT | 150 ± 11 | 30 | 1 | 37 | MST | ( | |
| 5 | TCCCTCTCTC | 218 ± 22 | 30 | 5 | 73 | MST | ( | |
| 6 | GG | >4000 | 46 | 4 | 80 | FA | ( | |
| 7 | AATATTTCATTTTATTCTATCTCAATGAGACAAAAGATTGATTAAT | >4000 | 46 | 0 | 20 | FA | ( | |
| PRC2.2 | ||||||||
| 8 | PRC2-AEBP2 | (m | 11 ± 6 | 48 | 24 | 100 | EMSA | ( |
| 9 | PRC2-AEBP2 | ( | 15 ± 2 | 60 | 30 | 100 | FA | ( |
| 10 | PRC2-AEBP2 | 60merb | 55 ± 9 | 60 | EMSA | ( | ||
| 11 | PRC2-AEBP2 | AATGm | 180 ± 40 | 48 | 8 | 69 | EMSA | ( |
| 12 | PRC2-AEBP2 | ( | 562 ± 47 | 48 | 24 | 100 | EMSA | ( |
| 13 | PRC2-AEBP2 | (TA)30 | 725 ± 50 | 60 | 0 | 0 | FA | ( |
| 14 | PRC2-AEBP2 | AATG | 5,900 ± 600 | 48 | 8 | 69 | EMSA | ( |
| 15 | PRC2-AEBP2 | GGGCGGCCGCCC | >2000 | 12 | 2 | 100 | EMSA | ( |
| 16 | PRC2-AEBP2 | ACGCGCGCCGGGAAATTGAAGCGG | >2000 | 24 | 5 | 67 | EMSA | ( |
| 17 | PRC2-AEBP2 | TCCGCCCGCTCGACGCGCGCCGGGAAATTGAAGCGG | >2000 | 36 | 8 | 72 | EMSA | ( |
| 18 | PRC2-AEBP2 | GCGCTAGGAGCATCCGCCCGCTCGACGCGCGCCGGGAAATTGAAGCGG | >2000 | 48 | 9 | 71 | EMSA | ( |
| 19 | PRC2-AEBP2 | TATATAATTATATTATATTAAAAT | >2000 | 24 | 0 | 0 | EMSA | ( |
| 20 | PRC2-AEBP2 | TCCGCCCGCTCGACGCGCGCCGGGAAATTGAAGCGG | >2000 | 36 | 8 | 72 | EMSA | ( |
| 21 | PRC2-AEBP2 | ATATTTTATATATATATAAATTAATATATAATTATATTATATTAAAAT | >2000 | 48 | 0 | 0 | EMSA | ( |
| 22 | PRC2-AEBP2-JARID2 | 60merb | 46 ± 11 | 60 | EMSA | ( | ||
| PRC2.1 | ||||||||
| 23 | PRC2-PHF1 | TCCCTCTCTC | 30 ± 3 | 30 | 5 | 73 | EMSA | ( |
| 24 | PRC2-PHF1(515–567) | TCCCTCTCTC | 49 ± 4 | 30 | 5 | 73 | EMSA | ( |
| 25 | PRC2-PHF1 | CATAAT | 117 ± 5 | 30 | 1 | 37 | MST | ( |
| 26 | PRC2-PHF1 | TCCCTCTCTC | 142 ± 22 | 30 | 5 | 73 | MST | ( |
| 27 | PRC2-PHF1(515–567) | TCCCTCTCTC | 252 ± 8 | 30 | 5 | 73 | MST | ( |
| 28 | PHF1165-360 | GGG | 3,900 ± 500 | 12 | 2 | 83 | ITC | ( |
| 29 | PHF1165-360 | GGG | 11,300 ± 800 | 14 | 2 | 86 | ITC | ( |
| 30 | PHF1165-360 | GGC | 22,000 ± 4,000 | 12 | 2 | 100 | ITC | ( |
| 31 | PHF1165-363 | TCCCTCTCTC | 29,700 ± 500 | 14 | 5 | 73 | FA | ( |
| 32 | PHF1165-360 | GGG | 31,000 ± 2000 | 12 | 2 | 83 | ITC | ( |
| 33 | dPHF114-87 | 10 bpb | 201 | 10 | NMR | ( | ||
| 34 | PHF1165-360 | GGA | - | 12 | 2 | 67 | ITC | ( |
| 35 | PHF1165-360 | GGA | - | 12 | 2 | 67 | ITC | ( |
| 36 | PHF1165-360 | GGT | - | 12 | 2 | 67 | ITC | ( |
| 37 | PRC2-MTF2 | GTGGTGGTGGTGGTTGAGAAAATAAAACCCAGAG | 15 ± 0.3 | 100 | 10 | 57 | EMSA | ( |
| 38 | MTF21-539 | GTGGTGGTGGTGGTTGAGAAAATAAAACCCAGAG | 190 | 100 | 10 | 57 | EMSA | ( |
| 39 | MTF2180-378 | GGG | 2100 ± 300 | 12 | 2 | 100 | ITC | ( |
| 40 | MTF2180-378 | GGT | 6400 ± 1000 | 12 | 2 | 83 | ITC | ( |
| 41 | MTF2180-378 | GGG | 9000 ± 2000 | 14 | 2 | 86 | ITC | ( |
| 42 | MTF2180-378 | GGC | 12 000 ± 1000 | 12 | 2 | 100 | ITC | ( |
| MTF2180-378 | GGG | 22 000 ± 7000 | 12 | 2 | 83 | ITC | ( | |
| 43 | MTF2180-378 | GGG | 25 000 ± 4000 | 12 | 2 | 83 | ITC | ( |
| 44 | MTF2180-378 | GGA | 33 000 ± 6000 | 12 | 2 | 83 | ITC | ( |
| 45 | MTF2180-378 | GGA | - | 12 | 2 | 67 | ITC | ( |
| 46 | MTF2180-378 | GGA | - | 12 | 2 | 67 | ITC | ( |
| 47 | MTF2180-378 | GGT | - | 12 | 2 | 67 | ITC | ( |
| 48 | PRC2-MTF2 | TATATATATTTTATATATATATAAATTAATATATAATTATATTATATTAAAATTAATATTATATTTAAATTATTATATATATATAAATAATTTAATTATA | - | 100 | 0 | 0 | EMSA | ( |
| 49 | PRC2-PHF19 | TCCGCCCGCTCGACGCGCGCCGGGAAATTGAAGCGG | 10 ( | 36 | 8 | 72 | EMSA | ( |
| 50 | PRC2-PHF19 | ATATTTTATATATATATAAATTAATATATAATTATATTATATTAAAAT | 20 ( | 48 | 0 | 0 | EMSA | ( |
| 51 | PRC2-PHF19 | GCGCTAGGAGCATCCGCCCGCTCGACGCGCGCCGGGAAATTGAAGCGG | 40 ( | 48 | 9 | 71 | EMSA | ( |
| 52 | PRC2-PHF19 | TATATAAATTAATATATAATTATATT ATATTAAAAT | 40 ( | 36 | 8 | 72 | EMSA | ( |
| 53 | PRC2-PHF19 | ACGCGCGCCGGGAAATTGAAGCGG | 370 ( | 24 | 5 | 67 | EMSA | ( |
| 54 | PRC2-PHF19 | TATATAATTATATTATATTAAAAT | 1600 (810, 3900) e | 24 | 0 | 0 | EMSA | ( |
| 55 | PRC2-PHF19 | GGGCGGCCGCCC | >2000 | 12 | 2 | 100 | EMSA | ( |
| 56 | Pcl491-694 | TAATGGCTG | 2900 | 20 | 2 | 55 | FA | ( |
| 57 | Pcl491-694 | CTCTC | 3500 | 20 | 4 | 75 | FA | ( |
| 58 | Pcl491-694 |
| 4200 | 20 | 3 | 55 | FA | ( |
| 59 | Pcl491-694 | GC | 4200 | 20 | 3 | 60 | FA | ( |
| 60 | Pcl491-694 | GT | 4200 | 20 | 3 | 55 | FA | ( |
| 61 | Pcl491-694 | GTTTGCTGAATCTGAATGGT | 4200 | 20 | 0 | 40 | FA | ( |
| 62 | Pcl491-694 | CTGT | 4400 | 20 | 3 | 55 | FA | ( |
| 63 | Pcl491-694 |
| 5100 | 20 | 3 | 60 | FA | ( |
| 64 | Pcl491-694 | CCTCATAAT | 5100 | 20 | 1 | 40 | FA | ( |
| 65 | Pcl491-694 | A | 7300 | 20 | 2 | 60 | FA | ( |
| 66 | Pcl491-694 | TTTAAGTG | 7300 | 20 | 1 | 45 | FA | ( |
| 67 | Pcl491-694 | TAATGGC | 7600 | 20 | 2 | 45 | FA | ( |
| 68 | Pcl491-694 | ATCTCTCCCTCTCTC | 7700 | 20 | 1 | 60 | FA | ( |
| 69 | Pcl491-694 | ACTGAGATGGCCTCATAATC | 8000 | 20 | 0 | 45 | FA | ( |
| 70 | Pcl491-694 | GGCTAAC | 8700 | 20 | 1 | 55 | FA | ( |
| 71 | Pcl491-694 | CTGCAGCTC | 9000 | 20 | 2 | 60 | FA | ( |
| 72 | Pcl491-694 | TCCCTCTCTC | 9700 ± 200 | 30 | 5 | 73 | FA | ( |
| 73 | Pcl491-694 | AGATAAGACTA | 9800 | 20 | 2 | 50 | FA | ( |
| 74 | Pcl491-694 | AG | 11400 | 20 | 1 | 40 | FA | ( |
| 75 | Pcl491-694 | GAGAG | 13300 | 20 | 3 | 60 | FA | ( |
| 76 | PRC2-PALI1058-1250 | AATATTTCATTTTATTCTATCTCAATGAGACAAAAGATTGATTAAT | 73.7 ± 10 | 46 | 0 | 20 | EMSA | ( |
| 77 | PRC2-PALI1058-1250 | GG | 155 ± 26 | 46 | 4 | 80 | EMSA | ( |
| PRC1 | ||||||||
| 78 | Pc-Ph-Psc-dRing1 | 155 bpb | 0.13 ± 0.08 | 155 | FBA | ( | ||
| 79 | Psc | 155 bpb | 0.15 ± 0.08 | 155 | FBA | ( | ||
| PRC1.1 | ||||||||
| 80 | KDM2B607–723 | GCCAC | 2200 ± 300 | 12 | 1 | 83 | ITC | ( |
| 81 | KDM2B607–723 | GCCAG | 1600 ± 400 | 12 | 1 | 83 | ITC | ( |
| 82 | KDM2B607–723 | GCCAA | 2500 ± 500 | 12 | 1 | 67 | ITC | ( |
| 83 | KDM2B607–723 | GCCAT | 2200 ± 300 | 12 | 1 | 67 | ITC | ( |
| 84 | KDM2B607–723 | GCCAGTACTGGC | NB | 12 | 0 | 67 | ITC | ( |
| 85 | KDM2B607–723 | GCCAAAmCGTAACCGf | NB | 12 | 0 | 67 | ITC | ( |
| PRC1.4 | ||||||||
| 86 | RING1B1-116-PCGF41-109 | GTGTTACTAGCT | 2800 ± 300 | 12 | 0 | 42 | Competition FA | ( |
| 87 | RING1B1-116-PCGF41-109 | TCAGCTGAACAT | 3200 ± 300 | 12 | 0 | 42 | FA | ( |
| 88 | RING1B1-116-PCGF41-109 | GGACCTGATGAC | 3200 ± 300 | 12 | 0 | 58 | Competition FA | ( |
| 89 | RING1B1-116-PCGF41-109 | AAGACATA | 3700 ± 200 | 12 | 1 | 42 | Competition FA | ( |
| 90 | RING1B1-116-PCGF41-109 | TCAGCTGAACAT | 3800 ± 300 | 12 | 0 | 42 | Competition FA | ( |
| 91 | RING1B1-116-PCGF41-109 | TCAGCTGAACAT | 4100 ± 300 | 12 | 0 | 42 | Competition FA | ( |
| 91 | RING1B1-116-PCGF41-109 | CTAGCCTAG | 4300 ± 200 | 12 | 1 | 58 | Competition FA | ( |
| 93 | RING1B1-116-PCGF41-109 | TCAGCTGAAC | 6400 ± 900 | 10 | 0 | 50 | Competition FA | ( |
| 94 | RING1B1-116-PCGF41-109 | TCAGCTGA | 12000 ± 2000 | 8 | 0 | 50 | Competition FA | ( |
| 95 | RING1B1-116-PCGF41-109 | TCAGCT | 75000 ± 44000 | 6 | 0 | 50 | Competition FA | ( |
| 96 | CBX22-63 | Major satellite | 8110 ± 360 | 130 | EMSA | ( | ||
| 97 | CBX260-82 | Major satellite | 510 ± 50 | 130 | EMSA | ( | ||
| 98 | CBX22-82 | Major satellite | 320 ± 50 | 130 | EMSA | ( | ||
| 99 | CBX22-129 | Major satellite | 210 ± 10 | 130 | EMSA | ( | ||
| 100 | CBX22-129 g | Major satellite | >10000 | 130 | EMSA | ( |
Figure 3.ARID domain mechanism of DNA binding. (A) Hydrogen bonds and salt bridges between the ARID domain of DRI and DNA are shown with black arrows. DRI Thr351 and Ser352 are shown in bold and make direct contacts with DNA bases which may confer AT selectivity. Contacts were mapped based on the structure of the ARID domains from DRI in a complex with DNA (PDB ID: 1KQQ (118)). (B) The structure of the ARID domain of DRI (dark blue, PDB ID: 1KQQ (118)) with DNA was superimposed over the structure of the ARID domain from JARID2 that was solved without DNA (light blue, PDB ID: 2RQ5 (121)). (C) Close contacts between Thr351 and Ser352 (green) of DRI and the DNA bases are indicated in dashed black lines. The equivalent residues in JARID2 are labelled in light blue. (D) Sequence alignment of JARID2, DRI and MRF2 ARID domains with DRI DNA contacts from (A) highlighted. In all panels, amino acids in DRI from the loop between H5 and H6 contact bases in the major groove and are labelled in green. Additional interactions are mediated by amino acids in the loop between H2 and H3 (magenta), a pocket between H4 and H5 (orange) and the end of H8 (red).
Figure 4.Mechanism of DNA binding by PCL proteins. (A) Sequence alignment of WH domains. Amino acids in the conserved Ile-Lys-Lys motif are shown in bold. (B) Top structure: A DNA-binding mode of Drosophila Pcl (light blue, PDB ID: 5OQD) proposed by Choi et al. (34) based on alignment to the DNA bound structure of the FoxO1 WH domain (dark blue) (PDB ID: 3CO6 (228)). Bottom structure: The DNA-bound structure of PHF1 (magenta), which forms non-canonical interactions with the DNA (PDB ID: 5XFQ (31)). (C) Contacts between PHF1 and the CpG dinucleotide. Contacts as reported in Li et al. (31) are represented by arrows (top) and dashed lines (bottom). Amino acids which contact the DNA bases, and hence are proposed to contribute to sequence selective binding (31) are in bold.
Figure 5.The ZF-CxxC domain of KDM2B bind CpG dinucleotides. Sequence alignment of ZF-CxxC domains. Residues highlighted in green were reported to contact the bases of the CpG dinucleotide (PDB: 2KKF (152), PDB: 3PT6 (153)). Residues in gold were reported to contact the DNA backbone. DNA-contacting residues which are conserved in human KDM2B are in bold. Conserved cysteine residues which coordinate zinc ions are highlighted in blue and two of these cysteines which mutating them disrupt KDM2B function in mice (36) are underlined.
Figure 6.Model of DNA-dependent recruitment of PcG proteins. PRC2.1 is seeded on target DNA by the PCL proteins (top left). PRC1.1 is seeded on DNA by KDM2B (top right) and PRC1.6 may also be recruited at some sites by MGA-MAX or E2F6. Once either the H3K27me3 or the H2AK119ub marks were nucleated, they can recruit the canonical PRC1 (middle left) or the PRC2.2 (middle right), respectively. After both the H2AK119Ub and H3K27me3 repressive marks were established, positive feedback loops involving PRC1, PRC2 and their respective chromatin modifications then lead to the maintenance of the repressive marks and possibly contributed to the recruitment of other PcG proteins (bottom). Orange arrows indicate enhancement of enzymatic activity by accessory subunits (in yellow) or the H3K27me3 mark (in orange). Transparent shapes and question marks indicate uncertainty regarding the precise role taken by the indicated protein in the presented pathway. For simplicity, histone tails are only shown for one copy of H3 and H2A for each nucleosome.