| Literature DB >> 35800781 |
Berend Snel1, Sander van den Heuvel2, Michael F Seidl1.
Abstract
Polycomb Repressive Complex 2 (PRC2) catalyzes the mono-, di-, and trimethylation of histone protein H3 on lysine 27 (H3K27), which is strongly associated with transcriptionally silent chromatin. The functional core of PRC2 is highly conserved in animals and consists of four subunits. One of these, SUZ12, has not been identified in the genetic model Caenorhabditis elegans, whereas C. elegans PRC2 contains the clade-specific MES-3 protein. Through unbiased sensitive sequence similarity searches complemented by high-quality structure predictions of monomers and multimers, we here demonstrate that MES-3 is a highly divergent ortholog of SUZ12. MES-3 shares protein folds and conserved residues of key domains with SUZ12 and is predicted to interact with core PRC2 members similar to SUZ12 in human PRC2. Thus, in agreement with previous genetic and biochemical studies, we provide evidence that C. elegans contains a diverged yet evolutionary conserved core PRC2, like other animals.Entities:
Keywords: 3days reconstruction of protein; Biochemistry; Bioinformatics; protein; sequence homology
Year: 2022 PMID: 35800781 PMCID: PMC9253717 DOI: 10.1016/j.isci.2022.104633
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1MES-3 is a highly divergent ortholog of the canonical Polycomb Repressive Complex two component SUZ12
(A). The Polycomb Repressive Complex 2 (PRC2) core components EZH2/1, EED, RBBP4/7, and SUZ12 are conserved in a broad range of metazoans; the presence of orthologs is indicated by filled boxes. Notably, based on sequence similarity searches, an ortholog of SUZ12 is absent in the nematode model species Caenorhabditis elegans, but present in other, closely related nematodes (Brugia malayi and Trichinella spiralis). C. elegans encodes the PRC2 core component MES-3 that lacks obvious motifs or sequence similarity to SUZ12 (Ahringer and Gasser, 2018; Bender et al., 2004; Ketel et al., 2005; Paulsen et al., 1995; Xu et al., 2001).
(B). Schematic representation of the composition of the core PRC2. The zinc finger binding (ZnB; red), WD-domain binding 1 (WDB1; blue), C2 domain (green), zinc finger (Zn; yellow), WD-domain binding 2 (WDB2; pink), and VEFS (orange) motifs or domains involved in SUZ12 protein-interactions are shown in the schematic as well as along the protein sequence (Chammas et al., 2020; Chen et al., 2018; Kasinath et al., 2018). Schematic representation of the protein sequence of MES-3 is shown, and regions of uncovered sequence (c) and structural (e, f) similarity are highlighted.
(C). Protein sequence alignment between the N-terminal region of SUZ12 and MES-3, as identified by sensitive profile-vs-profile sequence similarity searches, covers part of the zinc finger binding (ZnB; red), WD-domain binding 1 (WDB1; blue), and C2 domain (green). The conserved RBBP4/7 binding epitope as well as Gly299 are highlighted (Birve et al., 2001; Rai et al., 2013; Schmitges et al., 2011). Identical amino acids are shown in blue and biochemically similar amino acids are shown in turquoise.
(D–F). The predicted aligned error (in Å; based on model 2 ptm) of the MES-3 structure is shown as a heatmap and reveals two separated globular regions in the N- and C-terminus, the former overlaps with the profile-vs-profile match (c) and corresponds to the C2 domain of SUZ12 (e; Figure S1I; RMSD = 1.607), while the latter overlaps with the region that structurally resembles the VEFS domain (f; Figure S1J; RMSD = 3.676). The black arrows (e, f) highlight regions that differ considerably between SUZ12 and MES-3 (Figures S1I and S1J), and the structure predictions of SUZ12 and MES-3 (e, f) are shown in gray as well as green (C2) and orange (VEFS), respectively.
(G). Sequence-independent structure alignment of the VEFS regions of SUZ12 and MES-3 reveals significantly structural similarity (Dali Z score = 8.3; TM-score = 0.55), especially along the α helices in the C-terminus; a region previously shown to stimulate histone methyltransferase activity in SUZ12 (Birve et al., 2001) (pos. 580 to 612) is highlighted by a black bar, and individual amino acids important for PRC2 assembly (Birve et al., 2001) are shown by black arrows.
Figure 2MES-3 provides a structural scaffold for PRC2 in C. elegans
(A). The predicted aligned error (in Å) of MES-3 co-folded with MES-2 and MES-6 is shown as a heatmap and is consistent with association of MES-3 with MES-2 and MES-6 in the C-terminal regions of MES-3, which overlaps with the predicted VEFS domain.
(B). Experimentally resolved human core PRC2 (rcsbpdb:6WKR (Kasinath et al., 2021)) highlights interactions between SUZ12 and RBBP4 as well as SUZ12 and EZH2 and EED.
(C). Predicted C. elegans core PRC2 is formed by MES-2, MES-3, and MES-6. The corresponding region in human PRC2 is highlighted, as well as the position of the MES-3 VEFS domain (green triangle, see a.) and the MES-2 MCSS/SANT2-like region (red triangle, see a.), as well as a short central region of MES-3 (pos. 530-570) that associates with multiple regions in MES-2 and MES-6 (black triangle, see a.). For clarity, only regions of MES-3 interacting with MES-2 and MES-6 are shown (MES-3, pos. 1-530 and 570-640 are hidden).
(D). The predicted aligned error (in Å) of MES-3 co-folded with LIN-53 is shown as a heatmap and reveals association between the N-terminal region of MES-3 and LIN-53.
(E). Predicted MES-3 with LIN-53 complex. The corresponding region in human PRC2 is highlighted, as well as the MES-3 C2 domain (green triangle, see d.) and regions surrounding the C2 domain that engage in association with LIN-53 (WDB2, black triangle; WDB1, gray triangle; ZnB, light-blue triangle; see d.). For clarity, only regions of MES-3 interacting with LIN-53 are shown (MES-3, pos. 1-80, 365-415, and 470-754 are hidden).
(F). Schematic representation of MES-3 and its predicted interactions with MES-2, MES-6, and LIN-53. The positions of protein domains/motifs in MES-2, MES-6, and LIN-53 were inferred via structural alignments of the predicted structures to those of the experimentally determined human EZH2, EED, and RBBP4 (Chammas et al., 2020; Kasinath et al., 2021). MES-3 domains/motifs are indicated as in Figure 1 with the addition of the central MES-2/MES-6 interacting region; domains/motifs shown are WD-domains (WD40) in MES-6 and LIN-53, and MES-6 binding (MES-6b), Swi3, Ada2, N-CoR and TFIIIB DNA-binding domain 1 like (SANT1), Motif connecting SANT1 and SANT2 (MCSS), SANT2, CXC, and the Su(var)3-9 , EZ and Trx domain (SET) in MES-2 (Chammas et al., 2020). We note that the region around the potential MCSS/SANT2 domains in MES-2 is substantially diverged compared with EZH2, yet still displays considerable structural similarity.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Uniprot DB | Q10665 | |
| Uniprot DB | O17514 | |
| Uniprot DB | Q9GYS1 | |
| Uniprot DB | P90916 | |
| Human SUZ12 sequence | NCBI | NP_056170.2 |
| Human SUZ12 sequence | Uniprot DB | Q15022 |
| Human PRC2-AEBP2-JARID2 structure | PDB | 6WKR |
| Orthofinder (v2.1.2) | ||
| Broccoli (v1.0) | ||
| EggNOG (v4.5.1) | ||
| SonicParanoid (v1.3.0) | ||
| HHPRED (June 2nd, 2021) | ||
| Alphafold2 (v2.1) | ||
| Pymol (v2.5.2) | Schrödinger, LLC. | |
| pdb-tools (v2.4.1) | ||
| TM-align (v20190822) | ||
| Dali (v5) | ||
| IUPRed3 (v3) | ||