Literature DB >> 35059633

Roles of Polycomb complexes in regulating gene expression and chromatin structure in plants.

Fernando Baile1, Ángeles Gómez-Zambrano1, Myriam Calonje1.   

Abstract

The evolutionary conserved Polycomb Group (PcG) repressive system comprises two central protein complexes, PcG repressive complex 1 (PRC1) and PRC2. These complexes, through the incorporation of histone modifications on chromatin, have an essential role in the normal development of eukaryotes. In recent years, a significant effort has been made to characterize these complexes in the different kingdoms, and despite there being remarkable functional and mechanistic conservation, some key molecular principles have diverged. In this review, we discuss current views on the function of plant PcG complexes. We compare the composition of PcG complexes between animals and plants, highlight the role of recently identified plant PcG accessory proteins, and discuss newly revealed roles of known PcG partners. We also examine the mechanisms by which the repression is achieved and how these complexes are recruited to target genes. Finally, we consider the possible role of some plant PcG proteins in mediating local and long-range chromatin interactions and, thus, shaping chromatin 3D architecture.
© 2021 The Author(s).

Entities:  

Keywords:  PRC1; PRC2; Polycomb Group; chromatin organization; gene repression; histone modifications

Mesh:

Substances:

Year:  2021        PMID: 35059633      PMCID: PMC8760139          DOI: 10.1016/j.xplc.2021.100267

Source DB:  PubMed          Journal:  Plant Commun        ISSN: 2590-3462


Introduction

The development of multicellular organisms relies on cells and tissues that establish unique gene expression programs. These programs depend on transcription factors (TFs) that guide the binding of the transcriptional machinery to specific gene promoters and control its subsequent activity. It has become evident that nucleosome organization within chromatin can either expose or hide functionally important regulatory DNA sequences that are required for the binding of TFs and the transcriptional machinery, thus strongly influencing gene expression. Furthermore, nucleosome organization can determine the formation of higher-order structures that maintain transcriptional states through cell division. The organization of nucleosomes within chromatin can be altered in several ways (Rosa and Shaw, 2013), including the incorporation of posttranslational modifications on nucleosomal histone tails (Bannister and Kouzarides, 2011). The importance of histone modifications in the control of developmental processes was revealed a long time ago during investigations of the role of Polycomb Group (PcG) proteins. PcG genes were initially discovered in Drosophila melanogaster (Drosophila), where they are necessary for the repression of homeotic genes and therefore for the specification of the body plan (for review, see Kassis et al., 2017). Subsequently, Drosophila PcG genes were shown to have homologs in vertebrates and plants, where, among other processes, they also regulate development (reviewed in Whitcomb et al., 2007; Mozgova and Hennig, 2015; Xiao and Wagner, 2015). Biochemical analyses have shown that PcG proteins normally assemble into two large multiprotein complexes with different histone-modifying activities. Polycomb Repressive Complex 1 (PRC1) has H2A E3 ubiquitin ligase activity toward lysine 119, 120, and 121 in Drosophila, vertebrates, and Arabidopsis, respectively (Wang et al., 2004; Cao et al., 2005; Bratzel et al., 2010; Yang et al., 2013), and PRC2 has H3 lysine 27 (H3K27) trimethyltransferase activity (Müller et al., 2002; Makarevich et al., 2006; Mozgova and Hennig, 2015). Detailed studies over the last decade have revealed that, despite remarkable functional and mechanistic conservation among the PcG systems of different kingdoms, some key molecular principles have diverged. In this review, we discuss the current view of how the PcG system regulates gene expression in plants. We first present the composition and activities of PcG complexes as a prerequisite to understanding how PRC1 and PRC2 perform their functions. Excellent reviews have recently examined the composition of the core components of these complexes and their roles in the regulation of different developmental processes (Wang and Shen, 2018; Bieluszewski et al., 2021; Hinsch et al., 2021). We therefore focus our survey on the role of recently identified accessory proteins and on the new functions that have been attributed to previously known components. We then examine the mechanisms by which PRC1 and PRC2 regulate gene expression and how they identify their target sites. Finally, we discuss their possible role in the formation of local and long-range chromatin interactions and, thus, their contribution to shaping chromatin 3D structure.

Placing plant PRC1 within the context of animal PRC1 complexes

PRC1 complexes incorporate H2A monoubiquitin marks into chromatin. In Drosophila, the H2A E3 monoubiquitin ligase module (the E3 module hereafter) consists of Sex Comb Extra (Sce, also known as dRing) and Posterior Sex Combs (Psc) or Su(z)2 (Cao et al., 2005) (Figure 1). The E3 module is more diverse in vertebrates than in Drosophila. It comprises RING1A or RING1B and one of the six Polycomb Group RING finger (PCGF1–6) proteins (Barbour et al., 2020; Geng and Gao, 2020; Blackledge and Klose, 2021) (Figure 1).
Figure 1

A diverse repertoire of PRC1s.

Schematic representation of PRC1 E3 modules in Drosophila, vertebrates, and Arabidopsis and accessory proteins found in canonical PRC1s (cPRC1s) and variant PRC1s (vPRC1s) in the different cases. Possible accessory proteins that have not been verified by AP–MS are indicated in a dotted background. The complete names of vertebrate accessory proteins that are not mentioned in the text are: AUTS2, autism susceptibility protein 2; BCOR, BCL-6 co-repressor; CK2, casein kinase 2; FBRS, fibrosin; L3MBTL2, lethal(3)malignant brain tumor-like protein 2; SKP1, S-phase kinase-associated protein 1; DCAF7, DDB1 and CUL4-Associated Factor 7; and WDR5, WD40-repeat protein 5.

A diverse repertoire of PRC1s. Schematic representation of PRC1 E3 modules in Drosophila, vertebrates, and Arabidopsis and accessory proteins found in canonical PRC1s (cPRC1s) and variant PRC1s (vPRC1s) in the different cases. Possible accessory proteins that have not been verified by AP–MS are indicated in a dotted background. The complete names of vertebrate accessory proteins that are not mentioned in the text are: AUTS2, autism susceptibility protein 2; BCOR, BCL-6 co-repressor; CK2, casein kinase 2; FBRS, fibrosin; L3MBTL2, lethal(3)malignant brain tumor-like protein 2; SKP1, S-phase kinase-associated protein 1; DCAF7, DDB1 and CUL4-Associated Factor 7; and WDR5, WD40-repeat protein 5. The RING1 and PCGF proteins have similar domain architectures, including an N-terminal RING finger domain, a C-terminal RING finger, and a WD40-associated ubiquitin-like (RAWUL) domain (Sanchez-Pulido et al., 2008; Merini and Calonje, 2015) (Figure 1). The RING1 proteins provide the catalytic activity, and the PCGF proteins enhance their enzymatic activity (Buchwald et al., 2006; Li et al., 2006). RING1 and PCGF proteins dimerize through their RING domains, and this dimerization facilitates their interaction with an E2-conjugating enzyme to enable histone ubiquitination (Bentley et al., 2011). The RAWUL domain binds to a range of accessory subunits to give rise to different PRC1s (for review see Geng and Gao, 2020; Barbour et al., 2020; Blackledge and Klose, 2021). Drosophila Sce, Psc, and Su(z)2 also contain these domains; however, Psc and Su(z)2 have, in addition, a long, intrinsically disordered and positively charged C-terminal region (CTR) (Lo et al., 2009; Beh et al., 2012) (Figure 1). This CTR is able to bind chromatin and DNA in vitro (King et al., 2005; Emmons et al., 2009; Beh et al., 2012; Kang et al., 2020) and to mediate chromatin compaction (Francis et al., 2004), nucleosome bridging (Lo et al., 2012), transcription inhibition (King et al., 2005), and chromatin remodeling (King et al., 2005; Lo and Francis, 2010). The first characterized PRC1 in animals was the one currently known as canonical PRC1 (cPRC1). In Drosophila, this complex contains the E3 module (Sce and Psc/Su(z)2) and the accessory proteins Polycomb (Pc), Polyhomeotic (Ph), and Sex Comb on Midleg (Scm) (Shao et al., 1999; Saurin et al., 2001) (Figure 1). Similarly, vertebrate cPRC1 contains RING1A/B, PCGF2/4, Chromobox (CBX)2/4/6/7/8, Polyhomeotic homolog (PHC)1/2/3, and Scm homolog (SCMH)1, Scm-like 1 (SCML1), or SCML2 (Gao et al., 2012). Both Pc and the CBXs have a CHROMO domain that binds H3K27me3; however, the different CBXs also exhibit unique functions (Bernstein et al., 2006; Kaustov et al., 2011). For instance, CBX2, but not other CBXs, has an intrinsically disordered region (IDR) enriched in basic amino acids that displays functions similar to those of the Psc-CTR (Grau et al., 2011; Plys et al., 2019; Kent et al., 2020). On the other hand, Ph and PHC1/2/3 have a sterile alpha motif (SAM) domain that mediates oligomerization (Kim et al., 2002; Isono et al., 2013). Finally, the SCM proteins also contain an SAM domain whose architecture is similar to that of the PHCs, but the exact role of the SCM proteins is not clear (Kim et al., 2005) (Figure 1). Several variant PRC1s (vPRC1s) have subsequently been identified. In Drosophila, the dRing-associated factor (dRAF) complex contains Sce, Psc/Su(z)2, and the H3K36-specific demethylase dKDM2 (Lagarou et al., 2008). The Drosophila homolog of the vertebrate Ring1 and Ying Yang (YY)1 binding protein (dRybp) has also been connected to this complex (Fereres et al., 2014) (Figure 1). The dRAF complex promotes both monoubiquitination of H2A and demethylation of H3K36me2 (Lagarou et al., 2008), and most of the H2A monoubiquitination activity in Drosophila has been attributed to this complex. However, loss of Psc/Su(z)2 function decreases total H2A monoubiquitination by 30%–40%, whereas the knockdown of L(3)73Ah, a less-characterized PCGF homolog in Drosophila (Irminger-Finger and Nöthiger, 1995), decreases H2A monoubiquitination by 70%. This result suggests the existence of an alternative L(3)73Ah-dRing/Sce module that is responsible for incorporating most of this modification (Lee et al., 2015). In vertebrates, vPRC1s contain an E3 module composed of RING1A/B, one of the six PCGFs, and RYBP or its paralog YY1-associated factor (YAF) 2, which regulates the catalytic activity of the E3 module (Figure 1) (reviewed in Geng and Gao, 2020; Barbour et al., 2020; Blackledge and Klose, 2021). The associations of CBX proteins and RYBP/YAF2 with the E3 module are mutually exclusive. Whereas RYBP/YAF2 strongly stimulates the activity of the module, CBX proteins have a less stimulatory effect. Accordingly, cPRC1 displays much lower E3 ligase activity than the vPRC1s (Buchwald et al., 2006). In addition, each vPRC1 presents different accessory proteins depending on the PCGF that it contains (Figure 1). Although we are not going to describe the vast array of vPRC1 accessory proteins identified in vertebrates (for review see Barbour et al., 2020; Chetverina et al., 2020; Geng and Gao, 2020; Blackledge and Klose, 2021), we would like to highlight the diversity of their biochemical properties. For instance, these proteins include DNA binding activities that can target PRC1 to specific sites (e.g. KDM2B, the dimer MYC-associated factor X [MAX] and MAX gene-associated protein [MGA], and the dimer E2F6 and dimerization partner [DP]1/2), histone-modifying activities directed to remove counteracting marks (e.g. the histone deacetylases HDAC1 and HDAC2 and the H3K36me2 demethylase KDM2B), and proteins that regulate PRC1 activity (e.g. the ubiquitin C-terminal hydrolase ubiquitin-specific protease 7 [USP7] that prevents self-ubiquitination of RING1 and PCGF to stabilize the complex and thus regulates H2A monoubiquitination levels [de Bie et al., 2010; Maertens et al., 2010; Hu et al., 2014; Maat et al., 2021]) (Figure 1). In Arabidopsis, there are two RING1-like proteins, RING1A and RING1B, and three PCGF-like proteins named BMI1A, BMI1B, and BMI1C after the PCGF4/B lymphoma Mo-MLV insertion region 1 homolog (BMI1) (Merini and Calonje, 2015; Wang and Shen, 2018) (Figure 1). In animals, the E3 module is formed by a RING1/PCGF heterodimer, but in Arabidopsis, RING1A and RING1B can self-interact, cross-interact, and also interact with each of the BMI1s (Xu and Shen, 2008; Bratzel et al., 2010; Chen et al., 2010). Moreover, the five proteins alone have H2A E3 ligase activity in vitro (Bratzel et al., 2010, 2012), and loss of either RING1A/B or BMI1A/B/C activity causes a reduction in H2AK121ub (Bratzel et al., 2010; Yang et al., 2013; Li et al., 2017a). Therefore, it remains unclear whether or not heterodimerization takes place between the RING1s and the BMI1s and whether this association enhances the in vivo H2A E3-ligase activity. Interestingly, a recent work has revealed the existence of BMI1-like truncated genes (BMI1-L) restricted to embryophytes (Figure 1). These genes encode a RAWUL domain but lack the N-terminal RING finger domain, which is the catalytic part of BMI1. They share part of the BMI1 exon–intron structure, suggesting a common evolutionary history. A BLASTP search identified 138 homologs. Whereas Marchantia polymorpha and Selaginella moellendorffii contain a single gene, Physcomitrella patens, Amborella trichopoda, Pinus pinaster, and most dicots contain two genes, and monocots usually contain three (López et al., 2021). The tomato, rice, and maize homologs are involved in shoot apical meristem and axillary meristem development (Tabuchi et al., 2011; Yao et al., 2019; López et al., 2021). One of the tomato homologs, Super determinant1A (Sde1A), and the Marchatia polymorpha homolog, MpBMI1-L, have been shown to genetically interact with the BMI1 proteins (Liu et al., 2021; López et al., 2021). These findings raise the possibility that these genes may be involved in the regulation of PRC1 activity by, for instance, competing with BMI1 for interaction partners or, alternatively, in the formation of non-enzymatically active vPRC1s. In any case, a cPRC1 is not conserved in plants, as there are no plant homologs of the Pc/CBXs, Ph/PHCs, or SCM proteins. Nevertheless, it was proposed that plants have a cPRC1-like complex that contains RING1A/B, BMI1A/B/C, and the plant-specific proteins LIKE-HETEROCHROMATIN PROTEIN 1 (LHP1), also known as TERMINAL FLOWER 2 (TFL2), and EMBRYONIC FLOWER 1 (EMF1). LHP1 was considered to be the functional equivalent of the Pc/CBX proteins because, although LHP1 contains a CHROMO domain and a CHROMO SHADOW domain like animal Heterochromatin Protein 1 (HP1) (Gaudin et al., 2001; Kotake et al., 2003), it localizes in euchromatin and binds H3K27me3 marks (Zhang et al., 2007b; Turck et al., 2007). In support of this notion, LHP1 was shown to interact with the RING1 and BMI1 proteins in yeast two-hybrid assays (Xu and Shen, 2008; Chen et al., 2010). However, recent affinity purification followed by mass spectrometry (AP–MS) experiments have revealed that LHP1 co-purifies with PRC2 (Derkacheva et al., 2013; Liang et al., 2015; Bloomer et al., 2020). Similarly, EMF1 was proposed to be a PRC1 component because of its structural and functional similarities to Drosophila Psc-CTR (Calonje et al., 2008; Beh et al., 2012). However, EMF1 is required for H3K27me3 marking at some PcG target genes (Calonje et al., 2008; Kim et al., 2012; Yin et al., 2021), and it also co-purifies with PRC2 components (Liang et al., 2015; Bloomer et al., 2020). Therefore, LHP1 and EMF1 are now considered PRC2-associated proteins and, as such, they are included in the next section. Plants appear to contain only vPRC1s, although few accessory proteins have been identified to date (Figure 1). The recruitment of an Arabidopsis E3 module has been associated with VIVIPAROUS1/ABI3-LIKE (VAL)1/2 TFs (Yang et al., 2013; Qüesta et al., 2016; Baile et al., 2021; Mikulski et al., 2021). VAL factors contain a B3 DNA binding domain that specifically recognizes RY elements (CATGCA) (Suzuki et al., 2007). Several lines of evidence indicate that the VAL factors are bona fide PRC1 accessory proteins. First, the val1/2 mutant phenocopies the bmi1abc phenotype (Yang et al., 2013). Second, VAL1/2 are required for the incorporation of H2AK121ub marks (Yang et al., 2013; Baile et al., 2021). Third, VAL1 interacts with BMI1 proteins in vitro (Yang et al., 2013); and fourth, BMI1 and RING1 proteins co-purify with VAL1 in AP–MS experiments (Qüesta et al., 2016; Mikulski et al., 2021). Interestingly, VAL1 also co-purifies with SIN3-associated protein 18 (SAP18), SR45, and ACINUS (Qüesta et al., 2016). SAP18 is a component of both the SIN3-HISTONE DEACETYLASE COMPLEXES (HDACs) (Zhang et al., 1997) and, together with SR45 and ACINUS, the APOPTOSIS AND SPLICING-ASSOCIATED PROTEIN (ASAP) complex (Deka and Singh, 2017). In addition, several reports have shown that VAL factors directly or indirectly interact with different HDAs associated with SIN3-HDACs (Zhou et al., 2013; Zeng et al., 2020), suggesting a crosstalk between histone deacetylation and PcG repression. However, it remains to be shown how the interaction of VAL1 with ASAP, SIN3-HDACs, and PRC1 is coordinated spatially and temporally. In addition, it has been shown that ALFIN1-like 6 (AL6) interacts with RING1 and BMI1 proteins (Molitor et al., 2014). AL1–7 form a plant-specific protein family whose members contain a PAL domain at the N terminus and a plant homeodomain (PHD) at the C terminus (Molitor et al., 2014). The PHD domain of AL proteins was reported to bind H3K4me3 (Zhao et al., 2018), and the PAL domain has been involved in the interaction with RING1 and BMI1 (Peng et al., 2018). Based on several results, the ALs have been proposed to target PRC1 to H3K4me3-marked active chromatin in order to promote the transition from the active to the repressed stage (Molitor et al., 2014; Peng et al., 2018). Interestingly, two recent reports have shown that Nodulin Homeobox Factor (NDX) interacts with the RING1s but not with the BMI1 proteins and that it co-purifies with VAL1, BMI1, and RING1 in AP–MS experiments (Zhu et al., 2020; Mikulski et al., 2021). Furthermore, the levels of H2AK121ub are affected in ndx mutants (Zhu et al., 2020; Mikulski et al., 2021), pointing to NDX as a new PRC1 accessory protein. NDX is an atypical PHD-containing protein that has been previously implicated in single-stranded DNA recognition and stabilization of the R-loop formed at the 3′ end of FLOWERING LOCUS C (FLC) (Sun et al., 2013), as well as the regulation of abscisic acid signaling (Zhu et al., 2020); however, its exact role in PRC1 function remains to be investigated. VERNALIZATION 1 (VRN1), a B3 domain-containing plant-specific protein involved in the vernalization response (Levy et al., 2002), has also been proposed to be a PRC1 component (Mylne et al., 2006; Holec and Berger, 2012); however, there is not yet biochemical evidence to support this association. Finally, the H3K4me3 demethylase JMJ14 was shown to interact with BMI1A/B and EMF1 (Wang et al., 2014); however, it has also recently been found to co-purify with PRC2 (see next section). In summary, there are not many confirmed PRC1 accessory proteins in plants. In addition, it is not known whether these or other unknown accessory proteins associate with different E3 modules constituting different plant vPRC1s. It is likely that, as in vertebrates, different specialized complexes act in different tissues or developmental stages.

The number of plant PRC2-associated proteins has recently significantly increased

PRC2 core components are broadly conserved (Figure 2). These core components are essential for the H3K27 trimethyltransferase activity of the complex (reviewed in Blackledge et al., 2015; Chetverina et al., 2020; Bieluszewski et al., 2021; Blackledge and Klose, 2021).
Figure 2

PRC2 cores and accessory proteins.

Schematic representation of PRC2 core components in Drosophila, vertebrates, and Arabidopsis and accessory proteins found in the different PRC2 sub-complexes. Possible accessory proteins that have not been verified by AP–MS are indicated in a dotted background.

PRC2 cores and accessory proteins. Schematic representation of PRC2 core components in Drosophila, vertebrates, and Arabidopsis and accessory proteins found in the different PRC2 sub-complexes. Possible accessory proteins that have not been verified by AP–MS are indicated in a dotted background. The Drosophila PRC2 core is composed of the SET-domain-containing methyltransferase Enhancer of zeste (E(z)), the scaffold protein Suppressor of zeste 12 (Su(z)12), the H3K27me3 binding protein Extra sex combs (Esc) or Esc-like (Escl), and the nucleosome-remodeling factor Nurf55 (reviewed in Chetverina et al., 2020; Blackledge and Klose, 2021). Similarly, the vertebrate core components are EHZ1 or EHZ2 (of which EZH2 displays higher methyltransferase activity [Margueron et al., 2008]), SUZ12, EED, and RBBP4 or RBBP7 (Chetverina et al., 2020; Blackledge and Klose, 2021) (Figure 2). In Arabidopsis, homologs of all four PRC2 core subunits have been identified, but some of them show multiplication (Huang et al., 2017). CURLY LEAF (CLF), MEDEA (MEA), and SWINGER (SWN) are homologs of E(z) (Goodrich et al., 1997; Grossniklaus et al., 1998; Chanvivattana et al., 2004). FERTILIZATION INDEPENDENT SEED 2 (FIS2), EMBRYONIC FLOWER 2 (EMF2), and VERNALIZATION 2 (VRN2) are homologs of Su(z)12 (Luo et al., 1999; Yoshida et al., 2001; Gendall et al., 2001). FERTILIZATION INDEPENDENT ENDOSPERM (FIE) is the unique Esc homolog (Ohad et al., 1999), and MULTIPLE SUPPRESSOR OF IRA 1 (MSI1) is the homolog of Nurf55 (Köhler et al., 2003). Molecular and biochemical characterizations have revealed that Arabidopsis has at least three different PRC2 cores, named VRN2–PRC2 (VRN2, CLF/SWN, FIE, MSI1), EMF2–PRC2 (EMF2, CLF/SWN, FIE, MSI1), and FIS2–PRC2 (FIS2, MEA, FIE, MSI1) (Figure 2). These PRC2 cores display distinct and overlapping functions in the regulation of gene expression throughout plant development (Mozgova and Hennig, 2015; Xiao and Wagner, 2015; Hinsch et al., 2021). In Drosophila and vertebrates, PRC2 diversity is achieved through the association of accessory proteins that can modulate its enzymatic activity or chromatin target sites. There are two PRC2s, PRC2.1 and PRC2.2. PRC2.1 includes Polycomb-like (Pcl) in Drosophila and the mutually exclusive Pcl homologs PCL1, PCL2, and PCL3 in vertebrates. These proteins stimulate the methyltransferase activity of E(z)/EZH (Nekrasov et al., 2007; Sarma et al., 2008). Vertebrate PRC2.1 also contains Elongin BC and PRC2-associated protein (EPOP) and PRC2-associated LCOR isoform 1 (PALI1) or PALI2 (Beringer et al., 2016; Liefke et al., 2016; Conway et al., 2018), which also support methyltransferase activity through an undefined mechanism. These proteins have not been found in Drosophila PRC2.1 (Figure 2). PRC2.2 is defined by the presence of Jardin2 in Drosophila or JARID2 in vertebrates, which bind to H2A monoubiquitination marks through their N-terminal ubiquitin interaction motif, and Jing in Drosophila or AEBP2 in vertebrates, which enhance the enzymatic activity and regulate the chromatin binding of the complex (Figure 2) (Chetverina et al., 2020; Blackledge and Klose, 2021). In Arabidopsis, the number of identified PRC2 accessory proteins has significantly increased in recent years. These proteins co-purify with PRC2 core components and affect H3K27me3 levels (Figure 2). However, with several exceptions, it is not clear whether they preferentially associate with a specific PRC2 core or whether they give rise to sub-complexes through their mutually exclusive association with the same PRC2 core. As mentioned in the previous section, despite the fact that LHP1 and EMF1 were considered PRC1 components, they co-purify with PRC2 core components and affect H3K27me3 levels (Derkacheva et al., 2013; Liang et al., 2015; Zhou et al., 2017; Bloomer et al., 2020; Yin et al., 2021). Thus, although in animals these two activities are associated with PRC1, in plants they are included in the list of PRC2-associated proteins (Figure 2). Similarly, in the yeast Cryptococcus neoformans, the PRC2 components co-purify with a CHROMO domain-containing subunit, Ccc1, that recognizes H3K27me3 (Dumesic et al., 2015) as LHP1 does, suggesting that this could be a more common event than expected based on the information available from Drosophila and vertebrates. The best characterized PRC2 accessory proteins in Arabidopsis are VERNALIZATION INSENSITIVE 3 (VIN3), VIN3-LIKE 1/VERNALIZATION 5 (VIL1/VRN5), VIL2/VEL1, and VIL3/VEL2 (Sung et al., 2006; Greb et al., 2007) (Figure 2), which participate in the vernalization response by enhancing the activity of the VRN2-PRC2 core (Sung et al., 2006; Greb et al., 2007; Costa and Dean, 2019). These proteins share the PHD domain and winged helix DNA contact domain with Pcl proteins (Costa and Dean, 2019), which in vertebrates constitute different PRC2.1s. It has been shown that the VRN2-PRC2 core components VIN3 and VRN5 co-purify with SAP18, SR45, ACINUS, and HDA19 (Qüesta et al., 2016). Interestingly, these proteins also co-purify with the PRC1-associated protein VAL1, suggesting a connection between PRC1 and PRC2 established through VAL proteins. More recently, telomere repeat-binding factors (TRB1–3), which contribute to PRC2 recruitment (Zhou et al., 2018a), and the harbinger transposase-derived ANTAGONISTIC OF LHP1 1 (ALP1) and ALP2 proteins, which antagonize PRC2 activity, have also been shown to co-purify with PRC2 core components (Liang et al., 2015; Velanis et al., 2020) (Figure 2). Interestingly, it seems that LHP1 and EMF1 do not associate with PRC2 when the ALP proteins are present, and vice versa (Velanis et al., 2020), suggesting that they are accessory components of separate sub-complexes (Figure 2). PRC2 core components also co-purify with the ubiquitin C-terminal hydrolases UBIQUITIN-SPECIFIC PROTEASE 12 (UBP12) and UBP13 (Liang et al., 2015; Derkacheva et al., 2016; Bloomer et al., 2020) (Figure 2). Phylogenetic analysis of these proteins showed that they share a similar protein sequence with the vertebrate vPRC1 accessory protein USP7 (March and Farrona, 2017; Derkacheva et al., 2016). USP7 regulates PRC1 activity by maintaining the integrity of the complex. Consequently, USP7 inhibition results in decreased levels of H2A monoubiquitination at PRC1 target loci (de Bie et al., 2010; Maertens et al., 2010; Hu et al., 2014; Maat et al., 2021). A recent report showed that the ubp12/13 mutant displays a higher number of upregulated genes than downregulated genes, suggesting a role for UBP12/13 in repression. Also, a larger number of genes lose H2AK121ub than gain H2AK121ub in the ubp12/13 mutant, and it therefore shows a global reduction in H2AK121ub levels (Kralemann et al., 2020). These results suggest a role of UBP12/13 in regulating PRC1 integrity/activity, similar to that of USP7. However, based on analyses of the less-abundant loci that gained H2AK121ub in the ubp12/13 mutant, it has been proposed that UBP12/13 may directly regulate H2A monoubiquitination levels (Kralemann et al., 2020). EARLY BOLTING IN SHORT DAYS (EBS) and its homolog SHORT LIFE (SHL), which contain an N-terminal Bromo-Adjacent Homology (BAH) domain followed by a PHD domain with a C-terminal extension (Yang et al., 2018), also co-purify with PRC2 core components (Li et al., 2018) (Figure 2). These proteins bind to H3K4me3 and H3K27me3 marks, interact with EMF1, and are required for the maintenance of H3K27me3 levels. ebs/shl/lhp1 triple mutants show a significant global reduction in H3K27me3 levels (Li et al., 2018), leading to propose that, together with LHP1, they are the H3K27me3 readers in Arabidopsis (Krause and Turck, 2018). In line with this proposal, EPR-1, a BAH protein of the filamentous fungus Neurospora crassa, has been show to bind H3K27me2/3 and to be implicated in gene repression. Although PRC1 core components are absent in N. crassa, EPR-1 acts as an H3K27me2/3 reader and is required for the formation of nuclear foci reminiscent of Polycomb bodies. epr-1 mutants do not exhibit appreciably altered H3K27 methylation levels, indicating that EPR-1 acts downstream of PRC2 (Wiles et al., 2020). Moreover, vertebrate BAH and coiled-coil domain-containing protein 1 (BAHCC1) and BAH domain-containing protein 1 (BAHD1) also act as H3K27me3 readers and interact with HDACs, resulting in optimal repression of PcG target genes (Fan et al., 2020, 2021). Phylogenetic analysis has shown that EPR-1 orthologs are widely distributed across the eukaryotes, suggesting an ancient role of EPR-1 homologs in PcG repression (Wiles et al., 2020). Interestingly, a member of a novel family of putative Jumonji-type 2-oxoglutarate/Fe(II)-dependent dioxygenases, INCURVATA 11 (ICU11), robustly co-purified with PRC2 core components and accessory proteins (Bloomer et al., 2020) (Figure 2). Neither ICU11 nor EMF1 where found to interact with VIN3 or its homologs VRN5, VEL1, and VEL2, reinforcing the view that distinct PcG complexes operate over different spatial and temporal scales. Several lines of evidence support a role for ICU11 in mediating H3K36me3 demethylation, suggesting that ICU11 may enable the transition from an H3K36me3-marked active to an H3K27me3-marked inactive chromatin state (Bloomer et al., 2020). The H3K4me3 demethylase JMJ14 was detected in ICU11-enriched peptides and also interacts with EMF1 (Wang et al., 2014), supporting its association with PRC2 (Figure 2). Therefore, a combination of activities in the whole complex would link demethylation of active histone modifications and the incorporation of repressive modifications. Unlike in animals (Blackledge and Klose, 2021), a PRC2-associated protein that can recognize and bind H2A monoubiquitination marks has not yet been identified in plants. Nevertheless, the Arabidopsis ZUOTIN RELATED FACTOR 1 (ZRF1) homologs AtZRF1a and AtZRF1b have been proposed to perform PcG-related and PcG-independent functions (Feng et al., 2016). All homologs of ZRF1 have a zuotin domain at their N terminus, which is composed of a DnaJ domain and a potential ubiquitin-binding domain, and tandem repeats of the SANT domain at their C terminus (Feng et al., 2021). Vertebrate ZRF1 can specifically recognize and bind to monoubiquitinated H2A (Richly et al., 2010; Barbour et al., 2020). Likewise, AtZRF1b can bind ubiquitin in vitro and pull down monoubiquitinated H2A from plant protein extracts, acting as a possible H2AK121ub reader (Feng et al., 2016, 2021). However, in contrast to vertebrate ZRF1, which competes with PRC1 to activate many PcG-repressed target genes, AtZRF1a/b seem to promote H3K27me3 deposition for gene repression (Feng et al., 2016), which may provide a mechanism for PRC2 reading of H2AK121ub after PRC1 deposition. Nevertheless, it remains to be investigated whether ZRF1 proteins associate with PRC2 in Arabidopsis. Finally, although they have not been found to co-purify with PRC2 core components, the PWWP domain-containing proteins PWWP DOMAIN INTERACTOR OF POLYCOMBS 1–4 (PWO1–4), which are H3 readers via their PWWP domain, have been shown to interact with the three H3K27 trimethyltransferases in Arabidopsis and to recruit PcG proteins to subnuclear domains (Hohenstatt et al., 2018). In addition, another subgroup of PWWP domain proteins (PDP1–3) has been proposed to regulate PcG function (Zhou et al., 2018b), suggesting a role for PWWP domain-containing proteins in sustaining PcG function. In summary, the existence of this long list of PRC2 accessory proteins indicates that the regulation of PRC2 activity may be critical for specifying cell identity during development and differentiation.

Role of PcG complexes and their hallmarks in plant transcriptional regulation

PcG complexes play a crucial role in the regulation of gene expression; however, the exact mechanism by which they exert this regulation is not yet clear. Previous studies in Drosophila and vertebrates proposed a PRC2-initiated hierarchical model in which PRC2 establishes H3K27me3, which is recognized by the PRC1 subunit Pc/CBXs. PRC1 in turn incorporates H2A monoubiquitination to maintain stable repression (Wang et al., 2004). However, later studies in vertebrates found that the activity of vPRC1s can recruit PRC2 for H3K27me3 marking, revealing a reverse alternative model for the incorporation of PcG marks (Blackledge et al., 2014; Kalb et al., 2014). In support of this notion, several results indicate that the recognition of H2A monoubiquitination is crucial for PRC2 nucleation at target sites. Moreover, the PRC2 accessory proteins Jarid2/JARID2 are able to bind H2A monoubiquitination marks (Barbour et al., 2020). The incorporation of H3K27me3 is then necessary to reinforce the binding of PRC2 and the spread of H3K27me3, in which EED and possibly other H3K27me3 readers play an important role (Blackledge and Klose, 2021). Nevertheless, because H3K27me3 is also recognized by the Pc/CBXs, PRC2 can in turn recruit cPRC1, indicating that both models, the reverse and the classical, take place in animals. Interestingly, cPRC1 contributes only minimally to H2A monoubiquitination levels in vivo, whereas it seems to be required for the formation of Polycomb bodies (Blackledge and Klose, 2021), as will be discussed below. Despite all these results, the participation of H2A monoubiquitination in gene repression has remained controversial. Some reports have provided evidence that it is indispensable (Kundu et al., 2017), whereas others have shown that it is not (Illingworth et al., 2015; Pengelly et al., 2015). Therefore, additional work is needed to determine which chromatin context acts in concert with this modification to regulate gene expression. Genome-wide distribution analyses of H2AK121ub and H3K27me3 marks in Arabidopsis showed that most H2AK121ub peaks overlap with regions immediately downstream of transcriptional start sites (Zhou et al., 2017; Kralemann et al., 2020). Conversely, H3K27me3 peaks occupy complete gene bodies and thus partially overlap with H2AK121ub (Zhang et al., 2007a; Turck et al., 2007; Lafos et al., 2011; Zhou et al., 2017; Kralemann et al., 2020), unlike their extended co-localization in animals. In addition, H2AK121ub marks in Arabidopsis are widespread, often co-localizing with H3K27me3 but also occupying a set of genes devoid of H3K27me3 (Zhou et al., 2017; Kralemann et al., 2020), revealing three different subsets of PcG targets based on the presence of H2AK121ub, H3K27me3, or both. Transcriptional analyses of these PcG target subsets showed that most H3K27me3-marked genes are not expressed or display low expression levels in WT seedlings, consistent with previous reports (Zhang et al., 2007a; Turck et al., 2007; Lafos et al., 2011). Conversely, a considerable percentage of genes with only H2AK121ub marks are transcriptionally active (Zhou et al., 2017; Kralemann et al., 2020), although their average expression levels are still lower than those of active genes that lack H2AK121ub marks (Yin et al., 2021). These results indicate that the two modifications may have different roles in transcriptional repression. In addition, profiling of H2AK121ub and H3K27me3 marks in the clf28swn7 mutant showed that PRC2 activity was not required for H2AK121ub marking (Zhou et al., 2017), ruling out the classical hierarchical model for the incorporation of PcG marks proposed in animals. By contrast, loss of BMI1 function affects the incorporation of H3K27me3 at the genes in which H2AK121ub and H3K27me3 marks co-localize (Zhou et al., 2017; Kralemann et al., 2020), consistent with the reverse alternative model (Blackledge et al., 2014; Calonje, 2014; Kalb et al., 2014; Merini and Calonje, 2015). PRC1 components have been shown to interact with PRC2-associated proteins in Arabidopsis. For instance, RING1 and BMI1 interact with LHP1 and EMF1 (Xu and Shen, 2008; Bratzel et al., 2010), which may suggest that these interactions, rather than H2AK121ub, promote PRC2 recruitment and H3K27me3 marking. However, recent results in Marchantia polymorpha, which shares many signaling pathways with Arabidopsis but displays low genetic redundancy, revealed that H2A monoubiquitin marks are essential for PRC2 recruitment (Liu et al., 2021). Through mutation of the single gene encoding canonical H2A in M. polymorpha, H2A monoubiquitination was shown to be required for H3K27me3 incorporation (Liu et al., 2021). Consistent with this result, recent works showed that loss of vertebrate RING1B catalytic activity largely phenocopies the complete removal of the RING1B protein (Blackledge et al., 2020; Tamburri et al., 2020). Furthermore, the animal PRC2 accessory protein Jarid2/JARID2 binds H2A monoubiquitination marks, supporting the participation of this modification in PRC2 recruitment (Barbour et al., 2020). Unfortunately, an Arabidopsis PRC2 accessory protein that can bind H2AK121ub marks has not been identified; however, the fact that AtZRF1a/b proteins bind H2AK121ub and promote H3K27me3 marking makes them potential candidates to carry out this function. Alternatively, another unknown H2AK121ub reader associated with PRC2 may perform this function. In any case, recent results indicate that a combination of both H2AK121ub binding and direct interaction between PRC1 and PRC2 components may participate in PRC2 recruitment (Zhou et al., 2017; Baile et al., 2021; Liu et al., 2021). Despite the implication of H2K121ub in PRC2 recruitment, the fact that average transcription levels of genes marked with H2AK121ub only are higher than those of genes marked with H2AK121ub/H3K27me3—and those of H2AK121ub/H3K27me3-marked genes are higher than those of H3K27me3-marked genes—led us to propose that H2AK121ub is not a repressive mark per se (Kralemann et al., 2020). However, in support of a repressive role for this modification, it has recently been shown that BMI1 proteins also mediate monoubiquitination of the H2A variant H2A.Z. H2A.Z can be monoubiquitinated at lysine 129 (H2A.ZK129ub). The incorporation of this modification is required for H2A.Z-mediated transcriptional repression (Gómez-Zambrano et al., 2019). Transcriptomic comparison among the H2A.Z mutant hta9hta11, hta9hta11 complemented with a native form of H2A.Z, and hta9hta11 complemented with a mutated H2A.Z that cannot be monoubiquitinated showed that most genes upregulated in hta9hta11 recovered WT-like expression levels in the presence of native H2A.Z, but not in the presence of mutated H2A.Z (Gómez-Zambrano et al., 2019). Interestingly, a considerable number of the genes upregulated in hta9hta11 are already active in the WT, but their expression levels are further increased in hta9hta11, suggesting that H2A.ZK129ub marks modulate the expression of target genes. Furthermore, a recent report showed that H2AK121ub marks are associated with a less accessible chromatin state at transcriptional regulation hotspots that are enriched for the binding of TFs (Yin et al., 2021), and this presumably interferes with transcription. This report showed that decreased levels of H2AK121ub at both only-H2AK121ub and H2AK121ub/H3K27me3-marked chromatin led to an increase in chromatin accessibility. Nonetheless, the fact that average expression levels of only-H2AK121ub genes are higher than those of H2AK121ub/H3K27me3 genes indicates that chromatin marked only with H2AK121ub is transcriptionally permissive, suggesting a role for this modification in modulating rather than switching off gene expression (Figure 3). Chromatin accessibility is further reduced when H3K27me3 is present. However, although H2AK121ub/H3K27me3-mediated inaccessible chromatin is still transcriptionally responsive, as it can be reactivated when the levels of these modifications are reduced, only-H3K27me3 marked chromatin is less responsive. Interestingly, only-H3K27me3 chromatin is not usually associated with transcriptional regulation hotspots (Yin et al., 2021), indicating that these sites are required for gene responsiveness (Yin et al., 2021) and supporting a role for H2K121ub in the modulation of accessibility at these sites (Figure 3). Accordingly, the presence of H2AK121ub marks has been positively linked to gene responsiveness, whereas the presence of H3K27me3 marks showed a weak negative association (Kralemann et al., 2020).
Figure 3

Model for the two proposed PcG-mediated repressive mechanisms in plants.

Before the recruitment of PcG complexes, PRC1-dependent genes are active, whereas PRC1-independent genes are repressed. Once PRC1 is targeted to active genes, chromatin becomes less accessible, and the transcription is downregulated. PRC2 then recognizes H2AK121ub marks and PRC1 components and mediates the transcriptional repression of these genes by promoting an inaccessible chromatin state, which is responsive to reactivation. Repressed genes are targeted only by PRC2, which maintains an inaccessible chromatin state.

Model for the two proposed PcG-mediated repressive mechanisms in plants. Before the recruitment of PcG complexes, PRC1-dependent genes are active, whereas PRC1-independent genes are repressed. Once PRC1 is targeted to active genes, chromatin becomes less accessible, and the transcription is downregulated. PRC2 then recognizes H2AK121ub marks and PRC1 components and mediates the transcriptional repression of these genes by promoting an inaccessible chromatin state, which is responsive to reactivation. Repressed genes are targeted only by PRC2, which maintains an inaccessible chromatin state. Interestingly, the histone demethylase RELATIVE OF EARLY FLOWERING 6 (REF6) (Lu et al., 2011) binds to and removes H3K27me3 preferentially from genes marked with H2AK121ub (Kralemann et al., 2020), further supporting the possibility that H2A monoubiquitination creates a partially repressed state that enables a quick response to stimuli. All together, these data also indicate that PcG regulation of only-H3K27me3 marked genes may involve a different mechanism. Accordingly, PRC1-independent H3K27me3-marked genes are already repressed before PRC2 recruitment, unlike PRC1-dependent genes that require PRC2 for repression (Kralemann et al., 2020) (Figure 3). On the other hand, it has been proposed that H2AK121ub should be removed from H2AK121ub/H3K27me3-marked genes after recruitment of PRC2 to maintain a stable repression, placing UBP12/13 as key factors to remove these marks (Kralemann et al., 2020; Hinsch et al., 2021). To verify this possibility, it would be necessary to identify direct targets of UBP12/13 and to determine whether these proteins affect PRC1 integrity and activity or whether they directly deubiquitinate H2A.

Targeting PcG complexes in plants

Despite the fact that PcG activities are essential for controlling the transcriptional state of genes at a particular stage, time, or condition, none of the PcG core components are able to bind specific DNA sequences; therefore, this regulation may require different intermediaries that attract PcG complexes to specific target genes. In Drosophila, multiple TFs acting in combination have been shown to recruit PcG complexes to cis-elements in small genomic regions called Polycomb response elements (PREs) (Kassis and Brown, 2013; Steffen and Ringrose, 2014). In vertebrates, the recruitment of PcG complexes is broadly related to unmethylated CpG-rich DNA regions that are proximal to promoters and are known as CpG islands (Deaton and Bird, 2011). These regions are recognized and bound by the zinc-finger-CXXC domain of KDM2B and by the winged helix domain of PCL1/2/3 (Blackledge et al., 2014; Li et al., 2017b). Other TFs and histone-modification binding associated proteins also collaborate in the recruitment of PcG complexes (Blackledge and Klose, 2021). Moreover, certain PcG complexes transiently interact with TFs for target site recognition in certain contexts (Blackledge and Klose, 2021). On the other hand, as an alternative to TFs, long non-coding RNAs (lncRNAs) have been involved in targeting of PcG complexes and, more recently, RNA–DNA hybrid structures, which are known as R-loops (Chetverina et al., 2020; Blackledge and Klose, 2021), suggesting that different mechanisms can contribute to the specific tethering of PcG complexes. In Arabidopsis, the recruitment of PRC1 has been related to VAL1/2 factors. The B3 domain of VAL factors specifically recognizes RY elements (CATGCA) in the regulatory regions of several target genes (Suzuki et al., 2007; Qüesta et al., 2016; Wu et al., 2018a; Sasnauskas et al., 2018). In fact, as mentioned previously, several lines of evidence suggest that VAL factors are PRC1-associated proteins (Yang et al., 2013; Qüesta et al., 2016; Mikulski et al., 2021). In addition to the B3 domain, VAL1/2 factors also contain a plant homeodomain-like (PHD-L) domain, a cysteine- and tryptophan-rich zinc finger domain (CW), and an ethylene-responsive element binding factor-associated amphiphilic repression (EAR) domain (Suzuki et al., 2007). The PHD-L has been shown to participate in the homo- or heterodimerization of VAL1 and VAL2 (Chen et al., 2020) and has also been proposed to act as a reader of H3 methylation states, like the CW domain (Hoppmann et al., 2011; Yuan et al., 2016). The EAR domain is involved in the interaction with TOPLESS (TPL)/TPL-RELATED (TPR) 1–4 corepressors or SAP18, which in turn recruit HDA activities (Kagale and Rozwadowski, 2011). Accordingly, VAL1/2 have been reported to interact with HDA activities (Zhou et al., 2013; Zeng et al., 2020). Together, these data indicate that VAL1/2 binding to chromatin involves, in addition to cis-elements, interaction with histone modifications. Furthermore, VAL factors are able to recruit other histone-modifying activities (Zhou et al., 2013; Zeng et al., 2020; Baile et al., 2021), suggesting that they can act as platforms for the simultaneous assembly of different epigenetic mechanisms. Apart from the VAL factors, the AL proteins have been proposed to bind H3K4me3 through their PHD domain and to recruit PRC1 (Molitor et al., 2014; Peng et al., 2018). Interestingly, most AL proteins can also bind to the conserved cis-element GNGGTG/GTGGNG (ALFIN1 elements; Bastola et al., 1998; Wei et al., 2015), raising the possibility that PRC1 could be tethered to specific sites by the AL proteins independently of VAL1/2. In support of this possibility, the promoters of genes upregulated in the bmi1abc mutant are enriched in ALFIN1 elements (Merini et al., 2017). Whereas few factors have been implicated in PRC1 recruitment to date, the recruitment of PRC2 for H3K27me3 marking, in addition to depending on PRC1 activity (Zhou et al., 2017; Kralemann et al., 2020; Liu et al., 2021), has been linked to a wide diversity of TFs. These include the MYB TF ASYMMETRIC LEAVES 1 (AS1) (Lodha et al., 2013), the MADS-box TFs FLC and SHORT VEGETATIVE PHASE (SVP) (Wang et al., 2014; Richter et al., 2019), the GAGA motif binding proteins BASIC PENTACYSTEINE (BPC) 1–6 (Hecker et al., 2015; Xiao et al., 2017), the TELOBOX motif binding proteins ARABIDOPSIS ZINC FINGER 1 (AZF1) and ZINC FINGER OF ARABIDOPSIS THALIANA 6 (ZAT6) (Xiao et al., 2017), the C2H2 TFs SUPERMAN (SUP) (Xu et al., 2018) and KNUCKLES (KNU) (Sun et al., 2019), and VAL1/2 factors (Yuan et al., 2016, 2021; Chen et al., 2020). Nevertheless, although some of these TFs have been shown to interact with one or more PRC2 components, they may be engaged in more transient interactions than accessory proteins, as they have not been detected in AP–MS experiments. The existence of Arabidopsis PRE-like sequences containing cis-elements for the binding of TFs has been proposed in several independent studies (Berger et al., 2011; Lodha et al., 2013; Mu et al., 2017; Xiao et al., 2017). Moreover, two key families of TFs, the class I BPC TFs and the ZINC FINGER TFs, co-localize with PRC2 at thousands of loci, and loss or reduction of their function causes a loss of PRC2 binding (Xiao et al., 2017). A recent report described the in vivo mediation of TF binding to a synthetic locus whose promoter lacked any of the known cis-elements involved in PcG recruitment. Interestingly, the results implicated some of these factors in recruiting one or the other PcG complex in Arabidopsis (Baile et al., 2021). These findings showed that the binding of VAL1 leads to the incorporation of H2AK121ub and H3K27me3 and the removal of H3 acetylation marks (Figure 4A). Whereas BMI1 proteins directly interact with VAL1, PRC2 marking requires both PRC1 activity and VAL1, indicating that different interactions collaborate in PRC2 recruitment. Interestingly, SAP18 and HDA activities co-purify with VAL1 and the PRC2-associated proteins VIN3 and VRN5 (Qüesta et al., 2016), suggesting a connection between HDACs and PRC2 via some of these proteins. Furthermore, this report showed that PRC2 activity could be recruited independently of PRC1 by the binding of TFs from different families that, like the VALs, contain an EAR domain (Figure 4A). The EAR domains have been shown to bind SAP18 and in turn to recruit HDA activities. The binding of these EAR factors also leads to the removal of H3 acetylation marks (Baile et al., 2021). These results are consistent with the existence of PcG targets in which the PRC2 recruitment is dependent or independent of PRC1 activity. Moreover, they suggest that the EAR–SAP18 interaction acts as a link between PRC2 and HDACs to mediate gene repression.
Figure 4

Factors implicated in target site identification in plants.

(A) TFs recognize their DNA binding motifs at PRE-like sequences of target sites. TFs that contain an EAR domain as a common feature participate in PRC2 and HDAC complex recruitment. While VAL factors act as a platform for the assembly of PRC1, PRC2, and HDAC (left panel), other TFs lead to the recruitment of PRC2 and HDAC independently of PRC1 activity (right panel). The ability of VAL factors and other PRC1 and PRC2 accessory proteins to bind histone modifications, such as H3K4me3, may stabilize the recruitment of the complexes to target genes.

(B) An intronic lncRNA from the floral homeotic AGAMOUS (AG) gene, lncRNA-4, is expressed in leaves and interacts with CLF to deposit H3K27me3 histone marks into the AG locus.

(C) The lncRNA APOLO mediates the formation of R-loops by sequence complementarity with its targets, which attract LHP1 and PRC2 core components.

Factors implicated in target site identification in plants. (A) TFs recognize their DNA binding motifs at PRE-like sequences of target sites. TFs that contain an EAR domain as a common feature participate in PRC2 and HDAC complex recruitment. While VAL factors act as a platform for the assembly of PRC1, PRC2, and HDAC (left panel), other TFs lead to the recruitment of PRC2 and HDAC independently of PRC1 activity (right panel). The ability of VAL factors and other PRC1 and PRC2 accessory proteins to bind histone modifications, such as H3K4me3, may stabilize the recruitment of the complexes to target genes. (B) An intronic lncRNA from the floral homeotic AGAMOUS (AG) gene, lncRNA-4, is expressed in leaves and interacts with CLF to deposit H3K27me3 histone marks into the AG locus. (C) The lncRNA APOLO mediates the formation of R-loops by sequence complementarity with its targets, which attract LHP1 and PRC2 core components. Finally, lncRNAs have been associated with PcG repression in plants. For instance, several lncRNAs generated from the FLC locus have been implicated in the recruitment of PRC2 and FLC repression (Costa and Dean, 2019). COLDAIR sense lncRNA interacts with CLF (Heo and Sung, 2011), COLDWRAP is an FLC promoter-associated lncRNA that also interacts with CLF to form a repressive intragenic chromatin loop (Kim and Sung, 2017), and COOLAIR antisense lncRNAs recruit CLF indirectly via an RNA binding protein called FLOWERING CONTROL LOCUS A (FCA) (Tian et al., 2019). In addition, lncRNA-4, an intronic lncRNA from the floral homeotic AGAMOUS (AG) gene, is expressed in leaves and interacts with CLF to deposit H3K27me3 histone marks into the AG locus (Wu et al., 2018b) (Figure 4B). In addition to lncRNAs, recent studies have indicated that R-loops may also support PcG target site recognition. For example, the lncRNA APOLO mediates the formation of R-loops by sequence complementarity with its targets, which attract LHP1 (Ariel et al., 2020) (Figure 4C). Interestingly, through the analysis of a COOLAIR-induced R-loop at the 3′ end of FLC, a recent report characterized the mechanism by which resolution of a nascent-transcript-induced R-loop promotes chromatin silencing (Xu et al., 2021). Stabilization of a COOLAIR-induced R-loop at the 3′ end of FLC is mediated by NDX, which inhibits further antisense transcription (Sun et al., 2013). This enables the RNA binding protein FCA and other 3′-end processing factors to polyadenylate the nascent antisense transcript, which clears the R-loop and recruits chromatin modifiers to remove H3K4me1 and H3K36me3 and promote H3K27me3 accumulation (Xu et al., 2021). Also, NDX has been recently shown to co-purify with PRC1 components and to be required for H2AK121ub accumulation and H3K27me3 incorporation at the FLC nucleation region (Mikulski et al., 2021). Therefore, it will be important to establish whether this NDX-PRC1 association points to an additional NDX/R-loop interaction in the FLC nucleation region or is the result of the gene loop. In summary, TFs and specific combinations of their binding sites play an important role in the recruitment of PcG complexes to chromatin in plants, and epigenetic modifications of histones and RNA-protein interactions can stabilize interactions between the complexes and chromatin.

The role of PcG proteins in shaping chromatin structure in plants

PcG proteins are also related to a higher-order level of chromatin organization. PcG proteins form nuclear bodies (Polycomb bodies), suggesting that parts of the genome bound by PcG proteins gather together in the nucleus to interact, to share common machinery, and to create local concentrations of specific factors (Pirrotta and Li, 2012). Accordingly, PcG proteins are able to fold chromatin at multiple scales by establishing local loops, compacting chromatin domains, and mediating long-range interactions among H3K27me3-associated domains (Figure 5) (Cheutin and Cavalli, 2019). The formation of chromatin domains covered by H3K27me3 and the interaction among these domains shapes a chromatin network that seems to be crucial for PcG function in animals. Recent reports have shown that, in animals, cPRC1 plays a crucial role in creating these domains and in establishing the interaction among them (for a review, see Guo et al., 2021).
Figure 5

PcG proteins and chromatin structure in plants.

Different PcG proteins, such as EMF1, LHP1, or BAH proteins, may promote the formation of H3K27me3-marked domains, chromatin compaction, long-range chromatin loops, and phase-separated Polycomb bodies, which contribute to the maintenance of genes in a stable repressed state.

PcG proteins and chromatin structure in plants. Different PcG proteins, such as EMF1, LHP1, or BAH proteins, may promote the formation of H3K27me3-marked domains, chromatin compaction, long-range chromatin loops, and phase-separated Polycomb bodies, which contribute to the maintenance of genes in a stable repressed state. This cPRC1 ability relies mainly on Drosophila Psc and Ph and vertebrate CBX2 and PHC (Grau et al., 2011; Isono et al., 2013; Wani et al., 2016), but it seems to be independent of cPRC1 monoubiquitination activity, as it persists when RING1B catalytic activity is impaired (Boyle et al., 2020). The IDR of CBX2, which shares biochemical and functional properties with Drosophila Psc-CTR (Grau et al., 2011), and the SAM domain of Ph/PHC, which mediates head-to-tail oligomerization of cPRC1 (Isono et al., 2013), confer this ability. Accordingly, loss of these cPRC1 proteins leads to the dissolution of Polycomb bodies (Isono et al., 2013; Wani et al., 2016; Tatavosian et al., 2019; Plys et al., 2019) (Figure 5). Furthermore, CBX2 and PHC are able to mediate liquid–liquid phase separation (LLPS) that underlies the formation of condensates (Grau et al., 2011; Seif et al., 2020) and has been proposed to induce chromatin compaction and segregate repressed chromatin from transcriptional machinery. Interestingly, data have suggested that H3K27me3 may not be the seeding site for CBX2-mediated LLPS, as H3K27me3 does not prevent the formation of CBX2 condensates in live cells. However, removal of H3K27me3 greatly reduces the bound level of other CBXs, such as CBX7 or CBX8, to chromatin, indicating that they play a different role (Tatavosian et al., 2019). By integrating these and other results, a model has recently been proposed to explain the function of PcG complexes in the mediation of higher order chromatin organization. The model is known as the scaffold-adaptor-client phase separation: CBX2-PRC1 is the scaffold, CBX7-PRC1 is the adaptor, and H3K27me3-marked chromatin is the client. In this model, CBX7-PRC1 recruits H3K27me3-marked chromatin into established CBX2-PRC1 condensates through interactions between CBX7 and H3K27me3 and polymerization of PHC between CBX2-PRC1 and CBX7-PRC1 (Kent et al., 2020). In addition, the ability of BAH-containing proteins to bind H3K27me3 (for instance, BAHCC1; Fan et al., 2020) suggests that these proteins contribute to establishing bridges between surrounding chromatin domains covered with H3K27me3. However, the interplay between these and other factors remains to be dissected. In plants, several studies have suggested that H3K27me3 is a key contributor to chromatin topology. Local interaction of H3K27me3 domains is reduced in the Arabidopsis clfswn double mutant background (Feng et al., 2014). In addition, H3K27me3 is enriched at long-distance interacting loci across the Arabidopsis genome (Liu et al., 2016; Huang et al., 2021). However, it remains unknown whether any plant PcG proteins help to shape these interactions. EMF1 has been reported to show structural and functional similarities to Drosophila Psc-CTR and the IDR of CXB2 in vitro (Grau et al., 2011; Beh et al., 2012). Moreover, a region in the center of EMF1 is required for the formation of Polycomb bodies (Calonje et al., 2008). Therefore, it may participate in mediating phase-separated condensates, which may help to compact chromatin, promote H3K27me3 marking, and establish interactions between H3K27me3-marked domains. In addition, LHP1 contains a disordered hinge region that, when perturbed, causes the Polycomb bodies to be disrupted (Berry et al., 2017), suggesting that LHP1 participates in PcG condensate formation in plants. Moreover, as in animals, the ability of the CHROMO domain of LHP1 and the BAH domain proteins EBS and SHL to bind H3K27me3 could mediate interactions among H3K27me3 domains in plants. In addition, PWO1–4 proteins are proposed to recruit PcG proteins to subnuclear domains and to participate in chromatin compaction (Hohenstatt et al., 2018). In summary, although further work is required to explore the potential roles of these and other proteins in the mediation of plant 3D chromatin structures, the fact that Pc/CBX2 and Psc-CTR activities are linked to PRC2 instead of PRC1 in plants suggest differences in the way these interactions are established and maintained.

Concluding remarks and perspectives

Although our understanding of the PcG repressive mechanism in plants has increased significantly in recent years, many gaps remain. It is becoming clear that PcG complexes regulate gene expression at multiple levels; however, to understand how these different levels of regulation are achieved in plants, we need to identify all the players involved in the system. On the one hand, it is not known whether different plant PRC1 E3 modules exist and whether the BMI1-L truncated proteins can associate in a PRC1. Because the BMI1-Ls do not contain the RING domain, their role in PcG-mediated repression may be independent of H2A monoubiquitination. An intriguing possibility is that they play a role in creating Polycomb domains and establishing interactions among them, similar to animal cPRC1, which displays this function independently of E3 monoubiquitin ligase activity. In addition, compared with animals, the number of identified PRC1 accessory proteins in plants is still not very large; moreover, it is not known whether these or other unknown accessory proteins associate with different E3 modules constituting different plant vPRC1s. On the other hand, even though many more PRC2 accessory proteins are known now than some years ago, in most cases we still do not know whether they associate with different PRC2 cores constituting different PRC2 sub-complexes and whether these sub-complexes are involved in different levels of regulation. Addressing all these questions would require performing AP–MS experiments in different cell types and conditions. Interestingly, although PRC1 and PRC2 enzymatic activities are conserved between animals and plants, several animal PRC1-associated activities have been linked to PRC2 in Arabidopsis. These activities participate in reading and propagating H3K27me3 marks or in mediating chromatin compaction, such as the activities displayed by LHP1 and EMF1. A possible explanation for this finding could lie in the different genomic distributions of H2AK121ub and H3K27me3 marks between animals and plants. In Arabidopsis, H2AK121ub marks are enriched at what could be considered the PcG nucleation region of target genes, whereas H3K27me3 marks co-localize with H2AK121ub at these regions but also extend over gene bodies. By contrast, these two marks co-localize over large regions in animals. Therefore, the spreading of H3K27me3 and chromatin compaction abilities may well be associated with PRC1 in animals, as the two complexes follow the same pattern and maintain a feedforward loop. However, these activities should be linked to PRC2 in plants, as PRC1 and PRC2 markings do not follow the same pace. Interestingly, a considerable number of PRC1 and PRC2 accessory proteins are plant specific, indicating that in plants the PcG system has been re-invented, incorporating a different array of proteins. Nevertheless, similar activities have been recruited, demonstrating the effectiveness of the combination for the repressive mechanism. Finally, it will be important to extend our understanding of the Arabidopsis PcG system to other plants species with larger genomes to determine whether or not the recruitment of the complexes and their roles in shaping chromatin 3D structure follow the same rules.

Funding

This work was supported by a PID2019-106664GB-I00 grant from the .

Author contributions

F.B., A.G.-Z., and M.C. contributed equally to this work.
  173 in total

1.  The SAM domain of polyhomeotic forms a helical polymer.

Authors:  Chongwoo A Kim; Mari Gingery; Rosemarie M Pilpa; James U Bowie
Journal:  Nat Struct Biol       Date:  2002-06

2.  Molecular genetic analysis of Suppressor 2 of zeste identifies key functional domains.

Authors:  Richard B Emmons; Heather Genetti; Stephen Filandrinos; Jillian Lokere; Chao-ting Wu
Journal:  Genetics       Date:  2009-06-15       Impact factor: 4.562

3.  Alfin1, a novel zinc-finger protein in alfalfa roots that binds to promoter elements in the salt-inducible MsPRP2 gene.

Authors:  D R Bastola; V V Pethe; I Winicov
Journal:  Plant Mol Biol       Date:  1998-12       Impact factor: 4.076

4.  H2A deubiquitinases UBP12/13 are part of the Arabidopsis polycomb group protein system.

Authors:  Maria Derkacheva; Shujing Liu; Duarte D Figueiredo; Matthew Gentry; Iva Mozgova; Paolo Nanni; Min Tang; Mattias Mannervik; Claudia Köhler; Lars Hennig
Journal:  Nat Plants       Date:  2016-08-15       Impact factor: 15.793

Review 5.  PRC2 activity, recruitment, and silencing: a comparative perspective.

Authors:  Tomasz Bieluszewski; Jun Xiao; Yiman Yang; Doris Wagner
Journal:  Trends Plant Sci       Date:  2021-07-20       Impact factor: 18.313

6.  Dynamic regulation of H3K27 trimethylation during Arabidopsis differentiation.

Authors:  Marcel Lafos; Phillip Kroll; Mareike L Hohenstatt; Frazer L Thorpe; Oliver Clarenz; Daniel Schubert
Journal:  PLoS Genet       Date:  2011-04-07       Impact factor: 5.917

7.  Whole-genome analysis of histone H3 lysine 27 trimethylation in Arabidopsis.

Authors:  Xiaoyu Zhang; Oliver Clarenz; Shawn Cokus; Yana V Bernatavichute; Matteo Pellegrini; Justin Goodrich; Steven E Jacobsen
Journal:  PLoS Biol       Date:  2007-05       Impact factor: 8.029

8.  Transcriptional repression by PRC1 in the absence of H2A monoubiquitylation.

Authors:  Ana Raquel Pengelly; Reinhard Kalb; Katja Finkl; Jürg Müller
Journal:  Genes Dev       Date:  2015-07-15       Impact factor: 11.361

9.  R-loop resolution promotes co-transcriptional chromatin silencing.

Authors:  Congyao Xu; Zhe Wu; Hong-Chao Duan; Xiaofeng Fang; Guifang Jia; Caroline Dean
Journal:  Nat Commun       Date:  2021-03-19       Impact factor: 14.919

10.  EAR domain-containing transcription factors trigger PRC2-mediated chromatin marking in Arabidopsis.

Authors:  Fernando Baile; Wiam Merini; Inés Hidalgo; Myriam Calonje
Journal:  Plant Cell       Date:  2021-08-31       Impact factor: 11.277

View more
  2 in total

1.  Transcript profiling of Polycomb gene family in Oryza sativa indicates their abiotic stress-specific response.

Authors:  Nikita Yadav; Preeti Nagar; R Rakhi; Ashish Kumar; Archita Rai; Ananda Mustafiz
Journal:  Funct Integr Genomics       Date:  2022-10-05       Impact factor: 3.674

Review 2.  The Importance of Networking: Plant Polycomb Repressive Complex 2 and Its Interactors.

Authors:  James Godwin; Sara Farrona
Journal:  Epigenomes       Date:  2022-03-03
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.