Literature DB >> 24077223

Promiscuous RNA binding by Polycomb repressive complex 2.

Chen Davidovich1, Leon Zheng, Karen J Goodrich, Thomas R Cech.   

Abstract

Polycomb repressive complex 2 (PRC2) is a histone methyltransferase required for epigenetic silencing during development and cancer. Long noncoding RNAs (lncRNAs) recruit PRC2 to chromatin, but the general role of RNA in maintaining repressed chromatin is unknown. Here we measure the binding constants of human PRC2 to various RNAs and find comparable affinity for human lncRNAs targeted by PRC2 as for irrelevant transcripts from ciliates and bacteria. PRC2 binding is size dependent, with lower affinity for shorter RNAs. In vivo, PRC2 predominantly occupies repressed genes; PRC2 is also associated with active genes, but most of those are not regulated by PRC2. These findings support a model in which PRC2's promiscuous binding to RNA transcripts allows it to scan for target genes that have escaped repression, thus leading to maintenance of the repressed state. Such RNAs may also provide a decoy for PRC2.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24077223      PMCID: PMC3823624          DOI: 10.1038/nsmb.2679

Source DB:  PubMed          Journal:  Nat Struct Mol Biol        ISSN: 1545-9985            Impact factor:   15.369


Polycomb repressive complex-2 (PRC2) is a histone methyltransferase that mono-, di- and tri-methylates lysine 27 of histone H3 (H3K27me3), inducing repressed chromatin[1]. EZH2 is the catalytic subunit of PRC2, and SUZ12 is an essential regulatory subunit[2]. EED is a histone-binding subunit that binds H3K27me3-modified histone tails, resulting in increased affinity to nucleosomes and stimulation of the catalytic activity of PRC2[3,4]. PRC2 is further activated by dense chromatin[5]. Marks for active chromatin, H3K4me3 and H3K36me3, are recognized by RBBP4 together with SUZ12 to inhibit PRC2 catalytically and reduce its affinity to nucleosomes[6,7]. Collectively, these activities allow PRC2 to functionally distinguish repressed from transcriptionally active chromatin. Although PRC2 is typically shown to be associated with repressed chromatin and the H3K27me3 mark[8-10], an additional overlap between some PRC2 target genes and the H3K4me3 mark is observed in embryonic[11,12] and T-cells[13], forming bivalent chromatin suggested to be poised for rapid activation. Other recent evidence also tracks PRC2 to transcriptionally active genes[14,15] and suggests association with RNA polymerase II[16] and with Phf1 protein, which mediates recruitment of PRC2 to the H3K36me3 mark[17-19]. These mechanisms are neither fully understood nor unified. In Drosophila PRC2 is recruited to chromatin through Polycomb Response Elements (PREs, reviewed in ref[20]). Despite the discovery of functionally similar elements in vertebrates[21-23], the understanding of PRC2-specific recruiters is far from complete. Evidence indicates that lncRNAs recruit PRC2 to loci designated for silencing. HOTAIR lncRNA recruits PRC2 in trans to the HOXD locus[24,25] and other loci[26]. RepA lncRNA recruits PRC2 during X-chromosome inactivation[27]. Moreover, PRC2 is associated with thousands of RNAs in various cell lines[25,27-31]. A two-hairpin motif has been suggested to be enriched in a subclass of ncRNA that associates with PRC2[28], inspired by a two-hairpin motif that was originally discovered in RepA RNA[27]. More complex RNA structures have also been proposed[32,33]. Yet, the lack of quantitative data on the affinity of PRC2 for its RNA binding partners has limited the understanding of binding specificity. To measure the binding specificity of PRC2, we performed quantitative electrophoretic mobility shift assays (EMSAs) of reconstituted human PRC2 with various RNAs. We show that PRC2 binds RNA promiscuously in vitro, with sub-micromolar affinity. RNA immunoprecipitation coupled with next generation sequencing (RIP-seq), whole transcriptome shotgun sequencing (RNA-seq), Global Run-On Sequencing (GRO-seq) and chromatin immunoprecipitation combined with massively parallel DNA sequencing (ChIP-seq) datasets generated from various mouse cell lines published in multiple independent studies were subjected to comparative analysis, showing that PRC2 binds RNA promiscuously also in vivo. Furthermore, unlike the H3K27me3 mark, which exclusively associates with repressed chromatin, PRC2 associates with both repressed and active genes. Taken together, these results suggest a model for the maintenance of the repressed chromatin state by PRC2, directed by promiscuous binding to nascent RNA transcripts.

RESULTS

Promiscuous RNA binding by PRC2 in vitro

To quantitatively characterize the affinity of PRC2 to RNA in vitro, we expressed and purified nucleic-acid free human PRC2 with either four subunits (EZH2-SUZ12-EED-RBBP4, namely PRC2 4m complex) or five subunits (EZH2-SUZ12-EED-RBBP4-AEBP2, namely PRC2 5m) using the baculovirus system (Fig. 1a). Recombinant PRC2 was efficient in histone methyltransferase activity (Supplementary Fig. 1) and RNA binding; the apparent dissociation constants (Kd) for in vitro transcribed RNA comprising 400 bases from the 5’ end of HOTAIR lncRNA (HOTAIR 400) were 136 ± 22 nM and 255 ± 3 nM for PRC2 5m and PRC2 4m, respectively (Fig 1b,c and Supplementary Fig. 2 for RNA construct design). This two-fold difference in apparent Kd in the presence or absence of AEBP2 may reflect small differences between protein preparations. Data were fit to binding curves with Hill coefficients of 2.3 ± 0.2 for PRC2 5m and 1.7 ± 0.1 for PRC2 4m, suggesting positive binding cooperativity. The affinity of PRC2 4m was similar for HOTAIR 400 and HOTAIR 1–300, an RNA construct comprising 300 bases from the 5’ domain of HOTAIR that was previously shown to bind PRC2[24,34] (Fig 2a). Increased incubation time did not reduce the apparent Kd (Supplementary Fig. 3), supporting binding equilibrium. In agreement with earlier studies[27,29], PRC2 4m demonstrated similar affinity to the sense and antisense strands of the complete A-region sequence found within the human RepA transcript (Supplementary Fig. 4a).
Figure 1

PRC2 binds the 5’ domain of HOTAIR RNA with sub-micromolar affinity, in the presence or absence of AEBP2. (a) Recombinant human PRC2 complexes reconstituted using the baculovirus system to include SUZ12, EZH2, EED and RBBP4 in the presence or absence of the zinc-finger subunit AEBP2 (PRC2 5m and 4m, respectively), analyzed by SDS-PAGE and stained with Coomassie blue. (b) Electrophoretic Mobility Shift Assays (EMSAs) performed using in vitro transcribed RNA including 400 nucleotides from the 5’ of HOTAIR RNA (HOTAIR 400) in the presence of various concentrations of PRC2 4m and 5m. The two gels were run for different times, so the extent of the mobility shift upon protein binding is not meaningful. (c) Complete binding curves for HOTAIR 400 with PRC2 4m and PRC2 5m. Error bars for Kd, Hill and each data point represent range of two independent experiments performed on different days.

Figure 2

Promiscuous binding of RNA by PRC2 in vitro. (a) Representative EMSAs and the corresponding binding curves show binding of PRC2 to HOTAIR 1–300, HOTAIR 400, antisense (as) HOTAIR 1–300 and E. coli MBP 1–300. Kd and Hill coefficients represented with data-range from two independent experiments performed on different days. (b) Binding competition experiment with unlabeled competitors MBP 1–300 and HOTAIR 400, radiolabeled HOTAIR 400 and PRC2 4m.

While previous studies provided valuable qualitative information regarding the association of PRC2 and its subunits with different RNAs[24,25,27-29,34], the complete binding curves obtained here provide the first measurements of sub-micromolar affinity. Given the similar binding affinities of PRC2 to previously characterized RNA partners, we tested RNAs that would not be expected to bind. The E. coli maltose binding protein mRNA originates in an organism lacking polycomb group proteins. Unexpectedly, the first 300 bases of this mRNA (MBP 1–300) bound PRC2 4m with an apparent Kd of 110 ± 10 nM, comparable to that observed for HOTAIR RNA (Fig 2a). MBP 1–300 is not predicted to include the two-hairpin motif (Supplementary Fig. 5) previously observed in some PRC2-associated RNAs[27-29]. Similar affinities were also observed for the antisense (as) strand of HOTAIR 1–300 (Fig. 2a and Supplementary Fig. 4b), Mus musculus telomerase RNA and the P4–P6 domain of the group I intron from the ciliate Tetrahymena (Supplementary Fig. 4a). Excess MBP 1–300 competed HOTAIR 400 from PRC2 (Fig. 2b), suggesting that both RNAs interact with the same binding site on PRC2. Collectively, these data indicate promiscuous RNA binding by PRC2. On the other hand, the affinity of PRC2 to RNA is quite high and, remarkably, is higher than the affinities of its subunit EED for repressive-mark histone-tail peptides H3K27me3, H3K9me3 and H1K26me3 (Kd > 20 µM[3,4]) or of the PRC2 recruiting factor Phf1 for H3K36me3 tails (Kd > 20 µM[17,18] or Kd > 180 µM[19] by different approaches), suggesting biological significance for promiscuous RNA binding by PRC2.

RNA length dependence of PRC2 binding

To further test the observation that PRC2 binds RNA promiscuously, rather than interacting with an unrecognized encrypted binding motif, we generated RNAs comprising 10, 20, 50, 100, 200, 300 and 800 bases from the 5’ end of E. coli MBP mRNA. A complete binding curve was recorded for each RNA (Fig. 3a,b). The Kd decreased with increasing RNA length up to a size of around 300 bases, where the additional increment to 800 bases resulted in no further increase in affinity. Notably, plotting log(Kd) versus log(RNA length) revealed a linear dependence with slope = −1.04 (Fig. 3c). This linear dependency between the length of the RNA and 1/Kd, namely the association constant (Ka), is expected for a protein binding to an array of multiple binding sites[35-37]. On the contrary, if an encrypted binding motif were present in MBP 1–300, one would expect a step function or a sharp increase in Ka with the first RNA that included the motif. Finally, PRC2 binding exhibited increasing cooperativity for RNAs of 200–800 bases (Fig. 3d).
Figure 3

Binding of PRC2 to RNA is size dependent. (a) Representative EMSAs of different in vitro transcribed RNAs that comprise 10, 20, 50, 100, 300 and 800 bases from the 5’ end of E. coli MBP mRNA. The total size of each RNA includes additional five bases that were added through in vitro transcription. (b) Corresponding binding curves. (c) Linear correlation between log10(Kd) and log10(RNA length (bases)). (d) Dependency between RNA length and binding cooperativity (Hill coefficient). Error bars in panels b, c and d represent range of data from two independent experiments performed on different days.

One plausible hypothesis to explain promiscuous PRC2-RNA binding is that PRC2 interacts electrostatically with the RNA phosphate backbone. To test the dependency of Kd on salt concentration, we performed EMSA experiments of HOTAIR 400 and PRC2 5m at various KCl concentrations (Fig. 4a,b). A linear dependence was observed between log(Kd) and log(KCl) with a slope of 0.7 (Fig. 4c), suggesting that at most one salt bridge is mediating binding[38]. This implies that although promiscuous, the interactions between PRC2 and RNA are not primarily electrostatic. Base stacking with aromatic amino acid side chains is an example of such an interaction that can occur in a sequence-independent manner, as observed in the exosome[39].
Figure 4

Interaction between PRC2 and RNA shows little salt dependence. (a) Representative EMSAs of HOTAIR 400 in the presence PRC2 5m performed under different KCl concentrations. (b) Corresponding binding curves. (c) Linear correlation between log10(Kd) and log10(KCl concentration). The apparent number of salt bridges between the RNA and the protein is calculated from the slope of the fitted line[38] and is less than one. Error bars, same as Fig. 3.

Widespread RNA binding by PRC2 in vivo

While our results clearly indicate promiscuous RNA binding by PRC2 in vitro, factors such as posttranslational modifications and presence of other RNA-binding proteins could modify this intrinsic property of PRC2 in vivo. If the complex bound RNA promiscuously also in vivo, we expected that PRC2 would track with RNA from transcriptionally active chromatin genome-wide. To test this hypothesis we analyzed publically available Ezh2 RIP-seq datasets generated using mouse embryonic stem cells (ESC), either wild type (wt) or an Ezh2 knockout (Ezh2(−/−)) control. Ezh2 fold enrichment (Ezh2-FE) was defined for each transcript separately as the degree of over-representation of a given RNA transcript in the wt dataset relative to the Ezh2(−/−) dataset. These data were originally used to identify the pool of transcripts associated with PRC2, namely the PRC2 transcriptome[29]. We used the same criteria: a given RNA was included in the PRC2 transcriptome only if observed with Ezh2-FE ≥ 3 and expression ≥ 0.4 RPKMe (Reads Per Kilobase of exon per Million fragments mapped[40]). In agreement with previous results[29], we found that these criteria identified key RNAs previously shown to associate with PRC2, including Xist, Kcnq1o1 and Gtl2 (also known as Meg3, Fig. 5a). As previously demonstrated[29], the Hotair transcript is lowly expressed in this cell line and therefore did not meet these criteria and was excluded from the PRC2 transcriptome, thereby serving as a negative control for our analysis. A positive internal control is Ezh2, which is under-represented in the Ezh2(−/−) dataset and therefore observed with Ezh2-FE of 20. Importantly, the pool of genes that meet the criteria defining the PRC2 transcriptome is rich with highly expressed genes. These include mRNAs for nuclear-coded mitochondrial ATP synthase F1 alpha subunit 1, alpha-actin, gamma-actin and three PRC2 subunits: Suz12, Rbbp7 and Aebp2 (Fig. 5a). Accordingly, Gene Ontology (GO) analysis of genes with Ezh2-FE > 3 and high wt Ezh2 RIP-seq coverage (> 10 RPKMe, Supplementary Table 1) identified multiple GO terms, where the highly significant terms were related to metabolic processes (Supplementary Table 2).
Figure 5

Widespread binding of RNA by PRC2 in vivo. (a) Analysis of Ezh2 RIP-seq results from mouse ESC determines the PRC2 transcriptome, using the previous criteria[29]. Diagonal line represents Ezh2 RIP-seq Fold Enrichment (Ezh2-FE) of 3-fold for transcripts associated with Ezh2 in wt cells over Ezh2 knockout cells (Ezh2(−/−)). Vertical line represents a cutoff of 0.4 RPKMe. Transcripts belonging to the PRC2 transcriptome lie under the diagonal line and to the right of the vertical line. Green, selected transcripts previously suggested as associated with PRC2. Orange, negative and positive internal controls, Hotair RNA and Ezh2 mRNA, respectively. Red, selected transcripts present in the PRC2 transcriptome that belong to either highly expressed genes or PRC2 subunit mRNAs. Parentheses, Ezh2-FE values. (b) High throughput sequencing data from 35 published ChIP-seq, RNA-seq and GRO-seq experiments were subjected to different types of data acquisition and analysis using identical pipelines. Analysis resulted in 180 data vectors, each describing any of 27,874 autosomal non-redundant mouse Refseq genes. Next, data in each of these 180 data vectors were correlated against Ezh2-FE values of the corresponding genes. Finally, all 180 data vectors were ranked based on the degree of correlation and plotted as bar graph. Red dots, correlation significance as -log10(p-value). Bar size, Spearman’s rho correlation coefficient. Bar color, data classification. Top: bars were color-coded based on chromatin state recorded by the data. Bottom: the same bar graph as described above was color-coded based on different cell types. See Supplementary Table 6 for full description of each dataset.

If most of the transcripts associated with PRC2 were specifically recruiting PRC2 for repression, one would expect genome-wide positive correlation between ChIP-seq H3K27me3 mark and Ezh2-FE and negative correlation between Ezh2-FE and RNA expression level or the H3K36me3 mark for active chromatin. We tested these assumptions by comparative analysis using multiple independent published datasets (Fig. 5b and Supplementary Table 3). Short reads resulting from 35 ChIP-seq, RNA-seq and GRO-seq experiments performed in 8 independent studies and in multiple mouse cell lines were aligned to the mouse genome. To reduce bias, multiple independent approaches for data acquisition and normalization were performed, resulting in 4 or 9 different data vectors for each ChIP-seq or RNA-seq experiment, respectively; these summed to a total of 180 data vectors, each describing 27,874 autosomal non-redundant mouse Refseq genes (See Methods section for detailed description). Next, each of these 180 data vectors was correlated with the Ezh2-FE dataset, and each pair-wise comparison is represented by one bar in Fig. 5b. Strikingly, RNA-seq, GRO-seq and ChIP-seq for active chromatin marks H3K36me3, H3K79me2/3 and RNA polymerase II phosphorylated at Serine 2 (Pol II Ser2) were positively correlated with Ezh2-FE (blue bars are concentrated on the left part of the top graph of Fig. 5b), and repressed marks H3K27me3 and H3K9me3 were negatively correlated (red bars are concentrated on the right). This trend was consistent for almost all datasets that were subjected to this analysis, which spanned multiple cell lines and data-processing approaches. Interestingly, ChIP-seq datasets for SUZ12 and EZH2 did not follow this trend, and demonstrated either a positive or a negative correlation in different datasets. Classification of datasets based on cell lines showed no bias (Fig. 5b, bottom bar graph). These results suggest that the broad association of PRC2 with transcripts from active genes occurs in a cell-line independent manner. These data cannot exclude regulation in trans or specific recruitment of PRC2 by a subset of transcripts with greater affinity and specificity. Yet, the positive correlation observed between Ezh2-FE and active genes, in a genome-wide context and cell-line independent manner, implies that PRC2 predominantly binds RNA promiscuously in vivo.

PRC2 found at both repressed and active genes in vivo

ChIP-seq provides information complementary to that provided by RIP-seq. ChIP-seq signals could represent recruitment of PRC2 by RNA binding as well as other interactions, while RIP-seq signals represent RNA-protein binding that may or may not occur when the RNP is associated with chromatin. We analyzed ChIP-seq datasets for EZH2, H3K27me3, H3K4me3 and H3K36me3 marks published for five different human cell lines (Fig. 6a,b). We defined genes as associated with a given mark only if a ChIP-seq peak was called in proximity to the corresponding TSS. In agreement with previous studies[8-10,41], our analysis revealed 1328 to 3443 genes that associate with EZH2 in different cell lines. Most of these EZH2-associated genes also included the H3K27me3 mark (Fig. 6a), either in the absence or presence of H3K4me3 marks (i.e. bivalent chromatin). Yet, in most cell lines we could also detect a substantial amount, up to 30%, of EZH2-associated genes that were devoid of the H3K27me3 mark and were associated with an H3K4me3 or H3K36me3 mark. Over 90% of EZH2-associated genes were also associated with other marks, either repressed or active. In contrast, only a small fraction of H3K27me3-associated genes were associated also with the H3K36me3 mark (Fig. 6b). These data suggest that although the dominant pattern in vivo is PRC2 residing at repressed chromatin domains, PRC2 is also present at transcriptionally engaged or active genes to a significant extent.
Figure 6

PRC2 associates with active genes, in addition to its predominant association with repressed chromatin. (a) EZH2-associated genes were classified based on their association with other chromatin marks. Numbers in parentheses represent the number of EZH2-associated genes identified in each cell line. (b) H3K27me3-associated genes were classified based on their association with other chromatin marks. Numbers in parentheses represent the number of H3K27me3-associated genes identified in each cell line. (c) Heatmaps for chromatin marks H3K4me3 and H3K36me3 (ChIP-seq data) and for RNA-seq in mouse E14 cell line were generated using the same datasets, but presented using different types of sorting. Refseq mouse genes were sorted by three different criteria: reads (from 0.5 kb upstream of TSSs to 0.5 kb downstream) of either H3K27me3 (blue), EZH2 (red) or differential coverage obtained by subtracting the number of normalized H3K27me3 reads from the number of normalized EZH2 reads at the same position (EZH2 - H3K27me3, red). Below, top-ranked 20% of genes, resulting from these three different types of sorting, were used to generate enrichment profiles (same color key used). Titles of enrichment profiles represent the corresponding epigenetic mark. ‘Sorting dataset’ indicates the datasets that determined the three types of sorting, namely top 20% of genes ranked according to read occupancy around TSS of either H3K27me3 (blue), EZH2 (red) or EZH2 – H3K27me3 (red).

Analysis of mouse embryonic cell data provided additional support for the conclusion that EZH2 occupancy, unlike the H3K27me3 mark, is not restricted to inactive chromatin. When TSSs of the entire mouse genome were sorted by the occupancy of the repressed mark H3K27me3, active genes (identified by a high density of reads for H3K4me3, H3K36me3 and RNA-seq) were ranked low in the heatmap, as expected (Fig. 6c, in blue). Accordingly, enrichment profiles generated for the top 20% ranked genes, resulting from this sorting, were observed to have a prominent camelback profile for H3K27me3 accumulated reads, with two broad peaks spanning the TSS position (Fig. 6c, bottom left enrichment profile, blue line). The same set of genes had low representation of active chromatin marks (Fig. 6c, blue line within enrichment plots labeled H3K4me3, H3K36me3 and RNA-seq). When mouse genes were sorted based on EZH2 occupancy, repressed genes were still ranked at the very top of the heatmap, but now active genes were ranked much higher (Fig. 6c, green heatmaps and green line within enrichment profiles) relative to sorting on the basis of H3K27me3. This analysis suggests distinct profiles for EZH2 and H3K27me3, where H3K27me3 is associated with repressed chromatin and EZH2 is predominantly associated with H3K27me3 but tracks also with active genes. To emphasize this observation, genes were further sorted based on differential ChIP-seq coverage, obtained by subtracting the number of normalized H3K27me3 reads from the number of normalized EZH2 reads within the same chromosomal location (EZH2 – H3K27me3, Fig. 6c, red heatmap). This operation aimed to identify genes where EZH2 was present in the absence of a significant H3K27me3 mark. Indeed, the top 20% of genes (based on EZH2 – H3K27me3 differential coverage) were observed to have a prominent accumulation of the active histone mark over their TSSs as well as RNA production (Fig. 6c, red line within enrichment plots).

EZH2-associated active genes are not PRC2-regulated

We hypothesized that the association of PRC2 with transcriptionally active genes would not necessarily force repression, as transcriptionally active chromatin marks are known to inhibit PRC2 catalytic activity[6,7]. As a test, we knocked down the PRC2 essential subunit SUZ12[2] in HEK293T/17 cells (Fig. 7a). Of the 736 genes whose expression changed significantly (false discovery rate < 0.05) after SUZ12 knockdown, 338 were up-regulated (Fig. 7b and Supplementary Table 4). Gene Ontology (GO) analysis showed enrichment for regulation and development terms for up-regulated genes (Supplementary Table 5), with a notable specialization in neuron development (Supplementary Fig. 6). This finding correlates with the proposed neuronal origin of HEK293 cells[42].
Figure 7

Knockdown of SUZ12 in HEK293T/17 cells, combined with ChIP-seq and RNA-seq, confirms widespread association of EZH2 with H3K4me2/3 and Pol II Ser5, but expression of the vast majority of these genes does not respond to SUZ12 knockdown. (a) Knockdown of SUZ12 achieved after 48 hours, confirmed by qPCR and RNA-seq (right) and immunoblotting (left) and accompanied by global depletion of the H3K27me3 mark (left). (b) Top Euler diagram demonstrates large overlap between genes associated with EZH2, H3K4me2/3 and Pol II Ser5 in HEK293T/17 cells. Bottom Euler diagram demonstrates the small intersect between genes that responded to SUZ12 knockdown, identified based on two biological replicates, and genes associated with EZH2. (c) Only a small fraction of EZH2-associated genes is transcriptionally regulated by PRC2.

To identify genes that are physically associated with PRC2, we performed ChIP-seq for EZH2. Genes that were engaged in transcription were identified by ChIP-seq for H3K4me2/3 and ChIP-seq for RNA polymerase II phosphorylated at serine 5 (Pol II Ser5, performed using the same cell line within the scope of another study[43]). 2355 genes were associated with EZH2 (Fig. 7b). A large overlap was observed between EZH2-associated genes and H3K4me2/3 and Pol II Ser5 (hypergeometric p-value 5.43e–164 and 9.87e–183, respectively), in accord with previous reports[11-13,28]. Yet, only 51 of the 2355 genes that were physically associated with EZH2 responded to SUZ12 knockdown (Fig. 7b), a statistically insignificant overlap (hypergeometric p-value 0.813). Even if our ChIP-seq analysis is not sensitive enough to detect all EZH2-associated genes within repressed chromatin, we can conclude that the majority of EZH2-associated genes that were engaged in transcription were not regulated by PRC2 (Fig. 7c).

DISCUSSION

Our in vitro PRC2-RNA binding assays provide the first quantitative measurements of sub-micromolar affinity of PRC2 to RNA; such affinity is likely to be biologically meaningful. Yet, our data also show that a fundamental property of PRC2 is to bind RNA promiscuously. Although post-translational modification or partner proteins may modulate this fundamental activity of PRC2 in vivo, we find that it correlates with results obtained by Ezh2 RIP-seq experiments in vivo. The PRC2 transcriptome includes thousands of transcripts[29], among which one can pinpoint selected RNAs that were previously reported to recruit PRC2. Yet our comparative analysis suggests that Ezh2-FE correlates positively with active genes and negatively with repressed genes, in a cell line independent manner. Importantly, this does not rule out other PRC2 recruitment models that were previously suggested but implies that, in parallel to them, a strong driving force directs PRC2 to transcripts of a wide variety of genes, both active and insufficiently repressed. Our ChIP-seq analysis is in good agreement with previous studies[8-10,44] and shows that most of gene promoters that associate with EZH2 are also associated with H3K27me3 (Fig. 6). The differences between association patterns of PRC2 with RNA (RIP-seq) and chromatin (ChIP-seq) suggest that the majority of PRC2 that associates with RNA transcripts is never deposited to promoter regions of active genes. Alternatively, PRC2 may not stay there long enough to be cross-linked in a ChIP-seq experiment. There was a small, but yet notable, fraction of genes that tracked with EZH2 ChIP-seq peaks at promoter regions together with H3K36me3 or H3K4me3 marks and in the absence of H3K27me3; this suggests that such deposition of PRC2 to TSS of active genes does occur to some degree. The lack of H3K27me3 marks in the vicinity of these TSSs indicates that PRC2 is not actively repressing these genes. PRC2 has the remarkable ability to recognize and be regulated by chromatin compaction[5] and the epigenetic marks H3K27me3[3,4], H3K4me3 and H3K36me3[6,7]. This allows the complex to functionally discriminate silent chromatin domains, or those that are in the process of being repressed, from transcriptionally active chromatin. Here we suggest an additional layer to this model, in which promiscuous RNA binding by PRC2 allows it to recognize transcriptionally active chromatin domains and scan them for genuine target genes that are already decorated with some H3K27me3 mark, but are not fully silenced. As both the affinity and cooperativity of promiscuous RNA binding by PRC2 increases with the RNA length (Fig. 3c,d), we speculate that PRC2 has evolved to track nascent RNA transcripts in genes that escaped from repression. Once PRC2 recognizes the H3K27me3 mark within a transcriptionally active gene, it will bind nucleosomes and be catalytically stimulated[3,4], restoring repression. Contrarily, if PRC2 recognizes active marks H3K4me3 and H3K36me3, its histone methyltransferase activity will be inhibited and it will not be deposited from the RNA to nucleosomes[6,7]. Further studies are required to identify and distinguish among different recruitment modes of PRC2 to chromatin for transcriptional reprogramming and repression maintenance. While promiscuous RNA binding by PRC2 can facilitate scanning of chromatin, in highly expressed genes the RNA could additionally serve as a decoy, or natural sink, that can strip PRC2 away from chromatin as transcripts roll off of the DNA template. Similarly, exclusion of PRC2 from chromatin by Tsix lncRNA[27] and removal of CTCF from chromatin by Jpx lncRNA[45] have been suggested. The observations that Ezh2-FE is positively correlated with active genes and negatively correlated with repressed genes (Fig. 5b) together with the fact that EZH2 ChIP-seq tags are devoid of most highly expressed genes (Fig. 6a,c) fit with this idea. In the absence of previously deposited H3K27me3 mark, PRC2 is unlikely to be transferred from nascent transcripts to chromatin, leaving these genes active. Therefore, most highly expressed genes whose transcripts interact with PRC2, based on high Ezh2-FE (Fig. 5a), are unlikely to be repressed by the complex, as deposition to chromatin will never occur. Thus, promiscuous RNA binding is likely to boost the process of polycomb-mediated repression and serve as a checkpoint to prevent escape from silencing. In this model, this simple yet robust process allows PRC2 to use the most direct outcome of transcription, namely RNA, to maintain the repressed chromatin state. Although RNA-bound PRC2 is poised for repression, the actual choice is directed by the local chromatin context. An analogy occurs with commercial organizations that use rough and general criteria, such as annual income[46], to target ‘junk mail’ advertising to mailboxes of people who could potentially, but not necessarily, be their customers. Whether to become functionally involved is the choice of the recipient. In this ‘Junk Mail Model’ (Fig. 8), PRC2 is the junk mail, chromosomal loci are mailboxes, transcriptional activity is the criterion for delivery, and the local chromatin context (active or repressive histone marks and chromatin compaction) dictates the response.
Figure 8

The Junk Mail Model for repressed chromatin maintenance by PRC2 utilizing promiscuous RNA binding. Upon inappropriate transcription of a genuine polycomb target gene, recruitment of PRC2 through promiscuous RNA binding results in recognition of previously deposited H3K27me3 marks, stimulation of PRC2 histone methyltransferase activity and restoration of repression. Contrarily, if PRC2 binds to a nascent transcript of an active gene, in the absence of the H3K27me3 mark and in the presence of H3K36me3 and H3K4me3 marks, the histone methyltransferase activity of PRC2 is inhibited and its affinity to nucleosomes reduced, resulting in inefficient deposition to chromatin.

While earlier studies provided insights for target-specific recruitment of PRC2 for de novo repression by ncRNAs[24-29], our model proposes a new role for promiscuous RNA binding in maintaining the repressed chromatin state. Such functional duality has been well established for promiscuous enzymes that can process various substrates, some with increased affinity and catalytic efficiency (reviewed in ref[47]). Accordingly, previously described RNA motifs may still be preferred sites of PRC2 binding, and specific binding may be enhanced by post-transcriptional modifications of the RNA, by post-translational modifications of PRC2 subunits, as previously suggested[34], or by bridging proteins. As was previously shown for PRC2 recruitment by short ncRNAs[28], association of PRC2 with chromatin is predominantly around TSSs, rather than gene bodies (Figure 6c). While earlier results explained the association of PRC2 with promoters of repressed genes, our data show that a substantial amount of promoters that associate with EZH2 are active (Figure 6). Based on the Junk Mail Model, the association of PRC2 with TSSs of active genes, rather than genes bodies, can be attributed to the relatively open chromatin in these regions or the exclusion of PRC2 from gene bodies by other factors (e.g., elongating polymerases, histone acetylation marks or splicing factors). Above all, the massive amount of elongating RNA can serve as decoy to strip PRC2 away from chromatin while transcription machinery progresses into the gene body. The relative contributions of these various processes should be addressed in future studies. Although our data and analysis are restricted to PRC2, we suggest that the Junk Mail Model may potentially be generalized to other systems. Thus, promiscuous RNA binding may facilitate the recruitment of chromatin remodeling and other factors to transcriptionally active chromatin by direct binding to the most immediate product of the transcriptional machinery, RNA.

METHODS

Methods and associated references are available in the online version of the paper at http://www.nature.com/nsmb/.
  56 in total

Review 1.  Polycomb silencing mechanisms and the management of genomic programmes.

Authors:  Yuri B Schwartz; Vincenzo Pirrotta
Journal:  Nat Rev Genet       Date:  2007-01       Impact factor: 53.242

2.  Jpx RNA activates Xist by evicting CTCF.

Authors:  Sha Sun; Brian C Del Rosario; Attila Szanto; Yuya Ogawa; Yesu Jeon; Jeannie T Lee
Journal:  Cell       Date:  2013-06-20       Impact factor: 41.582

3.  Histone methylation by PRC2 is inhibited by active chromatin marks.

Authors:  Frank W Schmitges; Archana B Prusty; Mahamadou Faty; Alexandra Stützer; Gondichatnahalli M Lingaraju; Jonathan Aiwazian; Ragna Sack; Daniel Hess; Ling Li; Shaolian Zhou; Richard D Bunker; Urs Wirth; Tewis Bouwmeester; Andreas Bauer; Nga Ly-Hartig; Kehao Zhao; Homan Chan; Justin Gu; Heinz Gut; Wolfgang Fischle; Jürg Müller; Nicolas H Thomä
Journal:  Mol Cell       Date:  2011-05-06       Impact factor: 17.970

4.  FUS binds the CTD of RNA polymerase II and regulates its phosphorylation at Ser2.

Authors:  Jacob C Schwartz; Christopher C Ebmeier; Elaine R Podell; Joseph Heimiller; Dylan J Taatjes; Thomas R Cech
Journal:  Genes Dev       Date:  2012-12-15       Impact factor: 11.361

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  BEDTools: a flexible suite of utilities for comparing genomic features.

Authors:  Aaron R Quinlan; Ira M Hall
Journal:  Bioinformatics       Date:  2010-01-28       Impact factor: 6.937

7.  The transcriptional and epigenomic foundations of ground state pluripotency.

Authors:  Hendrik Marks; Tüzer Kalkan; Roberta Menafra; Sergey Denissov; Kenneth Jones; Helmut Hofemeister; Jennifer Nichols; Andrea Kranz; A Francis Stewart; Austin Smith; Hendrik G Stunnenberg
Journal:  Cell       Date:  2012-04-27       Impact factor: 41.582

8.  lincRNAs act in the circuitry controlling pluripotency and differentiation.

Authors:  Mitchell Guttman; Julie Donaghey; Bryce W Carey; Manuel Garber; Jennifer K Grenier; Glen Munson; Geneva Young; Anne Bergstrom Lucas; Robert Ach; Laurakay Bruhn; Xiaoping Yang; Ido Amit; Alexander Meissner; Aviv Regev; John L Rinn; David E Root; Eric S Lander
Journal:  Nature       Date:  2011-08-28       Impact factor: 49.962

9.  2-D structure of the A region of Xist RNA and its implication for PRC2 association.

Authors:  Sylvain Maenner; Magali Blaud; Laetitia Fouillen; Anne Savoye; Virginie Marchand; Agnès Dubois; Sarah Sanglier-Cianférani; Alain Van Dorsselaer; Philippe Clerc; Philip Avner; Athanase Visvikis; Christiane Branlant
Journal:  PLoS Biol       Date:  2010-01-05       Impact factor: 8.029

10.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

View more
  238 in total

Review 1.  The roles and regulation of Polycomb complexes in neural development.

Authors:  Matthew Corley; Kristen L Kroll
Journal:  Cell Tissue Res       Date:  2014-11-01       Impact factor: 5.249

Review 2.  The Necessity of Chromatin: A View in Perspective.

Authors:  Vincenzo Pirrotta
Journal:  Cold Spring Harb Perspect Biol       Date:  2016-01-04       Impact factor: 10.005

3.  Poly-combing the genome for RNA.

Authors:  Loyal A Goff; John L Rinn
Journal:  Nat Struct Mol Biol       Date:  2013-12       Impact factor: 15.369

Review 4.  RNA-mediated regulation of heterochromatin.

Authors:  Whitney L Johnson; Aaron F Straight
Journal:  Curr Opin Cell Biol       Date:  2017-06-11       Impact factor: 8.382

5.  Stability of local secondary structure determines selectivity of viral RNA chaperones.

Authors:  Jack P K Bravo; Alexander Borodavka; Anders Barth; Antonio N Calabrese; Peter Mojzes; Joseph J B Cockburn; Don C Lamb; Roman Tuma
Journal:  Nucleic Acids Res       Date:  2018-09-06       Impact factor: 16.971

Review 6.  Long Noncoding RNA Discovery in Cardiovascular Disease: Decoding Form to Function.

Authors:  Tamer Sallam; Jaspreet Sandhu; Peter Tontonoz
Journal:  Circ Res       Date:  2018-01-05       Impact factor: 17.367

Review 7.  The rise of regulatory RNA.

Authors:  Kevin V Morris; John S Mattick
Journal:  Nat Rev Genet       Date:  2014-04-29       Impact factor: 53.242

8.  RNA-DNA strand exchange by the Drosophila Polycomb complex PRC2.

Authors:  Célia Alecki; Victoria Chiwara; Lionel A Sanz; Daniel Grau; Osvaldo Arias Pérez; Elodie L Boulier; Karim-Jean Armache; Frédéric Chédin; Nicole J Francis
Journal:  Nat Commun       Date:  2020-04-14       Impact factor: 14.919

Review 9.  H3K27 methylation: a promiscuous repressive chromatin mark.

Authors:  Elizabeth T Wiles; Eric U Selker
Journal:  Curr Opin Genet Dev       Date:  2016-12-08       Impact factor: 5.578

10.  Topological organization of multichromosomal regions by the long intergenic noncoding RNA Firre.

Authors:  Ezgi Hacisuleyman; Loyal A Goff; Cole Trapnell; Adam Williams; Jorge Henao-Mejia; Lei Sun; Patrick McClanahan; David G Hendrickson; Martin Sauvageau; David R Kelley; Michael Morse; Jesse Engreitz; Eric S Lander; Mitch Guttman; Harvey F Lodish; Richard Flavell; Arjun Raj; John L Rinn
Journal:  Nat Struct Mol Biol       Date:  2014-01-26       Impact factor: 15.369

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.