| Literature DB >> 31289233 |
Justyna Cholewa-Waclaw1, Ruth Shah1, Shaun Webb1, Kashyap Chhatbar1, Bernard Ramsahoye2, Oliver Pusch3, Miao Yu4, Philip Greulich5,6, Bartlomiej Waclaw7, Adrian P Bird8.
Abstract
Patterns of gene expression are primarily determined by proteins that locally enhance or repress transcription. While many transcription factors target a restricted number of genes, others appear to modulate transcription levels globally. An example is MeCP2, an abundant methylated-DNA binding protein that is mutated in the neurological disorder Rett syndrome. Despite much research, the molecular mechanism by which MeCP2 regulates gene expression is not fully resolved. Here, we integrate quantitative, multidimensional experimental analysis and mathematical modeling to indicate that MeCP2 is a global transcriptional regulator whose binding to DNA creates "slow sites" in gene bodies. We hypothesize that waves of slowed-down RNA polymerase II formed behind these sites travel backward and indirectly affect initiation, reminiscent of defect-induced shockwaves in nonequilibrium physics transport models. This mechanism differs from conventional gene-regulation mechanisms, which often involve direct modulation of transcription initiation. Our findings point to a genome-wide function of DNA methylation that may account for the reversibility of Rett syndrome in mice. Moreover, our combined theoretical and experimental approach provides a general method for understanding how global gene-expression patterns are choreographed.Entities:
Keywords: DNA methylation; MeCP2; gene regulation; mathematical modelling
Mesh:
Substances:
Year: 2019 PMID: 31289233 PMCID: PMC6660794 DOI: 10.1073/pnas.1903549116
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.Gene expression strongly correlates with gene body mCG density and MeCP2 abundance. (A) Experimental design (). d0, day 0; d9, day 9. (B) Mean number of MeCP2 molecules per nucleus. (C) Log2 fold change of gene expression (Log2FC) relative to appropriate controls (ctr, unmodified controls; SCR, scrambled shRNA control; OE ctr, overexpression control) for all seven levels of MeCP2, plotted against gene body mCG density. All Log2FC values have been shifted so that Log2FC averaged over all genes is zero. Black line indicates the maximum slope. (D) The maximum slope for gene bodies varies proportionally to MeCP2 abundance. (E) Ratio between luciferase expressions from an unmethylated and gene-body methylated constructs, for three cases: no MeCP2, WT MeCP2, and a methyl-CpG binding domain mutant R111G that is unable to bind mCG. Points show individual replicates. In all panels, error bars represent ±SEM.
Fig. 2.MeCP2 occupancy on the DNA is proportional to mCG density and MeCP2 level. (A) MeCP2 ChIP- and ATAC-seq experimental procedures and their in silico counterparts. p and p are probabilities of background and MeCP2-bound reads, respectively. Tn5 insertion sites (scissors) occur in exposed DNA regions. (B) ChIP-seq enrichment profiles centered at mCG dinucleotides for different cell lines. Black lines represent in silico profiles fitted to the experimental data. (C) MeCP2 ChIP-seq enrichment data in OE 11x/KO (red) as a function of mCG density. (D) Average depletion profiles (logarithm of the ratio between the number of Tn5 insertions in a given cell line and KO1; two to four biological replicates) in the ±100-bp regions surrounding mCG dinucleotides. Black lines represent computer simulations of the model fitted to the data. (E) Predicted fraction of mCGs occupied by MeCP2 vs. MeCP2 level obtained from depletion profiles in D. Error bars represent ±SEM.
Fig. 3.MeCP2 does not regulate transcription via condensation of chromatin or premature termination. (A) A cartoon of the condensation model. Tangles represent regions of condensed chromatin that are inaccessible to RNA Pol II. (B) Chromatin accessibility (measured by ATAC-seq) at promoters rapidly decreases with increasing promoter methylation. In contrast, MeCP2 has a minor effect on accessibility (curves for OEs 4x and 11x are slightly lower than for KO). (C) The condensation model disagrees with Log2FC(OE 11x/KO) obtained from RNA-seq. (D) Schematic representation of the detachment model. (E) Log2FC (gene expression) for KO/ctr (purple) vs. the total number of mCGs per gene. Black lines represent predictions of the detachment model. Error bars represent ±SEM. (F) As in E for OE 11x/OE ctr (red).
Fig. 4.Mathematical modeling indicates that MeCP2 slows down transcriptional elongation. (A) Schematic representation of the dynamical obstacles model. (B) Transcription rate J predicted by the model, plotted as a function of the initiation rate α, for different mean MeCP2 densities in gene bodies. (C) Space–time plots (kymographs) representing Pol II moving along the gene. Queues of Pol II induced by MeCP2 can reach the TSS (red dot) and block initiation if both the initiation rate (α) and the density of MeCP2 (ρ) are sufficiently high (C, Left). (D) Schematic representation of Pol II (gray) density shockwaves forming behind MeCP2 (blue). Black line is the local density of Pol II. (E) Log2FC (gene expression) vs. mCG density in gene bodies obtained in computer simulations of the dynamical obstacles model (black solid lines) fitted to the OE 11x/OE ctr RNA-seq dataset (red) agrees well with experimental data for OE 4x/OE ctr (orange) and KO/ctr (purple) datasets. Error bars represent ±SEM. (F) The maximum slope of Log2FC (gene expression) vs. mCG density in gene bodies, predicted by the dynamical obstacles model (black line). Points are experimental slopes from Fig. 1.
Fig. 5.MeCP2 slows down transcription via a mechanism involving NCoR. (A) Location of two binding domains in MeCP2 that are relevant for the proposed mechanism: methyl-CpG binding domain (MBD) and NCoR-interaction domain (NID). The mutation R111G causes MeCP2 to lose the ability to bind specifically to mCG. The mutation R306C prevents MeCP2 from binding the NCoR complex. (B) Level of MeCP2 (Western blot) in two overexpressed mutant cell lines (R111G and R306C) and the overexpression control cell line (OE ctr). OE 11x is shown for comparison. Values are averaged over three biological replicates and normalized by the level of histone H3. (C) Log2FC (expression) of OE R111G/OE ctr shows almost no dependence on mCG density in gene bodies (black). The gray line shows the maximum slope. (D) Log2FC (expression) of OE R306C/OE ctr shows a small negative correlation with gene body mCG density (brown). The gray line shows the maximum slope. (E) Maximum slopes for all cell lines including OE R111G (black) and OE R306C (brown) from C and D vs. MeCP2 level (Western blot). In all plots, error bars represent ±SEM.