| Literature DB >> 23269663 |
Jonathan I Gent1, Nathanael A Ellis, Lin Guo, Alex E Harkess, Yingyin Yao, Xiaoyu Zhang, R Kelly Dawe.
Abstract
Small RNA-mediated regulation of chromatin structure is an important means of suppressing unwanted genetic activity in diverse plants, fungi, and animals. In plants specifically, 24-nt siRNAs direct de novo methylation to repetitive DNA, both foreign and endogenous, in a process known as RNA-directed DNA methylation (RdDM). Many components of the de novo methylation machinery have been identified recently, including multiple RNA polymerases, but specific genetic features that trigger methylation remain poorly understood. By applying whole-genome bisulfite sequencing to maize, we found that transposons close to cellular genes (particularly within 1 kb of either a gene start or end) are strongly associated with de novo methylation, as evidenced both by 24-nt siRNAs and by methylation specifically in the CHH sequence context. In addition, we found that the major classes of transposons exhibited a gradient of CHH methylation determined by proximity to genes. Our results further indicate that intergenic chromatin in maize exists in two major forms that are distinguished based on proximity to genes-one form marked by dense CG and CHG methylation and lack of transcription, and one marked by CHH methylation and activity of multiple forms of RNA polymerase. The existence of the latter, which we call CHH islands, may have implications for how cellular gene expression could be coordinated with immediately adjacent transposon repression in a large genome with a complex organization of genes interspersed in a landscape of transposons.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23269663 PMCID: PMC3613580 DOI: 10.1101/gr.146985.112
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Example distributions of 24-nt siRNAs and methylation near genes. (A) An example of an ∼20-kb region of chromosome 2 showing clusters of 24-nt siRNAs near genes. Each blue segment represents a single siRNA. The figure was modified from a screenshot from the Genomaize Genome Browser (http://genomaize.org; released 15 March 2012) with the B73 reference genome, version 2. Genomaize is derived from the UCSC Genome Browser (Kent et al. 2002). (B) An example 2-mb region of the genome showing methylation frequency (methylcytosine per total cytosine in each specific sequence context, left axis) and RNA abundance (normalized read count, right axis) for each 10-kb interval. mRNA reads are from a previously published study (Wang et al. 2009). For mRNA in A and siRNA reads in B, repetitively mapping reads were excluded.
Figure 2.Genome-wide summary of 24-nt siRNA and methylation distributions near genes. (A–F) Distributions of methylation in each sequence context near genes. Both relative frequency (5-methylcytosine over total cytosine in the specific sequence context) and absolute frequency (5-methylcytosine over total nucleotides) are shown. The genes were divided into four sets based on expression level. Methylation values were measured for each 100-bp interval in a 2-kb region upstream of and downstream from gene ends for all annotated genes in the filtered gene set (version 5b). Values were also measured for the first 600 bp inside genes on each end. (G) Comparison of gene expression vs. flanking CHH methylation. Genes were split into four quartiles based on the level of CHH methylation in either the upstream or downstream 1 kb. The average expression level for each quartile is shown in RPKM (reads per kilobase per million mapped reads). (Error bars) Standard errors of the means; (*) statistically significant difference between means (P-value < .005). (H) Distributions of 24-nt siRNAs near genes. The average number of 24-nt siRNAs that aligned within each 100-bp interval is displayed for each set of genes. Both uniquely aligning and repetitive siRNAs are included in this plot. For an analysis of just unique siRNAs, see Supplemental Figure S2A.
Correlations between abundance of 24-nt siRNAs and enrichment for locations within 1-kb upstream of genes across diverse transposon superfamilies
Figure 3.Methylation of transposons. (A) Comparison of transposon methylation in 1-kb regions upstream of genes with transposon methylation in the whole genome. In cases where a transposon copy extended beyond the 1-kb region, only the overlapping portion contributed to the analysis. For each superfamily, the difference in CHH methylation between the set of all copies (light blue, whole genome) and the subset upstream of genes (darker blue) was statistically significant (P-value < 0.005). Also shown are genome averages (both for the whole genome and for the regions within 1-kb upstream of genes) and transposon-free comparisons. Since excluding transposons enriched for genes, genes were also excluded. “Local” consists of portions of 1-kb regions that do not correspond to transposons or genes; “whole 1 KB” excludes entire 1-kb regions if they overlap at all with transposons or genes. (B) Comparison of methylation within single transposon copies relative to proximity of nearby genes. Copies that were contained in or overlapped with the region 1-kb upstream of genes were split into two halves, and the levels of methylation for the proximal and distal halves were measured separately. The transposons were also categorized by orientation relative to the genes, and the methylation averages for each orientation are shown separately. (Error bars) Standard errors of the means. For each of the superfamilies except for Tc1/Mariner and L1, the differences in CHH methylation between each half were statistically significant for both orientations (P-value < 0.005).