| Literature DB >> 30651589 |
Robert Horvath1, Benjamin Laenen2, Shohei Takuno3, Tanja Slotte4.
Abstract
Gene-body methylation (gbM) refers to an increased level of methylated cytosines specifically in a CG sequence context within genes. gbM is found in plant genes with intermediate expression level, which evolve slowly, and is often broadly conserved across millions of years of evolution. Intriguingly however, some plants lack gbM, and thus it remains unclear whether gbM has a function. In animals, there is support for a role of gbM in reducing erroneous transcription and transcription noise, but so far most studies in plants have tested for an effect of gbM on expression level, not noise. Here, we therefore tested whether gbM was associated with reduced expression noise in Arabidopsis thaliana, using single-cell transcriptome sequencing data from root quiescent centre cells. We find that gbM genes have lower expression noise levels than unmethylated genes. However, an analysis of covariance revealed that, if other genomic features are taken into account, this association disappears. Nonetheless, gbM genes were more consistently expressed across single-cell samples, supporting previous inference that gbM genes are constitutively expressed. Finally, we observed that fewer RNAseq reads map to introns of gbM genes than to introns of unmethylated genes, which indicates that gbM might be involved in reducing erroneous transcription by reducing intron retention.Entities:
Mesh:
Year: 2019 PMID: 30651589 PMCID: PMC6781109 DOI: 10.1038/s41437-018-0181-z
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Observed differences between gbM and unmethylated genes regarding various genomic features based on a two-sided Mann–Whitney U test
| Genomic features | gbM | Unmethylated | |
|---|---|---|---|
| Estimated stochastic gene expression | 0.96 | 1.11 | <2 × 10−16 |
| Estimated transcriptional noise | 2.94 | 3.10 | 3.16 × 10−14 |
| Expression consistency | 14 | 11 | <2 × 10−16 |
| Mean expression level | 8.64 | 13.85 | <2 × 10−16 |
| Gene length (bp) | 3450 | 1783 | <2 × 10−16 |
| Lethal gene | 23.4% | 13.1% | <2 × 10−16 |
| 0.156 | 0.153 | 0.179 | |
| 0.017 | 0.019 | 3.15 × 10−5 | |
| Co-expression module size | 19 | 14 | <2 × 10−16 |
| Expression breadth | 64 | 64 | <2 × 10−16 |
| Gene duplicates retained from the α WGD | 26.77% | 33.06% | 2.29 × 10−10 |
| Gene duplicates retained from the βγ WGD | 10.29% | 13.56% | 3.72 × 10−6 |
| Tandem duplicated genes | 7.6% | 10% | 1.05 × 10−4 |
The differences are shown as proportions for binary variables and as median values for non-binary variables. P values were corrected for multiple testing using a Benjamini and Hochberg P value adjustment
Fig. 1Proportion of gbM (black), unmethylated (white) and other (highly methylated non-gbM and ambiguous; grey) genes found to be expressed in n number of single-cell samples of A. thaliana QC cells
Best ANCOVA model of gene expression consistency based on a BIC model averaging model selection (R2 = 0.55; n = 5 637)
| Genetic features | Gene expression consistency | |||
|---|---|---|---|---|
| Coefficients | Sum of squares | |||
| Log (mean expression level) | 4.222 | 97,160 | 6 325.4 | <2 × 10−16 |
| Log (gene length) | 1.373 | 6690 | 435.6 | <2 × 10−16 |
| gbM | 0.457 | 180 | 11.7 | 6.34 ×10−4 |
| Expression breadth | 0.531 | 1538 | 100.1 | <2 ×10−16 |
| Gene duplicates retained from the α WGD | 0.513 | 312 | 20.4 | 6.61 ×10−6 |
Best ANCOVA model of log (intron FPKM) based on a BIC model averaging model selection (R2 = 0.731; n = 4376)
| Genetic features | Log (intron FPKM) | |||
|---|---|---|---|---|
| Coefficients | Sum of squares | |||
| Log (gene FPKM) | 1.56 | 9 974.6 | 9 751.4 | <2 × 10−16 |
| Log (gene length) | −0.83 | 909.7 | 889.3 | <2 × 10−16 |
| Log (total intron length) | 0.46 | 256.1 | 250.4 | <2 ×10−16 |
| Log (intron number) | 0.89 | 973.8 | 952 | <2 ×10−16 |
| gbM | −0.13 | 11.8 | 11.6 | 6.81 ×10−4 |
| Expression breadth | −0.08 | 29.5 | 28.9 | 8.17 × 10−8 |