| Literature DB >> 32471342 |
Chin-Sheng Teng1,2, Bing-Heng Wu1,3, Ming-Ren Yen1, Pao-Yang Chen4.
Abstract
BACKGROUND: DNA methylation is a major epigenetic modification involved in regulating gene expression. The effects of DNA methylation on gene expression differ by genomic location and vary across kingdoms, species and environmental conditions. To identify the functional regulatory roles of DNA methylation, the correlation between DNA methylation changes and alterations in gene expression is crucial. With the advance of next-generation sequencing, genome-wide methylation and gene expression profiling have become feasible. Current bioinformatics tools for investigating such correlation are designed to the assessment of DNA methylation at CG sites. The correlation of non-CG methylation and gene expression is very limited. Some bioinformatics databases allow correlation analysis, but they are limited to specific genomes such as that of humans and do not allow user-provided data.Entities:
Keywords: Bioinformatics; Correlation; DNA methylation; Epigenome; Gene expression; Next-generation sequencing; Web server
Mesh:
Year: 2020 PMID: 32471342 PMCID: PMC7257144 DOI: 10.1186/s12864-020-6722-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Schematic diagram of MethGET. The diagram shows the inputs and outputs of single-methylome analyses and multiple-methylome analyses
Fig. 2Correlation analyses of genome-wide DNA methylation and gene expression (human data). a Scatterplot of promoter methylation levels (y-axis) and gene expression values (x-axis). The correlation coefficient (R) and p-value (P) are provided in the top right corner of the plot. b The 2D kernel density plot of (a)
Fig. 3Ordinal association analyses with genes ranked by gene expression level (human data). Scatterplot and fitting curves of DNA methylation and relative gene expression. a Promoter methylation and b gene body methylation. The grey line in the plot separates genes into unexpressed genes on the left side (gene expression value = 0) and expressed genes on the right side (gene expression value > 0)
Fig. 4Distribution of DNA methylation by groups of genes with different expression levels (Arabidopsis data). a The boxplot shows the gene body methylation pattern in 10 different gene expression groups. b Violin plot of (a) with five expression groups
Fig. 5Average methylation level profiling according to different expression groups around genes (Arabidopsis data). a The “region” plot shows the DNA methylation pattern around the gene body region. b The “site” plot shows the methylation pattern around the transcription start site (TSS)
Fig. 6Multiple-methylome analyses (Arabidopsis mutant (Group a) vs. wild type (Group b)). a Gene-level associations between DNA methylation changes and gene expression changes. The red dots represent differential genes of DNA methylation and gene expression (bi-variate Gaussian mixture model; p-value < 10− 6). b Visualization of DNA methylation and gene expression data together
The processing time of Arabidopsis, human, rice, and wheat in MethGET
| Arabidopsis | Rice | Human | Wheat | |
|---|---|---|---|---|
| Genome size (Mb) | 135 | 380 | 3200 | 14,500 |
| Gene number | 27,655 | 39,045 | 20,805 | 107,891 |
| Processing time without metagene analyses (hrs:mins:secs) | 00:32:51 | 01:20:42 | 03:47:11 | 06:38:14 |
| Processing time with metagene analyses (hrs:mins:secs) | 04:21:15 | 07:50:36 | 09:47:52 | 18:31:14 |
The tests are on Intel Xeon E5–2650 processor (384GB RAM; clock speed 2.0GHz)
Fig. 7Demonstration of the application of MethGET with rice data. a The CHH methylation level trend increased with the increase in gene expression in the promoter region. b Most differential genes showed decreases in both CHH methylation and gene expression in gene body regions. The number of differential genes is shown in the four quadrants of the plot
Feature comparison of MethGET with other tools
| MethGET | COHCAP | PiiL | ViewBS | MethHC | iMETHYL | |
|---|---|---|---|---|---|---|
| Inputs with methylation and expression data | ○ | ○ | ○ | X (Using gene region files with different expression values) | No input | No input |
| Customized analyses | ○ | ○ | ○ | ○ | X | X |
| Web interface | ○ | X | X | X | ○ | ○ |
| Multiple-methylome analyses | ○ | ○ | ○ | ○ | ○ | X |
| Supported species | Any species with annotated genome | Species with CpG island annotation | Species with pathway annotation | Species with region file | ||
| Sequence context of cytosine methylation | CG, CHG, CHH | CG | CG | CG, CHG, CHH | CG | CG |
| Target regions | Genome-wide | Selected CpG islands | Genes in the pathway | Genome-wide | Selected genes | Selected genes or regions |
| Genomic locations | Gene body, promoter, exon, intron, TE | CpG islands | Annotated CpG sites | Regions in region files | 8 Gene regions+, 5 CpG Island regions | Only selected cytosines |
| Correlation analyses | 6 analyses | 1 (Scatterplot) | Visualization | 1 (Metagene plot) | 1 (Scatterplot) | Visualization |
| Citation | Teng et al. (2020) | Warden et al. (2013) [ | Moghadam et al. (2017) [ | Huang et al. (2018) [ | Huang et al. (2015) [ | Komaki et al. (2018) [ |