| Literature DB >> 26681520 |
Adam J Bewick1, Brigitte T Hofmeister2, Kevin Lee1, Xiaoyu Zhang3, David W Hall1, Robert J Schmitz4.
Abstract
We describe a suite of predictive models, coined FAST(m)C, for nonreference, cost-effective exploration and comparative analysis of context-specific DNA methylation levels. Accurate estimations of true DNA methylation levels can be obtained from as few as several thousand short-reads generated from whole-genome bisulfite sequencing. These models make high-resolution time course or developmental and large diversity studies practical regardless of species, genome size, and availability of a reference genome.Entities:
Keywords: DNA methylation; epigenetics; methylome; modeling; whole-genome bisulfite sequencing
Mesh:
Year: 2015 PMID: 26681520 PMCID: PMC4751562 DOI: 10.1534/g3.115.025668
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Detection of intraspecific DNA methylation levels by FAST. (A) Linear models (LMs) for estimated levels of methylation, i.e., , (Y-axis) vs. actual levels (X-axis) determined by reference mapping of WGBS reads. Estimated levels of methylation were based on 10,000 random WGBS reads. DNA methylation differences between A. thaliana mutants, mouse mutants/cell-types/tissues, and increasing CH methylation throughout brain development are captured with FAST. Shaded area represents the 95% confidence interval. (B) Environmental (temperature) effects on CHH DNA methylation in A. thaliana is also recapitulated using FAST. Left panel (“FAST”) represents FAST methylation estimates for individual lines based solely on the WGBS reads using the “plant” model from http://fastmc.genetics.uga.edu. Right panel (“True”) represents the methylation values from standard WGBS read alignment to the A. thaliana reference genome. Red lines are averages of all lines. Data from Dubin .
Figure 2Detection of interspecific DNA methylation levels by FAST. Linear models (LMs) for estimator () vs. target (m) CpG, CHG, CHH, and CH DNA methylation levels using 10,000 reads corrected for estimated GC content ().