| Literature DB >> 28056787 |
Hidehiro Toh1, Kenjiro Shirane1, Fumihito Miura2, Naoki Kubo1, Kenji Ichiyanagi1, Katsuhiko Hayashi3, Mitinori Saitou4, Mikita Suyama5, Takashi Ito2, Hiroyuki Sasaki6.
Abstract
BACKGROUND: Methylation of cytosine in genomic DNA is a well-characterized epigenetic modification involved in many cellular processes and diseases. Whole-genome bisulfite sequencing (WGBS), such as MethylC-seq and post-bisulfite adaptor tagging sequencing (PBAT-seq), uses the power of high-throughput DNA sequencers and provides genome-wide DNA methylation profiles at single-base resolution. However, the accuracy and consistency of WGBS outputs in relation to the operating conditions of high-throughput sequencers have not been explored.Entities:
Keywords: DNA methylation; HiSeq control software; Illumina HiSeq platform; Whole-genome bisulfite sequencing
Mesh:
Substances:
Year: 2017 PMID: 28056787 PMCID: PMC5217569 DOI: 10.1186/s12864-016-3392-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Different CpG methylation levels obtained from identical PBAT libraries using different HCS and RTA versions. a CpG methylation levels determined by single-end PBAT-seq. CpG methylation levels in 100 kb windows are shown as a box plot (left). Different proportions of G among the four bases in R1 obtained using different HCS and RTA versions are shown as a line plot (right). b Correlation between the CpG methylation levels determined using HCS v2.0.5, v2.0.10, and v2.0.12. CpG methylation values of 100-kb non-overlapping sliding windows across the autosomes are plotted with a linear regression line (red). c Differences between the CpG methylation levels determined using HCS v2.0.5 and v2.0.12 against the CpG density. CpG methylation values were calculated in 100-kb non-overlapping sliding windows across the autosomes. All 100 kb windows were grouped into nine classes according to the number of contained CpG. d CpG methylation levels determined by paired-end PBAT-seq (IMR-90). CpG methylation levels in 100 kb windows are shown as a box plot (left). Different proportions of G in R1 and C in R2 obtained using different HCS and RTA versions are shown as a line plot (right)
Fig. 2Observed versus predetermined CpG methylation levels of a series of mixture of unmethylated and in vitro methylated lambda DNAs. a The differences between the observed and predetermined CpG methylation levels are plotted against the predetermined CpG methylation levels for each HCS version. R1 and R2 data from paired-end PBAT-seq runs were separately analyzed. b Differences between the R1 and R2 CpG methylation levels are shown for each HCS version
Fig. 3Quality scores assigned to base G. a Quality scores assigned to Gs in the raw reads obtained using different HCS versions. All Gs were grouped into four classes according to the assigned quality score. b Examples of drops in quality score at Gs. Representative sequence reads from the PBAT-seq (IMR-90) and control (PhiX) data generated using HCS v2.0.12 are shown. Gs in the IMR-90 read are shown in red. c Quality scores assigned to the base representing 5mC (G or C) in R1 and R2 of the paired-end PBAT-seq and MethylC-seq using HCS v2.0.12. In the MethylC-seq, 50% w/w PhiX DNA was spiked in (accession no. DRA002280) [14]
Fig. 4Effect of cluster density on paired-end PBAT-seq (IMR-90). a Quality scores assigned to the four bases in R2 generated using HCS v2.2.38 at different cluster densities. The quality scores in R2 of PhiX control on the same flow cell are also shown. b Quality scores assigned to Cs in R2 generated using different HCS versions at different cluster densities. c Quality scores assigned to the four bases in R1 and R2 generated using the latest HCS v2.2.58 at a modest cluster density (483 K per mm2). The quality scores in R2 of PhiX control on the same flow cell are also shown