| Literature DB >> 26140213 |
Shengjie Gao1, Dan Zou2, Likai Mao3, Quan Zhou4, Wenlong Jia5, Yi Huang6, Shancen Zhao4, Gang Chen4, Song Wu6, Dongdong Li4, Fei Xia7, Huafeng Chen4, Maoshan Chen4, Torben F Ørntoft8, Lars Bolund9, Karina D Sørensen8.
Abstract
BACKGROUND: DNA methylation has important roles in the regulation of gene expression and cellular specification. Reduced representation bisulfite sequencing (RRBS) has prevailed in methylation studies due to its cost-effectiveness and single-base resolution. The rapid accumulation of RRBS data demands well designed analytical tools.Entities:
Keywords: Allele-specific DNA methylation (ASM); Differentially methylated region (DMR); Reduced representation bisulfite sequencing (RRBS)
Mesh:
Substances:
Year: 2015 PMID: 26140213 PMCID: PMC4488126 DOI: 10.1186/s13742-015-0070-9
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Fig. 1Brief workflow of SMAP. SMAP filters sequenced raw data and produces ‘clean data’. Clean data is then mapped onto the pretreated reference genome. Finally, SNPs, ASM, DMRs and DMCs are detected and reports created. Existing tools are in green
Fig. 2Effects of bias correction in overlapping regions of PE reads (schematic). Me: Methylated site. a The overlapping region of a pair of PE reads. The C bases in position a and c are not in overlapping regions. However, the C base in position b is in an overlapping region of the pair of reads. b A case assuming methylated reads have the same proportion as unmethylated reads (i.e., methylation rate is 50 %). c The upper panel shows a case assuming that unmethylated samples were sequenced by PE sequencing and methylated samples were sequenced by SE sequencing. The lower panel shows a case in which unmethylated samples were sequenced by SE sequencing whereas methylated samples were sequenced by PE sequencing
Fig. 3ASM detection. Purple Cs are methylated, whereas red Ts are not methylated. Me: Methylated site. a Basic case in which two C bases are methylated. b An example of an ASM region in a monoclonal tumor marked by a heterozygous G/T SNP. c An example of a polyclonal tumor in which the heterozygous SNP and reference allele are present. d An example of another type of polyclonal tumor in which the heterozygous SNP was changed to a homozygous G allele due to loss of heterozygosity
Assessment of the performance of ASM
| PE90 | PE50 | |||||
|---|---|---|---|---|---|---|
| BSMAP | Bismark | Bowtie2 | BSMAP | Bismark | Bowtie2 | |
| Number of simulated ASM SNPs | 158 | 108 | ||||
| Number of simulated ASM events | 1022 | 473 | ||||
| Number of estimated ASM SNPs | 136 | 93 | 55 | 72 | 79 | 30 |
| Number of estimated ASM events | 995 | 726 | 254 | 355 | 386 | 74 |
| Number of validated estimated ASM SNPs | 125 | 89 | 51 | 64 | 72 | 29 |
| Number of validated estimated ASM events | 843 | 719 | 239 | 338 | 372 | 73 |
| FPR (ASM events) | 0.15 | 0.01 | 0.06 | 0.05 | 0.04 | 0.01 |
| FNR (ASM events) | 0.18 | 0.3 | 0.77 | 0.29 | 0.21 | 0.85 |
Abbreviations: ASM allele-specific, DNA methylation, FNR false negative rate, FPR false positive rate, PE50 50 bp read PE data, PE90 90 bp read PE data, SNP single nucleotide polymorphism