Weilong Guo1, Ping Zhu2,3, Matteo Pellegrini4, Michael Q Zhang5,6, Xiangfeng Wang7, Zhongfu Ni1. 1. State Key Laboratory for Agrobiotechnology, Key Laboratory of Crop Heterosis and Utilization, Beijing Key Laboratory of Crop Genetic Improvement, China Agricultural University, Beijing 100193, China. 2. State Key Laboratory of Experimental Hematology, Institute of Hematology and Blood Disease Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Tianjin 300020, China. 3. BIOPIC, Peking-Tsinghua Center for Life Sciences, College of Life Sciences, Peking University, Beijing 100871, China. 4. Department of Molecular, Cell and Developmental Biology, University of California, Los Angeles, CA 90095, USA. 5. Department of Molecular and Cell Biology, Center for Systems Biology, The University of Texas at Dallas, Richardson, TX 75080, USA. 6. Bioinformatics Division and Center for Synthetic & Systems Biology, TNLIST, Tsinghua University, Beijing 100084, China. 7. Beijing Advanced Innovation Center for Food Nutrition and Human health, China Agricultural University, Beijing 100193, China.
Abstract
Motivation: DNA methylation is important for gene silencing and imprinting in both plants and animals. Recent advances in bisulfite sequencing allow detection of single nucleotide variations (SNVs) achieving high sensitivity, but accurately identifying heterozygous SNVs from partially C-to-T converted sequences remains challenging. Results: We designed two methods, BayesWC and BinomWC, that substantially improved the precision of heterozygous SNV calls from ∼80% to 99% while retaining comparable recalls. With these SNV calls, we provided functions for allele-specific DNA methylation (ASM) analysis and visualizing the methylation status on reads. Applying ASM analysis to a previous dataset, we found that an average of 1.5% of investigated regions showed allelic methylation, which were significantly enriched in transposon elements and likely to be shared by the same cell-type. A dynamic fragment strategy was utilized for DMR analysis in low-coverage data and was able to find differentially methylated regions (DMRs) related to key genes involved in tumorigenesis using a public cancer dataset. Finally, we integrated 40 applications into the software package CGmapTools to analyze DNA methylomes. This package uses CGmap as the format interface, and designs binary formats to reduce the file size and support fast data retrieval, and can be applied for context-wise, gene-wise, bin-wise, region-wise and sample-wise analyses and visualizations. Availability and implementation: The CGmapTools software is freely available at https://cgmaptools.github.io/. Contact: guoweilong@cau.edu.cn. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: DNA methylation is important for gene silencing and imprinting in both plants and animals. Recent advances in bisulfite sequencing allow detection of single nucleotide variations (SNVs) achieving high sensitivity, but accurately identifying heterozygous SNVs from partially C-to-T converted sequences remains challenging. Results: We designed two methods, BayesWC and BinomWC, that substantially improved the precision of heterozygous SNV calls from ∼80% to 99% while retaining comparable recalls. With these SNV calls, we provided functions for allele-specific DNA methylation (ASM) analysis and visualizing the methylation status on reads. Applying ASM analysis to a previous dataset, we found that an average of 1.5% of investigated regions showed allelic methylation, which were significantly enriched in transposon elements and likely to be shared by the same cell-type. A dynamic fragment strategy was utilized for DMR analysis in low-coverage data and was able to find differentially methylated regions (DMRs) related to key genes involved in tumorigenesis using a public cancer dataset. Finally, we integrated 40 applications into the software package CGmapTools to analyze DNA methylomes. This package uses CGmap as the format interface, and designs binary formats to reduce the file size and support fast data retrieval, and can be applied for context-wise, gene-wise, bin-wise, region-wise and sample-wise analyses and visualizations. Availability and implementation: The CGmapTools software is freely available at https://cgmaptools.github.io/. Contact: guoweilong@cau.edu.cn. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Justin Chen; Maria K Haanpää; Joshua J Gruber; Natalie Jäger; James M Ford; Michael P Snyder Journal: Clin Cancer Res Date: 2019-06-07 Impact factor: 12.531
Authors: Vitor Onuchic; Eugene Lurie; Ivenise Carrero; Piotr Pawliczek; Ronak Y Patel; Joel Rozowsky; Timur Galeev; Zhuoyi Huang; Robert C Altshuler; Zhizhuo Zhang; R Alan Harris; Cristian Coarfa; Lillian Ashmore; Jessica W Bertol; Walid D Fakhouri; Fuli Yu; Manolis Kellis; Mark Gerstein; Aleksandar Milosavljevic Journal: Science Date: 2018-08-23 Impact factor: 47.728
Authors: Daniel Hormaechea-Agulla; Katie A Matatall; Duy T Le; Bailee Kain; Xiaochen Long; Pawel Kus; Roman Jaksik; Grant A Challen; Marek Kimmel; Katherine Y King Journal: Cell Stem Cell Date: 2021-03-19 Impact factor: 25.269
Authors: Matthew B Hufford; Arun S Seetharam; Margaret R Woodhouse; Kapeel M Chougule; Shujun Ou; Jianing Liu; William A Ricci; Tingting Guo; Andrew Olson; Yinjie Qiu; Rafael Della Coletta; Silas Tittes; Asher I Hudson; Alexandre P Marand; Sharon Wei; Zhenyuan Lu; Bo Wang; Marcela K Tello-Ruiz; Rebecca D Piri; Na Wang; Dong Won Kim; Yibing Zeng; Christine H O'Connor; Xianran Li; Amanda M Gilbert; Erin Baggs; Ksenia V Krasileva; John L Portwood; Ethalinda K S Cannon; Carson M Andorf; Nancy Manchanda; Samantha J Snodgrass; David E Hufnagel; Qiuhan Jiang; Sarah Pedersen; Michael L Syring; David A Kudrna; Victor Llaca; Kevin Fengler; Robert J Schmitz; Jeffrey Ross-Ibarra; Jianming Yu; Jonathan I Gent; Candice N Hirsch; Doreen Ware; R Kelly Dawe Journal: Science Date: 2021-08-06 Impact factor: 47.728
Authors: Kimberly M Davenport; Alisha T Massa; Suraj Bhattarai; Stephanie D McKay; Michelle R Mousel; Maria K Herndon; Stephen N White; Noelle E Cockett; Timothy P L Smith; Brenda M Murdoch Journal: Front Genet Date: 2021-05-20 Impact factor: 4.599