Literature DB >> 28505334

DIRECTION: a machine learning framework for predicting and characterizing DNA methylation and hydroxymethylation in mammalian genomes.

Milos Pavlovic1, Pradipta Ray1,2, Kristina Pavlovic1, Aaron Kotamarti1, Min Chen3,4, Michael Q Zhang1,5.   

Abstract

MOTIVATION: 5-Methylcytosine and 5-Hydroxymethylcytosine in DNA are major epigenetic modifications known to significantly alter mammalian gene expression. High-throughput assays to detect these modifications are expensive, labor-intensive, unfeasible in some contexts and leave a portion of the genome unqueried. Hence, we devised a novel, supervised, integrative learning framework to perform whole-genome methylation and hydroxymethylation predictions in CpG dinucleotides. Our framework can also perform imputation of missing or low quality data in existing sequencing datasets. Additionally, we developed infrastructure to perform in silico, high-throughput hypotheses testing on such predicted methylation or hydroxymethylation maps.
RESULTS: We test our approach on H1 human embryonic stem cells and H1-derived neural progenitor cells. Our predictive model is comparable in accuracy to other state-of-the-art DNA methylation prediction algorithms. We are the first to predict hydroxymethylation in silico with high whole-genome accuracy, paving the way for large-scale reconstruction of hydroxymethylation maps in mammalian model systems. We designed a novel, beam-search driven feature selection algorithm to identify the most discriminative predictor variables, and developed a platform for performing integrative analysis and reconstruction of the epigenome. Our toolkit DIRECTION provides predictions at single nucleotide resolution and identifies relevant features based on resource availability. This offers enhanced biological interpretability of results potentially leading to a better understanding of epigenetic gene regulation.
AVAILABILITY AND IMPLEMENTATION: http://www.pradiptaray.com/direction, under CC-by-SA license. CONTACTS: pradiptaray@gmail.com or mchen@utdallas.edu or michael.zhang@utdallas.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28505334      PMCID: PMC5870843          DOI: 10.1093/bioinformatics/btx316

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  Base-resolution analysis of 5-hydroxymethylcytosine in the mammalian genome.

Authors:  Miao Yu; Gary C Hon; Keith E Szulwach; Chun-Xiao Song; Liang Zhang; Audrey Kim; Xuekun Li; Qing Dai; Yin Shen; Beomseok Park; Jung-Hyun Min; Peng Jin; Bing Ren; Chuan He
Journal:  Cell       Date:  2012-05-17       Impact factor: 41.582

2.  Computational prediction of methylation status in human genomic sequences.

Authors:  Rajdeep Das; Nevenka Dimitrova; Zhenyu Xuan; Robert A Rollins; Fatemah Haghighi; John R Edwards; Jingyue Ju; Timothy H Bestor; Michael Q Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  2006-07-03       Impact factor: 11.205

3.  The length of CpG islands is associated with the distribution of Alu and L1 retroelements.

Authors:  Moo-Il Kang; Mun-Gan Rhyu; Young-Ho Kim; Yu-Chae Jung; Seung-Jin Hong; Chul-Soo Cho; Hye-Soo Kim
Journal:  Genomics       Date:  2006-02-20       Impact factor: 5.736

Review 4.  Mammalian cytosine methylation at a glance.

Authors:  Steen K T Ooi; Anne H O'Donnell; Timothy H Bestor
Journal:  J Cell Sci       Date:  2009-08-15       Impact factor: 5.285

5.  Large-scale imputation of epigenomic datasets for systematic annotation of diverse human tissues.

Authors:  Jason Ernst; Manolis Kellis
Journal:  Nat Biotechnol       Date:  2015-02-18       Impact factor: 54.908

6.  Protocadherin 17 regulates presynaptic assembly in topographic corticobasal Ganglia circuits.

Authors:  Naosuke Hoshina; Asami Tanimura; Miwako Yamasaki; Takeshi Inoue; Ryoji Fukabori; Teiko Kuroda; Kazumasa Yokoyama; Tohru Tezuka; Hiroshi Sagara; Shinji Hirano; Hiroshi Kiyonari; Masahiko Takada; Kazuto Kobayashi; Masahiko Watanabe; Masanobu Kano; Takanobu Nakazawa; Tadashi Yamamoto
Journal:  Neuron       Date:  2013-05-16       Impact factor: 17.173

7.  Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications.

Authors:  Felix Krueger; Simon R Andrews
Journal:  Bioinformatics       Date:  2011-04-14       Impact factor: 6.937

8.  Predicting genome-wide DNA methylation using methylation marks, genomic position, and DNA regulatory elements.

Authors:  Weiwei Zhang; Tim D Spector; Panos Deloukas; Jordana T Bell; Barbara E Engelhardt
Journal:  Genome Biol       Date:  2015-01-24       Impact factor: 13.583

9.  5-hmC in the brain is abundant in synaptic genes and shows differences at the exon-intron boundary.

Authors:  Tarang Khare; Shraddha Pai; Karolis Koncevicius; Mrinal Pal; Edita Kriukiene; Zita Liutkeviciute; Manuel Irimia; Peixin Jia; Carolyn Ptak; Menghang Xia; Raymond Tice; Mamoru Tochigi; Solange Moréra; Anaies Nazarians; Denise Belsham; Albert H C Wong; Benjamin J Blencowe; Sun Chong Wang; Philipp Kapranov; Rafal Kustra; Viviane Labrie; Saulius Klimasauskas; Arturas Petronis
Journal:  Nat Struct Mol Biol       Date:  2012-09-09       Impact factor: 15.369

10.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

View more
  6 in total

1.  SRHiC: A Deep Learning Model to Enhance the Resolution of Hi-C Data.

Authors:  Zhilan Li; Zhiming Dai
Journal:  Front Genet       Date:  2020-04-08       Impact factor: 4.599

2.  A Novel Computational Method for Detecting DNA Methylation Sites with DNA Sequence Information and Physicochemical Properties.

Authors:  Gaofeng Pan; Limin Jiang; Jijun Tang; Fei Guo
Journal:  Int J Mol Sci       Date:  2018-02-08       Impact factor: 5.923

Review 3.  A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis.

Authors:  Sen Liang; Anjun Ma; Sen Yang; Yan Wang; Qin Ma
Journal:  Comput Struct Biotechnol J       Date:  2018-02-25       Impact factor: 7.271

4.  Combining DNA methylation and RNA sequencing data of cancer for supervised knowledge extraction.

Authors:  Eleonora Cappelli; Giovanni Felici; Emanuel Weitschek
Journal:  BioData Min       Date:  2018-10-25       Impact factor: 2.522

5.  SDM6A: A Web-Based Integrative Machine-Learning Framework for Predicting 6mA Sites in the Rice Genome.

Authors:  Shaherin Basith; Balachandran Manavalan; Tae Hwan Shin; Gwang Lee
Journal:  Mol Ther Nucleic Acids       Date:  2019-08-16       Impact factor: 8.886

6.  DeepH&M: Estimating single-CpG hydroxymethylation and methylation levels from enrichment and restriction enzyme sequencing methods.

Authors:  Yu He; Hyo Sik Jang; Xiaoyun Xing; Daofeng Li; Michael J Vasek; Joseph D Dougherty; Ting Wang
Journal:  Sci Adv       Date:  2020-07-01       Impact factor: 14.136

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.