Literature DB >> 31672653

Decoding regulatory structures and features from epigenomics profiles: A Roadmap-ENCODE Variational Auto-Encoder (RE-VAE) model.

Ruifeng Hu1, Guangsheng Pei1, Peilin Jia2, Zhongming Zhao3.   

Abstract

The development of chromatin immunoprecipitation (ChIP) with massively parallel DNA sequencing (ChIP-seq) technologies has promoted generation of large-scale epigenomics data, providing us unprecedented opportunities to explore the landscape of epigenomic profiles at scales across both histone marks and tissue types. In addition to many tools directly for data analysis, advanced computational approaches, such as deep learning, have recently become promising to deeply mine the data structures and identify important regulators from complex functional genomics data. We implemented a neural network framework, a Variational Auto-Encoder (VAE) model, to explore the epigenomic data from the Roadmap Epigenomics Project and the Encyclopedia of DNA Elements (ENCODE) project. Our model is applied to 935 reference samples, covering 28 tissues and 12 histone marks. We used the enhancer and promoter regions as the annotation features and ChIP-seq signal values in these regions as the feature values. Through a parameter sweep process, we identified the suitable hyperparameter values and built a VAE model to represent the epigenomics data and to further explore the biological regulation. The resultant Roadmap-ENCODE VAE (RE-VAE) model contained data compression and feature representation. Using the compressed data in the latent space, we found that the majority of histone marks were well clustered but not for tissues or cell types. Tissue or cell specificity was observed only in some histone marks (e.g., H3K4me3 and H3K27ac) and could be characterized when the number of tissue samples is large (e.g., blood and brain). In blood, the contributive regions and genes identified by RE-VAE model were confirmed by tissue-specificity enrichment analysis with an independent tissue expression panel. Finally, we demonstrated that RE-VAE model could detect cancer cell lines with similar epigenomics profiles. In conclusion, we introduced and implemented a VAE model to represent large-scale epigenomics data. The model could be used to explore classifications of histone modifications and tissue/cell specificity and to classify new data with unknown sources.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Epigenomic; Histone mark; Roadmap Epigenomics; Tissue specificity; Variational Auto-Encoder

Mesh:

Year:  2019        PMID: 31672653      PMCID: PMC7431277          DOI: 10.1016/j.ymeth.2019.10.012

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  27 in total

Review 1.  Regulation of chromatin by histone modifications.

Authors:  Andrew J Bannister; Tony Kouzarides
Journal:  Cell Res       Date:  2011-02-15       Impact factor: 25.617

2.  deTS: tissue-specific enrichment analysis to decode tissue specificity.

Authors:  Guangsheng Pei; Yulin Dai; Zhongming Zhao; Peilin Jia
Journal:  Bioinformatics       Date:  2019-10-01       Impact factor: 6.937

3.  An atlas of active enhancers across human cell types and tissues.

Authors:  Robin Andersson; Claudia Gebhard; Michael Rehli; Albin Sandelin; Irene Miguel-Escalada; Ilka Hoof; Jette Bornholdt; Mette Boyd; Yun Chen; Xiaobei Zhao; Christian Schmidl; Takahiro Suzuki; Evgenia Ntini; Erik Arner; Eivind Valen; Kang Li; Lucia Schwarzfischer; Dagmar Glatz; Johanna Raithel; Berit Lilje; Nicolas Rapin; Frederik Otzen Bagger; Mette Jørgensen; Peter Refsing Andersen; Nicolas Bertin; Owen Rackham; A Maxwell Burroughs; J Kenneth Baillie; Yuri Ishizu; Yuri Shimizu; Erina Furuhata; Shiori Maeda; Yutaka Negishi; Christopher J Mungall; Terrence F Meehan; Timo Lassmann; Masayoshi Itoh; Hideya Kawaji; Naoto Kondo; Jun Kawai; Andreas Lennartsson; Carsten O Daub; Peter Heutink; David A Hume; Torben Heick Jensen; Harukazu Suzuki; Yoshihide Hayashizaki; Ferenc Müller; Alistair R R Forrest; Piero Carninci
Journal:  Nature       Date:  2014-03-27       Impact factor: 49.962

4.  Chromatin state signatures associated with tissue-specific gene expression and enhancer activity in the embryonic limb.

Authors:  Justin Cotney; Jing Leng; Sunghee Oh; Laura E DeMare; Steven K Reilly; Mark B Gerstein; James P Noonan
Journal:  Genome Res       Date:  2012-03-15       Impact factor: 9.043

5.  Intermediate DNA methylation is a conserved signature of genome regulation.

Authors:  GiNell Elliott; Chibo Hong; Xiaoyun Xing; Xin Zhou; Daofeng Li; Cristian Coarfa; Robert J A Bell; Cecile L Maire; Keith L Ligon; Mahvash Sigaroudinia; Philippe Gascard; Thea D Tlsty; R Alan Harris; Leonard C Schalkwyk; Misha Bilenky; Jonathan Mill; Peggy J Farnham; Manolis Kellis; Marco A Marra; Aleksandar Milosavljevic; Martin Hirst; Gary D Stormo; Ting Wang; Joseph F Costello
Journal:  Nat Commun       Date:  2015-02-18       Impact factor: 14.919

6.  Integrative analysis of haplotype-resolved epigenomes across human tissues.

Authors:  Danny Leung; Inkyung Jung; Nisha Rajagopal; Anthony Schmitt; Siddarth Selvaraj; Ah Young Lee; Chia-An Yen; Shin Lin; Yiing Lin; Yunjiang Qiu; Wei Xie; Feng Yue; Manoj Hariharan; Pradipta Ray; Samantha Kuan; Lee Edsall; Hongbo Yang; Neil C Chi; Michael Q Zhang; Joseph R Ecker; Bing Ren
Journal:  Nature       Date:  2015-02-19       Impact factor: 49.962

7.  Conserved epigenomic signals in mice and humans reveal immune basis of Alzheimer's disease.

Authors:  Elizabeta Gjoneska; Andreas R Pfenning; Hansruedi Mathys; Gerald Quon; Anshul Kundaje; Li-Huei Tsai; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 49.962

8.  The ensembl regulatory build.

Authors:  Daniel R Zerbino; Steven P Wilder; Nathan Johnson; Thomas Juettemann; Paul R Flicek
Journal:  Genome Biol       Date:  2015-03-24       Impact factor: 13.583

9.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

10.  JQ1 affects BRD2-dependent and independent transcription regulation without disrupting H4-hyperacetylated chromatin states.

Authors:  Lusy Handoko; Bogumil Kaczkowski; Chung-Chau Hon; Marina Lizio; Masatoshi Wakamori; Takayoshi Matsuda; Takuhiro Ito; Prashanti Jeyamohan; Yuko Sato; Kensaku Sakamoto; Shigeyuki Yokoyama; Hiroshi Kimura; Aki Minoda; Takashi Umehara
Journal:  Epigenetics       Date:  2018-08-06       Impact factor: 4.528

View more
  6 in total

1.  Deep4mC: systematic assessment and computational prediction for DNA N4-methylcytosine sites by deep learning.

Authors:  Haodong Xu; Peilin Jia; Zhongming Zhao
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

2.  Predicting regulatory variants using a dense epigenomic mapped CNN model elucidated the molecular basis of trait-tissue associations.

Authors:  Guangsheng Pei; Ruifeng Hu; Yulin Dai; Astrid Marilyn Manuel; Zhongming Zhao; Peilin Jia
Journal:  Nucleic Acids Res       Date:  2021-01-11       Impact factor: 16.971

Review 3.  Machine Learning in Epigenomics: Insights into Cancer Biology and Medicine.

Authors:  Emre Arslan; Jonathan Schulz; Kunal Rai
Journal:  Biochim Biophys Acta Rev Cancer       Date:  2021-07-07       Impact factor: 10.680

4.  Variational autoencoding of gene landscapes during mouse CNS development uncovers layered roles of Polycomb Repressor Complex 2.

Authors:  Ariane Mora; Jonathan Rakar; Ignacio Monedero Cobeta; Behzad Yaghmaeian Salmani; Annika Starkenberg; Stefan Thor; Mikael Bodén
Journal:  Nucleic Acids Res       Date:  2022-02-22       Impact factor: 16.971

5.  DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner.

Authors:  Guangsheng Pei; Ruifeng Hu; Peilin Jia; Zhongming Zhao
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

6.  Integrative computational epigenomics to build data-driven gene regulation hypotheses.

Authors:  Tyrone Chen; Sonika Tyagi
Journal:  Gigascience       Date:  2020-06-01       Impact factor: 6.524

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.