Literature DB >> 30922998

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.

Daniel Quang1, Xiaohui Xie2.   

Abstract

Due to the large numbers of transcription factors (TFs) and cell types, querying binding profiles of all valid TF/cell type pairs is not experimentally feasible. To address this issue, we developed a convolutional-recurrent neural network model, called FactorNet, to computationally impute the missing binding data. FactorNet trains on binding data from reference cell types to make predictions on testing cell types by leveraging a variety of features, including genomic sequences, genome annotations, gene expression, and signal data, such as DNase I cleavage. FactorNet implements several convenient strategies to reduce runtime and memory consumption. By visualizing the neural network models, we can interpret how the model predicts binding. We also investigate the variables that affect cross-cell type accuracy, and offer suggestions to improve upon this field. Our method ranked among the top teams in the ENCODE-DREAM in vivo Transcription Factor Binding Site Prediction Challenge, achieving first place on six of the 13 final round evaluation TF/cell type pairs, the most of any competing team. The FactorNet source code is publicly available, allowing users to reproduce our methodology from the ENCODE-DREAM Challenge.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  DREAM; Deep learning; ENCODE; Transcription factors

Mesh:

Substances:

Year:  2019        PMID: 30922998      PMCID: PMC6708499          DOI: 10.1016/j.ymeth.2019.03.020

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  46 in total

1.  DANN: a deep learning approach for annotating the pathogenicity of genetic variants.

Authors:  Daniel Quang; Yifei Chen; Xiaohui Xie
Journal:  Bioinformatics       Date:  2014-10-22       Impact factor: 6.937

2.  Genome-scale mapping of DNase I hypersensitivity.

Authors:  Sam John; Peter J Sabo; Theresa K Canfield; Kristen Lee; Shinny Vong; Molly Weaver; Hao Wang; Jeff Vierstra; Alex P Reynolds; Robert E Thurman; John A Stamatoyannopoulos
Journal:  Curr Protoc Mol Biol       Date:  2013-07

3.  Pybedtools: a flexible Python library for manipulating genomic datasets and annotations.

Authors:  Ryan K Dale; Brent S Pedersen; Aaron R Quinlan
Journal:  Bioinformatics       Date:  2011-09-23       Impact factor: 6.937

4.  Integrating and mining the chromatin landscape of cell-type specificity using self-organizing maps.

Authors:  Ali Mortazavi; Shirley Pepke; Camden Jansen; Georgi K Marinov; Jason Ernst; Manolis Kellis; Ross C Hardison; Richard M Myers; Barbara J Wold
Journal:  Genome Res       Date:  2013-10-29       Impact factor: 9.043

5.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

6.  An efficient targeted nuclease strategy for high-resolution mapping of DNA binding sites.

Authors:  Peter J Skene; Steven Henikoff
Journal:  Elife       Date:  2017-01-16       Impact factor: 8.140

7.  DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning.

Authors:  Christof Angermueller; Heather J Lee; Wolf Reik; Oliver Stegle
Journal:  Genome Biol       Date:  2017-04-11       Impact factor: 13.583

8.  BoostMe accurately predicts DNA methylation values in whole-genome bisulfite sequencing of multiple human tissues.

Authors:  Luli S Zou; Michael R Erdos; D Leland Taylor; Peter S Chines; Arushi Varshney; Stephen C J Parker; Francis S Collins; John P Didion
Journal:  BMC Genomics       Date:  2018-05-23       Impact factor: 3.969

9.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

10.  deepTools2: a next generation web server for deep-sequencing data analysis.

Authors:  Fidel Ramírez; Devon P Ryan; Björn Grüning; Vivek Bhardwaj; Fabian Kilpert; Andreas S Richter; Steffen Heyne; Friederike Dündar; Thomas Manke
Journal:  Nucleic Acids Res       Date:  2016-04-13       Impact factor: 16.971

View more
  38 in total

1.  Application of deep learning in genomics.

Authors:  Jianxiao Liu; Jiying Li; Hai Wang; Jianbing Yan
Journal:  Sci China Life Sci       Date:  2020-10-10       Impact factor: 6.038

2.  Deep learning for inferring transcription factor binding sites.

Authors:  Peter K Koo; Matt Ploenzke
Journal:  Curr Opin Syst Biol       Date:  2020-06-11

Review 3.  Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns.

Authors:  Divyanshi Srivastava; Shaun Mahony
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2019-10-19       Impact factor: 4.490

Review 4.  Machine learning: its challenges and opportunities in plant system biology.

Authors:  Mohsen Hesami; Milad Alizadeh; Andrew Maxwell Phineas Jones; Davoud Torkamaneh
Journal:  Appl Microbiol Biotechnol       Date:  2022-05-16       Impact factor: 4.813

5.  NetTIME: a multitask and base-pair resolution framework for improved transcription factor binding site prediction.

Authors:  Ren Yi; Kyunghyun Cho; Richard Bonneau
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

Review 6.  Decoding disease: from genomes to networks to phenotypes.

Authors:  Aaron K Wong; Rachel S G Sealfon; Chandra L Theesfeld; Olga G Troyanskaya
Journal:  Nat Rev Genet       Date:  2021-08-02       Impact factor: 53.242

7.  Identifying viruses from metagenomic data using deep learning.

Authors:  Jie Ren; Kai Song; Chao Deng; Nathan A Ahlgren; Jed A Fuhrman; Yi Li; Xiaohui Xie; Ryan Poplin; Fengzhu Sun
Journal:  Quant Biol       Date:  2020-03

8.  Expression of Human Endogenous Retroviruses in Systemic Lupus Erythematosus: Multiomic Integration With Gene Expression.

Authors:  Nathaniel Stearrett; Tyson Dawson; Ali Rahnavard; Prathyusha Bachali; Matthew L Bendall; Chen Zeng; Roberto Caricchio; Marcos Pérez-Losada; Amrie C Grammer; Peter E Lipsky; Keith A Crandall
Journal:  Front Immunol       Date:  2021-04-27       Impact factor: 7.561

9.  Deep neural networks identify sequence context features predictive of transcription factor binding.

Authors:  An Zheng; Michael Lamkin; Hanqing Zhao; Cynthia Wu; Hao Su; Melissa Gymrek
Journal:  Nat Mach Intell       Date:  2021-01-18

10.  Interpretation of allele-specific chromatin accessibility using cell state-aware deep learning.

Authors:  Zeynep Kalender Atak; Ibrahim Ihsan Taskiran; Jonas Demeulemeester; Christopher Flerin; David Mauduit; Liesbeth Minnoye; Gert Hulselmans; Valerie Christiaens; Ghanem-Elias Ghanem; Jasper Wouters; Stein Aerts
Journal:  Genome Res       Date:  2021-04-08       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.