Literature DB >> 27896980

DEEP MOTIF DASHBOARD: VISUALIZING AND UNDERSTANDING GENOMIC SEQUENCES USING DEEP NEURAL NETWORKS.

Jack Lanchantin1, Ritambhara Singh, Beilun Wang, Yanjun Qi.   

Abstract

Deep neural network (DNN) models have recently obtained state-of-the-art prediction accuracy for the transcription factor binding (TFBS) site classification task. However, it remains unclear how these approaches identify meaningful DNA sequence signals and give insights as to why TFs bind to certain locations. In this paper, we propose a toolkit called the Deep Motif Dashboard (DeMo Dashboard) which provides a suite of visualization strategies to extract motifs, or sequence patterns from deep neural network models for TFBS classification. We demonstrate how to visualize and understand three important DNN models: convolutional, recurrent, and convolutional-recurrent networks. Our first visualization method is finding a test sequence's saliency map which uses first-order derivatives to describe the importance of each nucleotide in making the final prediction. Second, considering recurrent models make predictions in a temporal manner (from one end of a TFBS sequence to the other), we introduce temporal output scores, indicating the prediction score of a model over time for a sequential input. Lastly, a class-specific visualization strategy finds the optimal input sequence for a given TFBS positive class via stochastic gradient optimization. Our experimental results indicate that a convolutional-recurrent architecture performs the best among the three architectures. The visualization techniques indicate that CNN-RNN makes predictions by modeling both motifs as well as dependencies among them.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 27896980      PMCID: PMC5787355          DOI: 10.1142/9789813207813_0025

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  15 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  An assessment of neural network and statistical approaches for prediction of E. coli promoter sites.

Authors:  P B Horton; M Kanehisa
Journal:  Nucleic Acids Res       Date:  1992-08-25       Impact factor: 16.971

3.  DeepChrome: deep-learning for predicting gene expression from histone modifications.

Authors:  Ritambhara Singh; Jack Lanchantin; Gabriel Robins; Yanjun Qi
Journal:  Bioinformatics       Date:  2016-09-01       Impact factor: 6.937

4.  Predicting effects of noncoding variants with deep learning-based sequence model.

Authors:  Jian Zhou; Olga G Troyanskaya
Journal:  Nat Methods       Date:  2015-08-24       Impact factor: 28.547

5.  MEME-ChIP: motif analysis of large DNA datasets.

Authors:  Philip Machanick; Timothy L Bailey
Journal:  Bioinformatics       Date:  2011-04-12       Impact factor: 6.937

6.  SeqGL Identifies Context-Dependent Binding Signals in Genome-Wide Regulatory Element Maps.

Authors:  Manu Setty; Christina S Leslie
Journal:  PLoS Comput Biol       Date:  2015-05-27       Impact factor: 4.475

7.  Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks.

Authors:  David R Kelley; Jasper Snoek; John L Rinn
Journal:  Genome Res       Date:  2016-05-03       Impact factor: 9.043

8.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

9.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

10.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Authors:  Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  32 in total

1.  Modeling multi-species RNA modification through multi-task curriculum learning.

Authors:  Yuanpeng Xiong; Xuan He; Dan Zhao; Tingzhong Tian; Lixiang Hong; Tao Jiang; Jianyang Zeng
Journal:  Nucleic Acids Res       Date:  2021-04-19       Impact factor: 16.971

2.  Application of deep learning in genomics.

Authors:  Jianxiao Liu; Jiying Li; Hai Wang; Jianbing Yan
Journal:  Sci China Life Sci       Date:  2020-10-10       Impact factor: 6.038

Review 3.  Deep learning in pharmacogenomics: from gene regulation to patient stratification.

Authors:  Alexandr A Kalinin; Gerald A Higgins; Narathip Reamaroon; Sayedmohammadreza Soroushmehr; Ari Allyn-Feuer; Ivo D Dinov; Kayvan Najarian; Brian D Athey
Journal:  Pharmacogenomics       Date:  2018-04-26       Impact factor: 2.533

4.  Interpretable detection of novel human viruses from genome sequencing data.

Authors:  Jakub M Bartoszewicz; Anja Seidel; Bernhard Y Renard
Journal:  NAR Genom Bioinform       Date:  2021-02-01

5.  Deep learning for de-convolution of Smad2 versus Smad3 binding sites.

Authors:  Jeremy W K Ng; Esther H Q Ong; Lisa Tucker-Kellogg; Greg Tucker-Kellogg
Journal:  BMC Genomics       Date:  2022-07-20       Impact factor: 4.547

6.  Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin.

Authors:  Ritambhara Singh; Jack Lanchantin; Arshdeep Sekhon; Yanjun Qi
Journal:  Adv Neural Inf Process Syst       Date:  2017-12

Review 7.  A primer on deep learning in genomics.

Authors:  James Zou; Mikael Huss; Abubakar Abid; Pejman Mohammadi; Ali Torkamani; Amalio Telenti
Journal:  Nat Genet       Date:  2018-11-26       Impact factor: 38.330

8.  Deep neural networks identify sequence context features predictive of transcription factor binding.

Authors:  An Zheng; Michael Lamkin; Hanqing Zhao; Cynthia Wu; Hao Su; Melissa Gymrek
Journal:  Nat Mach Intell       Date:  2021-01-18

9.  Neural network modeling of differential binding between wild-type and mutant CTCF reveals putative binding preferences for zinc fingers 1-2.

Authors:  Irene M Kaplow; Abhimanyu Banerjee; Chuan Sheng Foo
Journal:  BMC Genomics       Date:  2022-04-12       Impact factor: 3.969

Review 10.  Learning the Regulatory Code of Gene Expression.

Authors:  Jan Zrimec; Filip Buric; Mariia Kokina; Victor Garcia; Aleksej Zelezniak
Journal:  Front Mol Biosci       Date:  2021-06-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.