Literature DB >> 34322657

Improving representations of genomic sequence motifs in convolutional networks with exponential activations.

Peter K Koo1, Matt Ploenzke2.   

Abstract

Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to build representations in a distributed manner, making it a challenge to extract learned features that are biologically meaningful, such as sequence motifs. Here we perform a comprehensive analysis on synthetic sequences to investigate the role that CNN activations have on model interpretability. We show that employing an exponential activation to first layer filters consistently leads to interpretable and robust representations of motifs compared to other commonly used activations. Strikingly, we demonstrate that CNNs with better test performance do not necessarily imply more interpretable representations with attribution methods. We find that CNNs with exponential activations significantly improve the efficacy of recovering biologically meaningful representations with attribution methods. We demonstrate these results generalise to real DNA sequences across several in vivo datasets. Together, this work demonstrates how a small modification to existing CNNs, i.e. setting exponential activations in the first layer, can significantly improve the robustness and interpretabilty of learned representations directly in convolutional filters and indirectly with attribution methods.

Year:  2021        PMID: 34322657      PMCID: PMC8315445          DOI: 10.1038/s42256-020-00291-x

Source DB:  PubMed          Journal:  Nat Mach Intell        ISSN: 2522-5839


  27 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E. coli.

Authors:  G D Stormo; T D Schneider; L Gold; A Ehrenfeucht
Journal:  Nucleic Acids Res       Date:  1982-05-11       Impact factor: 16.971

Review 3.  Transcription factor-DNA binding: beyond binding site motifs.

Authors:  Sachi Inukai; Kian Hong Kock; Martha L Bulyk
Journal:  Curr Opin Genet Dev       Date:  2017-03-27       Impact factor: 5.578

4.  FIMO: scanning for occurrences of a given motif.

Authors:  Charles E Grant; Timothy L Bailey; William Stafford Noble
Journal:  Bioinformatics       Date:  2011-02-16       Impact factor: 6.937

5.  Predicting Splicing from Primary Sequence with Deep Learning.

Authors:  Kishore Jaganathan; Sofia Kyriazopoulou Panagiotopoulou; Jeremy F McRae; Siavash Fazel Darbandi; David Knowles; Yang I Li; Jack A Kosmicki; Juan Arbelaez; Wenwu Cui; Grace B Schwartz; Eric D Chow; Efstathios Kanterakis; Hong Gao; Amirali Kia; Serafim Batzoglou; Stephan J Sanders; Kyle Kai-How Farh
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

6.  Global importance analysis: An interpretability method to quantify importance of genomic features in deep neural networks.

Authors:  Peter K Koo; Antonio Majdandzic; Matthew Ploenzke; Praveen Anand; Steffan B Paul
Journal:  PLoS Comput Biol       Date:  2021-05-13       Impact factor: 4.475

7.  An integrated encyclopedia of DNA elements in the human genome.

Authors: 
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

8.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

9.  Logomaker: beautiful sequence logos in Python.

Authors:  Ammar Tareen; Justin B Kinney
Journal:  Bioinformatics       Date:  2019-12-10       Impact factor: 6.937

10.  JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.

Authors:  Anthony Mathelier; Oriol Fornes; David J Arenillas; Chih-Yu Chen; Grégoire Denay; Jessica Lee; Wenqiang Shi; Casper Shyr; Ge Tan; Rebecca Worsley-Hunt; Allen W Zhang; François Parcy; Boris Lenhard; Albin Sandelin; Wyeth W Wasserman
Journal:  Nucleic Acids Res       Date:  2015-11-03       Impact factor: 16.971

View more
  6 in total

1.  fastISM: Performant in-silico saturation mutagenesis for convolutional neural networks.

Authors:  Surag Nair; Avanti Shrikumar; Jacob Schreiber; Anshul Kundaje
Journal:  Bioinformatics       Date:  2022-03-03       Impact factor: 6.937

Review 2.  Obtaining genetics insights from deep learning via explainable artificial intelligence.

Authors:  Gherman Novakovsky; Nick Dexter; Maxwell W Libbrecht; Wyeth W Wasserman; Sara Mostafavi
Journal:  Nat Rev Genet       Date:  2022-10-03       Impact factor: 59.581

3.  Genomics enters the deep learning era.

Authors:  Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ       Date:  2022-06-24       Impact factor: 3.061

4.  Multinomial Convolutions for Joint Modeling of Regulatory Motifs and Sequence Activity Readouts.

Authors:  Minjun Park; Salvi Singh; Samin Rahman Khan; Mohammed Abid Abrar; Francisco Grisanti; M Sohel Rahman; Md Abul Hassan Samee
Journal:  Genes (Basel)       Date:  2022-09-08       Impact factor: 4.141

Review 5.  Virtual Gene Concept and a Corresponding Pragmatic Research Program in Genetical Data Science.

Authors:  Łukasz Huminiecki
Journal:  Entropy (Basel)       Date:  2021-12-23       Impact factor: 2.524

6.  Machine learning in medicine: should the pursuit of enhanced interpretability be abandoned?

Authors:  Chang Ho Yoon; Robert Torrance; Naomi Scheinerman
Journal:  J Med Ethics       Date:  2021-05-18       Impact factor: 5.926

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.