Literature DB >> 26213851

Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Babak Alipanahi1, Andrew Delong2, Matthew T Weirauch3, Brendan J Frey4.   

Abstract

Knowing the sequence specificities of DNA- and RNA-binding proteins is essential for developing models of the regulatory processes in biological systems and for identifying causal disease variants. Here we show that sequence specificities can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for pattern discovery. Using a diverse array of experimental data and evaluation metrics, we find that deep learning outperforms other state-of-the-art methods, even when training on in vitro data and testing on in vivo data. We call this approach DeepBind and have built a stand-alone software tool that is fully automatic and handles millions of sequences per experiment. Specificities determined by DeepBind are readily visualized as a weighted ensemble of position weight matrices or as a 'mutation map' that indicates how variations affect binding within a specific sequence.

Mesh:

Substances:

Year:  2015        PMID: 26213851     DOI: 10.1038/nbt.3300

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  45 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

Review 2.  Origins of specificity in protein-DNA recognition.

Authors:  Remo Rohs; Xiangshu Jin; Sean M West; Rohit Joshi; Barry Honig; Richard S Mann
Journal:  Annu Rev Biochem       Date:  2010       Impact factor: 23.643

3.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities.

Authors:  Michael F Berger; Anthony A Philippakis; Aaron M Qureshi; Fangxue S He; Preston W Estep; Martha L Bulyk
Journal:  Nat Biotechnol       Date:  2006-09-24       Impact factor: 54.908

4.  RankMotif++: a motif-search algorithm that accounts for relative ranks of K-mers in binding transcription factors.

Authors:  Xiaoyu Chen; Timothy R Hughes; Quaid Morris
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

5.  RBFOX1 regulates both splicing and transcriptional networks in human neuronal development.

Authors:  Brent L Fogel; Eric Wexler; Amanda Wahnich; Tara Friedrich; Chandran Vijayendran; Fuying Gao; Neelroop Parikshak; Genevieve Konopka; Daniel H Geschwind
Journal:  Hum Mol Genet       Date:  2012-06-23       Impact factor: 6.150

6.  Direct measurement of DNA affinity landscapes on a high-throughput sequencing instrument.

Authors:  Razvan Nutiu; Robin C Friedman; Shujun Luo; Irina Khrebtukova; David Silva; Robin Li; Lu Zhang; Gary P Schroth; Christopher B Burge
Journal:  Nat Biotechnol       Date:  2011-06-26       Impact factor: 54.908

7.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities.

Authors:  Arttu Jolma; Teemu Kivioja; Jarkko Toivonen; Lu Cheng; Gonghong Wei; Martin Enge; Mikko Taipale; Juan M Vaquerizas; Jian Yan; Mikko J Sillanpää; Martin Bonke; Kimmo Palin; Shaheynoor Talukder; Timothy R Hughes; Nicholas M Luscombe; Esko Ukkonen; Jussi Taipale
Journal:  Genome Res       Date:  2010-04-08       Impact factor: 9.043

8.  Evaluation of methods for modeling transcription factor sequence specificity.

Authors:  Matthew T Weirauch; Atina Cote; Raquel Norel; Matti Annala; Yue Zhao; Todd R Riley; Julio Saez-Rodriguez; Thomas Cokelaer; Anastasia Vedenko; Shaheynoor Talukder; Harmen J Bussemaker; Quaid D Morris; Martha L Bulyk; Gustavo Stolovitzky; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2013-01-27       Impact factor: 54.908

9.  Evolutionarily dynamic alternative splicing of GPR56 regulates regional cerebral cortical patterning.

Authors:  Byoung-Il Bae; Ian Tietjen; Kutay D Atabay; Gilad D Evrony; Matthew B Johnson; Ebenezer Asare; Peter P Wang; Ayako Y Murayama; Kiho Im; Steven N Lisgo; Lynne Overman; Nenad Šestan; Bernard S Chang; A James Barkovich; P Ellen Grant; Meral Topçu; Jeffrey Politsky; Hideyuki Okano; Xianhua Piao; Christopher A Walsh
Journal:  Science       Date:  2014-02-14       Impact factor: 47.728

10.  Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Authors:  Peter V Kharchenko; Michael Y Tolstorukov; Peter J Park
Journal:  Nat Biotechnol       Date:  2008-11-16       Impact factor: 54.908

View more
  567 in total

1.  RNA structure from deep sequencing.

Authors:  Eric Westhof
Journal:  Nat Biotechnol       Date:  2015-09       Impact factor: 54.908

2.  Deep learning for regulatory genomics.

Authors:  Yongjin Park; Manolis Kellis
Journal:  Nat Biotechnol       Date:  2015-08       Impact factor: 54.908

Review 3.  Small Genetic Circuits and MicroRNAs: Big Players in Polymerase II Transcriptional Control in Plants.

Authors:  Molly Megraw; Jason S Cumbie; Maria G Ivanchenko; Sergei A Filichkin
Journal:  Plant Cell       Date:  2016-02-11       Impact factor: 11.277

4.  DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors:  Hamid Reza Hassanzadeh; May D Wang
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-01-19

5.  Prediction of condition-specific regulatory genes using machine learning.

Authors:  Qi Song; Jiyoung Lee; Shamima Akter; Matthew Rogers; Ruth Grene; Song Li
Journal:  Nucleic Acids Res       Date:  2020-06-19       Impact factor: 16.971

6.  Predicting functional variants in enhancer and promoter elements using RegulomeDB.

Authors:  Shengcheng Dong; Alan P Boyle
Journal:  Hum Mutat       Date:  2019-06-22       Impact factor: 4.878

7.  PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning.

Authors:  Zhencheng Fang; Jie Tan; Shufang Wu; Mo Li; Congmin Xu; Zhongjie Xie; Huaiqiu Zhu
Journal:  Gigascience       Date:  2019-06-01       Impact factor: 6.524

8.  Prediction of regulatory motifs from human Chip-sequencing data using a deep learning framework.

Authors:  Jinyu Yang; Anjun Ma; Adam D Hoppe; Cankun Wang; Yang Li; Chi Zhang; Yan Wang; Bingqiang Liu; Qin Ma
Journal:  Nucleic Acids Res       Date:  2019-09-05       Impact factor: 16.971

9.  The tuberous sclerosis complex subunit TBC1D7 is stabilized by Akt phosphorylation-mediated 14-3-3 binding.

Authors:  James P Madigan; Feng Hou; Linlei Ye; Jicheng Hu; Aiping Dong; Wolfram Tempel; Marielle E Yohe; Paul A Randazzo; Lisa M Miller Jenkins; Michael M Gottesman; Yufeng Tong
Journal:  J Biol Chem       Date:  2018-08-24       Impact factor: 5.157

10.  Attend and Predict: Understanding Gene Regulation by Selective Attention on Chromatin.

Authors:  Ritambhara Singh; Jack Lanchantin; Arshdeep Sekhon; Yanjun Qi
Journal:  Adv Neural Inf Process Syst       Date:  2017-12
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.