Literature DB >> 28379348

BEESEM: estimation of binding energy models using HT-SELEX data.

Shuxiang Ruan1, S Joshua Swamidass2, Gary D Stormo1.   

Abstract

MOTIVATION: Characterizing the binding specificities of transcription factors (TFs) is crucial to the study of gene expression regulation. Recently developed high-throughput experimental methods, including protein binding microarrays (PBM) and high-throughput SELEX (HT-SELEX), have enabled rapid measurements of the specificities for hundreds of TFs. However, few studies have developed efficient algorithms for estimating binding motifs based on HT-SELEX data. Also the simple method of constructing a position weight matrix (PWM) by comparing the frequency of the preferred sequence with single-nucleotide variants has the risk of generating motifs with higher information content than the true binding specificity.
RESULTS: We developed an algorithm called BEESEM that builds on a comprehensive biophysical model of protein-DNA interactions, which is trained using the expectation maximization method. BEESEM is capable of selecting the optimal motif length and calculating the confidence intervals of estimated parameters. By comparing BEESEM with the published motifs estimated using the same HT-SELEX data, we demonstrate that BEESEM provides significant improvements. We also evaluate several motif discovery algorithms on independent PBM and ChIP-seq data. BEESEM provides significantly better fits to in vitro data, but its performance is similar to some other methods on in vivo data under the criterion of the area under the receiver operating characteristic curve (AUROC). This highlights the limitations of the purely rank-based AUROC criterion. Using quantitative binding data to assess models, however, demonstrates that BEESEM improves on prior models.
AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://stormo.wustl.edu/resources.html . CONTACT: stormo@wustl.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28379348      PMCID: PMC5860122          DOI: 10.1093/bioinformatics/btx191

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  43 in total

1.  A biophysical approach to transcription factor binding site discovery.

Authors:  Marko Djordjevic; Anirvan M Sengupta; Boris I Shraiman
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

Review 2.  Determining the specificity of protein-DNA interactions.

Authors:  Gary D Stormo; Yue Zhao
Journal:  Nat Rev Genet       Date:  2010-09-28       Impact factor: 53.242

3.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities.

Authors:  Michael F Berger; Anthony A Philippakis; Aaron M Qureshi; Fangxue S He; Preston W Estep; Martha L Bulyk
Journal:  Nat Biotechnol       Date:  2006-09-24       Impact factor: 54.908

4.  A Biophysical Approach to Predicting Protein-DNA Binding Energetics.

Authors:  George Locke; Alexandre V Morozov
Journal:  Genetics       Date:  2015-06-16       Impact factor: 4.562

5.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities.

Authors:  Arttu Jolma; Teemu Kivioja; Jarkko Toivonen; Lu Cheng; Gonghong Wei; Martin Enge; Mikko Taipale; Juan M Vaquerizas; Jian Yan; Mikko J Sillanpää; Martin Bonke; Kimmo Palin; Shaheynoor Talukder; Timothy R Hughes; Nicholas M Luscombe; Esko Ukkonen; Jussi Taipale
Journal:  Genome Res       Date:  2010-04-08       Impact factor: 9.043

6.  Evaluation of methods for modeling transcription factor sequence specificity.

Authors:  Matthew T Weirauch; Atina Cote; Raquel Norel; Matti Annala; Yue Zhao; Todd R Riley; Julio Saez-Rodriguez; Thomas Cokelaer; Anastasia Vedenko; Shaheynoor Talukder; Harmen J Bussemaker; Quaid D Morris; Martha L Bulyk; Gustavo Stolovitzky; Timothy R Hughes
Journal:  Nat Biotechnol       Date:  2013-01-27       Impact factor: 54.908

7.  Transcription factor networks in Drosophila melanogaster.

Authors:  David Y Rhee; Dong-Yeon Cho; Bo Zhai; Matthew Slattery; Lijia Ma; Julian Mintseris; Christina Y Wong; Kevin P White; Susan E Celniker; Teresa M Przytycka; Steven P Gygi; Robert A Obar; Spyros Artavanis-Tsakonas
Journal:  Cell Rep       Date:  2014-09-18       Impact factor: 9.423

8.  YeTFaSCo: a database of evaluated yeast transcription factor sequence specificities.

Authors:  Carl G de Boer; Timothy R Hughes
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

9.  Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights.

Authors:  Raluca Gordân; Kevin F Murphy; Rachel P McCord; Cong Zhu; Anastasia Vedenko; Martha L Bulyk
Journal:  Genome Biol       Date:  2011-12-21       Impact factor: 13.583

10.  Effects of sequence variation on differential allelic transcription factor occupancy and gene expression.

Authors:  Timothy E Reddy; Jason Gertz; Florencia Pauli; Katerina S Kucera; Katherine E Varley; Kimberly M Newberry; Georgi K Marinov; Ali Mortazavi; Brian A Williams; Lingyun Song; Gregory E Crawford; Barbara Wold; Huntington F Willard; Richard M Myers
Journal:  Genome Res       Date:  2012-02-02       Impact factor: 9.043

View more
  13 in total

1.  Specificity landscapes unmask submaximal binding site preferences of transcription factors.

Authors:  Devesh Bhimsaria; José A Rodríguez-Martínez; Junkun Pan; Daniel Roston; Elif Nihal Korkmaz; Qiang Cui; Parameswaran Ramanathan; Aseem Z Ansari
Journal:  Proc Natl Acad Sci U S A       Date:  2018-10-19       Impact factor: 11.205

2.  Sharing DNA-binding information across structurally similar proteins enables accurate specificity determination.

Authors:  Joshua L Wetzel; Mona Singh
Journal:  Nucleic Acids Res       Date:  2020-01-24       Impact factor: 16.971

Review 3.  Low-Affinity Binding Sites and the Transcription Factor Specificity Paradox in Eukaryotes.

Authors:  Judith F Kribelbauer; Chaitanya Rastogi; Harmen J Bussemaker; Richard S Mann
Journal:  Annu Rev Cell Dev Biol       Date:  2019-07-05       Impact factor: 13.827

4.  A De Novo Shape Motif Discovery Algorithm Reveals Preferences of Transcription Factors for DNA Shape Beyond Sequence Motifs.

Authors:  Md Abul Hassan Samee; Benoit G Bruneau; Katherine S Pollard
Journal:  Cell Syst       Date:  2019-01-16       Impact factor: 10.304

5.  Systematic analysis of binding of transcription factors to noncoding variants.

Authors:  Jian Yan; Yunjiang Qiu; André M Ribeiro Dos Santos; Yimeng Yin; Yang E Li; Nick Vinckier; Naoki Nariai; Paola Benaglio; Anugraha Raman; Xiaoyu Li; Shicai Fan; Joshua Chiou; Fulin Chen; Kelly A Frazer; Kyle J Gaulton; Maike Sander; Jussi Taipale; Bing Ren
Journal:  Nature       Date:  2021-01-27       Impact factor: 69.504

6.  SelexGLM differentiates androgen and glucocorticoid receptor DNA-binding preference over an extended binding site.

Authors:  Liyang Zhang; Gabriella D Martini; H Tomas Rube; Judith F Kribelbauer; Chaitanya Rastogi; Vincent D FitzPatrick; Jon C Houtman; Harmen J Bussemaker; Miles A Pufall
Journal:  Genome Res       Date:  2017-12-01       Impact factor: 9.043

7.  Comparison of discriminative motif optimization using matrix and DNA shape-based models.

Authors:  Shuxiang Ruan; Gary D Stormo
Journal:  BMC Bioinformatics       Date:  2018-03-06       Impact factor: 3.169

8.  Inherent limitations of probabilistic models for protein-DNA binding specificity.

Authors:  Shuxiang Ruan; Gary D Stormo
Journal:  PLoS Comput Biol       Date:  2017-07-07       Impact factor: 4.475

9.  Accurate and sensitive quantification of protein-DNA binding affinity.

Authors:  Chaitanya Rastogi; H Tomas Rube; Judith F Kribelbauer; Justin Crocker; Ryan E Loker; Gabriella D Martini; Oleg Laptenko; William A Freed-Pastor; Carol Prives; David L Stern; Richard S Mann; Harmen J Bussemaker
Journal:  Proc Natl Acad Sci U S A       Date:  2018-04-02       Impact factor: 11.205

10.  A unified approach for quantifying and interpreting DNA shape readout by transcription factors.

Authors:  H Tomas Rube; Chaitanya Rastogi; Judith F Kribelbauer; Harmen J Bussemaker
Journal:  Mol Syst Biol       Date:  2018-02-22       Impact factor: 11.429

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.