Literature DB >> 27812284

An Empirical Prior Improves Accuracy for Bayesian Estimation of Transcription Factor Binding Site Frequencies within Gene Promoters.

Stephen A Ramsey1.   

Abstract

A Bayesian method for sampling from the distribution of matches to a precompiled transcription factor binding site (TFBS) sequence pattern (conditioned on an observed nucleotide sequence and the sequence pattern) is described. The method takes a position frequency matrix as input for a set of representative binding sites for a transcription factor and two sets of noncoding, 5' regulatory sequences for gene sets that are to be compared. An empirical prior on the frequency A (per base pair of gene-vicinal, noncoding DNA) of TFBSs is developed using data from the ENCODE project and incorporated into the method. In addition, a probabilistic model for binding site occurrences conditioned on λ is developed analytically, taking into account the finite-width effects of binding sites. The count of TFBS β (conditioned on the observed sequence) is sampled using Metropolis-Hastings with an information entropy-based move generator. The derivation of the method is presented in a step-by-step fashion, starting from specific conditional independence assumptions. Empirical results show that the newly proposed prior on β improves accuracy for estimating the number of TFBS within a set of promoter sequences.

Entities:  

Keywords:  Bayesian statistics; binding site; enrichment analysis; gene regulation; transcription factor

Year:  2016        PMID: 27812284      PMCID: PMC5081247          DOI: 10.4137/BBI.S29330

Source DB:  PubMed          Journal:  Bioinform Biol Insights        ISSN: 1177-9322


  44 in total

1.  JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

Authors:  Albin Sandelin; Wynand Alkema; Pär Engström; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 2.  Integrated approaches to uncovering transcription regulatory networks in mammalian cells.

Authors:  Kai Tan; Jesper Tegner; Timothy Ravasi
Journal:  Genomics       Date:  2008-01-08       Impact factor: 5.736

3.  MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data.

Authors:  K Quandt; K Frech; H Karas; E Wingender; T Werner
Journal:  Nucleic Acids Res       Date:  1995-12-11       Impact factor: 16.971

4.  A map of the cis-regulatory sequences in the mouse genome.

Authors:  Yin Shen; Feng Yue; David F McCleary; Zhen Ye; Lee Edsall; Samantha Kuan; Ulrich Wagner; Jesse Dixon; Leonard Lee; Victor V Lobanenkov; Bing Ren
Journal:  Nature       Date:  2012-08-02       Impact factor: 49.962

5.  Selection of DNA binding sites by regulatory proteins: the LexA protein and the arginine repressor use different strategies for functional specificity.

Authors:  O G Berg
Journal:  Nucleic Acids Res       Date:  1988-06-10       Impact factor: 16.971

6.  Architecture of the human regulatory network derived from ENCODE data.

Authors:  Mark B Gerstein; Anshul Kundaje; Manoj Hariharan; Stephen G Landt; Koon-Kiu Yan; Chao Cheng; Xinmeng Jasmine Mu; Ekta Khurana; Joel Rozowsky; Roger Alexander; Renqiang Min; Pedro Alves; Alexej Abyzov; Nick Addleman; Nitin Bhardwaj; Alan P Boyle; Philip Cayting; Alexandra Charos; David Z Chen; Yong Cheng; Declan Clarke; Catharine Eastman; Ghia Euskirchen; Seth Frietze; Yao Fu; Jason Gertz; Fabian Grubert; Arif Harmanci; Preti Jain; Maya Kasowski; Phil Lacroute; Jing Jane Leng; Jin Lian; Hannah Monahan; Henriette O'Geen; Zhengqing Ouyang; E Christopher Partridge; Dorrelyn Patacsil; Florencia Pauli; Debasish Raha; Lucia Ramirez; Timothy E Reddy; Brian Reed; Minyi Shi; Teri Slifer; Jing Wang; Linfeng Wu; Xinqiong Yang; Kevin Y Yip; Gili Zilberman-Schapira; Serafim Batzoglou; Arend Sidow; Peggy J Farnham; Richard M Myers; Sherman M Weissman; Michael Snyder
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

7.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

8.  ATF3 protects against atherosclerosis by suppressing 25-hydroxycholesterol-induced lipid body formation.

Authors:  Elizabeth S Gold; Stephen A Ramsey; Mark J Sartain; Jyrki Selinummi; Irina Podolsky; David J Rodriguez; Robert L Moritz; Alan Aderem
Journal:  J Exp Med       Date:  2012-04-02       Impact factor: 14.307

9.  UniPROBE: an online database of protein binding microarray data on protein-DNA interactions.

Authors:  Daniel E Newburger; Martha L Bulyk
Journal:  Nucleic Acids Res       Date:  2008-10-08       Impact factor: 16.971

10.  Design and analysis of ChIP-seq experiments for DNA-binding proteins.

Authors:  Peter V Kharchenko; Michael Y Tolstorukov; Peter J Park
Journal:  Nat Biotechnol       Date:  2008-11-16       Impact factor: 54.908

View more
  2 in total

1.  Identifying Cell Type-Specific Transcription Factors by Integrating ChIP-seq and eQTL Data-Application to Monocyte Gene Regulation.

Authors:  Mudra Choudhury; Stephen A Ramsey
Journal:  Gene Regul Syst Bio       Date:  2016-12-13

2.  Statistical estimates of multiple transcription factors binding in the model plant genomes based on ChIP-seq data.

Authors:  Arthur I Dergilev; Nina G Orlova; Oxana B Dobrovolskaya; Yuriy L Orlov
Journal:  J Integr Bioinform       Date:  2021-12-21
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.