Literature DB >> 22851788

Statistical Mechanics of Transcription-Factor Binding Site Discovery Using Hidden Markov Models.

Pankaj Mehta1, David J Schwab, Anirvan M Sengupta.   

Abstract

Hidden Markov Models (HMMs) are a commonly used tool for inference of transcription factor (TF) binding sites from DNA sequence data. We exploit the mathematical equivalence between HMMs for TF binding and the "inverse" statistical mechanics of hard rods in a one-dimensional disordered potential to investigate learning in HMMs. We derive analytic expressions for the Fisher information, a commonly employed measure of confidence in learned parameters, in the biologically relevant limit where the density of binding sites is low. We then use techniques from statistical mechanics to derive a scaling principle relating the specificity (binding energy) of a TF to the minimum amount of training data necessary to learn it.

Entities:  

Year:  2011        PMID: 22851788      PMCID: PMC3407691          DOI: 10.1007/s10955-010-0102-x

Source DB:  PubMed          Journal:  J Stat Phys        ISSN: 0022-4715            Impact factor:   1.548


  13 in total

1.  A probabilistic method to detect regulatory modules.

Authors:  Saurabh Sinha; Erik van Nimwegen; Eric D Siggia
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

2.  A biophysical approach to transcription factor binding site discovery.

Authors:  Marko Djordjevic; Anirvan M Sengupta; Boris I Shraiman
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

3.  An invariant form for the prior probability in estimation problems.

Authors:  H JEFFREYS
Journal:  Proc R Soc Lond A Math Phys Sci       Date:  1946

4.  Weak pairwise correlations imply strongly correlated network states in a neural population.

Authors:  Elad Schneidman; Michael J Berry; Ronen Segev; William Bialek
Journal:  Nature       Date:  2006-04-09       Impact factor: 49.962

5.  Precise physical models of protein-DNA interaction from high-throughput data.

Authors:  Justin B Kinney; Gasper Tkacik; Curtis G Callan
Journal:  Proc Natl Acad Sci U S A       Date:  2006-12-29       Impact factor: 11.205

6.  Nucleosome switches.

Authors:  David J Schwab; Robijn F Bruinsma; Joseph Rudnick; Jonathan Widom
Journal:  Phys Rev Lett       Date:  2008-06-06       Impact factor: 9.161

7.  Identification of direct residue contacts in protein-protein interaction by message passing.

Authors:  Martin Weigt; Robert A White; Hendrik Szurmant; James A Hoch; Terence Hwa
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-30       Impact factor: 11.205

8.  Maximum entropy models for antibody diversity.

Authors:  Thierry Mora; Aleksandra M Walczak; William Bialek; Curtis G Callan
Journal:  Proc Natl Acad Sci U S A       Date:  2010-03-08       Impact factor: 11.205

Review 9.  Specificity, free energy and information content in protein-DNA interactions.

Authors:  G D Stormo; D S Fields
Journal:  Trends Biochem Sci       Date:  1998-03       Impact factor: 13.807

10.  OHMM: a Hidden Markov Model accurately predicting the occupancy of a transcription factor with a self-overlapping binding motif.

Authors:  Amar Drawid; Nupur Gupta; Vijayalakshmi H Nagaraj; Céline Gélinas; Anirvan M Sengupta
Journal:  BMC Bioinformatics       Date:  2009-07-07       Impact factor: 3.169

View more
  2 in total

1.  A high-bias, low-variance introduction to Machine Learning for physicists.

Authors:  Pankaj Mehta; Ching-Hao Wang; Alexandre G R Day; Clint Richardson; Marin Bukov; Charles K Fisher; David J Schwab
Journal:  Phys Rep       Date:  2019-03-14       Impact factor: 25.600

2.  The next generation of transcription factor binding site prediction.

Authors:  Anthony Mathelier; Wyeth W Wasserman
Journal:  PLoS Comput Biol       Date:  2013-09-05       Impact factor: 4.475

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.