Literature DB >> 21728506

High-dimensional inference with the generalized Hopfield model: principal component analysis and corrections.

S Cocco1, R Monasson, V Sessak.   

Abstract

We consider the problem of inferring the interactions between a set of N binary variables from the knowledge of their frequencies and pairwise correlations. The inference framework is based on the Hopfield model, a special case of the Ising model where the interaction matrix is defined through a set of patterns in the variable space, and is of rank much smaller than N. We show that maximum likelihood inference is deeply related to principal component analysis when the amplitude of the pattern components ξ is negligible compared to √N. Using techniques from statistical mechanics, we calculate the corrections to the patterns to the first order in ξ/√N. We stress the need to generalize the Hopfield model and include both attractive and repulsive patterns in order to correctly infer networks with sparse and strong interactions. We present a simple geometrical criterion to decide how many attractive and repulsive patterns should be considered as a function of the sampling noise. We moreover discuss how many sampled configurations are required for a good inference, as a function of the system size N and of the amplitude ξ. The inference approach is illustrated on synthetic and biological data.

Mesh:

Substances:

Year:  2011        PMID: 21728506     DOI: 10.1103/PhysRevE.83.051123

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  6 in total

1.  Statistical mechanics for natural flocks of birds.

Authors:  William Bialek; Andrea Cavagna; Irene Giardina; Thierry Mora; Edmondo Silvestri; Massimiliano Viale; Aleksandra M Walczak
Journal:  Proc Natl Acad Sci U S A       Date:  2012-03-16       Impact factor: 11.205

2.  Predicting protein-ligand affinity with a random matrix framework.

Authors:  Alpha A Lee; Michael P Brenner; Lucy J Colwell
Journal:  Proc Natl Acad Sci U S A       Date:  2016-11-16       Impact factor: 11.205

3.  From principal component to direct coupling analysis of coevolution in proteins: low-eigenvalue modes are needed for structure prediction.

Authors:  Simona Cocco; Remi Monasson; Martin Weigt
Journal:  PLoS Comput Biol       Date:  2013-08-22       Impact factor: 4.475

4.  A general pairwise interaction model provides an accurate description of in vivo transcription factor binding sites.

Authors:  Marc Santolini; Thierry Mora; Vincent Hakim
Journal:  PLoS One       Date:  2014-06-13       Impact factor: 3.240

5.  Improving landscape inference by integrating heterogeneous data in the inverse Ising problem.

Authors:  Pierre Barrat-Charlaix; Matteo Figliuzzi; Martin Weigt
Journal:  Sci Rep       Date:  2016-11-25       Impact factor: 4.379

6.  Revealing evolutionary constraints on proteins through sequence analysis.

Authors:  Shou-Wen Wang; Anne-Florence Bitbol; Ned S Wingreen
Journal:  PLoS Comput Biol       Date:  2019-04-24       Impact factor: 4.475

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.