Literature DB >> 27329863

ACE: adaptive cluster expansion for maximum entropy graphical model inference.

J P Barton1, E De Leonardis2, A Coucke3, S Cocco4.   

Abstract

MOTIVATION: Graphical models are often employed to interpret patterns of correlations observed in data through a network of interactions between the variables. Recently, Ising/Potts models, also known as Markov random fields, have been productively applied to diverse problems in biology, including the prediction of structural contacts from protein sequence data and the description of neural activity patterns. However, inference of such models is a challenging computational problem that cannot be solved exactly. Here, we describe the adaptive cluster expansion (ACE) method to quickly and accurately infer Ising or Potts models based on correlation data. ACE avoids overfitting by constructing a sparse network of interactions sufficient to reproduce the observed correlation data within the statistical error expected due to finite sampling. When convergence of the ACE algorithm is slow, we combine it with a Boltzmann Machine Learning algorithm (BML). We illustrate this method on a variety of biological and artificial datasets and compare it to state-of-the-art approximate methods such as Gaussian and pseudo-likelihood inference.
RESULTS: We show that ACE accurately reproduces the true parameters of the underlying model when they are known, and yields accurate statistical descriptions of both biological and artificial data. Models inferred by ACE more accurately describe the statistics of the data, including both the constrained low-order correlations and unconstrained higher-order correlations, compared to those obtained by faster Gaussian and pseudo-likelihood methods. These alternative approaches can recover the structure of the interaction network but typically not the correct strength of interactions, resulting in less accurate generative models.
AVAILABILITY AND IMPLEMENTATION: The ACE source code, user manual and tutorials with the example data and filtered correlations described herein are freely available on GitHub at https://github.com/johnbarton/ACE CONTACTS: jpbarton@mit.edu, cocco@lps.ens.frSupplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2016        PMID: 27329863     DOI: 10.1093/bioinformatics/btw328

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

2.  Influence of multiple-sequence-alignment depth on Potts statistical models of protein covariation.

Authors:  Allan Haldane; Ronald M Levy
Journal:  Phys Rev E       Date:  2019-03       Impact factor: 2.529

3.  Functional connectivity models for decoding of spatial representations from hippocampal CA1 recordings.

Authors:  Lorenzo Posani; Simona Cocco; Karel Ježek; Rémi Monasson
Journal:  J Comput Neurosci       Date:  2017-05-08       Impact factor: 1.621

4.  Neural assemblies revealed by inferred connectivity-based models of prefrontal cortex recordings.

Authors:  G Tavoni; S Cocco; R Monasson
Journal:  J Comput Neurosci       Date:  2016-07-28       Impact factor: 1.621

Review 5.  Potts Hamiltonian models of protein co-variation, free energy landscapes, and evolutionary fitness.

Authors:  Ronald M Levy; Allan Haldane; William F Flynn
Journal:  Curr Opin Struct Biol       Date:  2016-11-18       Impact factor: 6.809

6.  Coevolutionary Landscape of Kinase Family Proteins: Sequence Probabilities and Functional Motifs.

Authors:  Allan Haldane; William F Flynn; Peng He; Ronald M Levy
Journal:  Biophys J       Date:  2018-01-09       Impact factor: 4.033

7.  Adenovirus-vectored vaccine containing multidimensionally conserved parts of the HIV proteome is immunogenic in rhesus macaques.

Authors:  Dariusz K Murakowski; John P Barton; Lauren Peter; Abishek Chandrashekar; Esther Bondzie; Ang Gao; Dan H Barouch; Arup K Chakraborty
Journal:  Proc Natl Acad Sci U S A       Date:  2021-02-02       Impact factor: 11.205

8.  Epistasis and entrenchment of drug resistance in HIV-1 subtype B.

Authors:  Avik Biswas; Allan Haldane; Eddy Arnold; Ronald M Levy
Journal:  Elife       Date:  2019-10-08       Impact factor: 8.140

9.  Inference of stochastic time series with missing data.

Authors:  Sangwon Lee; Vipul Periwal; Junghyo Jo
Journal:  Phys Rev E       Date:  2021-08       Impact factor: 2.707

10.  Mi3-GPU: MCMC-based Inverse Ising Inference on GPUs for protein covariation analysis.

Authors:  Allan Haldane; Ronald M Levy
Journal:  Comput Phys Commun       Date:  2020-04-17       Impact factor: 4.390

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.