Literature DB >> 23410359

Improved contact prediction in proteins: using pseudolikelihoods to infer Potts models.

Magnus Ekeberg1, Cecilia Lövkvist, Yueheng Lan, Martin Weigt, Erik Aurell.   

Abstract

Spatially proximate amino acids in a protein tend to coevolve. A protein's three-dimensional (3D) structure hence leaves an echo of correlations in the evolutionary record. Reverse engineering 3D structures from such correlations is an open problem in structural biology, pursued with increasing vigor as more and more protein sequences continue to fill the data banks. Within this task lies a statistical inference problem, rooted in the following: correlation between two sites in a protein sequence can arise from firsthand interaction but can also be network-propagated via intermediate sites; observed correlation is not enough to guarantee proximity. To separate direct from indirect interactions is an instance of the general problem of inverse statistical mechanics, where the task is to learn model parameters (fields, couplings) from observables (magnetizations, correlations, samples) in large systems. In the context of protein sequences, the approach has been referred to as direct-coupling analysis. Here we show that the pseudolikelihood method, applied to 21-state Potts models describing the statistical properties of families of evolutionarily related proteins, significantly outperforms existing approaches to the direct-coupling analysis, the latter being based on standard mean-field techniques. This improved performance also relies on a modified score for the coupling strength. The results are verified using known crystal structures of specific sequence instances of various protein families. Code implementing the new method can be found at http://plmdca.csc.kth.se/.

Mesh:

Substances:

Year:  2013        PMID: 23410359     DOI: 10.1103/PhysRevE.87.012707

Source DB:  PubMed          Journal:  Phys Rev E Stat Nonlin Soft Matter Phys        ISSN: 1539-3755


  217 in total

1.  Genome-wide discovery of epistatic loci affecting antibiotic resistance in Neisseria gonorrhoeae using evolutionary couplings.

Authors:  Benjamin Schubert; Rohan Maddamsetti; Jackson Nyman; Maha R Farhat; Debora S Marks
Journal:  Nat Microbiol       Date:  2018-12-03       Impact factor: 17.745

2.  From residue coevolution to protein conformational ensembles and functional dynamics.

Authors:  Ludovico Sutto; Simone Marsili; Alfonso Valencia; Francesco Luigi Gervasio
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-20       Impact factor: 11.205

3.  Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.

Authors:  Jianzhu Ma; Sheng Wang; Zhiyong Wang; Jinbo Xu
Journal:  Bioinformatics       Date:  2015-08-14       Impact factor: 6.937

4.  Constructing sequence-dependent protein models using coevolutionary information.

Authors:  Ryan R Cheng; Mohit Raghunathan; Jeffrey K Noel; José N Onuchic
Journal:  Protein Sci       Date:  2015-08-10       Impact factor: 6.725

5.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

6.  Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era.

Authors:  Hetunandan Kamisetty; Sergey Ovchinnikov; David Baker
Journal:  Proc Natl Acad Sci U S A       Date:  2013-09-05       Impact factor: 11.205

7.  BCov: a method for predicting β-sheet topology using sparse inverse covariance estimation and integer programming.

Authors:  Castrense Savojardo; Piero Fariselli; Pier Luigi Martelli; Rita Casadio
Journal:  Bioinformatics       Date:  2013-09-23       Impact factor: 6.937

8.  Structured States of Disordered Proteins from Genomic Sequences.

Authors:  Agnes Toth-Petroczy; Perry Palmedo; John Ingraham; Thomas A Hopf; Bonnie Berger; Chris Sander; Debora S Marks
Journal:  Cell       Date:  2016-09-22       Impact factor: 41.582

9.  Structure of the DASH/Dam1 complex shows its role at the yeast kinetochore-microtubule interface.

Authors:  Simon Jenni; Stephen C Harrison
Journal:  Science       Date:  2018-05-04       Impact factor: 47.728

10.  Global analysis of more than 50,000 SARS-CoV-2 genomes reveals epistasis between eight viral genes.

Authors:  Hong-Li Zeng; Vito Dichio; Edwin Rodríguez Horta; Kaisa Thorell; Erik Aurell
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-17       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.