Literature DB >> 22106262

Direct-coupling analysis of residue coevolution captures native contacts across many protein families.

Faruck Morcos1, Andrea Pagnani, Bryan Lunt, Arianna Bertolino, Debora S Marks, Chris Sander, Riccardo Zecchina, José N Onuchic, Terence Hwa, Martin Weigt.   

Abstract

The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced direct-coupling analysis (DCA). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intradomain residue contacts, arising, e.g., from alternative protein conformations, ligand-mediated residue couplings, and interdomain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22106262      PMCID: PMC3241805          DOI: 10.1073/pnas.1111471108

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  43 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Two-component and phosphorelay signal transduction.

Authors:  J A Hoch
Journal:  Curr Opin Microbiol       Date:  2000-04       Impact factor: 7.934

3.  Correlations among amino acid sites in bHLH protein domains: an information theoretic analysis.

Authors:  W R Atchley; K R Wollenberg; W M Fitch; W Terhalle; A W Dress
Journal:  Mol Biol Evol       Date:  2000-01       Impact factor: 16.240

4.  Features of protein-protein interactions in two-component signaling deduced from genomic libraries.

Authors:  Robert A White; Hendrik Szurmant; James A Hoch; Terence Hwa
Journal:  Methods Enzymol       Date:  2007       Impact factor: 1.600

5.  The role of water in sequence-independent ligand binding by an oligopeptide transporter protein.

Authors:  J R Tame; S H Sleigh; A J Wilkinson; J E Ladbury
Journal:  Nat Struct Biol       Date:  1996-12

6.  Respective roles of short- and long-range interactions in protein folding.

Authors:  N Go; H Taketomi
Journal:  Proc Natl Acad Sci U S A       Date:  1978-02       Impact factor: 11.205

7.  Phosphonoformate: a minimal transition state analogue inhibitor of the fosfomycin resistance protein, FosA.

Authors:  Rachel E Rigsby; Chris L Rife; Kerry L Fillgrove; Marcia E Newcomer; Richard N Armstrong
Journal:  Biochemistry       Date:  2004-11-02       Impact factor: 3.162

8.  Crystal structures of the response regulator DosR from Mycobacterium tuberculosis suggest a helix rearrangement mechanism for phosphorylation activation.

Authors:  Goragot Wisedchaisri; Meiting Wu; David R Sherman; Wim G J Hol
Journal:  J Mol Biol       Date:  2008-02-26       Impact factor: 5.469

9.  The Genomes On Line Database (GOLD) in 2009: status of genomic and metagenomic projects and their associated metadata.

Authors:  Konstantinos Liolios; I-Min A Chen; Konstantinos Mavromatis; Nektarios Tavernarakis; Philip Hugenholtz; Victor M Markowitz; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2009-11-13       Impact factor: 16.971

10.  Finding coevolving amino acid residues using row and column weighting of mutual information and multi-dimensional amino acid representation.

Authors:  Rodrigo Gouveia-Oliveira; Anders G Pedersen
Journal:  Algorithms Mol Biol       Date:  2007-10-03       Impact factor: 1.405

View more
  460 in total

1.  Accurate de novo structure prediction of large transmembrane protein domains using fragment-assembly and correlated mutation analysis.

Authors:  Timothy Nugent; David T Jones
Journal:  Proc Natl Acad Sci U S A       Date:  2012-05-29       Impact factor: 11.205

2.  Genomics-aided structure prediction.

Authors:  Joanna I Sułkowska; Faruck Morcos; Martin Weigt; Terence Hwa; José N Onuchic
Journal:  Proc Natl Acad Sci U S A       Date:  2012-06-12       Impact factor: 11.205

3.  Structural basis of histidine kinase autophosphorylation deduced by integrating genomics, molecular dynamics, and mutagenesis.

Authors:  Angel E Dago; Alexander Schug; Andrea Procaccini; James A Hoch; Martin Weigt; Hendrik Szurmant
Journal:  Proc Natl Acad Sci U S A       Date:  2012-06-05       Impact factor: 11.205

4.  From residue coevolution to protein conformational ensembles and functional dynamics.

Authors:  Ludovico Sutto; Simone Marsili; Alfonso Valencia; Francesco Luigi Gervasio
Journal:  Proc Natl Acad Sci U S A       Date:  2015-10-20       Impact factor: 11.205

5.  Constructing sequence-dependent protein models using coevolutionary information.

Authors:  Ryan R Cheng; Mohit Raghunathan; Jeffrey K Noel; José N Onuchic
Journal:  Protein Sci       Date:  2015-08-10       Impact factor: 6.725

Review 6.  Epistasis in protein evolution.

Authors:  Tyler N Starr; Joseph W Thornton
Journal:  Protein Sci       Date:  2016-02-28       Impact factor: 6.725

7.  Dynamic architecture of a protein kinase.

Authors:  Christopher L McClendon; Alexandr P Kornev; Michael K Gilson; Susan S Taylor
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-15       Impact factor: 11.205

8.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

9.  Structured States of Disordered Proteins from Genomic Sequences.

Authors:  Agnes Toth-Petroczy; Perry Palmedo; John Ingraham; Thomas A Hopf; Bonnie Berger; Chris Sander; Debora S Marks
Journal:  Cell       Date:  2016-09-22       Impact factor: 41.582

10.  Structure of the DASH/Dam1 complex shows its role at the yeast kinetochore-microtubule interface.

Authors:  Simon Jenni; Stephen C Harrison
Journal:  Science       Date:  2018-05-04       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.