Literature DB >> 24009338

Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era.

Hetunandan Kamisetty1, Sergey Ovchinnikov, David Baker.   

Abstract

Recently developed methods have shown considerable promise in predicting residue-residue contacts in protein 3D structures using evolutionary covariance information. However, these methods require large numbers of evolutionarily related sequences to robustly assess the extent of residue covariation, and the larger the protein family, the more likely that contact information is unnecessary because a reasonable model can be built based on the structure of a homolog. Here we describe a method that integrates sequence coevolution and structural context information using a pseudolikelihood approach, allowing more accurate contact predictions from fewer homologous sequences. We rigorously assess the utility of predicted contacts for protein structure prediction using large and representative sequence and structure databases from recent structure prediction experiments. We find that contact predictions are likely to be accurate when the number of aligned sequences (with sequence redundancy reduced to 90%) is greater than five times the length of the protein, and that accurate predictions are likely to be useful for structure modeling if the aligned sequences are more similar to the protein of interest than to the closest homolog of known structure. These conditions are currently met by 422 of the protein families collected in the Pfam database.

Keywords:  markov random field; maximum-entropy model; protein coevolution

Mesh:

Substances:

Year:  2013        PMID: 24009338      PMCID: PMC3785744          DOI: 10.1073/pnas.1314045110

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  22 in total

1.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

2.  Mutual information without the influence of phylogeny or entropy dramatically improves residue contact prediction.

Authors:  S D Dunn; L M Wahl; G B Gloor
Journal:  Bioinformatics       Date:  2007-12-05       Impact factor: 6.937

3.  Identification of direct residue contacts in protein-protein interaction by message passing.

Authors:  Martin Weigt; Robert A White; Hendrik Szurmant; James A Hoch; Terence Hwa
Journal:  Proc Natl Acad Sci U S A       Date:  2008-12-30       Impact factor: 11.205

4.  Graphical models of residue coupling in protein families.

Authors:  John Thomas; Naren Ramakrishnan; Chris Bailey-Kellogg
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2008 Apr-Jun       Impact factor: 3.710

5.  The Pfam protein families database.

Authors:  Alex Bateman; Ewan Birney; Lorenzo Cerruti; Richard Durbin; Laurence Etwiller; Sean R Eddy; Sam Griffiths-Jones; Kevin L Howe; Mhairi Marshall; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  PISCES: recent improvements to a PDB sequence culling server.

Authors:  Guoli Wang; Roland L Dunbrack
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

7.  Improved residue contact prediction using support vector machines and a large feature set.

Authors:  Jianlin Cheng; Pierre Baldi
Journal:  BMC Bioinformatics       Date:  2007-04-02       Impact factor: 3.169

8.  The MPI Bioinformatics Toolkit for protein sequence analysis.

Authors:  Andreas Biegert; Christian Mayer; Michael Remmert; Johannes Söding; Andrei N Lupas
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

9.  TM-align: a protein structure alignment algorithm based on the TM-score.

Authors:  Yang Zhang; Jeffrey Skolnick
Journal:  Nucleic Acids Res       Date:  2005-04-22       Impact factor: 16.971

10.  The Protein Model Portal.

Authors:  Konstantin Arnold; Florian Kiefer; Jürgen Kopp; James N D Battey; Michael Podvinec; John D Westbrook; Helen M Berman; Lorenza Bordoli; Torsten Schwede
Journal:  J Struct Funct Genomics       Date:  2008-11-27
View more
  273 in total

1.  Integration of QUARK and I-TASSER for Ab Initio Protein Structure Prediction in CASP11.

Authors:  Wenxuan Zhang; Jianyi Yang; Baoji He; Sara Elizabeth Walker; Hongjiu Zhang; Brandon Govindarajoo; Jouko Virtanen; Zhidong Xue; Hong-Bin Shen; Yang Zhang
Journal:  Proteins       Date:  2015-09-23

2.  Protein contact prediction by integrating joint evolutionary coupling analysis and supervised learning.

Authors:  Jianzhu Ma; Sheng Wang; Zhiyong Wang; Jinbo Xu
Journal:  Bioinformatics       Date:  2015-08-14       Impact factor: 6.937

3.  Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Authors:  Jing Yang; Bao-Ji He; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2015-08-07       Impact factor: 6.937

4.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

5.  Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.

Authors:  Chengxin Zhang; S M Mortuza; Baoji He; Yanting Wang; Yang Zhang
Journal:  Proteins       Date:  2017-11-14

6.  Deep-learning contact-map guided protein structure prediction in CASP13.

Authors:  Wei Zheng; Yang Li; Chengxin Zhang; Robin Pearce; S M Mortuza; Yang Zhang
Journal:  Proteins       Date:  2019-08-14

7.  Structure of the Bacterial Cytoskeleton Protein Bactofilin by NMR Chemical Shifts and Sequence Variation.

Authors:  Maher M Kassem; Yong Wang; Wouter Boomsma; Kresten Lindorff-Larsen
Journal:  Biophys J       Date:  2016-06-07       Impact factor: 4.033

8.  Folding Membrane Proteins by Deep Transfer Learning.

Authors:  Sheng Wang; Zhen Li; Yizhou Yu; Jinbo Xu
Journal:  Cell Syst       Date:  2017-09-27       Impact factor: 10.304

9.  Learning sequence determinants of protein:protein interaction specificity with sparse graphical models.

Authors:  Hetunandan Kamisetty; Bornika Ghosh; Christopher James Langmead; Chris Bailey-Kellogg
Journal:  J Comput Biol       Date:  2015-05-14       Impact factor: 1.479

10.  Structure and Assembly of the Enterohemorrhagic Escherichia coli Type 4 Pilus.

Authors:  Benjamin Bardiaux; Gisele Cardoso de Amorim; Areli Luna Rico; Weili Zheng; Ingrid Guilvout; Camille Jollivet; Michael Nilges; Edward H Egelman; Nadia Izadi-Pruneyre; Olivera Francetic
Journal:  Structure       Date:  2019-05-02       Impact factor: 5.006

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.