Literature DB >> 18832340

How well can the accuracy of comparative protein structure models be predicted?

David Eramian1, Narayanan Eswar, Min-Yi Shen, Andrej Sali.   

Abstract

Comparative structure models are available for two orders of magnitude more protein sequences than are experimentally determined structures. These models, however, suffer from two limitations that experimentally determined structures do not: They frequently contain significant errors, and their accuracy cannot be readily assessed. We have addressed the latter limitation by developing a protocol optimized specifically for predicting the Calpha root-mean-squared deviation (RMSD) and native overlap (NO3.5A) errors of a model in the absence of its native structure. In contrast to most traditional assessment scores that merely predict one model is more accurate than others, this approach quantifies the error in an absolute sense, thus helping to determine whether or not the model is suitable for intended applications. The assessment relies on a model-specific scoring function constructed by a support vector machine. This regression optimizes the weights of up to nine features, including various sequence similarity measures and statistical potentials, extracted from a tailored training set of models unique to the model being assessed: If possible, we use similarly sized models with the same fold; otherwise, we use similarly sized models with the same secondary structure composition. This protocol predicts the RMSD and NO3.5A errors for a diverse set of 580,317 comparative models of 6174 sequences with correlation coefficients (r) of 0.84 and 0.86, respectively, to the actual errors. This scoring function achieves the best correlation compared to 13 other tested assessment criteria that achieved correlations ranging from 0.35 to 0.71.

Mesh:

Substances:

Year:  2008        PMID: 18832340      PMCID: PMC2578807          DOI: 10.1110/ps.036061.108

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  69 in total

Review 1.  Comparative protein structure modeling of genes and genomes.

Authors:  M A Martí-Renom; A C Stuart; A Fiser; R Sánchez; F Melo; A Sali
Journal:  Annu Rev Biophys Biomol Struct       Date:  2000

2.  Systematic analysis of added-value in simple comparative models of protein structure.

Authors:  Suvobrata Chakravarty; Roberto Sanchez
Journal:  Structure       Date:  2004-08       Impact factor: 5.006

3.  Prediction of homology model quality with multivariate regression.

Authors:  Kristin Tøndel
Journal:  J Chem Inf Comput Sci       Date:  2004 Sep-Oct

4.  The ModFOLD server for the quality assessment of protein structural models.

Authors:  Liam J McGuffin
Journal:  Bioinformatics       Date:  2008-01-09       Impact factor: 6.937

5.  Ranking predicted protein structures with support vector regression.

Authors:  Jian Qiu; Will Sheffler; David Baker; William Stafford Noble
Journal:  Proteins       Date:  2008-05-15

6.  FragQA: predicting local fragment quality of a sequence-structure alignment.

Authors:  Xin Gao; Dongbo Bu; Shuai Cheng Li; Jinbo Xu; Ming Li
Journal:  Genome Inform       Date:  2007

7.  Comparative protein structure modeling using MODELLER.

Authors:  Narayanan Eswar; Ben Webb; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper; Andrej Sali
Journal:  Curr Protoc Protein Sci       Date:  2007-11

8.  The interpretation of protein structures: estimation of static accessibility.

Authors:  B Lee; F M Richards
Journal:  J Mol Biol       Date:  1971-02-14       Impact factor: 5.469

9.  Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features.

Authors:  W Kabsch; C Sander
Journal:  Biopolymers       Date:  1983-12       Impact factor: 2.505

10.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

View more
  62 in total

1.  Protein structure validation by generalized linear model root-mean-square deviation prediction.

Authors:  Anurag Bagaria; Victor Jaravine; Yuanpeng J Huang; Gaetano T Montelione; Peter Güntert
Journal:  Protein Sci       Date:  2012-01-04       Impact factor: 6.725

2.  VITAL NMR: using chemical shift derived secondary structure information for a limited set of amino acids to assess homology model accuracy.

Authors:  Michael C Brothers; Anna E Nesbitt; Michael J Hallock; Sanjeewa G Rupasinghe; Ming Tang; Jason Harris; Jerome Baudry; Mary A Schuler; Chad M Rienstra
Journal:  J Biomol NMR       Date:  2011-11-03       Impact factor: 2.835

3.  Sub-AQUA: real-value quality assessment of protein structure models.

Authors:  Yifeng David Yang; Preston Spratt; Hao Chen; Changsoon Park; Daisuke Kihara
Journal:  Protein Eng Des Sel       Date:  2010-06-04       Impact factor: 1.650

4.  Prediction of protease substrates using sequence and structure features.

Authors:  David T Barkan; Daniel R Hostetter; Sami Mahrus; Ursula Pieper; James A Wells; Charles S Craik; Andrej Sali
Journal:  Bioinformatics       Date:  2010-05-26       Impact factor: 6.937

5.  Unraveling the distinctive features of hemorrhagic and non-hemorrhagic snake venom metalloproteinases using molecular simulations.

Authors:  Raoni Almeida de Souza; Natalia Díaz; Ronaldo Alves Pinto Nagem; Rafaela Salgado Ferreira; Dimas Suárez
Journal:  J Comput Aided Mol Des       Date:  2015-12-16       Impact factor: 3.686

6.  Comparative protein structure modeling using Modeller.

Authors:  Ben Webb; Andrej Sali; Narayanan Eswar; Marc A Marti-Renom; M S Madhusudhan; David Eramian; Min-Yi Shen; Ursula Pieper
Journal:  Curr Protoc Bioinformatics       Date:  2006-10

7.  Functional architecture of MFS D-glucose transporters.

Authors:  M Gregor Madej; Linfeng Sun; Nieng Yan; H Ronald Kaback
Journal:  Proc Natl Acad Sci U S A       Date:  2014-02-03       Impact factor: 11.205

8.  Determinants of substrate and cation transport in the human Na+/dicarboxylate cotransporter NaDC3.

Authors:  Avner Schlessinger; Nina N Sun; Claire Colas; Ana M Pajor
Journal:  J Biol Chem       Date:  2014-05-07       Impact factor: 5.157

9.  GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains.

Authors:  David A Lee; Robert Rentzsch; Christine Orengo
Journal:  Nucleic Acids Res       Date:  2009-11-18       Impact factor: 16.971

10.  QMEANclust: estimation of protein model quality by combining a composite scoring function with structural density information.

Authors:  Pascal Benkert; Torsten Schwede; Silvio Ce Tosatto
Journal:  BMC Struct Biol       Date:  2009-05-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.