Literature DB >> 15446811

Prediction of homology model quality with multivariate regression.

Kristin Tøndel1.   

Abstract

A new method has been developed for prediction of homology model quality directly from the sequence alignment, using multivariate regression. Hence, the expected quality of future homology models can be estimated using only information about the primary structure. This method has been applied to protein kinases and can easily be extended to other protein families. Homology model quality for a reference set of homology models was verified by comparison to experimental structures, by calculation of root-mean-square deviations (RMSDs) and comparison of interresidue contact areas. The homology model quality measures were then used as dependent variables in a Partial Least Squares (PLS) regression, using a matrix of alignment score profiles found from the Point Accepted Mutation (PAM) 250 similarity matrix as independent variables. This resulted in a regression model that can be used to predict the accuracy of future homology models from the sequence alignment. Using this method, one can identify the target-template combinations that are most likely to give homology models of sufficient quality. Hence, this method can be used to effectively choose the optimal templates to use for the homology modeling. The method's ability to guide the choice of homology modeling templates was verified by comparison of success rates to those obtained using BLAST scores and target-template sequence identities, respectively. The results indicate that the method presented here performs best in choosing the optimal homology modeling templates. Using this method, the optimal template was chosen in 86% of the cases, as compared to 62% using BLAST scores, and 57% using sequence identities. The method presented here can also be used to identify regions of the protein structure that are difficult to model, as well as alignment errors. Hence, this method is a useful tool for ensuring that the best possible homology model is generated. Copyright 2004 American Chemical Society

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15446811     DOI: 10.1021/ci049924m

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  4 in total

1.  Protein structure validation by generalized linear model root-mean-square deviation prediction.

Authors:  Anurag Bagaria; Victor Jaravine; Yuanpeng J Huang; Gaetano T Montelione; Peter Güntert
Journal:  Protein Sci       Date:  2012-01-04       Impact factor: 6.725

2.  Sub-AQUA: real-value quality assessment of protein structure models.

Authors:  Yifeng David Yang; Preston Spratt; Hao Chen; Changsoon Park; Daisuke Kihara
Journal:  Protein Eng Des Sel       Date:  2010-06-04       Impact factor: 1.650

3.  How well can the accuracy of comparative protein structure models be predicted?

Authors:  David Eramian; Narayanan Eswar; Min-Yi Shen; Andrej Sali
Journal:  Protein Sci       Date:  2008-10-01       Impact factor: 6.725

4.  Preservation of protein clefts in comparative models.

Authors:  David Piedra; Sergi Lois; Xavier de la Cruz
Journal:  BMC Struct Biol       Date:  2008-01-16
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.