Literature DB >> 15281130

Combining evolutionary and structural information for local protein structure prediction.

Jimin Pei1, Nick V Grishin.   

Abstract

We study the effects of various factors in representing and combining evolutionary and structural information for local protein structural prediction based on fragment selection. We prepare databases of fragments from a set of non-redundant protein domains. For each fragment, evolutionary information is derived from homologous sequences and represented as estimated effective counts and frequencies of amino acids (evolutionary frequencies) at each position. Position-specific amino acid preferences called structural frequencies are derived from statistical analysis of discrete local structural environments in database structures. Our method for local structure prediction is based on ranking and selecting database fragments that are most similar to a target fragment. Using secondary structure type as a local structural property, we test our method in a number of settings. The major findings are: (1) the COMPASS-type scoring function for fragment similarity comparison gives better prediction accuracy than three other tested scoring functions for profile-profile comparison. We show that the COMPASS-type scoring function can be derived both in the probabilistic framework and in the framework of statistical potentials. (2) Using the evolutionary frequencies of database fragments gives better prediction accuracy than using structural frequencies. (3) Finer definition of local environments, such as including more side-chain solvent accessibility classes and considering the backbone conformations of neighboring residues, gives increasingly better prediction accuracy using structural frequencies. (4) Combining evolutionary and structural frequencies of database fragments, either in a linear fashion or using a pseudocount mixture formula, results in improvement of prediction accuracy. Combination at the log-odds score level is not as effective as combination at the frequency level. This suggests that there might be better ways of combining sequence and structural information than the commonly used linear combination of log-odds scores. Our method of fragment selection and frequency combination gives reasonable results of secondary structure prediction tested on 56 CASP5 targets (average SOV score 0.77), suggesting that it is a valid method for local protein structure prediction. Mixture of predicted structural frequencies and evolutionary frequencies improve the quality of local profile-to-profile alignment by COMPASS. Copyright 2004 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15281130     DOI: 10.1002/prot.20158

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  10 in total

1.  New assessment of a structural alphabet.

Authors:  Alexandre G de Brevern
Journal:  In Silico Biol       Date:  2005-03-16

2.  "Pinning strategy": a novel approach for predicting the backbone structure in terms of protein blocks from sequence.

Authors:  A G De Brevern; C Etchebest; C Benros; S Hazout
Journal:  J Biosci       Date:  2007-01       Impact factor: 1.826

3.  A new prediction strategy for long local protein structures using an original description.

Authors:  Aurélie Bornot; Catherine Etchebest; Alexandre G de Brevern
Journal:  Proteins       Date:  2009-08-15

Review 4.  From local structure to a global framework: recognition of protein folds.

Authors:  Agnel Praveen Joseph; Alexandre G de Brevern
Journal:  J R Soc Interface       Date:  2014-04-16       Impact factor: 4.118

5.  SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.

Authors:  Eshel Faraggi; Tuo Zhang; Yuedong Yang; Lukasz Kurgan; Yaoqi Zhou
Journal:  J Comput Chem       Date:  2011-11-02       Impact factor: 3.376

6.  Convergent evolution in structural elements of proteins investigated using cross profile analysis.

Authors:  Kentaro Tomii; Yoshito Sawada; Shinya Honda
Journal:  BMC Bioinformatics       Date:  2012-01-16       Impact factor: 3.169

7.  MUMMALS: multiple sequence alignment improved by using hidden Markov models with local structural information.

Authors:  Jimin Pei; Nick V Grishin
Journal:  Nucleic Acids Res       Date:  2006-08-26       Impact factor: 16.971

8.  Improving prediction of secondary structure, local backbone angles, and solvent accessible surface area of proteins by iterative deep learning.

Authors:  Rhys Heffernan; Kuldip Paliwal; James Lyons; Abdollah Dehzangi; Alok Sharma; Jihua Wang; Abdul Sattar; Yuedong Yang; Yaoqi Zhou
Journal:  Sci Rep       Date:  2015-06-22       Impact factor: 4.379

9.  Estimates of statistical significance for comparison of individual positions in multiple sequence alignments.

Authors:  Ruslan I Sadreyev; Nick V Grishin
Journal:  BMC Bioinformatics       Date:  2004-08-05       Impact factor: 3.169

10.  In Silico Genetics Revealing 5 Mutations in CEBPA Gene Associated With Acute Myeloid Leukemia.

Authors:  Mujahed I Mustafa; Zainab O Mohammed; Naseem S Murshed; Nafisa M Elfadol; Abdelrahman H Abdelmoneim; Mohamed A Hassan
Journal:  Cancer Inform       Date:  2019-08-19
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.