Literature DB >> 9037710

Improvement of protein secondary structure prediction using binary word encoding.

T Kawabata1, J Doi.   

Abstract

We propose a binary word encoding to improve the protein secondary structure prediction. A binary word encoding encodes a local amino acid sequence to a binary word, which consists of 0 or 1. We use an encoding function to map an amino acid to 0 or 1. Using the binary word encoding, we can statistically extract the multiresidue information, which depends on more than one residue. We combine the binary word encoding with the GOR method, its modified version, which shows better accuracy, and the neural network method. The binary word encoding improves the accuracy of GOR by 2.8%. We obtain similar improvement when we combine this with the modified GOR method and the neural network method. When we use multiple sequence alignment data, the binary word encoding similarly improves the accuracy. The accuracy of our best combined method is 68.2%. In this paper, we only show improvement of the GOR and neural network method, we cannot say that the encoding improves the other methods. But the improvement by the encoding suggests that the multiresidue interaction affects the formation of secondary structure. In addition, we find that the optimal encoding function obtained by the simulated annealing method relates to nonpolarity. This means that nonpolarity is important to the multiresidue interaction.

Mesh:

Year:  1997        PMID: 9037710     DOI: 10.1002/(sici)1097-0134(199701)27:1<36::aid-prot5>3.0.co;2-l

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  4 in total

1.  Cascaded multiple classifiers for secondary structure prediction.

Authors:  M Ouali; R D King
Journal:  Protein Sci       Date:  2000-06       Impact factor: 6.725

2.  Deciphering the structural code for proteins: helical propensities in domain classes and statistical multiresidue information in alpha-helices.

Authors:  J A Negrete; Y Viñuales; J Palau
Journal:  Protein Sci       Date:  1998-06       Impact factor: 6.725

3.  Grass carp reovirus-GD108 fiber protein is involved in cell attachment.

Authors:  Yuanyuan Tian; Zhenzhen Jiao; Junjian Dong; Chengfei Sun; Xiaoyan Jiang; Xing Ye
Journal:  Virus Genes       Date:  2017-05-26       Impact factor: 2.332

4.  Profile conditional random fields for modeling protein families with structural information.

Authors:  Akira R Kinjo
Journal:  Biophysics (Nagoya-shi)       Date:  2009-05-30
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.