Literature DB >> 11535203

On preprocessing of protein sequences for neural network prediction of polyproline type II secondary structures.

M Siermala1, M Juhola, M Vihinen.   

Abstract

Polyproline type II stretches are somewhat rare on proteins. The backbone of this secondary structural element folds to a triangular form instead of the normal alpha-helix with 3.6 residues per turn. It is a very challenging task to try to detect them computationally from protein sequence. Here, we have studied the preprocessing phase in particular, which is important for any machine learning method. Preprocessing included selection of relevant data from the Protein Data Bank and investigation of learnability properties. These properties show whether the material is suitable for neural network computing. The complexity of algorithms in connection with preprocessing was briefly considered. We found that feedforward perceptron neural networks were appropriate for the prediction of polyproline type II and also relatively efficient in this task. The problem is very difficult because of the great similarity of the two classes present in the classification. Nevertheless, neural networks were able to recognize and predict about 75% of secondary structures.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11535203     DOI: 10.1016/s0010-4825(01)00013-0

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  1 in total

1.  BERT-PPII: The Polyproline Type II Helix Structure Prediction Model Based on BERT and Multichannel CNN.

Authors:  Chuang Feng; Zhen Wang; Guokun Li; Xiaohan Yang; Nannan Wu; Lei Wang
Journal:  Biomed Res Int       Date:  2022-08-24       Impact factor: 3.246

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.