Literature DB >> 10583400

Prediction of protein (domain) structural classes based on amino-acid index.

W S Bu1, Z P Feng, Z Zhang, C T Zhang.   

Abstract

A protein (domain) is usually classified into one of the following four structural classes: all-alpha, all-beta, alpha/beta and alpha + beta. In this paper, a new formulation is proposed to predict the structural class of a protein (domain) from its primary sequence. Instead of the amino-acid composition used widely in the previous structural class prediction work, the auto-correlation functions based on the profile of amino-acid index along the primary sequence of the query protein (domain) are used for the structural class prediction. Consequently, the overall predictive accuracy is remarkably improved. For the same training database consisting of 359 proteins (domains) and the same component-coupled algorithm [Chou, K.C. & Maggiora, G.M. (1998) Protein Eng. 11, 523-538], the overall predictive accuracy of the new method for the jackknife test is 5-7% higher than the accuracy based only on the amino-acid composition. The overall predictive accuracy finally obtained for the jackknife test is as high as 90.5%, implying that a significant improvement has been achieved by making full use of the information contained in the primary sequence for the class prediction. This improvement depends on the size of the training database, the auto-correlation functions selected and the amino-acid index used. We have found that the amino-acid index proposed by Oobatake and Ooi, i.e. the average nonbonded energy per residue, leads to the optimal predictive result in the case for the database sets studied in this paper. This study may be considered as an alternative step towards making the structural class prediction more practical.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10583400     DOI: 10.1046/j.1432-1327.1999.00947.x

Source DB:  PubMed          Journal:  Eur J Biochem        ISSN: 0014-2956


  8 in total

1.  Accurate prediction of protein structural class.

Authors:  Xia-Yu Xia; Meng Ge; Zhi-Xin Wang; Xian-Ming Pan
Journal:  PLoS One       Date:  2012-06-19       Impact factor: 3.240

2.  Prediction of enzymes and non-enzymes from protein sequences based on sequence derived features and PSSM matrix using artificial neural network.

Authors:  Pradeep Kumar Naik; Viplav Shankar Mishra; Mukul Gupta; Kunal Jaiswal
Journal:  Bioinformation       Date:  2007-12-05

3.  Identification of Cancerlectins Using Support Vector Machines With Fusion of G-Gap Dipeptide.

Authors:  Lili Qian; Yaping Wen; Guosheng Han
Journal:  Front Genet       Date:  2020-04-03       Impact factor: 4.599

Review 4.  Folding by numbers: primary sequence statistics and their use in studying protein folding.

Authors:  Brent Wathen; Zongchao Jia
Journal:  Int J Mol Sci       Date:  2009-04-08       Impact factor: 6.208

5.  Large-scale prediction of long disordered regions in proteins using random forests.

Authors:  Pengfei Han; Xiuzhen Zhang; Raymond S Norton; Zhi-Ping Feng
Journal:  BMC Bioinformatics       Date:  2009-01-07       Impact factor: 3.169

6.  Application of amino acid occurrence for discriminating different folding types of globular proteins.

Authors:  Y-h Taguchi; M Michael Gromiha
Journal:  BMC Bioinformatics       Date:  2007-10-22       Impact factor: 3.169

7.  Protein structural class prediction based on an improved statistical strategy.

Authors:  Fei Gu; Hang Chen; Jun Ni
Journal:  BMC Bioinformatics       Date:  2008-05-28       Impact factor: 3.169

8.  SCPRED: accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences.

Authors:  Lukasz Kurgan; Krzysztof Cios; Ke Chen
Journal:  BMC Bioinformatics       Date:  2008-05-01       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.