Literature DB >> 15932901

HYPROSP II--a knowledge-based hybrid method for protein secondary structure prediction based on local prediction confidence.

Hsin-Nan Lin1, Jia-Ming Chang, Kuen-Pin Wu, Ting-Yi Sung, Wen-Lian Hsu.   

Abstract

MOTIVATION: In our previous approach, we proposed a hybrid method for protein secondary structure prediction called HYPROSP, which combined our proposed knowledge-based prediction algorithm PROSP and PSIPRED. The knowledge base constructed for PROSP contains small peptides together with their secondary structural information. The hybrid strategy of HYPROSP uses a global quantitative measure, match rate, to determine whether PROSP or PSIPRED is to be used for the prediction of a target protein. HYPROSP made slight improvement of Q(3) over PSIPRED because PROSP predicted well for proteins with match rate >80%. As the portion of proteins with match rate >80% is quite small and as the performance of PSIPRED also improves, the advantage of HYPROSP is diluted. To overcome this limitation and further improve the hybrid prediction method, we present in this paper a new hybrid strategy HYPROSP II that is based on a new quantitative measure called local match rate.
RESULTS: Local match rate indicates the amount of structural information that each amino acid can extract from the knowledge base. With the local match rate, we are able to define a confidence level of the PROSP prediction results for each amino acid. Our new hybrid approach, HYPROSP II, is proposed as follows: for each amino acid in a target protein, we combine the prediction results of PROSP and PSIPRED using a hybrid function defined on their respective confidence levels. Two datasets in nrDSSP and EVA are used to perform a 10-fold cross validation. The average Q(3) of HYPROSP II is 81.8% and 80.7% on nrDSSP and EVA datasets, respectively, which is 2.0% and 1.1% better than that of PSIPRED. For local structures with match rate >80%, the average Q(3) improvement is 4.4% on the nrDSSP dataset. The use of local match rate improves the accuracy better than global match rate. There has been a long history of attempts to improve secondary structure prediction. We believe that HYPROSP II has greatly utilized the power of peptide knowledge base and raised the prediction accuracy to a new high. The method we developed in this paper could have a profound effect on the general use of knowledge base techniques for various predictionalgorithms. AVAILABILITY: The Linux executable file of HYPROSP II, as well as both nrDSSP and EVA datasets can be downloaded from http://bioinformatics.iis.sinica.edu.tw/HYPROSPII/.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15932901     DOI: 10.1093/bioinformatics/bti524

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  A new prediction strategy for long local protein structures using an original description.

Authors:  Aurélie Bornot; Catherine Etchebest; Alexandre G de Brevern
Journal:  Proteins       Date:  2009-08-15

Review 2.  Aegerolysins: structure, function, and putative biological role.

Authors:  Sabina Berne; Ljerka Lah; Kristina Sepcić
Journal:  Protein Sci       Date:  2009-04       Impact factor: 6.725

3.  SPINE X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles.

Authors:  Eshel Faraggi; Tuo Zhang; Yuedong Yang; Lukasz Kurgan; Yaoqi Zhou
Journal:  J Comput Chem       Date:  2011-11-02       Impact factor: 3.376

4.  PSP_MCSVM: brainstorming consensus prediction of protein secondary structures using two-stage multiclass support vector machines.

Authors:  Piyali Chatterjee; Subhadip Basu; Mahantapas Kundu; Mita Nasipuri; Dariusz Plewczynski
Journal:  J Mol Model       Date:  2011-05-19       Impact factor: 1.810

5.  Protein subcellular localization prediction of eukaryotes using a knowledge-based approach.

Authors:  Hsin-Nan Lin; Ching-Tai Chen; Ting-Yi Sung; Shinn-Ying Ho; Wen-Lian Hsu
Journal:  BMC Bioinformatics       Date:  2009-12-03       Impact factor: 3.169

Review 6.  Template-based protein modeling: recent methodological advances.

Authors:  Pankaj R Daga; Ronak Y Patel; Robert J Doerksen
Journal:  Curr Top Med Chem       Date:  2010       Impact factor: 3.295

7.  Prediction of protein structural classes for low-homology sequences based on predicted secondary structure.

Authors:  Jian-Yi Yang; Zhen-Ling Peng; Xin Chen
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

8.  Prediction of protein secondary structures with a novel kernel density estimation based classifier.

Authors:  Darby Tien-Hao Chang; Yu-Yen Ou; Hao-Geng Hung; Meng-Han Yang; Chien-Yu Chen; Yen-Jen Oyang
Journal:  BMC Res Notes       Date:  2008-07-23

9.  Inferences from structural comparison: flexibility, secondary structure wobble and sequence alignment optimization.

Authors:  Gaihua Zhang; Zhen Su
Journal:  BMC Bioinformatics       Date:  2012-09-11       Impact factor: 3.169

10.  Comparison study on statistical features of predicted secondary structures for protein structural class prediction: From content to position.

Authors:  Qi Dai; Yan Li; Xiaoqing Liu; Yuhua Yao; Yunjie Cao; Pingan He
Journal:  BMC Bioinformatics       Date:  2013-05-04       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.