Literature DB >> 15769835

Fold recognition by combining profile-profile alignment and support vector machine.

Sangjo Han1, Byung-Chul Lee, Seung Taek Yu, Chan-Seok Jeong, Soyoung Lee, Dongsup Kim.   

Abstract

MOTIVATION: Currently, the most accurate fold-recognition method is to perform profile-profile alignments and estimate the statistical significances of those alignments by calculating Z-score or E-value. Although this scheme is reliable in recognizing relatively close homologs related at the family level, it has difficulty in finding the remote homologs that are related at the superfamily or fold level.
RESULTS: In this paper, we present an alternative method to estimate the significance of the alignments. The alignment between a query protein and a template of length n in the fold library is transformed into a feature vector of length n + 1, which is then evaluated by support vector machine (SVM). The output from SVM is converted to a posterior probability that a query sequence is related to a template, given SVM output. Results show that a new method shows significantly better performance than PSI-BLAST and profile-profile alignment with Z-score scheme. While PSI-BLAST and Z-score scheme detect 16 and 20% of superfamily-related proteins, respectively, at 90% specificity, a new method detects 46% of these proteins, resulting in more than 2-fold increase in sensitivity. More significantly, at the fold level, a new method can detect 14% of remotely related proteins at 90% specificity, a remarkable result considering the fact that the other methods can detect almost none at the same level of specificity.

Mesh:

Substances:

Year:  2005        PMID: 15769835     DOI: 10.1093/bioinformatics/bti384

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Functional annotation prediction: all for one and one for all.

Authors:  Ori Sasson; Noam Kaplan; Michal Linial
Journal:  Protein Sci       Date:  2006-05-02       Impact factor: 6.725

2.  RaptorX: exploiting structure information for protein alignment by statistical inference.

Authors:  Jian Peng; Jinbo Xu
Journal:  Proteins       Date:  2011-10-11

3.  Boosting Protein Threading Accuracy.

Authors:  Jian Peng; Jinbo Xu
Journal:  Res Comput Mol Biol       Date:  2009

4.  eThread: a highly optimized machine learning-based approach to meta-threading and the modeling of protein tertiary structures.

Authors:  Michal Brylinski; Daswanth Lingam
Journal:  PLoS One       Date:  2012-11-21       Impact factor: 3.240

5.  Conotoxin protein classification using free scores of words and support vector machines.

Authors:  Nazar Zaki; Stefan Wolfsheimer; Gregory Nuel; Sawsan Khuri
Journal:  BMC Bioinformatics       Date:  2011-05-29       Impact factor: 3.169

6.  Recognition of 27-class protein folds by adding the interaction of segments and motif information.

Authors:  Zhenxing Feng; Xiuzhen Hu
Journal:  Biomed Res Int       Date:  2014-07-21       Impact factor: 3.411

7.  Predicting and improving the protein sequence alignment quality by support vector regression.

Authors:  Minho Lee; Chan-seok Jeong; Dongsup Kim
Journal:  BMC Bioinformatics       Date:  2007-12-03       Impact factor: 3.169

8.  Application of nonnegative matrix factorization to improve profile-profile alignment features for fold recognition and remote homolog detection.

Authors:  Inkyung Jung; Jaehyung Lee; Soo-Young Lee; Dongsup Kim
Journal:  BMC Bioinformatics       Date:  2008-07-01       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.