Literature DB >> 14630658

Efficient remote homology detection using local structure.

Yuna Hou1, Wynne Hsu, Mong Li Lee, Christopher Bystroff.   

Abstract

MOTIVATION: The function of an unknown biological sequence can often be accurately inferred if we are able to map this unknown sequence to its corresponding homologous family. At present, discriminative methods such as SVM-Fisher and SVM-pairwise, which combine support vector machine (SVM) and sequence similarity, are recognized as the most accurate methods, with SVM-pairwise being the most accurate. However, these methods typically encode sequence information into their feature vectors and ignore the structure information. They are also computationally inefficient. Based on these observations, we present an alternative method for SVM-based protein classification. Our proposed method, SVM-I-sites, utilizes structure similarity for remote homology detection. RESULT: We run experiments on the Structural Classification of Proteins 1.53 data set. The results show that SVM-I-sites is more efficient than SVM-pairwise. Further, we find that SVM-I-sites outperforms sequence-based methods such as PSI-BLAST, SAM, and SVM-Fisher while achieving a comparable performance with SVM-pairwise. AVAILABILITY: I-sites server is accessible through the web at http://www.bioinfo.rpi.edu. Programs are available upon request for academics. Licensing agreements are available for commercial interests. The framework of encoding local structure into feature vector is available upon request.

Mesh:

Substances:

Year:  2003        PMID: 14630658     DOI: 10.1093/bioinformatics/btg317

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Descriptor-based protein remote homology identification.

Authors:  Ziding Zhang; Sunil Kochhar; Martin G Grigorov
Journal:  Protein Sci       Date:  2005-01-04       Impact factor: 6.725

2.  Global mapping of the protein structure space and application in structure-based inference of protein function.

Authors:  Jingtong Hou; Se-Ran Jun; Chao Zhang; Sung-Hou Kim
Journal:  Proc Natl Acad Sci U S A       Date:  2005-02-10       Impact factor: 11.205

3.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

4.  Using amino acid physicochemical distance transformation for fast protein remote homology detection.

Authors:  Bin Liu; Xiaolong Wang; Qingcai Chen; Qiwen Dong; Xun Lan
Journal:  PLoS One       Date:  2012-09-28       Impact factor: 3.240

5.  Classification of protein sequences by means of irredundant patterns.

Authors:  Matteo Comin; Davide Verzotto
Journal:  BMC Bioinformatics       Date:  2010-01-18       Impact factor: 3.169

6.  A discriminative method for family-based protein remote homology detection that combines inductive logic programming and propositional models.

Authors:  Juliana S Bernardes; Alessandra Carbone; Gerson Zaverucha
Journal:  BMC Bioinformatics       Date:  2011-03-23       Impact factor: 3.169

7.  Incorporation of local structural preference potential improves fold recognition.

Authors:  Yun Hu; Xiaoxi Dong; Aiping Wu; Yang Cao; Liqing Tian; Taijiao Jiang
Journal:  PLoS One       Date:  2011-02-18       Impact factor: 3.240

8.  Local protein structure prediction using discriminative models.

Authors:  Oliver Sander; Ingolf Sommer; Thomas Lengauer
Journal:  BMC Bioinformatics       Date:  2006-01-11       Impact factor: 3.169

9.  GISMO--gene identification using a support vector machine for ORF classification.

Authors:  Lutz Krause; Alice C McHardy; Tim W Nattkemper; Alfred Pühler; Jens Stoye; Folker Meyer
Journal:  Nucleic Acids Res       Date:  2006-12-14       Impact factor: 16.971

10.  A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2008-12-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.