Literature DB >> 16188929

Profile-based direct kernels for remote homology detection and fold recognition.

Huzefa Rangwala1, George Karypis.   

Abstract

MOTIVATION: Protein remote homology detection is a central problem in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for remote homology detection. The performance of these methods depends on how the protein sequences are modeled and on the method used to compute the kernel function between them.
RESULTS: We introduce two classes of kernel functions that are constructed by combining sequence profiles with new and existing approaches for determining the similarity between pairs of protein sequences. These kernels are constructed directly from these explicit protein similarity measures and employ effective profile-to-profile scoring schemes for measuring the similarity between pairs of proteins. Experiments with remote homology detection and fold recognition problems show that these kernels are capable of producing results that are substantially better than those produced by all of the existing state-of-the-art SVM-based methods. In addition, the experiments show that these kernels, even when used in the absence of profiles, produce results that are better than those produced by existing non-profile-based schemes. AVAILABILITY: The programs for computing the various kernel functions are available on request from the authors.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16188929     DOI: 10.1093/bioinformatics/bti687

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

1.  Improved prediction of malaria degradomes by supervised learning with SVM and profile kernel.

Authors:  Rui Kuang; Jianying Gu; Hong Cai; Yufeng Wang
Journal:  Genetica       Date:  2008-12-06       Impact factor: 1.082

2.  A new prediction strategy for long local protein structures using an original description.

Authors:  Aurélie Bornot; Catherine Etchebest; Alexandre G de Brevern
Journal:  Proteins       Date:  2009-08-15

3.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

4.  TASSER_low-zsc: an approach to improve structure prediction using low z-score-ranked templates.

Authors:  Shashi B Pandit; Jeffrey Skolnick
Journal:  Proteins       Date:  2010-10

Review 5.  Machine learning for in silico virtual screening and chemical genomics: new strategies.

Authors:  Jean-Philippe Vert; Laurent Jacob
Journal:  Comb Chem High Throughput Screen       Date:  2008-09       Impact factor: 1.339

6.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

7.  Exploiting physico-chemical properties in string kernels.

Authors:  Nora C Toussaint; Christian Widmer; Oliver Kohlbacher; Gunnar Rätsch
Journal:  BMC Bioinformatics       Date:  2010-10-26       Impact factor: 3.169

8.  BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors:  Hong-Liang Li; Yi-He Pang; Bin Liu
Journal:  Nucleic Acids Res       Date:  2021-12-16       Impact factor: 16.971

9.  DescFold: a web server for protein fold recognition.

Authors:  Ren-Xiang Yan; Jing-Na Si; Chuan Wang; Ziding Zhang
Journal:  BMC Bioinformatics       Date:  2009-12-14       Impact factor: 3.169

10.  A discriminative method for protein remote homology detection and fold recognition combining Top-n-grams and latent semantic analysis.

Authors:  Bin Liu; Xiaolong Wang; Lei Lin; Qiwen Dong; Xuan Wang
Journal:  BMC Bioinformatics       Date:  2008-12-01       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.