Literature DB >> 14988126

Protein homology detection using string alignment kernels.

Hiroto Saigo1, Jean-Philippe Vert, Nobuhisa Ueda, Tatsuya Akutsu.   

Abstract

MOTIVATION: Remote homology detection between protein sequences is a central problem in computational biology. Discriminative methods involving support vector machines (SVMs) are currently the most effective methods for the problem of superfamily recognition in the Structural Classification Of Proteins (SCOP) database. The performance of SVMs depends critically on the kernel function used to quantify the similarity between sequences.
RESULTS: We propose new kernels for strings adapted to biological sequences, which we call local alignment kernels. These kernels measure the similarity between two sequences by summing up scores obtained from local alignments with gaps of the sequences. When tested in combination with SVM on their ability to recognize SCOP superfamilies on a benchmark dataset, the new kernels outperform state-of-the-art methods for remote homology detection. AVAILABILITY: Software and data available upon request.

Mesh:

Substances:

Year:  2004        PMID: 14988126     DOI: 10.1093/bioinformatics/bth141

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  60 in total

1.  Functional census of mutation sequence spaces: the example of p53 cancer rescue mutants.

Authors:  Samuel A Danziger; S Joshua Swamidass; Jue Zeng; Lawrence R Dearth; Qiang Lu; Jonathan H Chen; Jianlin Cheng; Vinh P Hoang; Hiroto Saigo; Ray Luo; Pierre Baldi; Rainer K Brachmann; Richard H Lathrop
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2006 Apr-Jun       Impact factor: 3.710

2.  Predicting flexible length linear B-cell epitopes.

Authors:  Yasser El-Manzalawy; Drena Dobbs; Vasant Honavar
Journal:  Comput Syst Bioinformatics Conf       Date:  2008

3.  Machine learning assisted design of highly active peptides for drug discovery.

Authors:  Sébastien Giguère; François Laviolette; Mario Marchand; Denise Tremblay; Sylvain Moineau; Xinxia Liang; Éric Biron; Jacques Corbeil
Journal:  PLoS Comput Biol       Date:  2015-04-07       Impact factor: 4.475

Review 4.  Machine learning for in silico virtual screening and chemical genomics: new strategies.

Authors:  Jean-Philippe Vert; Laurent Jacob
Journal:  Comb Chem High Throughput Screen       Date:  2008-09       Impact factor: 1.339

5.  The proteins of intra-nuclear bodies: a data-driven analysis of sequence, interaction and expression.

Authors:  Nurul Mohamad; Mikael Bodén
Journal:  BMC Syst Biol       Date:  2010-04-13

6.  Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Authors:  Bobbie-Jo M Webb-Robertson; Kyle G Ratuiste; Christopher S Oehmen
Journal:  BMC Bioinformatics       Date:  2010-03-19       Impact factor: 3.169

7.  Genome-wide searching with base-pairing kernel functions for noncoding RNAs: computational and expression analysis of snoRNA families in Caenorhabditis elegans.

Authors:  Kensuke Morita; Yutaka Saito; Kengo Sato; Kotaro Oka; Kohji Hotta; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2009-01-07       Impact factor: 16.971

8.  Predicting linear B-cell epitopes using string kernels.

Authors:  Yasser El-Manzalawy; Drena Dobbs; Vasant Honavar
Journal:  J Mol Recognit       Date:  2008 Jul-Aug       Impact factor: 2.137

9.  Protein-protein interaction based on pairwise similarity.

Authors:  Nazar Zaki; Sanja Lazarova-Molnar; Wassim El-Hajj; Piers Campbell
Journal:  BMC Bioinformatics       Date:  2009-05-17       Impact factor: 3.169

10.  LipocalinPred: a SVM-based method for prediction of lipocalins.

Authors:  Jayashree Ramana; Dinesh Gupta
Journal:  BMC Bioinformatics       Date:  2009-12-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.