Literature DB >> 12112703

Performance evaluation of a new algorithm for the detection of remote homologs with sequence comparison.

Maricel G Kann1, Richard A Goldstein.   

Abstract

A detailed analysis of the performance of hybrid, a new sequence alignment algorithm developed by Yu and coworkers that combines Smith Waterman local dynamic programming with a local version of the maximum-likelihood approach, was made to access the applicability of this algorithm to the detection of distant homologs by sequence comparison. We analyzed the statistics of hybrid with a set of nonhomologous protein sequences from the SCOP database and found that the statistics of the scores from hybrid algorithm follows an Extreme Value Distribution with lambda approximately 1, as previously shown by Yu et al. for the case of artificially generated sequences. Local dynamic programming was compared to the hybrid algorithm by using two different test data sets of distant homologs from the PFAM and COGs protein sequence databases. The studies were made with several score functions in current use including OPTIMA, a new score function originally developed to detect remote homologs with the Smith Waterman algorithm. We found OPTIMA to be the best score function for both both dynamic programming and the hybrid algorithms. The ability of dynamic programming to discriminate between homologs and nonhomologs in the two sets of distantly related sequences is slightly better than that of hybrid algorithm. The advantage of producing accurate score statistics with only a few simulations may overcome the small differences in performance and make this new algorithm suitable for detection of homologs in conjunction with a wide range of score functions and gap penalties. Copyright 2002 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2002        PMID: 12112703     DOI: 10.1002/prot.10117

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  3 in total

1.  Finding Protein and Nucleotide Similarities with FASTA.

Authors:  William R Pearson
Journal:  Curr Protoc Bioinformatics       Date:  2016-03-24

2.  Novel type IV secretion system involved in propagation of genomic islands.

Authors:  Mario Juhas; Derrick W Crook; Ioanna D Dimopoulou; Gerton Lunter; Rosalind M Harding; David J P Ferguson; Derek W Hood
Journal:  J Bacteriol       Date:  2006-11-22       Impact factor: 3.490

3.  morFeus: a web-based program to detect remotely conserved orthologs using symmetrical best hits and orthology network scoring.

Authors:  Ines Wagner; Michael Volkmer; Malvika Sharan; Jose M Villaveces; Felix Oswald; Vineeth Surendranath; Bianca H Habermann
Journal:  BMC Bioinformatics       Date:  2014-08-06       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.