Literature DB >> 9773344

Homology detection via family pairwise search.

W N Grundy1.   

Abstract

The function of an unknown biological sequence can often be accurately inferred by identifying sequences homologous to the original sequence. Given a query set of known homologs, there exist at least three general classes of techniques for finding additional homologs: pairwise sequence comparisons, motif analysis, and hidden Markov modeling. Pairwise sequence comparisons are typically employed when only a single query sequence is known. Hidden Markov models (HMMs), on the other hand, are usually trained with sets of more than 100 sequences. Motif-based methods fall in between these two extremes. The current work introduces a straightforward generalization of pairwise sequence comparison algorithms to the case when multiple query sequences are available. This algorithm, called Family Pairwise Search (FPS), combines pairwise sequence comparison scores from each query sequence. A BLAST implementation of FPS is compared to representative examples of hidden Markov modeling (HMMER) and motif modeling (MEME). The three techniques are compared across a wide range of protein families, using query sets of varying sizes. BLAST FPS significantly outperforms motif-based and HMM methods. Furthermore, FPS is much more efficient than the training algorithms for statistical models.

Mesh:

Substances:

Year:  1998        PMID: 9773344     DOI: 10.1089/cmb.1998.5.479

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  14 in total

1.  The complement of protein phosphatase catalytic subunits encoded in the genome of Arabidopsis.

Authors:  David Kerk; Joshua Bulgrien; Douglas W Smith; Brooke Barsam; Stella Veretnik; Michael Gribskov
Journal:  Plant Physiol       Date:  2002-06       Impact factor: 8.340

2.  Comparative homology agreement search: an effective combination of homology-search methods.

Authors:  Intikhab Alam; Andreas Dress; Marc Rehmsmeier; Georg Fuellen
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-14       Impact factor: 11.205

3.  Infernal 1.0: inference of RNA alignments.

Authors:  Eric P Nawrocki; Diana L Kolbe; Sean R Eddy
Journal:  Bioinformatics       Date:  2009-03-23       Impact factor: 6.937

4.  Defining the Domain Arrangement of the Mammalian Target of Rapamycin Complex Component Rictor Protein.

Authors:  Ping Zhou; Ning Zhang; Ruth Nussinov; Buyong Ma
Journal:  J Comput Biol       Date:  2015-07-15       Impact factor: 1.479

5.  Phamerator: a bioinformatic tool for comparative bacteriophage genomics.

Authors:  Steven G Cresawn; Matt Bogel; Nathan Day; Deborah Jacobs-Sera; Roger W Hendrix; Graham F Hatfull
Journal:  BMC Bioinformatics       Date:  2011-10-12       Impact factor: 3.169

6.  Constructing benchmark test sets for biological sequence analysis using independent set algorithms.

Authors:  Samantha Petti; Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2022-03-07       Impact factor: 4.475

7.  SIB-BLAST: a web server for improved delineation of true and false positives in PSI-BLAST searches.

Authors:  Marianne M Lee; Michael K Chan; Ralf Bundschuh
Journal:  Nucleic Acids Res       Date:  2009-05-08       Impact factor: 16.971

8.  Hidden Markov model speed heuristic and iterative HMM search procedure.

Authors:  L Steven Johnson; Sean R Eddy; Elon Portugaly
Journal:  BMC Bioinformatics       Date:  2010-08-18       Impact factor: 3.169

9.  Accelerated Profile HMM Searches.

Authors:  Sean R Eddy
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

10.  Increasing sequence search sensitivity with transitive alignments.

Authors:  Ketil Malde; Tomasz Furmanek
Journal:  PLoS One       Date:  2013-02-14       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.