Literature DB >> 12487762

A novel approach to remote homology detection: jumping alignments.

Rainer Spang1, Marc Rehmsmeier, Jens Stoye.   

Abstract

We describe a new algorithm for protein classification and the detection of remote homologs. The rationale is to exploit both vertical and horizontal information of a multiple alignment in a well-balanced manner. This is in contrast to established methods such as profiles and profile hidden Markov models which focus on vertical information as they model the columns of the alignment independently and to family pairwise search which focuses on horizontal information as it treats given sequences separately. In our setting, we want to select from a given database of "candidate sequences" those proteins that belong to a given superfamily. In order to do so, each candidate sequence is separately tested against a multiple alignment of the known members of the superfamily by means of a new jumping alignment algorithm. This algorithm is an extension of the Smith-Waterman algorithm and computes a local alignment of a single sequence and a multiple alignment. In contrast to traditional methods, however, this alignment is not based on a summary of the individual columns of the multiple alignment. Rather, the candidate sequence is at each position aligned to one sequence of the multiple alignment, called the "reference sequence." In addition, the reference sequence may change within the alignment, while each such jump is penalized. To evaluate the discriminative quality of the jumping alignment algorithm, we compare it to profiles, profile hidden Markov models, and family pairwise search on a subset of the SCOP database of protein domains. The discriminative quality is assessed by median false positive counts (med-FP-counts). For moderate med-FP-counts, the number of successful searches with our method is considerably higher than with the competing methods.

Mesh:

Substances:

Year:  2002        PMID: 12487762     DOI: 10.1089/106652702761034172

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  16 in total

1.  HIV classification using the coalescent theory.

Authors:  Ingo Bulla; Anne-Kathrin Schultz; Fabian Schreiber; Ming Zhang; Thomas Leitner; Bette Korber; Burkhard Morgenstern; Mario Stanke
Journal:  Bioinformatics       Date:  2010-04-16       Impact factor: 6.937

Review 2.  Homology and phylogeny and their automated inference.

Authors:  Georg Fuellen
Journal:  Naturwissenschaften       Date:  2008-02-21

3.  Pareto optimization in algebraic dynamic programming.

Authors:  Cédric Saule; Robert Giegerich
Journal:  Algorithms Mol Biol       Date:  2015-07-07       Impact factor: 1.405

4.  Probabilistic inference of viral quasispecies subject to recombination.

Authors:  Armin Töpfer; Osvaldo Zagordi; Sandhya Prabhakaran; Volker Roth; Eran Halperin; Niko Beerenwinkel
Journal:  J Comput Biol       Date:  2013-02       Impact factor: 1.479

5.  jpHMM: recombination analysis in viruses with circular genomes such as the hepatitis B virus.

Authors:  Anne-Kathrin Schultz; Ingo Bulla; Mariama Abdou-Chekaraou; Emmanuel Gordien; Burkhard Morgenstern; Fabien Zoaulim; Paul Dény; Mario Stanke
Journal:  Nucleic Acids Res       Date:  2012-05-16       Impact factor: 16.971

6.  TCRep 3D: an automated in silico approach to study the structural properties of TCR repertoires.

Authors:  Antoine Leimgruber; Mathias Ferber; Melita Irving; Hamid Hussain-Kahn; Sébastien Wieckowski; Laurent Derré; Nathalie Rufer; Vincent Zoete; Olivier Michielin
Journal:  PLoS One       Date:  2011-10-28       Impact factor: 3.240

7.  Classification of HIV-1 sequences using profile Hidden Markov Models.

Authors:  Sanjiv K Dwivedi; Supratim Sengupta
Journal:  PLoS One       Date:  2012-05-18       Impact factor: 3.240

8.  A sequence sub-sampling algorithm increases the power to detect distant homologues.

Authors:  Catrióna R Johnston; Denis C Shields
Journal:  Nucleic Acids Res       Date:  2005-07-08       Impact factor: 16.971

9.  Improving accuracy of multiple sequence alignment algorithms based on alignment of neighboring residues.

Authors:  Yue Lu; Sing-Hoi Sze
Journal:  Nucleic Acids Res       Date:  2008-12-04       Impact factor: 16.971

10.  jpHMM: improving the reliability of recombination prediction in HIV-1.

Authors:  Anne-Kathrin Schultz; Ming Zhang; Ingo Bulla; Thomas Leitner; Bette Korber; Burkhard Morgenstern; Mario Stanke
Journal:  Nucleic Acids Res       Date:  2009-05-14       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.