Literature DB >> 10383814

Local sequence alignments with monotonic gap penalties.

R Mott1.   

Abstract

MOTIVATION: Sequence alignments obtained using affine gap penalties are not always biologically correct, because the insertion of long gaps is over-penalised. There is a need for an efficient algorithm which can find local alignments using non-linear gap penalties.
RESULTS: A dynamic programming algorithm is described which computes optimal local sequence alignments for arbitrary, monotonically increasing gap penalties, i.e. where the cost g(k) of inserting a gap of k symbols is such that g(k) >/= g(k-1). The running time of the algorithm is dependent on the scoring scheme; if the expected score of an alignment between random, unrelated sequences of lengths m, n is proportional to log mn, then with one exception, the algorithm has expected running time O(mn). Elsewhere, the running time is no greater than O(mn(m+n)). Optimisations are described which appear to reduce the worst-case run-time to O(mn) in many cases. We show how using a non-affine gap penalty can dramatically increase the probability of detecting a similarity containing a long gap. AVAILABILITY: The source code is available to academic collaborators under licence.

Mesh:

Year:  1999        PMID: 10383814     DOI: 10.1093/bioinformatics/15.6.455

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  7 in total

1.  Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.

Authors:  Nalin C W Goonesekere; Byungkook Lee
Journal:  Nucleic Acids Res       Date:  2004-05-20       Impact factor: 16.971

2.  Ngila: global pairwise alignments with logarithmic and affine gap costs.

Authors:  Reed A Cartwright
Journal:  Bioinformatics       Date:  2007-03-25       Impact factor: 6.937

3.  ESTIMATING THE GUMBEL SCALE PARAMETER FOR LOCAL ALIGNMENT OF RANDOM SEQUENCES BY IMPORTANCE SAMPLING WITH STOPPING TIMES.

Authors:  Yonil Park; Sergey Sheetlin; John L Spouge
Journal:  Ann Stat       Date:  2009-12-01       Impact factor: 4.028

4.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

5.  The Gumbel pre-factor k for gapped local alignment can be estimated from simulations of global alignment.

Authors:  Sergey Sheetlin; Yonil Park; John L Spouge
Journal:  Nucleic Acids Res       Date:  2005-09-06       Impact factor: 16.971

Review 6.  Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

Authors:  Marco Pellegrini
Journal:  Front Bioeng Biotechnol       Date:  2015-09-24

7.  Modular and configurable optimal sequence alignment software: Cola.

Authors:  Neda Zamani; Görel Sundström; Marc P Höppner; Manfred G Grabherr
Journal:  Source Code Biol Med       Date:  2014-06-09
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.