Literature DB >> 11301301

A new approach to sequence comparison: normalized sequence alignment.

A N Arslan1, O Eğecioğlu , P A Pevzner.   

Abstract

The Smith-Waterman algorithm for local sequence alignment is one of the most important techniques in computational molecular biology. This ingenious dynamic programming approach was designed to reveal the highly conserved fragments by discarding poorly conserved initial and terminal segments. However, the existing notion of local similarity has a serious flaw: it does not discard poorly conserved intermediate segments. The Smith-Waterman algorithm finds the local alignment with maximal score but it is unable to find local alignment with maximum degree of similarity (e.g. maximal percent of matches). Moreover, there is still no efficient algorithm that answers the following natural question: do two sequences share a (sufficiently long) fragment with more than 70% of similarity? As a result, the local alignment sometimes produces a mosaic of well-conserved fragments artificially connected by poorly-conserved or even unrelated fragments. This may lead to problems in comparison of long genomic sequences and comparative gene prediction as recently pointed out by Zhang et al. (Bioinformatics, 15, 1012-1019, 1999). In this paper we propose a new sequence comparison algorithm (normalized local alignment ) that reports the regions with maximum degree of similarity. The algorithm is based on fractional programming and its running time is O(n2log n). In practice, normalized local alignment is only 3-5 times slower than the standard Smith-Waterman algorithm.

Mesh:

Year:  2001        PMID: 11301301     DOI: 10.1093/bioinformatics/17.4.327

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Minimal entropy probability paths between genome families.

Authors:  Calvin Ahlbrandt; Gary Benson; William Casey
Journal:  J Math Biol       Date:  2003-12-02       Impact factor: 2.259

2.  Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs.

Authors:  Bastien Chevreux; Thomas Pfisterer; Bernd Drescher; Albert J Driesel; Werner E G Müller; Thomas Wetter; Sándor Suhai
Journal:  Genome Res       Date:  2004-05-12       Impact factor: 9.043

3.  A pattern matching approach for the estimation of alignment between any two given DNA sequences.

Authors:  K Basu; N Sriraam; R J A Richard
Journal:  J Med Syst       Date:  2007-08       Impact factor: 4.460

Review 4.  The bioinformatics challenges in comparative analysis of cereal genomes-an overview.

Authors:  M Bellgard; Jia Ye; T Gojobori; R Appels
Journal:  Funct Integr Genomics       Date:  2004-02-10       Impact factor: 3.410

5.  iPARTS: an improved tool of pairwise alignment of RNA tertiary structures.

Authors:  Chih-Wei Wang; Kun-Tze Chen; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2010-05-27       Impact factor: 16.971

6.  A new measurement of sequence conservation.

Authors:  Xiaohui Cai; Haiyan Hu; Xiaoman Li
Journal:  BMC Genomics       Date:  2009-12-22       Impact factor: 3.969

7.  Accurate statistics for local sequence alignment with position-dependent scoring by rare-event sampling.

Authors:  Stefan Wolfsheimer; Inke Herms; Sven Rahmann; Alexander K Hartmann
Journal:  BMC Bioinformatics       Date:  2011-02-03       Impact factor: 3.169

8.  SARSA: a web tool for structural alignment of RNA using a structural alphabet.

Authors:  Yen-Fu Chang; Yen-Lin Huang; Chin Lung Lu
Journal:  Nucleic Acids Res       Date:  2008-05-23       Impact factor: 16.971

9.  Adjusting scoring matrices to correct overextended alignments.

Authors:  Lauren J Mills; William R Pearson
Journal:  Bioinformatics       Date:  2013-08-31       Impact factor: 6.937

10.  Analysis of 5' gene regions reveals extraordinary conservation of novel non-coding sequences in a wide range of animals.

Authors:  Nathaniel J Davies; Peter Krusche; Eran Tauber; Sascha Ott
Journal:  BMC Evol Biol       Date:  2015-10-19       Impact factor: 3.260

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.