Literature DB >> 8289235

Sequence alignment and penalty choice. Review of concepts, case studies and implications.

M Vingron1, M S Waterman.   

Abstract

Alignment algorithms to compare DNA or amino acid sequences are widely used tools in molecular biology. The algorithms depend on the setting of various parameters, most notably gap penalties. The effect that such parameters have on the resulting alignments is still poorly understood. This paper begins by reviewing two recent advances in algorithms and probability that enable us to take a new approach to this question. The first tool we introduce is a newly developed method to delineate efficiently all optimal alignments arising under all choices of parameters. The second tool comprises insights into the statistical behavior of optimal alignment scores. From this we gain a better understanding of the dependence of alignments on parameters in general. We propose novel criteria to detect biologically good alignments and highlight some specific features about the interaction between similarity matrices and gap penalties. To illustrate our analysis we present a detailed study of the comparison of two immunoglobulin sequences.

Mesh:

Substances:

Year:  1994        PMID: 8289235     DOI: 10.1016/s0022-2836(05)80006-3

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  40 in total

1.  The estimation of statistical parameters for local alignment score distributions.

Authors:  S F Altschul; R Bundschuh; R Olsen; T Hwa
Journal:  Nucleic Acids Res       Date:  2001-01-15       Impact factor: 16.971

Review 2.  Evolution of genes and taxa: a primer.

Authors:  J J Doyle; B S Gaut
Journal:  Plant Mol Biol       Date:  2000-01       Impact factor: 4.076

3.  Fold recognition by combining sequence profiles derived from evolution and from depth-dependent structural alignment of fragments.

Authors:  Hongyi Zhou; Yaoqi Zhou
Journal:  Proteins       Date:  2005-02-01

4.  An information theoretic approach to macromolecular modeling: I. Sequence alignments.

Authors:  Tiba Aynechi; Irwin D Kuntz
Journal:  Biophys J       Date:  2005-11       Impact factor: 4.033

5.  ProbCons: Probabilistic consistency-based multiple sequence alignment.

Authors:  Chuong B Do; Mahathi S P Mahabhashyam; Michael Brudno; Serafim Batzoglou
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

6.  Ab initio protein structure prediction using chunk-TASSER.

Authors:  Hongyi Zhou; Jeffrey Skolnick
Journal:  Biophys J       Date:  2007-05-11       Impact factor: 4.033

7.  Aligning sequences by minimum description length.

Authors:  John S Conery
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

8.  Protein structure prediction by pro-Sp3-TASSER.

Authors:  Hongyi Zhou; Jeffrey Skolnick
Journal:  Biophys J       Date:  2009-03-18       Impact factor: 4.033

9.  Fast multiple alignment of ungapped DNA sequences using information theory and a relaxation method.

Authors:  Thomas D Schneider; David N Mastronarde
Journal:  Discrete Appl Math       Date:  1996-12-01       Impact factor: 1.139

10.  Comparison of methods for searching protein sequence databases.

Authors:  W R Pearson
Journal:  Protein Sci       Date:  1995-06       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.