Literature DB >> 15155852

Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.

Nalin C W Goonesekere1, Byungkook Lee.   

Abstract

Gap penalty is an important component of the scoring scheme that is needed when searching for homologous proteins and for accurate alignment of protein sequences. Most homology search and sequence alignment algorithms employ a heuristic 'affine gap penalty' scheme q + r x n, in which q is the penalty for opening a gap, r the penalty for extending it and n the gap length. In order to devise a more rational scoring scheme, we examined the pattern of gaps that occur in a database of structurally aligned protein domain pairs. We find that the logarithm of the frequency of gaps varies linearly with the length of the gap, but with a break at a gap of length 3, and is well approximated by two linear regression lines with R2 values of 1.0 and 0.99. The bilinear behavior is retained when gaps are categorized by secondary structures of the two residues flanking the gap. Similar results were obtained when another, totally independent, structurally aligned protein pair database was used. These results suggest a modification of the affine gap penalty function.

Mesh:

Year:  2004        PMID: 15155852      PMCID: PMC419611          DOI: 10.1093/nar/gkh610

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  26 in total

1.  The ASTRAL compendium for protein structure and sequence analysis.

Authors:  S E Brenner; P Koehl; M Levitt
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Local sequence alignments with monotonic gap penalties.

Authors:  R Mott
Journal:  Bioinformatics       Date:  1999-06       Impact factor: 6.937

3.  Distribution of Indel lengths.

Authors:  B Qian; R A Goldstein
Journal:  Proteins       Date:  2001-10-01

4.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties.

Authors:  J Shi; T L Blundell; K Mizuguchi
Journal:  J Mol Biol       Date:  2001-06-29       Impact factor: 5.469

5.  Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.

Authors:  Golan Yona; Michael Levitt
Journal:  J Mol Biol       Date:  2002-02-01       Impact factor: 5.469

Review 6.  Evolution of alternative splicing: deletions, insertions and origin of functional parts of proteins from intron sequences.

Authors:  Fyodor A Kondrashov; Eugene V Koonin
Journal:  Trends Genet       Date:  2003-03       Impact factor: 11.639

7.  Empirical determination of effective gap penalties for sequence comparison.

Authors:  J T Reese; W R Pearson
Journal:  Bioinformatics       Date:  2002-11       Impact factor: 6.937

8.  The directional atomic solvation energy: an atom-based potential for the assignment of protein sequences to known folds.

Authors:  Parag Mallick; Robert Weiss; David Eisenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2002-12-02       Impact factor: 11.205

9.  Finding weak similarities between proteins by sequence profile comparison.

Authors:  Anna R Panchenko
Journal:  Nucleic Acids Res       Date:  2003-01-15       Impact factor: 16.971

10.  Patterns of nucleotide substitution, insertion and deletion in the human genome inferred from pseudogenes.

Authors:  Zhaolei Zhang; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2003-09-15       Impact factor: 16.971

View more
  5 in total

1.  The construction and use of log-odds substitution scores for multiple sequence alignment.

Authors:  Stephen F Altschul; John C Wootton; Elena Zaslavsky; Yi-Kuo Yu
Journal:  PLoS Comput Biol       Date:  2010-07-15       Impact factor: 4.475

2.  Using structure to explore the sequence alignment space of remote homologs.

Authors:  Andrew Kuziemko; Barry Honig; Donald Petrey
Journal:  PLoS Comput Biol       Date:  2011-10-06       Impact factor: 4.475

3.  Patterns of insertion and deletion in Mammalian genomes.

Authors:  Yanhui Fan; Wenjuan Wang; Guoji Ma; Lijing Liang; Qi Shi; Shiheng Tao
Journal:  Curr Genomics       Date:  2007-09       Impact factor: 2.236

4.  SP5: improving protein fold recognition by using torsion angle profiles and profile-based gap penalty model.

Authors:  Wei Zhang; Song Liu; Yaoqi Zhou
Journal:  PLoS One       Date:  2008-06-04       Impact factor: 3.240

5.  Dynamic programming used to align protein structures with a spectrum is robust.

Authors:  Allen Holder; Jacqueline Simon; Jonathon Strauser; Jonathan Taylor; Yosi Shibberu
Journal:  Biology (Basel)       Date:  2013-11-20
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.