Literature DB >> 11536366

Distribution of Indel lengths.

B Qian1, R A Goldstein.   

Abstract

Protein sequence alignment has become a widely used method in the study of newly sequenced proteins. Most sequence alignment methods use an affine gap penalty to assign scores to insertions and deletions. Although affine gap penalties represent the relative ease of extending a gap compared with initializing a gap, it is still an obvious oversimplification of the real processes that occur during sequence evolution. To improve the efficiency of sequence alignment methods and to obtain a better understanding of the process of sequence evolution, we wanted to find a more accurate model of insertions and deletions in homologous proteins. In this work, we extract the probability of a gap occurrence and the resulting gap length distribution in distantly related proteins (sequence identity < 25%) using alignments based on their common structures. We observe a distribution of gaps that can be fitted with a multiexponential with four distinct components. The results suggest new approaches to modeling insertions and deletions in sequence alignments. Copyright 2001 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2001        PMID: 11536366     DOI: 10.1002/prot.1129

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  22 in total

1.  Evolution's cauldron: duplication, deletion, and rearrangement in the mouse and human genomes.

Authors:  W James Kent; Robert Baertsch; Angie Hinrichs; Webb Miller; David Haussler
Journal:  Proc Natl Acad Sci U S A       Date:  2003-09-19       Impact factor: 11.205

2.  Frequency of gaps observed in a structurally aligned protein pair database suggests a simple gap penalty function.

Authors:  Nalin C W Goonesekere; Byungkook Lee
Journal:  Nucleic Acids Res       Date:  2004-05-20       Impact factor: 16.971

3.  Analysis of protein homology by assessing the (dis)similarity in protein loop regions.

Authors:  Anna R Panchenko; Thomas Madej
Journal:  Proteins       Date:  2004-11-15

Review 4.  The interface of protein structure, protein biophysics, and molecular evolution.

Authors:  David A Liberles; Sarah A Teichmann; Ivet Bahar; Ugo Bastolla; Jesse Bloom; Erich Bornberg-Bauer; Lucy J Colwell; A P Jason de Koning; Nikolay V Dokholyan; Julian Echave; Arne Elofsson; Dietlind L Gerloff; Richard A Goldstein; Johan A Grahnen; Mark T Holder; Clemens Lakner; Nicholas Lartillot; Simon C Lovell; Gavin Naylor; Tina Perica; David D Pollock; Tal Pupko; Lynne Regan; Andrew Roger; Nimrod Rubinstein; Eugene Shakhnovich; Kimmen Sjölander; Shamil Sunyaev; Ashley I Teufel; Jeffrey L Thorne; Joseph W Thornton; Daniel M Weinreich; Simon Whelan
Journal:  Protein Sci       Date:  2012-04-23       Impact factor: 6.725

5.  An information theoretic approach to macromolecular modeling: I. Sequence alignments.

Authors:  Tiba Aynechi; Irwin D Kuntz
Journal:  Biophys J       Date:  2005-11       Impact factor: 4.033

6.  The nature of protein domain evolution: shaping the interaction network.

Authors:  Christoph P Bagowski; Wouter Bruins; Aartjan J W Te Velthuis
Journal:  Curr Genomics       Date:  2010-08       Impact factor: 2.236

7.  Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features.

Authors:  Timo Lassmann; Oliver Frings; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2008-12-22       Impact factor: 16.971

8.  INDELible: a flexible simulator of biological sequence evolution.

Authors:  William Fletcher; Ziheng Yang
Journal:  Mol Biol Evol       Date:  2009-05-07       Impact factor: 16.240

9.  Biological sequence simulation for testing complex evolutionary hypotheses: indel-Seq-Gen version 2.0.

Authors:  Cory L Strope; Kevin Abel; Stephen D Scott; Etsuko N Moriyama
Journal:  Mol Biol Evol       Date:  2009-08-03       Impact factor: 16.240

10.  Linking fold, function and phylogeny: a comparative genomics view on protein (domain) evolution.

Authors:  Aartjan J W Te Velthuis; Christoph P Bagowski
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.