Literature DB >> 14871874

A metric model of amino acid substitution.

Weijia Xu1, Daniel P Miranker.   

Abstract

MOTIVATION: We address the question of whether there exists an effective evolutionary model of amino-acid substitution that forms a metric-distance function. There is always a trade-off between speed and sensitivity among competing computational methods of determining sequence homology. A metric model of evolution is a prerequisite for the development of an entire class of fast sequence analysis algorithms that are both scalable, O(log n) and sensitive.
RESULTS: We have reworked the mathematics of the point accepted mutation model (PAM) by calculating the expected time between accepted mutations in lieu of calculating log-odds probabilities. The resulting substitution matrix (mPAM) forms a metric. We validate the application of the mPAM evolutionary model for sequence homology by executing sequence queries from a controlled yeast protein homology search benchmark. We compare the accuracy of the results of mPAM and PAM similarity matrices as well as three prior metric models. The experiment shows that mPAM significantly outperforms the other three metrics and sufficiently approaches the sensitivity of PAM250 to make it applicable to the management of protein sequence databases.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 14871874     DOI: 10.1093/bioinformatics/bth065

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  A collection of amino acid replacement matrices derived from clusters of orthologs.

Authors:  Rolf Olsen; William F Loomis
Journal:  J Mol Evol       Date:  2005-10-20       Impact factor: 2.395

2.  The influenza virus resource at the National Center for Biotechnology Information.

Authors:  Yiming Bao; Pavel Bolotov; Dmitry Dernovoy; Boris Kiryutin; Leonid Zaslavsky; Tatiana Tatusova; Jim Ostell; David Lipman
Journal:  J Virol       Date:  2007-10-17       Impact factor: 5.103

3.  A reduced amino acid alphabet for understanding and designing protein adaptation to mutation.

Authors:  C Etchebest; C Benros; A Bornot; A-C Camproux; A G de Brevern
Journal:  Eur Biophys J       Date:  2007-06-13       Impact factor: 1.733

4.  Ab initio detection of fuzzy amino acid tandem repeats in protein sequences.

Authors:  Marco Pellegrini; Maria Elena Renda; Alessio Vecchio
Journal:  BMC Bioinformatics       Date:  2012-03-21       Impact factor: 3.169

5.  Inconsistent distances in substitution matrices can be avoided by properly handling hydrophobic residues.

Authors:  J Baussand; A Carbone
Journal:  Evol Bioinform Online       Date:  2008-10-09       Impact factor: 1.625

6.  Amino acid "little Big Bang": representing amino acid substitution matrices as dot products of Euclidian vectors.

Authors:  Karel Zimmermann; Jean-François Gibrat
Journal:  BMC Bioinformatics       Date:  2010-01-04       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.