Literature DB >> 16752207

Efficient methods for estimating amino acid replacement rates.

Lars Arvestad1.   

Abstract

Replacement rate matrices describe the process of evolution at one position in a protein and are used in many applications where proteins are studied with an evolutionary perspective. Several general matrices have been suggested and have proved to be good approximations of the real process. However, there are data for which general matrices are inappropriate, for example, special protein families, certain lineages in the tree of life, or particular parts of proteins. Analysis of such data could benefit from adaption of a data-specific rate matrix. This paper suggests two new methods for estimating replacement rate matrices from independent pairwise protein sequence alignments and also carefully studies Müller-Vingron's resolvent method. Comprehensive tests on synthetic datasets show that both new methods perform better than the resolvent method in a variety of settings. The best method is furthermore demonstrated to be robust on small datasets as well as practical on very large datasets of real data. Neither short nor divergent sequence pairs have to be discarded, making the method economical with data. A generalization to multialignment data is suggested and used in a test on protein-domain family phylogenies, where it is shown that the method offers family-specific rate matrices that often have a significantly better likelihood than a general matrix.

Mesh:

Substances:

Year:  2006        PMID: 16752207     DOI: 10.1007/s00239-004-0113-9

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  32 in total

1.  Is there a phylogenetic signal in prokaryote proteins?

Authors:  S A Teichmann; G Mitchison
Journal:  J Mol Evol       Date:  1999-07       Impact factor: 2.395

2.  An expectation maximization algorithm for training hidden substitution models.

Authors:  I Holmes; G M Rubin
Journal:  J Mol Biol       Date:  2002-04-12       Impact factor: 5.469

3.  Rate matrices for analyzing large families of protein sequences.

Authors:  C Devauchelle; A Grossmann; A Hénaut; M Holschneider; M Monnerot; J L Risler; B Torrésani
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

4.  Modeling amino acid replacement.

Authors:  T Müller; M Vingron
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

5.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

6.  A novel use of equilibrium frequencies in models of sequence evolution.

Authors:  Nick Goldman; Simon Whelan
Journal:  Mol Biol Evol       Date:  2002-11       Impact factor: 16.240

7.  The compositional adjustment of amino acid substitution matrices.

Authors:  Yi-Kuo Yu; John C Wootton; Stephen F Altschul
Journal:  Proc Natl Acad Sci U S A       Date:  2003-12-08       Impact factor: 11.205

8.  Exhaustive matching of the entire protein sequence database.

Authors:  G H Gonnet; M A Cohen; S A Benner
Journal:  Science       Date:  1992-06-05       Impact factor: 47.728

9.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

10.  A new method of inference of ancestral nucleotide and amino acid sequences.

Authors:  Z Yang; S Kumar; M Nei
Journal:  Genetics       Date:  1995-12       Impact factor: 4.562

View more
  6 in total

1.  A novel method for protein-protein interaction site prediction using phylogenetic substitution models.

Authors:  David La; Daisuke Kihara
Journal:  Proteins       Date:  2011-10-12

2.  Evolution of general transcription factors.

Authors:  K V Gunbin; A Ruvinsky
Journal:  J Mol Evol       Date:  2012-12-11       Impact factor: 2.395

3.  Molecular evolution of cyclin proteins in animals and fungi.

Authors:  Konstantin V Gunbin; Valentin V Suslov; Igor I Turnaev; Dmitry A Afonnikov; Nikolay A Kolchanov
Journal:  BMC Evol Biol       Date:  2011-07-28       Impact factor: 3.260

4.  Stress-induced changes in the expression of antioxidant system genes for rice (Oryza sativa L.) and bread wheat (Triticum aestivum L.).

Authors:  Anton Ermakov; Alexey Doroshkov; Aleksandr Bobrovskikh; Ulyana Zubairova; Dmitrii Konstantinov
Journal:  PeerJ       Date:  2019-11-29       Impact factor: 2.984

5.  Genome-Wide Insights Into the Organelle Translocation of Photosynthetic NDH-1 Genes During Evolution.

Authors:  Jie Yu; Zhaoxing Ran; Jingsong Zhang; Lanzhen Wei; Weimin Ma
Journal:  Front Microbiol       Date:  2022-07-13       Impact factor: 6.064

6.  Past and present giant viruses diversity explored through permafrost metagenomics.

Authors:  Sofia Rigou; Sébastien Santini; Chantal Abergel; Jean-Michel Claverie; Matthieu Legendre
Journal:  Nat Commun       Date:  2022-10-07       Impact factor: 17.694

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.