Literature DB >> 11319253

A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

S Whelan1, N Goldman.   

Abstract

Phylogenetic inference from amino acid sequence data uses mainly empirical models of amino acid replacement and is therefore dependent on those models. Two of the more widely used models, the Dayhoff and JTT models, are estimated using similar methods that can utilize large numbers of sequences from many unrelated protein families but are somewhat unsatisfactory because they rely on assumptions that may lead to systematic error and discard a large amount of the information within the sequences. The alternative method of maximum-likelihood estimation may utilize the information in the sequence data more efficiently and suffers from no systematic error, but it has previously been applicable to relatively few sequences related by a single phylogenetic tree. Here, we combine the best attributes of these two methods using an approximate maximum-likelihood method. We implemented this approach to estimate a new model of amino acid replacement from a database of globular protein sequences comprising 3,905 amino acid sequences split into 182 protein families. While the new model has an overall structure similar to those of other commonly used models, there are significant differences. The new model outperforms the Dayhoff and JTT models with respect to maximum-likelihood values for a large majority of the protein families in our database. This suggests that it provides a better overall fit to the evolutionary process in globular proteins and may lead to more accurate phylogenetic tree estimates. Potentially, this matrix, and the methods used to generate it, may also be useful in other areas of research, such as biological sequence database searching, sequence alignment, and protein structure prediction, for which an accurate description of amino acid replacement is required.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11319253     DOI: 10.1093/oxfordjournals.molbev.a003851

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  1127 in total

1.  Origin and evolution of circadian clock genes in prokaryotes.

Authors:  Volodymyr Dvornyk; Oxana Vinogradova; Eviatar Nevo
Journal:  Proc Natl Acad Sci U S A       Date:  2003-02-25       Impact factor: 11.205

2.  FastML: a web server for probabilistic reconstruction of ancestral sequences.

Authors:  Haim Ashkenazy; Osnat Penn; Adi Doron-Faigenboim; Ofir Cohen; Gina Cannarozzi; Oren Zomer; Tal Pupko
Journal:  Nucleic Acids Res       Date:  2012-05-31       Impact factor: 16.971

3.  Cross talk between the KNOX and ethylene pathways is mediated by intron-binding transcription factors in barley.

Authors:  Michela Osnato; Maria Rosaria Stile; Yamei Wang; Donaldo Meynard; Serena Curiale; Emmanuel Guiderdoni; Yongxiu Liu; David S Horner; Pieter B F Ouwerkerk; Carlo Pozzi; Kai J Müller; Francesco Salamini; Laura Rossini
Journal:  Plant Physiol       Date:  2010-10-04       Impact factor: 8.340

4.  Putative phenoloxidases in the tunicate Ciona intestinalis and the origin of the arthropod hemocyanin superfamily.

Authors:  A Immesberger; T Burmester
Journal:  J Comp Physiol B       Date:  2003-12-11       Impact factor: 2.200

5.  Second- and third-hand chloroplasts in dinoflagellates: phylogeny of oxygen-evolving enhancer 1 (PsbO) protein reveals replacement of a nuclear-encoded plastid gene by that of a haptophyte tertiary endosymbiont.

Authors:  Ken-ichiro Ishida; Beverley R Green
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-27       Impact factor: 11.205

6.  Phylogenetic analysis of Sec7-domain-containing Arf nucleotide exchangers.

Authors:  Randal Cox; Roberta J Mason-Gamer; Catherine L Jackson; Nava Segev
Journal:  Mol Biol Cell       Date:  2004-01-23       Impact factor: 4.138

7.  Molecular evolution of FtsZ protein sequences encoded within the genomes of archaea, bacteria, and eukaryota.

Authors:  Sue Vaughan; Bill Wickstead; Keith Gull; Stephen G Addinall
Journal:  J Mol Evol       Date:  2004-01       Impact factor: 2.395

8.  Evidence for the presence of a cellulase gene in the last common ancestor of bilaterian animals.

Authors:  Nathan Lo; Hirofumi Watanabe; Masahiro Sugimura
Journal:  Proc Biol Sci       Date:  2003-08-07       Impact factor: 5.349

9.  Respiratory Selenite Reductase from Bacillus selenitireducens Strain MLS10.

Authors:  Michael Wells; Jennifer McGarry; Maissa M Gaye; Partha Basu; Ronald S Oremland; John F Stolz
Journal:  J Bacteriol       Date:  2019-03-13       Impact factor: 3.490

10.  Sequential duplications of an ancient member of the DnaJ-family expanded the functional chaperone network in the eukaryotic cytosol.

Authors:  Chandan Sahi; Jacek Kominek; Thomas Ziegelhoffer; Hyun Young Yu; Maciej Baranowski; Jaroslaw Marszalek; Elizabeth A Craig
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.