Literature DB >> 22491036

Modeling protein evolution with several amino acid replacement matrices depending on site rates.

Si Quang Le1, Cuong Cao Dang, Olivier Gascuel.   

Abstract

Most protein substitution models use a single amino acid replacement matrix summarizing the biochemical properties of amino acids. However, site evolution is highly heterogeneous and depends on many factors that influence the substitution patterns. In this paper, we investigate the use of different substitution matrices for different site evolutionary rates. Indeed, the variability of evolutionary rates corresponds to one of the most apparent heterogeneity factors among sites, and there is no reason to assume that the substitution patterns remain identical regardless of the evolutionary rate. We first introduce LG4M, which is composed of four matrices, each corresponding to one discrete gamma rate category (of four). These matrices differ in their amino acid equilibrium distributions and in their exchangeabilities, contrary to the standard gamma model where only the global rate differs from one category to another. Next, we present LG4X, which also uses four different matrices, but leaves aside the gamma distribution and follows a distribution-free scheme for the site rates. All these matrices are estimated from a very large alignment database, and our two models are tested using a large sample of independent alignments. Detailed analysis of resulting matrices and models shows the complexity of amino acid substitutions and the advantage of flexible models such as LG4M and LG4X. Both significantly outperform single-matrix models, providing gains of dozens to hundreds of log-likelihood units for most data sets. LG4X obtains substantial gains compared with LG4M, thanks to its distribution-free scheme for site rates. Since LG4M and LG4X display such advantages but require the same memory space and have comparable running times to standard models, we believe that LG4M and LG4X are relevant alternatives to single replacement matrices. Our models, data, and software are available from http://www.atgc-montpellier.fr/models/lg4x.

Mesh:

Substances:

Year:  2012        PMID: 22491036     DOI: 10.1093/molbev/mss112

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  78 in total

Review 1.  Probabilistic models of eukaryotic evolution: time for integration.

Authors:  Nicolas Lartillot
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2015-09-26       Impact factor: 6.237

2.  Reconstruction of cyclooxygenase evolution in animals suggests variable, lineage-specific duplications, and homologs with low sequence identity.

Authors:  Justin C Havird; Kevin M Kocot; Pamela M Brannock; Johanna T Cannon; Damien S Waits; David A Weese; Scott R Santos; Kenneth M Halanych
Journal:  J Mol Evol       Date:  2015-03-11       Impact factor: 2.395

3.  Disentangling the aging gene expression network of termite queens.

Authors:  José Manuel Monroy Kuhn; Karen Meusemann; Judith Korb
Journal:  BMC Genomics       Date:  2021-05-11       Impact factor: 3.969

4.  Chromerid genomes reveal the evolutionary path from photosynthetic algae to obligate intracellular parasites.

Authors:  Yong H Woo; Hifzur Ansari; Thomas D Otto; Christen M Klinger; Martin Kolisko; Jan Michálek; Alka Saxena; Dhanasekaran Shanmugam; Annageldi Tayyrov; Alaguraj Veluchamy; Shahjahan Ali; Axel Bernal; Javier del Campo; Jaromír Cihlář; Pavel Flegontov; Sebastian G Gornik; Eva Hajdušková; Aleš Horák; Jan Janouškovec; Nicholas J Katris; Fred D Mast; Diego Miranda-Saavedra; Tobias Mourier; Raeece Naeem; Mridul Nair; Aswini K Panigrahi; Neil D Rawlings; Eriko Padron-Regalado; Abhinay Ramaprasad; Nadira Samad; Aleš Tomčala; Jon Wilkes; Daniel E Neafsey; Christian Doerig; Chris Bowler; Patrick J Keeling; David S Roos; Joel B Dacks; Thomas J Templeton; Ross F Waller; Julius Lukeš; Miroslav Oborník; Arnab Pain
Journal:  Elife       Date:  2015-07-15       Impact factor: 8.140

5.  An ancestral bacterial division system is widespread in eukaryotic mitochondria.

Authors:  Michelle M Leger; Markéta Petrů; Vojtěch Žárský; Laura Eme; Čestmír Vlček; Tommy Harding; B Franz Lang; Marek Eliáš; Pavel Doležal; Andrew J Roger
Journal:  Proc Natl Acad Sci U S A       Date:  2015-03-23       Impact factor: 11.205

6.  Phylogenomic resolution of scorpions reveals multilevel discordance with morphological phylogenetic signal.

Authors:  Prashant P Sharma; Rosa Fernández; Lauren A Esposito; Edmundo González-Santillán; Lionel Monod
Journal:  Proc Biol Sci       Date:  2015-04-07       Impact factor: 5.349

7.  Molecular evolution of type II MAGE genes from ancestral MAGED2 gene and their phylogenetic resolution of basal mammalian clades.

Authors:  Marcos De Donato; Sunday O Peters; Tanveer Hussain; Hectorina Rodulfo; Bolaji N Thomas; Masroor E Babar; Ikhide G Imumorin
Journal:  Mamm Genome       Date:  2017-05-17       Impact factor: 2.957

8.  RNA Binding Motif Protein 48 Is Required for U12 Splicing and Maize Endosperm Differentiation.

Authors:  Fang Bai; Jacob Corll; Donya N Shodja; Ruth Davenport; Guanqiao Feng; Janaki Mudunkothge; Christian J Brigolin; Federico Martin; Gertraud Spielbauer; Chi-Wah Tseung; Amy E Siebert; W Brad Barbazuk; Shailesh Lal; A Mark Settles
Journal:  Plant Cell       Date:  2019-02-13       Impact factor: 11.277

9.  W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis.

Authors:  Jana Trifinopoulos; Lam-Tung Nguyen; Arndt von Haeseler; Bui Quang Minh
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

10.  The multiple evolutionary origins of the eukaryotic N-glycosylation pathway.

Authors:  Jonathan Lombard
Journal:  Biol Direct       Date:  2016-08-04       Impact factor: 4.540

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.