Literature DB >> 20526712

Empirical analysis of the most relevant parameters of codon substitution models.

Stefan Zoller1, Adrian Schneider.   

Abstract

Traditionally, codon models of evolution have been parametric, meaning that the 61 x 61 substitution rate matrix was derived from only a handful of parameters, typically the equilibrium frequencies, the ratio of nonsynonymous to synonymous substitution rates and the ratio between transition and transversion rates. These parameters are reasonable choices and are based on observations of what aspects of evolution often vary in coding DNA. However, the choices are relatively arbitrary and no systematic empirical search has ever been performed to identify the best parameters for a codon model. Even for the empirical or semi-empirical models that have been presented recently, only the average substitution rates have been estimated from databases of real coding DNA, but the parameters used were essentially the same as before. In this study we attempted to investigate empirically what the most relevant parameters for a codon model are. By performing a principal component analysis (PCA) on 3666 substitution rate matrices estimated from single gene families, the sets of the most co-varying substitution rates were determined. Interestingly, the two most significant principal components (PCs) describe clearly identifiable parameters: the first PC separates synonymous and nonsynonymous substitutions while the second PC distinguishes between substitutions where only one nucleotide changes and substitutions with two or three nucleotide changes. For the third and subsequent PCs no simple descriptions could be found.

Mesh:

Substances:

Year:  2010        PMID: 20526712     DOI: 10.1007/s00239-010-9356-9

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  17 in total

1.  Darwin v. 2.0: an interpreted computer language for the biosciences.

Authors:  G H Gonnet; M T Hallett; C Korostensky; L Bernardin
Journal:  Bioinformatics       Date:  2000-02       Impact factor: 6.937

2.  Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages.

Authors:  Ziheng Yang; Rasmus Nielsen
Journal:  Mol Biol Evol       Date:  2002-06       Impact factor: 16.240

3.  A combined empirical and mechanistic codon model.

Authors:  Adi Doron-Faigenboim; Tal Pupko
Journal:  Mol Biol Evol       Date:  2006-11-16       Impact factor: 16.240

4.  An empirical codon model for protein sequence evolution.

Authors:  Carolin Kosiol; Ian Holmes; Nick Goldman
Journal:  Mol Biol Evol       Date:  2007-03-30       Impact factor: 16.240

5.  Likelihood models for detecting positively selected amino acid sites and applications to the HIV-1 envelope gene.

Authors:  R Nielsen; Z Yang
Journal:  Genetics       Date:  1998-03       Impact factor: 4.562

6.  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome.

Authors:  S V Muse; B S Gaut
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

7.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

8.  Empirical codon substitution matrix.

Authors:  Adrian Schneider; Gina M Cannarozzi; Gaston H Gonnet
Journal:  BMC Bioinformatics       Date:  2005-06-01       Impact factor: 3.169

9.  XRate: a fast prototyping, training and annotation tool for phylo-grammars.

Authors:  Peter S Klosterman; Andrew V Uzilov; Yuri R Bendaña; Robert K Bradley; Sharon Chao; Carolin Kosiol; Nick Goldman; Ian Holmes
Journal:  BMC Bioinformatics       Date:  2006-10-03       Impact factor: 3.169

10.  MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Authors:  Kazutaka Katoh; Kei-ichi Kuma; Hiroyuki Toh; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2005-01-20       Impact factor: 16.971

View more
  5 in total

1.  CodonPhyML: fast maximum likelihood phylogeny estimation under codon substitution models.

Authors:  Manuel Gil; Marcelo Serrano Zanetti; Stefan Zoller; Maria Anisimova
Journal:  Mol Biol Evol       Date:  2013-02-23       Impact factor: 16.240

2.  A genomic approach to examine the complex evolution of laurasiatherian mammals.

Authors:  Björn M Hallström; Adrian Schneider; Stefan Zoller; Axel Janke
Journal:  PLoS One       Date:  2011-12-02       Impact factor: 3.240

3.  Darwin and Fisher meet at biotech: on the potential of computational molecular evolution in industry.

Authors:  Maria Anisimova
Journal:  BMC Evol Biol       Date:  2015-05-01       Impact factor: 3.260

4.  A generalized mechanistic codon model.

Authors:  Maryam Zaheri; Linda Dib; Nicolas Salamin
Journal:  Mol Biol Evol       Date:  2014-06-23       Impact factor: 16.240

5.  Big data analysis of human mitochondrial DNA substitution models: a regression approach.

Authors:  Keren Levinstein Hallak; Shay Tzur; Saharon Rosset
Journal:  BMC Genomics       Date:  2018-10-19       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.