Literature DB >> 18852105

Basing population genetic inferences and models of molecular evolution upon desired stationary distributions of DNA or protein sequences.

Sang Chul Choi1, Benjamin D Redelings, Jeffrey L Thorne.   

Abstract

Models of molecular evolution tend to be overly simplistic caricatures of biology that are prone to assigning high probabilities to biologically implausible DNA or protein sequences. Here, we explore how to construct time-reversible evolutionary models that yield stationary distributions of sequences that match given target distributions. By adopting comparatively realistic target distributions,evolutionary models can be improved. Instead of focusing on estimating parameters, we concentrate on the population genetic implications of these models. Specifically, we obtain estimates of the product of effective population size and relative fitness difference of alleles. The approach is illustrated with two applications to protein-coding DNA. In the first, a codon-based evolutionary model yields a stationary distribution of sequences, which, when the sequences are translated,matches a variable-length Markov model trained on human proteins. In the second, we introduce an insertion-deletion model that describes selectively neutral evolutionary changes to DNA. We then show how to modify the neutral model so that its stationary distribution at the amino acid level can match a profile hidden Markov model, such as the one associated with the Pfam database.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18852105      PMCID: PMC2607412          DOI: 10.1098/rstb.2008.0167

Source DB:  PubMed          Journal:  Philos Trans R Soc Lond B Biol Sci        ISSN: 0962-8436            Impact factor:   6.237


  37 in total

1.  A "Long Indel" model for evolutionary sequence alignment.

Authors:  I Miklós; G A Lunter; I Holmes
Journal:  Mol Biol Evol       Date:  2003-12-23       Impact factor: 16.240

2.  Algorithms for variable length Markov chain modeling.

Authors:  Gill Bejerano
Journal:  Bioinformatics       Date:  2004-01-29       Impact factor: 6.937

3.  Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolution.

Authors:  Dick G Hwang; Phil Green
Journal:  Proc Natl Acad Sci U S A       Date:  2004-08-03       Impact factor: 11.205

Review 4.  Inching toward reality: an improved likelihood model of sequence evolution.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1992-01       Impact factor: 2.395

5.  An evolutionary model for maximum likelihood alignment of DNA sequences.

Authors:  J L Thorne; H Kishino; J Felsenstein
Journal:  J Mol Evol       Date:  1991-08       Impact factor: 2.395

6.  Inferring pattern and process: maximum-likelihood implementation of a nonhomogeneous model of DNA sequence evolution for phylogenetic analysis.

Authors:  N Galtier; M Gouy
Journal:  Mol Biol Evol       Date:  1998-07       Impact factor: 16.240

7.  Pfam: a comprehensive database of protein domain families based on seed alignments.

Authors:  E L Sonnhammer; S R Eddy; R Durbin
Journal:  Proteins       Date:  1997-07

8.  A likelihood approach for comparing synonymous and nonsynonymous nucleotide substitution rates, with application to the chloroplast genome.

Authors:  S V Muse; B S Gaut
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

9.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

10.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA.

Authors:  M Hasegawa; H Kishino; T Yano
Journal:  J Mol Evol       Date:  1985       Impact factor: 2.395

View more
  8 in total

1.  The role of epistasis in protein evolution.

Authors:  David M McCandlish; Etienne Rajon; Premal Shah; Yang Ding; Joshua B Plotkin
Journal:  Nature       Date:  2013-05-30       Impact factor: 49.962

2.  Mutation-selection models of coding sequence evolution with site-heterogeneous amino acid fitness profiles.

Authors:  Nicolas Rodrigue; Hervé Philippe; Nicolas Lartillot
Journal:  Proc Natl Acad Sci U S A       Date:  2010-02-22       Impact factor: 11.205

3.  History can matter: non-Markovian behavior of ancestral lineages.

Authors:  Reed A Cartwright; Nicolas Lartillot; Jeffrey L Thorne
Journal:  Syst Biol       Date:  2011-03-11       Impact factor: 15.683

4.  Introduction. Statistical and computational challenges in molecular phylogenetics and evolution.

Authors:  Nick Goldman; Ziheng Yang
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

5.  A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness.

Authors:  Panagiotis Katsonis; Olivier Lichtarge
Journal:  Genome Res       Date:  2014-09-12       Impact factor: 9.043

6.  Roles of solvent accessibility and gene expression in modeling protein sequence evolution.

Authors:  Kuangyu Wang; Shuhui Yu; Xiang Ji; Clemens Lakner; Alexander Griffing; Jeffrey L Thorne
Journal:  Evol Bioinform Online       Date:  2015-04-29       Impact factor: 1.625

7.  Fast optimization of statistical potentials for structurally constrained phylogenetic models.

Authors:  Cécile Bonnard; Claudia L Kleinman; Nicolas Rodrigue; Nicolas Lartillot
Journal:  BMC Evol Biol       Date:  2009-09-09       Impact factor: 3.260

8.  Inferring selection on amino acid preference in protein domains.

Authors:  Alan M Moses; Richard Durbin
Journal:  Mol Biol Evol       Date:  2008-12-18       Impact factor: 16.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.