Literature DB >> 16551582

Inferring complex DNA substitution processes on phylogenies using uniformization and data augmentation.

Ligia Mateiu1, Bruce Rannala.   

Abstract

A new method is developed for calculating sequence substitution probabilities using Markov chain Monte Carlo (MCMC) methods. The basic strategy is to use uniformization to transform the original continuous time Markov process into a Poisson substitution process and a discrete Markov chain of state transitions. An efficient MCMC algorithm for evaluating substitution probabilities by this approach using a continuous gamma distribution to model site-specific rates is outlined. The method is applied to the problem of inferring branch lengths and site-specific rates from nucleotide sequences under a general time-reversible (GTR) model and a computer program BYPASSR is developed. Simulations are used to examine the performance of the new program relative to an existing program BASEML that uses a discrete approximation for the gamma distributed prior on site-specific rates. It is found that BASEML and BYPASSR are in close agreement when inferring branch lengths, regardless of the number of rate categories used, but that BASEML tends to underestimate high site-specific substitution rates, and to overestimate intermediate rates, when fewer than 50 rate categories are used. Rate estimates obtained using BASEML agree more closely with those of BYPASSR as the number of rate categories increases. Analyses of the posterior distributions of site-specific rates from BYPASSR suggest that a large number of taxa are needed to obtain precise estimates of site-specific rates, especially when rates are very high or very low. The method is applied to analyze 45 sequences of the alpha 2B adrenergic receptor gene (A2AB) from a sample of eutherian taxa. In general, the pattern expected for regions under negative selection is observed with third codon positions having the highest inferred rates, followed by first codon positions and with second codon positions having the lowest inferred rates. Several sites show exceptionally high substitution rates at second codon positions that may represent the effects of positive selection.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16551582     DOI: 10.1080/10635150500541599

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  10 in total

1.  Maximum-likelihood estimation of site-specific mutation rates in human mitochondrial DNA from partial phylogenetic classification.

Authors:  Saharon Rosset; R Spencer Wells; David F Soria-Hernanz; Chris Tyler-Smith; Ajay K Royyuru; Doron M Behar
Journal:  Genetics       Date:  2008-09-14       Impact factor: 4.562

2.  Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories.

Authors:  A P Jason de Koning; Wanjun Gu; David D Pollock
Journal:  Mol Biol Evol       Date:  2009-09-25       Impact factor: 16.240

3.  Relaxing the Molecular Clock to Different Degrees for Different Substitution Types.

Authors:  Hui-Jie Lee; Nicolas Rodrigue; Jeffrey L Thorne
Journal:  Mol Biol Evol       Date:  2015-04-29       Impact factor: 16.240

4.  On the statistical interpretation of site-specific variables in phylogeny-based substitution models.

Authors:  Nicolas Rodrigue
Journal:  Genetics       Date:  2012-12-05       Impact factor: 4.562

5.  Monte Carlo algorithms for Brownian phylogenetic models.

Authors:  Benjamin Horvilleur; Nicolas Lartillot
Journal:  Bioinformatics       Date:  2014-07-22       Impact factor: 6.937

6.  SIMULATION FROM ENDPOINT-CONDITIONED, CONTINUOUS-TIME MARKOV CHAINS ON A FINITE STATE SPACE, WITH APPLICATIONS TO MOLECULAR EVOLUTION.

Authors:  Asger Hobolth; Eric A Stone
Journal:  Ann Appl Stat       Date:  2009-09-01       Impact factor: 2.083

7.  A Bayesian Approach for Inferring the Impact of a Discrete Character on Rates of Continuous-Character Evolution in the Presence of Background-Rate Variation.

Authors:  Michael R May; Brian R Moore
Journal:  Syst Biol       Date:  2020-05-01       Impact factor: 15.683

8.  Peripheral arterial disease screening for hemodialysis patients using a fractional-order integrator and transition probability decision-making model.

Authors:  Jian-Xing Wu; Chien-Ming Li; Guan-Chun Chen; Yueh-Ren Ho; Chia-Hung Lin
Journal:  IET Syst Biol       Date:  2017-04       Impact factor: 1.615

9.  Comparison of methods for calculating conditional expectations of sufficient statistics for continuous time Markov chains.

Authors:  Paula Tataru; Asger Hobolth
Journal:  BMC Bioinformatics       Date:  2011-12-05       Impact factor: 3.169

10.  The tangled bank of amino acids.

Authors:  Richard A Goldstein; David D Pollock
Journal:  Protein Sci       Date:  2016-05-12       Impact factor: 6.725

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.