Literature DB >> 15483327

A simple hierarchical approach to modeling distributions of substitution rates.

Sergei L Kosakovsky Pond1, Simon D W Frost.   

Abstract

Genetic sequence data typically exhibit variability in substitution rates across sites. In practice, there is often too little variation to fit a different rate for each site in the alignment, but the distribution of rates across sites may not be well modeled using simple parametric families. Mixtures of different distributions can capture more complex patterns of rate variation, but are often parameter-rich and difficult to fit. We present a simple hierarchical model in which a baseline rate distribution, such as a gamma distribution, is discretized into several categories, the quantiles of which are estimated using a discretized beta distribution. Although this approach involves adding only two extra parameters to a standard distribution, a wide range of rate distributions can be captured. Using simulated data, we demonstrate that a "beta-" model can reproduce the moments of the rate distribution more accurately than the distribution used to simulate the data, even when the baseline rate distribution is misspecified. Using hepatitis C virus and mammalian mitochondrial sequences, we show that a beta- model can fit as well or better than a model with multiple discrete rate categories, and compares favorably with a model which fits a separate rate category to each site. We also demonstrate this discretization scheme in the context of codon models specifically aimed at identifying individual sites undergoing adaptive or purifying evolution.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15483327     DOI: 10.1093/molbev/msi009

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  37 in total

1.  Likelihoods from summary statistics: recent divergence between species.

Authors:  Scotland C Leman; Yuguo Chen; Jason E Stajich; Mohamed A F Noor; Marcy K Uyenoyama
Journal:  Genetics       Date:  2005-09-02       Impact factor: 4.562

2.  A Dirichlet process model for detecting positive selection in protein-coding DNA sequences.

Authors:  John P Huelsenbeck; Sonia Jain; Simon W D Frost; Sergei L Kosakovsky Pond
Journal:  Proc Natl Acad Sci U S A       Date:  2006-04-10       Impact factor: 11.205

3.  Site-specific evolutionary rates in proteins are better modeled as non-independent and strictly relative.

Authors:  Andrew D Fernandes; William R Atchley
Journal:  Bioinformatics       Date:  2008-07-28       Impact factor: 6.937

4.  Positive selection on HIV accessory proteins and the analysis of molecular adaptation after interspecies transmission.

Authors:  André E R Soares; Marcelo A Soares; Carlos G Schrago
Journal:  J Mol Evol       Date:  2008-05-09       Impact factor: 2.395

5.  Phylogenetic analysis of population-based and deep sequencing data to identify coevolving sites in the nef gene of HIV-1.

Authors:  Art F Y Poon; Luke C Swenson; Winnie W Y Dong; Wenjie Deng; Sergei L Kosakovsky Pond; Zabrina L Brumme; James I Mullins; Douglas D Richman; P Richard Harrigan; Simon D W Frost
Journal:  Mol Biol Evol       Date:  2009-12-02       Impact factor: 16.240

6.  Function and evolution of a gene family encoding odorant binding-like proteins in a social insect, the honey bee (Apis mellifera).

Authors:  Sylvain Forêt; Ryszard Maleszka
Journal:  Genome Res       Date:  2006-10-25       Impact factor: 9.043

7.  Estimating selection pressures on HIV-1 using phylogenetic likelihood models.

Authors:  S L Kosakovsky Pond; A F Y Poon; S Zárate; D M Smith; S J Little; S K Pillai; R J Ellis; J K Wong; A J Leigh Brown; D D Richman; S D W Frost
Journal:  Stat Med       Date:  2008-10-15       Impact factor: 2.373

8.  Comparative molecular evolution of trichoderma chitinases in response to mycoparasitic interactions.

Authors:  Katarina Ihrmark; Nashwan Asmail; Wimal Ubhayasekera; Petter Melin; Jan Stenlid; Magnus Karlsson
Journal:  Evol Bioinform Online       Date:  2010-03-15       Impact factor: 1.625

9.  Cross-sectional dating of novel haplotypes of HERV-K 113 and HERV-K 115 indicate these proviruses originated in Africa before Homo sapiens.

Authors:  Aashish R Jha; Satish K Pillai; Vanessa A York; Elizabeth R Sharp; Emily C Storm; Douglas J Wachter; Jeffrey N Martin; Steven G Deeks; Michael G Rosenberg; Douglas F Nixon; Keith E Garrison
Journal:  Mol Biol Evol       Date:  2009-08-10       Impact factor: 16.240

10.  An evolutionary model-based algorithm for accurate phylogenetic breakpoint mapping and subtype prediction in HIV-1.

Authors:  Sergei L Kosakovsky Pond; David Posada; Eric Stawiski; Colombe Chappey; Art F Y Poon; Gareth Hughes; Esther Fearnhill; Mike B Gravenor; Andrew J Leigh Brown; Simon D W Frost
Journal:  PLoS Comput Biol       Date:  2009-11-26       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.