Literature DB >> 9242594

Hadamard conjugations and modeling sequence evolution with unequal rates across sites.

P J Waddell1, D Penny, T Moore.   

Abstract

This paper considers the many different distributions that may approximate the distribution of site rates in DNA sequences and shows how the Hadamard conjugation may be modified to take these into account. This is done for both 2-state and 4-state data. Distributions which give simple closed forms include the gamma (gamma) distribution, the inverse Gaussian distribution (which is similar to the lognormal), and a mixture of either of these with a proportion of sites which cannot change (invariant sites). It is seen that the tail of a distribution can have major effects upon the coefficient of variation of site rates. Because the Hadamard conjugation can be used to either correct data or predict the data given the model (i.e., the likelihood of site patterns), light is shed on properties of maximum likelihood tree selection with unequal site rates. Analysis of rRNA shows how unequal rates across sites can change the optimal tree. Maximum likelihood analysis also shows that distinct distributions fit each data set, with the gamma often not being the best. Analyzing both these data and a long stretch of primate mtDNA reveals evidence of many "hidden" multiple substitutions, while signals not corresponding to the preferred biological tree generally decrease an unequal rates are allowed for. Last, we discuss the expected behavior of sequences evolving by models where stabilizing selection alone explains unequal site rates. Such models do not explain "synapomorphies" or informative changes in ancient molecules, because while stabilizing selection can vastly decrease change at a site, it will also vastly accelerate back-substitution (leaving only a covarion model to explain old synapomorphies). When and why models allowing a continuous distribution of site rates (e.g., gamma) will approximate covarion evolution requires further study.

Mesh:

Substances:

Year:  1997        PMID: 9242594     DOI: 10.1006/mpev.1997.0405

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  4 in total

1.  Measuring fit of sequence data to phylogenetic model: gain of power using marginal tests.

Authors:  Peter J Waddell; Rissa Ota; David Penny
Journal:  J Mol Evol       Date:  2009-10-23       Impact factor: 2.395

2.  Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.

Authors:  X Gu; W H Li
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

3.  Rooting gene trees without outgroups: EP rooting.

Authors:  Janet S Sinsheimer; Roderick J A Little; James A Lake
Journal:  Genome Biol Evol       Date:  2012-05-16       Impact factor: 3.416

4.  Can comprehensive background knowledge be incorporated into substitution models to improve phylogenetic analyses? A case study on major arthropod relationships.

Authors:  Björn M von Reumont; Karen Meusemann; Nikolaus U Szucsich; Emiliano Dell'Ampio; Vivek Gowri-Shankar; Daniela Bartel; Sabrina Simon; Harald O Letsch; Roman R Stocsits; Yun-xia Luan; Johann Wolfgang Wägele; Günther Pass; Heike Hadrys; Bernhard Misof
Journal:  BMC Evol Biol       Date:  2009-05-27       Impact factor: 3.260

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.