Literature DB >> 15590908

Impact of taxon sampling on the estimation of rates of evolution at sites.

C Blouin1, D Butt, A J Roger.   

Abstract

The function of individual sites within a protein influences their rate of accepted point mutation. During the computation of phylogenetic likelihoods, rate heterogeneity can be modeled on a site-per-site basis with relative rates drawn from a discretized Gamma-distribution. Site-rate estimates (e.g., the rate of highest posterior probability given the data at a site) can then be used as a measure of evolutionary constraints imposed by function. However, if the sequence availability is limited, the estimation of rates is subject to sampling error. This article presents a simulation study that evaluates the robustness of evolutionary site-rate estimates for both small and phylogenetically unbalanced samples. The sampling error on rate estimates was first evaluated for alignments that included 5-45 sequences, sampled by jackknifing, from a master alignment containing 968 sequences. We observed that the potentially enhanced resolution among site rates due to the inclusion of a larger number of rate categories is negated by the difficulty in correctly estimating intermediate rates. This effect is marked for data sets with less than 30 sequences. Although the computation of likelihood theoretically accounts for phylogenetic distances through branch lengths, the introduction of a single long-branch outlier sequence had a significant negative effect on site-rate estimates. Finally, the presence of a shift in rates of evolution between related lineages can be diagnostic of a gain/loss of function within a protein family. Our analyses indicate that detecting these rate shifts is a harder problem than estimating rates. This is so, partially, because the difference in rates depends on two rate estimates, each with an intrinsic uncertainty. The performances of four methods to detect these site-rate shifts are evaluated and compared. Guidelines are suggested for preparing data sets minimally influenced by error introduced by sequence sampling.

Mesh:

Substances:

Year:  2004        PMID: 15590908     DOI: 10.1093/molbev/msi065

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  9 in total

1.  Using confidence set heuristics during topology search improves the robustness of phylogenetic inference.

Authors:  Shirley L Pepke; Davin Butt; Isabelle Nadeau; Andrew J Roger; Christian Blouin
Journal:  J Mol Evol       Date:  2006-12-09       Impact factor: 2.395

2.  Characterization and phylogenetic analysis of a cnidarian LMP X-like cDNA.

Authors:  Larry J Dishaw; Manuel L Herrera; Charles H Bigger
Journal:  Immunogenetics       Date:  2006-03-22       Impact factor: 2.846

3.  Are rates of molecular evolution in mammals substantially accelerated in warmer environments?

Authors:  Jason T Weir; Dolph Schluter
Journal:  Proc Biol Sci       Date:  2011-02-02       Impact factor: 5.349

4.  Evolutionary history and functional diversification of phosphomannomutase genes.

Authors:  Rita Quental; Ana Moleirinho; Luísa Azevedo; António Amorim
Journal:  J Mol Evol       Date:  2010-07-27       Impact factor: 2.395

5.  The nature of protein domain evolution: shaping the interaction network.

Authors:  Christoph P Bagowski; Wouter Bruins; Aartjan J W Te Velthuis
Journal:  Curr Genomics       Date:  2010-08       Impact factor: 2.236

6.  libcov: a C++ bioinformatic library to manipulate protein structures, sequence alignments and phylogeny.

Authors:  Davin Butt; Andrew J Roger; Christian Blouin
Journal:  BMC Bioinformatics       Date:  2005-06-06       Impact factor: 3.169

7.  Phylogenomic Perspective on the Relationships and Evolutionary History of the Major Otocephalan Lineages.

Authors:  Wei Dai; Ming Zou; Liandong Yang; Kang Du; Weitao Chen; Yanjun Shen; Richard L Mayden; Shunping He
Journal:  Sci Rep       Date:  2018-01-09       Impact factor: 4.379

8.  Linking fold, function and phylogeny: a comparative genomics view on protein (domain) evolution.

Authors:  Aartjan J W Te Velthuis; Christoph P Bagowski
Journal:  Curr Genomics       Date:  2008-04       Impact factor: 2.236

9.  Framing the Salmonidae family phylogenetic portrait: a more complete picture from increased taxon sampling.

Authors:  Alexis Crête-Lafrenière; Laura K Weir; Louis Bernatchez
Journal:  PLoS One       Date:  2012-10-05       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.