Literature DB >> 9066796

Evolutionary distances between nucleotide sequences based on the distribution of substitution rates among sites as estimated by parsimony.

N J Tourasse1, M Gouy.   

Abstract

The rate of evolution of macromolecules such as ribosomal RNAs and proteins varies along the molecule because structural and functional constraints differ between sites. Many studies have shown that ignoring this variation in computing evolutionary distances leads to severe underestimation of sequence divergences, and thus can lead to misleading evolutionary tree inferences. We propose here a new parsimony-based method for computing evolutionary distances between pairs of sequences that takes into account this variation and estimates it from the data. This method applies to the number of substitutions per site in ribosomal RNA genes as well as to the number of nonsynonymous substitutions per codon for protein-coding genes and is especially suitable when large data sets (> or = 100 sequences) are analyzed. First, starting from a phylogeny constructed with usual distances, the maximum-parsimony method is used to infer the distribution of the number of substitutions that have occurred at each site (or codon) along this tree. This distribution is then fitted to an "invariant + truncated negative binomial" distribution that allows for invariant sites. Maximum-likelihood fitting of this distribution to different data sets showed that it agreed very well with real data. Noticeably, allowing for invariant sites seemed to be very important. Finally, two distance estimates were developed by introducing the distribution of site variability into the substitution models of Jukes and Cantor and of Kimura. The use of different numbers of aligned sequences (up to 1,000 rRNA sequences) showed that the parameters of the model are very sensitive to the number of sequences used to estimate them. However, if at least 100 sequences are considered, the two new distance estimates are quite stable with respect to the number of sequences used to fit the distribution. This stability is true for low as well as for high evolutionary distances. These new distances appeared to be much better estimates of the number of substitutions per site than the classical distances of Jukes and Cantor and of Kimura, which both greatly underestimate this number, so that they can serve as indexes to detect saturation. We conclude that the new distances are particularly suitable for phylogenetic analysis when very distantly related species and relatively large data sets are considered. Trees reconstructed using these distances are generally different from those constructed by means of the classical estimates. Using this new method, we showed that the mean evolutionary distance between Prokaryotes and Eukaryotes is substantially higher for the small-subunit than for the large-subunit rRNAs. This suggests than the former might have experienced a drastic change during the early evolution of Eukaryotes.

Mesh:

Substances:

Year:  1997        PMID: 9066796     DOI: 10.1093/oxfordjournals.molbev.a025764

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  6 in total

1.  Mutation rate variation at human dinucleotide microsatellites.

Authors:  Hongyan Xu; Ranajit Chakraborty; Yun-Xin Fu
Journal:  Genetics       Date:  2005-02-16       Impact factor: 4.562

2.  Estimation of evolutionary distances under stationary and nonstationary models of nucleotide substitution.

Authors:  X Gu; W H Li
Journal:  Proc Natl Acad Sci U S A       Date:  1998-05-26       Impact factor: 11.205

3.  Long branch effects distort maximum likelihood phylogenies in simulations despite selection of the correct model.

Authors:  Patrick Kück; Christoph Mayer; Johann-Wolfgang Wägele; Bernhard Misof
Journal:  PLoS One       Date:  2012-05-09       Impact factor: 3.240

4.  The impact of modelling rate heterogeneity among sites on phylogenetic estimates of intraspecific evolutionary rates and timescales.

Authors:  Fangzhi Jia; Nathan Lo; Simon Y W Ho
Journal:  PLoS One       Date:  2014-05-05       Impact factor: 3.240

5.  Uncovering the Horseshoe Effect in Microbial Analyses.

Authors:  James T Morton; Liam Toran; Anna Edlund; Jessica L Metcalf; Christian Lauber; Rob Knight
Journal:  mSystems       Date:  2017-02-21       Impact factor: 6.496

6.  Comparisons of mutation rate variation at genome-wide microsatellites: evolutionary insights from two cultivated rice and their wild relatives.

Authors:  Li-Zhi Gao; Hongyan Xu
Journal:  BMC Evol Biol       Date:  2008-01-16       Impact factor: 3.260

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.