Literature DB >> 11371591

Effects of nucleotide composition bias on the success of the parsimony criterion in phylogenetic inference.

G C Conant1, P O Lewis.   

Abstract

Convergence in nucleotide composition (CNC) in unrelated lineages is a factor potentially affecting the performance of most phylogeny reconstruction methods. Such convergence has deleterious effects because unrelated lineages show similarities due to similar nucleotide compositions and not shared histories. While some methods (such as the LogDet/paralinear distance measure) avoid this pitfall, the amount of convergence in nucleotide composition necessary to deceive other phylogenetic methods has never been quantified. We examined analytically the relationship between convergence in nucleotide composition and the consistency of parsimony as a phylogenetic estimator for four taxa. Our results show that rather extreme amounts of convergence are necessary before parsimony begins to prefer the incorrect tree. Ancillary observations are that (for unweighted Fitch parsimony) transition/transversion bias contributes to the impact of CNC and, for a given amount of CNC and fixed branch lengths, data sets exhibiting substantial site-to-site rate heterogeneity present fewer difficulties than data sets in which rates are homogeneous. We conclude by reexamining a data set originally used to illustrate the problems caused by CNC. Using simulations, we show that in this case the convergence in nucleotide composition alone is insufficient to cause any commonly used methods to fail, and accounting for other evolutionary factors (such as site-to-site rate heterogeneity) can give a correct inference without accounting for CNC.

Mesh:

Year:  2001        PMID: 11371591     DOI: 10.1093/oxfordjournals.molbev.a003874

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  5 in total

Review 1.  The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy.

Authors:  Ahmed Ragab Nabhan; Indra Neil Sarkar
Journal:  Brief Bioinform       Date:  2011-03-23       Impact factor: 11.622

2.  Evolution of prokaryotic SPFH proteins.

Authors:  Markus Hinderhofer; Christina A Walker; Anke Friemel; Claudia A O Stuermer; Heiko M Möller; Alexander Reuter
Journal:  BMC Evol Biol       Date:  2009-01-12       Impact factor: 3.260

3.  Sources of signal in 62 protein-coding nuclear genes for higher-level phylogenetics of arthropods.

Authors:  Jerome C Regier; Andreas Zwick
Journal:  PLoS One       Date:  2011-08-04       Impact factor: 3.240

4.  A Genome-Scale Investigation of How Sequence, Function, and Tree-Based Gene Properties Influence Phylogenetic Inference.

Authors:  Xing-Xing Shen; Leonidas Salichos; Antonis Rokas
Journal:  Genome Biol Evol       Date:  2016-09-02       Impact factor: 3.416

5.  RY-Coding and Non-Homogeneous Models Can Ameliorate the Maximum-Likelihood Inferences From Nucleotide Sequence Data with Parallel Compositional Heterogeneity.

Authors:  Sohta A Ishikawa; Yuji Inagaki; Tetsuo Hashimoto
Journal:  Evol Bioinform Online       Date:  2012-06-25       Impact factor: 1.625

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.