Literature DB >> 20547780

Optimal selection of gene and ingroup taxon sampling for resolving phylogenetic relationships.

Jeffrey P Townsend1, Francesc Lopez-Giraldez.   

Abstract

A controversial topic that underlies much of phylogenetic experimental design is the relative utility of increased taxonomic versus character sampling. Conclusions about the relative utility of adding characters or taxa to a current phylogenetic study have subtly hinged upon the appropriateness of the rate of evolution of the characters added for resolution of the phylogeny in question. Clearly, the addition of characters evolving at optimal rates will have much greater impact upon accurate phylogenetic analysis than will the addition of characters with an inappropriate rate of evolution. Development of practical analytical predictions of the asymptotic impact of adding additional taxa would complement computational investigations of the relative utility of these two methods of expanding acquired data. Accordingly, we here formulate a measure of the phylogenetic informativeness of the additional sampling of character states from a new taxon added to the canonical phylogenetic quartet. We derive the optimal rate of evolution for characters assessed in taxa to be sampled and a metric of informativeness based on the rate of evolution of the characters assessed in the new taxon and the distance of the new taxon from the internode of interest. Calculation of the informativeness per base pair of additional character sampling for included taxa versus additional character sampling for novel taxa can be used to estimate cost-effectiveness and optimal efficiency of phylogenetic experimental design. The approach requires estimation of rates of evolution of individual sites based on an alignment of genes orthologous to those to be sequenced, which may be identified in a well-established clade of sister taxa or of related taxa diverging at a deeper phylogenetic scale. Some approximate idea of the potential phylogenetic relationships of taxa to be sequenced is also desirable, such as may be obtained from ribosomal RNA sequence alone. Application to the solution of recalcitrant unresolved nodes in an otherwise well-known phylogeny is the most obvious application. We validate the theory by analysis of its predictions regarding the phylogenetic informativeness for taxon addition of 46 amino acid alignments of 21 fungal taxa. Gene and taxon sampling according to the theory herein and following a "deepest ingroup" heuristic are shown to provide significantly improved resolution of specified deep internodes.

Mesh:

Year:  2010        PMID: 20547780     DOI: 10.1093/sysbio/syq025

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  21 in total

Review 1.  The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy.

Authors:  Ahmed Ragab Nabhan; Indra Neil Sarkar
Journal:  Brief Bioinform       Date:  2011-03-23       Impact factor: 11.622

2.  Phylogenetic informativeness reconciles ray-finned fish molecular divergence times.

Authors:  Alex Dornburg; Jeffrey P Townsend; Matt Friedman; Thomas J Near
Journal:  BMC Evol Biol       Date:  2014-08-08       Impact factor: 3.260

Review 3.  The archaeal legacy of eukaryotes: a phylogenomic perspective.

Authors:  Lionel Guy; Jimmy H Saw; Thijs J G Ettema
Journal:  Cold Spring Harb Perspect Biol       Date:  2014-07-03       Impact factor: 10.005

4.  maxAlike: maximum likelihood-based sequence reconstruction with application to improved primer design for unknown sequences.

Authors:  Peter Menzel; Peter F Stadler; Jan Gorodkin
Journal:  Bioinformatics       Date:  2010-12-01       Impact factor: 6.937

5.  PhyDesign: an online application for profiling phylogenetic informativeness.

Authors:  Francesc López-Giráldez; Jeffrey P Townsend
Journal:  BMC Evol Biol       Date:  2011-05-31       Impact factor: 3.260

6.  Utility of characters evolving at diverse rates of evolution to resolve quartet trees with unequal branch lengths: analytical predictions of long-branch effects.

Authors:  Zhuo Su; Jeffrey P Townsend
Journal:  BMC Evol Biol       Date:  2015-05-14       Impact factor: 3.260

7.  A MINE alternative to D-optimal designs for the linear model.

Authors:  Amanda M Bouffier; Jonathan Arnold; H Bernd Schüttler
Journal:  PLoS One       Date:  2014-10-30       Impact factor: 3.240

8.  Plastome structure and phylogenetic relationships of Styracaceae (Ericales).

Authors:  Xiu-Lian Cai; Jacob B Landis; Hong-Xin Wang; Jian-Hua Wang; Zhi-Xin Zhu; Hua-Feng Wang
Journal:  BMC Ecol Evol       Date:  2021-05-28

9.  How to handle speciose clades? Mass taxon-sampling as a strategy towards illuminating the natural history of Campanula (Campanuloideae).

Authors:  Guilhem Mansion; Gerald Parolly; Andrew A Crowl; Evgeny Mavrodiev; Nico Cellinese; Marine Oganesian; Katharina Fraunhofer; Georgia Kamari; Dimitrios Phitos; Rosemarie Haberle; Galip Akaydin; Nursel Ikinci; Thomas Raus; Thomas Borsch
Journal:  PLoS One       Date:  2012-11-28       Impact factor: 3.240

10.  AST: an automated sequence-sampling method for improving the taxonomic diversity of gene phylogenetic trees.

Authors:  Chan Zhou; Fenglou Mao; Yanbin Yin; Jinling Huang; Johann Peter Gogarten; Ying Xu
Journal:  PLoS One       Date:  2014-06-03       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.