Literature DB >> 25124098

Should genes with missing data be excluded from phylogenetic analyses?

Wei Jiang1, Si-Yun Chen2, Hong Wang3, De-Zhu Li4, John J Wiens5.   

Abstract

Phylogeneticists often design their studies to maximize the number of genes included but minimize the overall amount of missing data. However, few studies have addressed the costs and benefits of adding characters with missing data, especially for likelihood analyses of multiple loci. In this paper, we address this topic using two empirical data sets (in yeast and plants) with well-resolved phylogenies. We introduce varying amounts of missing data into varying numbers of genes and test whether the benefits of excluding genes with missing data outweigh the costs of excluding the non-missing data that are associated with them. We also test if there is a proportion of missing data in the incomplete genes at which they cease to be beneficial or harmful, and whether missing data consistently bias branch length estimates. Our results indicate that adding incomplete genes generally increases the accuracy of phylogenetic analyses relative to excluding them, especially when there is a high proportion of incomplete genes in the overall dataset (and thus few complete genes). Detailed analyses suggest that adding incomplete genes is especially helpful for resolving poorly supported nodes. Given that we find that excluding genes with missing data often decreases accuracy relative to including these genes (and that decreases are generally of greater magnitude than increases), there is little basis for assuming that excluding these genes is necessarily the safer or more conservative approach. We also find no evidence that missing data consistently bias branch length estimates.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Accuracy; Maximum likelihood; Missing data; Phylogeny

Mesh:

Year:  2014        PMID: 25124098     DOI: 10.1016/j.ympev.2014.08.006

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  25 in total

1.  Evolutionary history of the river frog genus Amietia (Anura: Pyxicephalidae) reveals extensive diversification in Central African highlands.

Authors:  Thornton R Larson; Delilah Castro; Mathias Behangana; Eli Greenbaum
Journal:  Mol Phylogenet Evol       Date:  2016-03-26       Impact factor: 4.286

2.  Molecular phylogeny of Panaspis and Afroablepharus skinks (Squamata: Scincidae) in the savannas of sub-Saharan Africa.

Authors:  Maria F Medina; Aaron M Bauer; William R Branch; Andreas Schmitz; Werner Conradie; Zoltán T Nagy; Toby J Hibbitts; Raffael Ernst; Daniel M Portik; Stuart V Nielsen; Timothy J Colston; Chifundera Kusamba; Mathias Behangana; Mark-Oliver Rödel; Eli Greenbaum
Journal:  Mol Phylogenet Evol       Date:  2016-04-23       Impact factor: 4.286

3.  Combining sequence data from multiple studies: Impact of analysis strategies on rare variant calling and association results.

Authors:  Zhongsheng Chen; Michael Boehnke; Christian Fuchsberger
Journal:  Genet Epidemiol       Date:  2019-09-14       Impact factor: 2.135

4.  DNA Barcodes Combined with Multilocus Data of Representative Taxa Can Generate Reliable Higher-Level Phylogenies.

Authors:  Gerard Talavera; Vladimir Lukhtanov; Naomi E Pierce; Roger Vila
Journal:  Syst Biol       Date:  2022-02-10       Impact factor: 15.683

5.  Concatenation and Species Tree Methods Exhibit Statistically Indistinguishable Accuracy under a Range of Simulated Conditions.

Authors:  João Tonini; Andrew Moore; David Stern; Maryia Shcheglovitova; Guillermo Ortí
Journal:  PLoS Curr       Date:  2015-03-09

6.  Genotyping-by-sequencing provides the discriminating power to investigate the subspecies of Daucus carota (Apiaceae).

Authors:  Carlos I Arbizu; Shelby L Ellison; Douglas Senalik; Philipp W Simon; David M Spooner
Journal:  BMC Evol Biol       Date:  2016-10-28       Impact factor: 3.260

7.  Chloroplast Genome Evolution in Actinidiaceae: clpP Loss, Heterogenous Divergence and Phylogenomic Practice.

Authors:  Wen-Cai Wang; Si-Yun Chen; Xian-Zhi Zhang
Journal:  PLoS One       Date:  2016-09-02       Impact factor: 3.240

8.  Resolving the Phylogenetic Position of Coelacanth: The Closest Relative Is Not Always the Most Appropriate Outgroup.

Authors:  Naoko Takezaki; Hidenori Nishihara
Journal:  Genome Biol Evol       Date:  2016-04-25       Impact factor: 3.416

9.  Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution.

Authors:  Michelle Kendall; Caroline Colijn
Journal:  Mol Biol Evol       Date:  2016-06-24       Impact factor: 16.240

10.  Diagnostics for a troubled backbone: testing topological hypotheses of trapelioid lichenized fungi in a large-scale phylogeny of Ostropomycetidae (Lecanoromycetes).

Authors:  Philipp Resl; Kevin Schneider; Martin Westberg; Christian Printzen; Zdeněk Palice; Göran Thor; Alan Fryday; Helmut Mayrhofer; Toby Spribille
Journal:  Fungal Divers       Date:  2015-05-13       Impact factor: 20.372

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.