Literature DB >> 12857643

Missing data, incomplete taxa, and phylogenetic accuracy.

John J Wiens1.   

Abstract

The problem of missing data is often considered to be the most important obstacle in reconstructing the phylogeny of fossil taxa and in combining data from diverse characters and taxa for phylogenetic analysis. Empirical and theoretical studies show that including highly incomplete taxa can lead to multiple equally parsimonious trees, poorly resolved consensus trees, and decreased phylogenetic accuracy. However, the mechanisms that cause incomplete taxa to be problematic have remained unclear. It has been widely assumed that incomplete taxa are problematic because of the proportion or amount of missing data that they bear. In this study, I use simulations to show that the reduced accuracy associated with including incomplete taxa is caused by these taxa bearing too few complete characters rather than too many missing data cells. This seemingly subtle distinction has a number of important implications. First, the so-called missing data problem for incomplete taxa is, paradoxically, not directly related to their amount or proportion of missing data. Thus, the level of completeness alone should not guide the exclusion of taxa (contrary to common practice), and these results may explain why empirical studies have sometimes found little relationship between the completeness of a taxon and its impact on an analysis. These results also (1) suggest a more effective strategy for dealing with incomplete taxa, (2) call into question a justification of the controversial phylogenetic supertree approach, and (3) show the potential for the accurate phylogenetic placement of highly incomplete taxa, both when combining diverse data sets and when analyzing relationships of fossil taxa.

Mesh:

Year:  2003        PMID: 12857643     DOI: 10.1080/10635150390218330

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  98 in total

1.  Tangled in a sparse spider web: single origin of orb weavers and their spinning work unravelled by denser taxonomic sampling.

Authors:  Dimitar Dimitrov; Lara Lopardo; Gonzalo Giribet; Miquel A Arnedo; Fernando Alvarez-Padilla; Gustavo Hormiga
Journal:  Proc Biol Sci       Date:  2011-11-02       Impact factor: 5.349

2.  The timing of eukaryotic evolution: does a relaxed molecular clock reconcile proteins and fossils?

Authors:  Emmanuel J P Douzery; Elizabeth A Snell; Eric Bapteste; Frédéric Delsuc; Hervé Philippe
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-19       Impact factor: 11.205

3.  Comparative phylogenetic analyses of the adaptive radiation of Lake Tanganyika cichlid fish: nuclear sequences are less homoplasious but also less informative than mitochondrial DNA.

Authors:  Céline Clabaut; Walter Salzburger; Axel Meyer
Journal:  J Mol Evol       Date:  2005-10-13       Impact factor: 2.395

4.  A hierarchical model for incomplete alignments in phylogenetic inference.

Authors:  Fuxia Cheng; Stefanie Hartmann; Mayetri Gupta; Joseph G Ibrahim; Todd J Vision
Journal:  Bioinformatics       Date:  2009-01-15       Impact factor: 6.937

5.  Resolving the evolution of extant and extinct ruminants with high-throughput phylogenomics.

Authors:  Jared E Decker; J Chris Pires; Gavin C Conant; Stephanie D McKay; Michael P Heaton; Kefei Chen; Alan Cooper; Johanna Vilkki; Christopher M Seabury; Alexandre R Caetano; Gary S Johnson; Rick A Brenneman; Olivier Hanotte; Lori S Eggert; Pamela Wiener; Jong-Joo Kim; Kwan Suk Kim; Tad S Sonstegard; Curt P Van Tassell; Holly L Neibergs; John C McEwan; Rudiger Brauning; Luiz L Coutinho; Masroor E Babar; Gregory A Wilson; Matthew C McClure; Megan M Rolf; Jaewoo Kim; Robert D Schnabel; Jeremy F Taylor
Journal:  Proc Natl Acad Sci U S A       Date:  2009-10-21       Impact factor: 11.205

Review 6.  The impact of taxon sampling on phylogenetic inference: a review of two decades of controversy.

Authors:  Ahmed Ragab Nabhan; Indra Neil Sarkar
Journal:  Brief Bioinform       Date:  2011-03-23       Impact factor: 11.622

7.  Species delimitation and phylogenetic relationships of Chinese Leishmania isolates reexamined using kinetoplast cytochrome oxidase II gene sequences.

Authors:  De-Ping Cao; Xian-Guang Guo; Da-Li Chen; Jian-Ping Chen
Journal:  Parasitol Res       Date:  2011-01-11       Impact factor: 2.289

8.  A new effective method for estimating missing values in the sequence data prior to phylogenetic analysis.

Authors:  Abdoulaye Baniré Diallo; François-Joseph Lapointe; Vladimir Makarenkov
Journal:  Evol Bioinform Online       Date:  2007-02-01       Impact factor: 1.625

9.  Inferring phylogenies with incomplete data sets: a 5-gene, 567-taxon analysis of angiosperms.

Authors:  J Gordon Burleigh; Khidir W Hilu; Douglas E Soltis
Journal:  BMC Evol Biol       Date:  2009-03-17       Impact factor: 3.260

10.  A simple, fast, and accurate method of phylogenomic inference.

Authors:  Martin Wu; Jonathan A Eisen
Journal:  Genome Biol       Date:  2008-10-13       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.