Literature DB >> 23769751

Effects of missing data on species tree estimation under the coalescent.

Rasmus Hovmöller1, L Lacey Knowles, Laura S Kubatko.   

Abstract

With recent advances in genomic sequencing, the importance of taking the effects of the processes that can cause discord between the speciation history and the individual gene histories into account has become evident. For multilocus datasets, it is difficult to achieve complete coverage of all sampled loci across all sample specimens, a problem that also arises when combining incompletely overlapping datasets. Here we examine how missing data affects the accuracy of species tree reconstruction. In our study, 10- and 100-locus sequence datasets were simulated under the coalescent model from shallow and deep speciation histories, and species trees were estimated using the maximum likelihood and Bayesian frameworks (with STEM and (*)BEAST, respectively). The accuracy of the estimated species trees was evaluated using the symmetric difference and the SPR distance. We examine the effects of sampling more than one individual per species, as well as the effects of different patterns of missing data (i.e., different amounts of missing data, which is represented among random taxa as opposed to being concentrated in specific taxa, as is often the case for empirical studies). Our general conclusion is that the species tree estimates are remarkably resilient to the effects of missing data. We find that for datasets with more limited numbers of loci, sampling more than one individual per species has the strongest effect on improving species tree accuracy when there is missing data, especially at higher degrees of missing data. For larger multilocus datasets (e.g., 25-100 loci), the amount of missing data has a negligible effect on species tree reconstruction, even at 50% missing data and a single sampled individual per species.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Coalescent model; Multilocus data; Species tree inference

Mesh:

Year:  2013        PMID: 23769751     DOI: 10.1016/j.ympev.2013.06.004

Source DB:  PubMed          Journal:  Mol Phylogenet Evol        ISSN: 1055-7903            Impact factor:   4.286


  13 in total

1.  DNA Barcodes Combined with Multilocus Data of Representative Taxa Can Generate Reliable Higher-Level Phylogenies.

Authors:  Gerard Talavera; Vladimir Lukhtanov; Naomi E Pierce; Roger Vila
Journal:  Syst Biol       Date:  2022-02-10       Impact factor: 15.683

2.  Application of the phylogenetic species concept to Wallemia sebi from house dust and indoor air revealed by multi-locus genealogical concordance.

Authors:  Hai D T Nguyen; Sašo Jančič; Martin Meijer; Joey B Tanney; Polona Zalar; Nina Gunde-Cimerman; Keith A Seifert
Journal:  PLoS One       Date:  2015-03-23       Impact factor: 3.240

3.  Concatenation and Species Tree Methods Exhibit Statistically Indistinguishable Accuracy under a Range of Simulated Conditions.

Authors:  João Tonini; Andrew Moore; David Stern; Maryia Shcheglovitova; Guillermo Ortí
Journal:  PLoS Curr       Date:  2015-03-09

4.  Effect of diversity and missing data on genetic assignment with RAD-Seq markers.

Authors:  Balaji Chattopadhyay; Kritika M Garg; Uma Ramakrishnan
Journal:  BMC Res Notes       Date:  2014-11-25

5.  Genotyping-by-sequencing provides the discriminating power to investigate the subspecies of Daucus carota (Apiaceae).

Authors:  Carlos I Arbizu; Shelby L Ellison; Douglas Senalik; Philipp W Simon; David M Spooner
Journal:  BMC Evol Biol       Date:  2016-10-28       Impact factor: 3.260

Review 6.  Multilocus inference of species trees and DNA barcoding.

Authors:  Diego Mallo; David Posada
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2016-09-05       Impact factor: 6.237

7.  Coalescent-based delimitation outperforms distance-based methods for delineating less divergent species: the case of Kurixalus odontotarsus species group.

Authors:  Guohua Yu; Dingqi Rao; Masafumi Matsui; Junxing Yang
Journal:  Sci Rep       Date:  2017-11-23       Impact factor: 4.379

8.  Phylogeny of Hepatocystis parasites of Australian flying foxes reveals distinct parasite clade.

Authors:  Juliane Schaer; Lee McMichael; Anita N Gordon; Daniel Russell; Kai Matuschewski; Susan L Perkins; Hume Field; Michelle Power
Journal:  Int J Parasitol Parasites Wildl       Date:  2018-06-06       Impact factor: 2.674

9.  Uneven Missing Data Skew Phylogenomic Relationships within the Lories and Lorikeets.

Authors:  Brian Tilston Smith; William M Mauck; Brett W Benz; Michael J Andersen
Journal:  Genome Biol Evol       Date:  2020-07-01       Impact factor: 3.416

10.  Reconstructing the Backbone of the Saccharomycotina Yeast Phylogeny Using Genome-Scale Data.

Authors:  Xing-Xing Shen; Xiaofan Zhou; Jacek Kominek; Cletus P Kurtzman; Chris Todd Hittinger; Antonis Rokas
Journal:  G3 (Bethesda)       Date:  2016-12-07       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.