Literature DB >> 16364968

Accuracy of coalescent likelihood estimates: do we need more sites, more sequences, or more loci?

Joseph Felsenstein1.   

Abstract

A computer simulation study has been made of the accuracy of estimates of Theta = 4Nemu from a sample from a single isolated population of finite size. The accuracies turn out to be well predicted by a formula developed by Fu and Li, who used optimistic assumptions. Their formulas are restated in terms of accuracy, defined here as the reciprocal of the squared coefficient of variation. This should be proportional to sample size when the entities sampled provide independent information. Using these formulas for accuracy, the sampling strategy for estimation of Theta can be investigated. Two models for cost have been used, a cost-per-base model and a cost-per-read model. The former would lead us to prefer to have a very large number of loci, each one base long. The latter, which is more realistic, causes us to prefer to have one read per locus and an optimum sample size which declines as costs of sampling organisms increase. For realistic values, the optimum sample size is 8 or fewer individuals. This is quite close to the results obtained by Pluzhnikov and Donnelly for a cost-per-base model, evaluating other estimators of Theta. It can be understood by considering that the resources spent collecting larger samples prevent us from considering more loci. An examination of the efficiency of Watterson's estimator of Theta was also made, and it was found to be reasonably efficient when the number of mutants per generation in the sequence in the whole population is less than 2.5.

Mesh:

Year:  2005        PMID: 16364968     DOI: 10.1093/molbev/msj079

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  84 in total

1.  High-resolution species trees without concatenation.

Authors:  Scott V Edwards; Liang Liu; Dennis K Pearl
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-28       Impact factor: 11.205

2.  The limits of fine-scale mapping.

Authors:  Lucian P Smith; Mary K Kuhner
Journal:  Genet Epidemiol       Date:  2009-05       Impact factor: 2.135

Review 3.  Multilocus phylogeography and phylogenetics using sequence-based markers.

Authors:  Patrícia H Brito; Scott V Edwards
Journal:  Genetica       Date:  2008-07-24       Impact factor: 1.082

4.  Molecular systematics and the diatom species.

Authors:  Andrew J Alverson
Journal:  Protist       Date:  2008-06-09

5.  Investigating species boundaries using DNA and morphology in the mite Tyrophagus curvipenis (Acari: Acaridae), an emerging invasive pest, with a molecular phylogeny of the genus Tyrophagus.

Authors:  Pamela Murillo; Pavel Klimov; Jan Hubert; Barry OConnor
Journal:  Exp Appl Acarol       Date:  2018-04-26       Impact factor: 2.132

6.  How robust are "isolation with migration" analyses to violations of the im model? A simulation study.

Authors:  Jared L Strasburg; Loren H Rieseberg
Journal:  Mol Biol Evol       Date:  2009-09-30       Impact factor: 16.240

7.  Isolation with migration models for more than two populations.

Authors:  Jody Hey
Journal:  Mol Biol Evol       Date:  2009-12-02       Impact factor: 16.240

8.  DNA evidence for historic population size and past ecosystem impacts of gray whales.

Authors:  S Elizabeth Alter; Eric Rynes; Stephen R Palumbi
Journal:  Proc Natl Acad Sci U S A       Date:  2007-09-11       Impact factor: 11.205

9.  A reduced representation approach to population genetic analyses and applications to human evolution.

Authors:  Francesca Luca; Richard R Hudson; David B Witonsky; Anna Di Rienzo
Journal:  Genome Res       Date:  2011-05-31       Impact factor: 9.043

10.  Bayesian inference of local trees along chromosomes by the sequential Markov coalescent.

Authors:  Chaozhi Zheng; Mary K Kuhner; Elizabeth A Thompson
Journal:  J Mol Evol       Date:  2014-05-11       Impact factor: 2.395

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.