Literature DB >> 17237522

Inference of population structure under a Dirichlet process model.

John P Huelsenbeck1, Peter Andolfatto.   

Abstract

Inferring population structure from genetic data sampled from some number of individuals is a formidable statistical problem. One widely used approach considers the number of populations to be fixed and calculates the posterior probability of assigning individuals to each population. More recently, the assignment of individuals to populations and the number of populations have both been considered random variables that follow a Dirichlet process prior. We examined the statistical behavior of assignment of individuals to populations under a Dirichlet process prior. First, we examined a best-case scenario, in which all of the assumptions of the Dirichlet process prior were satisfied, by generating data under a Dirichlet process prior. Second, we examined the performance of the method when the genetic data were generated under a population genetics model with symmetric migration between populations. We examined the accuracy of population assignment using a distance on partitions. The method can be quite accurate with a moderate number of loci. As expected, inferences on the number of populations are more accurate when theta = 4N(e)u is large and when the migration rate (4N(e)m) is low. We also examined the sensitivity of inferences of population structure to choice of the parameter of the Dirichlet process model. Although inferences could be sensitive to the choice of the prior on the number of populations, this sensitivity occurred when the number of loci sampled was small; inferences are more robust to the prior on the number of populations when the number of sampled loci is large. Finally, we discuss several methods for summarizing the results of a Bayesian Markov chain Monte Carlo (MCMC) analysis of population structure. We develop the notion of the mean population partition, which is the partition of individuals to populations that minimizes the squared partition distance to the partitions sampled by the MCMC algorithm.

Mesh:

Year:  2007        PMID: 17237522      PMCID: PMC1855109          DOI: 10.1534/genetics.106.061317

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  27 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Bayesian analysis of genetic differentiation between populations.

Authors:  Jukka Corander; Patrik Waldmann; Mikko J Sillanpää
Journal:  Genetics       Date:  2003-01       Impact factor: 4.562

3.  A Bayesian approach to inferring population structure from dominant markers.

Authors:  Kent E Holsinger; Paul O Lewis; Dipak K Dey
Journal:  Mol Ecol       Date:  2002-07       Impact factor: 6.185

4.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

5.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

6.  Detecting immigration by using multilocus genotypes.

Authors:  B Rannala; J L Mountain
Journal:  Proc Natl Acad Sci U S A       Date:  1997-08-19       Impact factor: 11.205

7.  The genetical structure of populations.

Authors:  S WRIGHT
Journal:  Ann Eugen       Date:  1951-03

8.  The head and body lice of humans are genetically distinct (Insecta: Phthiraptera, Pediculidae): evidence from double infestations.

Authors:  N P Leo; J M Hughes; X Yang; S K S Poudel; W G Brogdon; S C Barker
Journal:  Heredity (Edinb)       Date:  2005-07       Impact factor: 3.821

9.  [Natural hybridization between two subspecies of the house mouse, Mus musculus domesticus and Mus musculus castaneus, near Lake Casitas, California].

Authors:  A Orth; T Adama; W Din; F Bonhomme
Journal:  Genome       Date:  1998-02       Impact factor: 2.166

10.  Regional genetic structuring and evolutionary history of the impala Aepyceros melampus.

Authors:  Eline D Lorenzen; Peter Arctander; Hans R Siegismund
Journal:  J Hered       Date:  2006-01-11       Impact factor: 2.645

View more
  83 in total

1.  A dirichlet process prior for estimating lineage-specific substitution rates.

Authors:  Tracy A Heath; Mark T Holder; John P Huelsenbeck
Journal:  Mol Biol Evol       Date:  2011-11-02       Impact factor: 16.240

2.  A hierarchical Bayesian model for calibrating estimates of species divergence times.

Authors:  Tracy A Heath
Journal:  Syst Biol       Date:  2012-02-14       Impact factor: 15.683

3.  Robust estimation of local genetic ancestry in admixed populations using a nonparametric Bayesian approach.

Authors:  Kyung-Ah Sohn; Zoubin Ghahramani; Eric P Xing
Journal:  Genetics       Date:  2012-05-29       Impact factor: 4.562

4.  Bayesian species delimitation using multilocus sequence data.

Authors:  Ziheng Yang; Bruce Rannala
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-03       Impact factor: 11.205

5.  An empirical assessment of individual-based population genetic statistical techniques: application to British pig breeds.

Authors:  S Wilkinson; C Haley; L Alderson; P Wiener
Journal:  Heredity (Edinb)       Date:  2010-06-16       Impact factor: 3.821

6.  Empirical Bayes inference of pairwise F(ST) and its distribution in the genome.

Authors:  Shuichi Kitada; Toshihide Kitakado; Hirohisa Kishino
Journal:  Genetics       Date:  2007-07-29       Impact factor: 4.562

7.  mStruct: inference of population structure in light of both genetic admixing and allele mutations.

Authors:  Suyash Shringarpure; Eric P Xing
Journal:  Genetics       Date:  2009-04-10       Impact factor: 4.562

8.  Bayesian analysis of amino acid substitution models.

Authors:  John P Huelsenbeck; Paul Joyce; Clemens Lakner; Fredrik Ronquist
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-12-27       Impact factor: 6.237

9.  Recombination yet inefficient selection along the Drosophila melanogaster subgroup's fourth chromosome.

Authors:  J Roman Arguello; Yue Zhang; Tomoyuki Kado; Chuanzhu Fan; Ruoping Zhao; Hideki Innan; Wen Wang; Manyuan Long
Journal:  Mol Biol Evol       Date:  2009-12-14       Impact factor: 16.240

10.  Identifying loci under selection against gene flow in isolation-with-migration models.

Authors:  Vitor C Sousa; Miguel Carneiro; Nuno Ferrand; Jody Hey
Journal:  Genetics       Date:  2013-03-02       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.