Literature DB >> 26071445

Posterior predictive checks to quantify lack-of-fit in admixture models of latent population structure.

David Mimno1, David M Blei2, Barbara E Engelhardt3.   

Abstract

Admixture models are a ubiquitous approach to capture latent population structure in genetic samples. Despite the widespread application of admixture models, little thought has been devoted to the quality of the model fit or the accuracy of the estimates of parameters of interest for a particular study. Here we develop methods for validating admixture models based on posterior predictive checks (PPCs), a Bayesian method for assessing the quality of fit of a statistical model to a specific dataset. We develop PPCs for five population-level statistics of interest: within-population genetic variation, background linkage disequilibrium, number of ancestral populations, between-population genetic variation, and the downstream use of admixture parameters to correct for population structure in association studies. Using PPCs, we evaluate the quality of the admixture model fit to four qualitatively different population genetic datasets: the population reference sample (POPRES) European individuals, the HapMap phase 3 individuals, continental Indians, and African American individuals. We found that the same model fitted to different genomic studies resulted in highly study-specific results when evaluated using PPCs, illustrating the utility of PPCs for model-based analyses in large genomic studies.

Keywords:  admixture models; genomic data; model checking; population structure; posterior predictive checks

Mesh:

Year:  2015        PMID: 26071445      PMCID: PMC4491772          DOI: 10.1073/pnas.1412301112

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  52 in total

1.  Linkage disequilibrium patterns of the human genome across populations.

Authors:  Sagiv Shifman; Jane Kuypers; Mark Kokoris; Benjamin Yakir; Ariel Darvasi
Journal:  Hum Mol Genet       Date:  2003-04-01       Impact factor: 6.150

2.  Informativeness of genetic markers for inference of ancestry.

Authors:  Noah A Rosenberg; Lei M Li; Ryk Ward; Jonathan K Pritchard
Journal:  Am J Hum Genet       Date:  2003-11-20       Impact factor: 11.025

3.  Control of confounding of genetic associations in stratified populations.

Authors:  Clive J Hoggart; Eteban J Parra; Mark D Shriver; Carolina Bonilla; Rick A Kittles; David G Clayton; Paul M McKeigue
Journal:  Am J Hum Genet       Date:  2003-06       Impact factor: 11.025

4.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

5.  Genetic structure of human populations.

Authors:  Noah A Rosenberg; Jonathan K Pritchard; James L Weber; Howard M Cann; Kenneth K Kidd; Lev A Zhivotovsky; Marcus W Feldman
Journal:  Science       Date:  2002-12-20       Impact factor: 47.728

6.  The effects of human population structure on large genetic association studies.

Authors:  Jonathan Marchini; Lon R Cardon; Michael S Phillips; Peter Donnelly
Journal:  Nat Genet       Date:  2004-03-28       Impact factor: 38.330

7.  Denisova admixture and the first modern human dispersals into Southeast Asia and Oceania.

Authors:  David Reich; Nick Patterson; Martin Kircher; Frederick Delfin; Madhusudan R Nandineni; Irina Pugach; Albert Min-Shan Ko; Ying-Chin Ko; Timothy A Jinam; Maude E Phipps; Naruya Saitou; Andreas Wollstein; Manfred Kayser; Svante Pääbo; Mark Stoneking
Journal:  Am J Hum Genet       Date:  2011-09-22       Impact factor: 11.025

8.  Methods for high-density admixture mapping of disease genes.

Authors:  Nick Patterson; Neil Hattangadi; Barton Lane; Kirk E Lohmueller; David A Hafler; Jorge R Oksenberg; Stephen L Hauser; Michael W Smith; Stephen J O'Brien; David Altshuler; Mark J Daly; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

9.  The history of African gene flow into Southern Europeans, Levantines, and Jews.

Authors:  Priya Moorjani; Nick Patterson; Joel N Hirschhorn; Alon Keinan; Li Hao; Gil Atzmon; Edward Burns; Harry Ostrer; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2011-04-21       Impact factor: 5.917

10.  Variation in human recombination rates and its genetic determinants.

Authors:  Adi Fledel-Alon; Ellen Miranda Leffler; Yongtao Guan; Matthew Stephens; Graham Coop; Molly Przeworski
Journal:  PLoS One       Date:  2011-06-17       Impact factor: 3.240

View more
  3 in total

1.  Standardized Biogeographic Grouping System for Annotating Populations in Pharmacogenetic Research.

Authors:  Rachel Huddart; Alison E Fohner; Michelle Whirl-Carrillo; Genevieve L Wojcik; Christopher R Gignoux; Alice B Popejoy; Carlos D Bustamante; Russ B Altman; Teri E Klein
Journal:  Clin Pharmacol Ther       Date:  2019-01-21       Impact factor: 6.875

2.  Combining Multiple Hypothesis Testing with Machine Learning Increases the Statistical Power of Genome-wide Association Studies.

Authors:  Bettina Mieth; Marius Kloft; Juan Antonio Rodríguez; Sören Sonnenburg; Robin Vobruba; Carlos Morcillo-Suárez; Xavier Farré; Urko M Marigorta; Ernst Fehr; Thorsten Dickhaus; Gilles Blanchard; Daniel Schunk; Arcadi Navarro; Klaus-Robert Müller
Journal:  Sci Rep       Date:  2016-11-28       Impact factor: 4.379

3.  Efficient analysis of large datasets and sex bias with ADMIXTURE.

Authors:  Suyash S Shringarpure; Carlos D Bustamante; Kenneth Lange; David H Alexander
Journal:  BMC Bioinformatics       Date:  2016-05-23       Impact factor: 3.169

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.