Literature DB >> 22649082

Robust estimation of local genetic ancestry in admixed populations using a nonparametric Bayesian approach.

Kyung-Ah Sohn1, Zoubin Ghahramani, Eric P Xing.   

Abstract

We present a new haplotype-based approach for inferring local genetic ancestry of individuals in an admixed population. Most existing approaches for local ancestry estimation ignore the latent genetic relatedness between ancestral populations and treat them as independent. In this article, we exploit such information by building an inheritance model that describes both the ancestral populations and the admixed population jointly in a unified framework. Based on an assumption that the common hypothetical founder haplotypes give rise to both the ancestral and the admixed population haplotypes, we employ an infinite hidden Markov model to characterize each ancestral population and further extend it to generate the admixed population. Through an effective utilization of the population structural information under a principled nonparametric Bayesian framework, the resulting model is significantly less sensitive to the choice and the amount of training data for ancestral populations than state-of-the-art algorithms. We also improve the robustness under deviation from common modeling assumptions by incorporating population-specific scale parameters that allow variable recombination rates in different populations. Our method is applicable to an admixed population from an arbitrary number of ancestral populations and also performs competitively in terms of spurious ancestry proportions under a general multiway admixture assumption. We validate the proposed method by simulation under various admixing scenarios and present empirical analysis results from a worldwide-distributed dataset from the Human Genome Diversity Project.

Entities:  

Mesh:

Year:  2012        PMID: 22649082      PMCID: PMC3416008          DOI: 10.1534/genetics.112.140228

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  27 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Reconstructing genetic ancestry blocks in admixed individuals.

Authors:  Hua Tang; Marc Coram; Pei Wang; Xiaofeng Zhu; Neil Risch
Journal:  Am J Hum Genet       Date:  2006-05-17       Impact factor: 11.025

3.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

4.  Inference of population structure under a Dirichlet process model.

Authors:  John P Huelsenbeck; Peter Andolfatto
Journal:  Genetics       Date:  2007-01-21       Impact factor: 4.562

5.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase.

Authors:  Paul Scheet; Matthew Stephens
Journal:  Am J Hum Genet       Date:  2006-02-17       Impact factor: 11.025

6.  Spectrum: joint Bayesian inference of population structure and recombination events.

Authors:  Kyung-Ah Sohn; Eric P Xing
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

7.  Recent genetic selection in the ancestral admixture of Puerto Ricans.

Authors:  Hua Tang; Shweta Choudhry; Rui Mei; Martin Morgan; William Rodriguez-Cintron; Esteban González Burchard; Neil J Risch
Journal:  Am J Hum Genet       Date:  2007-08-01       Impact factor: 11.025

8.  Methods for high-density admixture mapping of disease genes.

Authors:  Nick Patterson; Neil Hattangadi; Barton Lane; Kirk E Lohmueller; David A Hafler; Jorge R Oksenberg; Stephen L Hauser; Michael W Smith; Stephen J O'Brien; David Altshuler; Mark J Daly; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

9.  Enhancements to the ADMIXTURE algorithm for individual ancestry estimation.

Authors:  David H Alexander; Kenneth Lange
Journal:  BMC Bioinformatics       Date:  2011-06-18       Impact factor: 3.307

10.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

View more
  11 in total

1.  A comprehensive survey of models for dissecting local ancestry deconvolution in human genome.

Authors:  Ephifania Geza; Jacquiline Mugo; Nicola J Mulder; Ambroise Wonkam; Emile R Chimusa; Gaston K Mazandu
Journal:  Brief Bioinform       Date:  2019-09-27       Impact factor: 11.622

2.  POPSTR: Inference of Admixed Population Structure Based on Single-Nucleotide Polymorphisms and Copy Number Variations.

Authors:  Jaeil Ahn; Brian Conkright; Simina M Boca; Subha Madhavan
Journal:  J Comput Biol       Date:  2018-01-02       Impact factor: 1.479

3.  Differential Evolution approach to detect recent admixture.

Authors:  Konstantin Kozlov; Dmitri Chebotarev; Mehedi Hassan; Martin Triska; Petr Triska; Pavel Flegontov; Tatiana V Tatarinova
Journal:  BMC Genomics       Date:  2015-06-18       Impact factor: 3.969

4.  hiHMM: Bayesian non-parametric joint inference of chromatin state maps.

Authors:  Kyung-Ah Sohn; Joshua W K Ho; Djordje Djordjevic; Hyun-Hwan Jeong; Peter J Park; Ju Han Kim
Journal:  Bioinformatics       Date:  2015-02-27       Impact factor: 6.937

5.  Spatial and temporal simulation of human evolution. Methods, frameworks and applications.

Authors:  Macarena Benguigui; Miguel Arenas
Journal:  Curr Genomics       Date:  2014-08       Impact factor: 2.236

6.  A Continuous Correlated Beta Process Model for Genetic Ancestry in Admixed Populations.

Authors:  Zachariah Gompert
Journal:  PLoS One       Date:  2016-03-11       Impact factor: 3.240

Review 7.  Toward high-resolution population genomics using archaeological samples.

Authors:  Irina Morozova; Pavel Flegontov; Alexander S Mikheyev; Sergey Bruskin; Hosseinali Asgharian; Petr Ponomarenko; Vladimir Klyuchnikov; GaneshPrasad ArunKumar; Egor Prokhortchouk; Yuriy Gankin; Evgeny Rogaev; Yuri Nikolsky; Ancha Baranova; Eran Elhaik; Tatiana V Tatarinova
Journal:  DNA Res       Date:  2016-07-19       Impact factor: 4.458

8.  Supervised machine learning reveals introgressed loci in the genomes of Drosophila simulans and D. sechellia.

Authors:  Daniel R Schrider; Julien Ayroles; Daniel R Matute; Andrew D Kern
Journal:  PLoS Genet       Date:  2018-04-23       Impact factor: 5.917

9.  The Unreasonable Effectiveness of Convolutional Neural Networks in Population Genetic Inference.

Authors:  Lex Flagel; Yaniv Brandvain; Daniel R Schrider
Journal:  Mol Biol Evol       Date:  2019-02-01       Impact factor: 16.240

10.  Computer programs and methodologies for the simulation of DNA sequence data with recombination.

Authors:  Miguel Arenas
Journal:  Front Genet       Date:  2013-02-01       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.