Literature DB >> 19648217

Fast model-based estimation of ancestry in unrelated individuals.

David H Alexander1, John Novembre, Kenneth Lange.   

Abstract

Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.

Mesh:

Year:  2009        PMID: 19648217      PMCID: PMC2752134          DOI: 10.1101/gr.094052.109

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

3.  The New York Cancer Project: rationale, organization, design, and baseline characteristics.

Authors:  Maria K Mitchell; Peter K Gregersen; Stephen Johnson; Ramon Parsons; David Vlahov
Journal:  J Urban Health       Date:  2004-06       Impact factor: 3.671

4.  The effects of human population structure on large genetic association studies.

Authors:  Jonathan Marchini; Lon R Cardon; Michael S Phillips; Peter Donnelly
Journal:  Nat Genet       Date:  2004-03-28       Impact factor: 38.330

5.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

6.  Estimation of individual admixture: analytical and study design considerations.

Authors:  Hua Tang; Jie Peng; Pei Wang; Neil J Risch
Journal:  Genet Epidemiol       Date:  2005-05       Impact factor: 2.135

7.  Population subdivision with respect to multiple alleles.

Authors:  C C Li
Journal:  Ann Hum Genet       Date:  1969-07       Impact factor: 1.670

8.  Methods for high-density admixture mapping of disease genes.

Authors:  Nick Patterson; Neil Hattangadi; Barton Lane; Kirk E Lohmueller; David A Hafler; Jorge R Oksenberg; Stephen L Hauser; Michael W Smith; Stephen J O'Brien; David Altshuler; Mark J Daly; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

9.  Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture.

Authors:  W C Knowler; R C Williams; D J Pettitt; A G Steinberg
Journal:  Am J Hum Genet       Date:  1988-10       Impact factor: 11.025

10.  Genes mirror geography within Europe.

Authors:  John Novembre; Toby Johnson; Katarzyna Bryc; Zoltán Kutalik; Adam R Boyko; Adam Auton; Amit Indap; Karen S King; Sven Bergmann; Matthew R Nelson; Matthew Stephens; Carlos D Bustamante
Journal:  Nature       Date:  2008-08-31       Impact factor: 49.962

View more
  2000 in total

1.  Association of CR1, CLU and PICALM with Alzheimer's disease in a cohort of clinically characterized and neuropathologically verified individuals.

Authors:  Jason J Corneveaux; Amanda J Myers; April N Allen; Jeremy J Pruzin; Manuel Ramirez; Anzhelika Engel; Michael A Nalls; Kewei Chen; Wendy Lee; Kendria Chewning; Stephen E Villa; Hunsar B Meechoovet; Jill D Gerber; Danielle Frost; Hollie L Benson; Sean O'Reilly; Lori B Chibnik; Joshua M Shulman; Andrew B Singleton; David W Craig; Kendall R Van Keuren-Jensen; Travis Dunckley; David A Bennett; Philip L De Jager; Christopher Heward; John Hardy; Eric M Reiman; Matthew J Huentelman
Journal:  Hum Mol Genet       Date:  2010-06-09       Impact factor: 6.150

2.  Shared and unique components of human population structure and genome-wide signals of positive selection in South Asia.

Authors:  Mait Metspalu; Irene Gallego Romero; Bayazit Yunusbayev; Gyaneshwer Chaubey; Chandana Basu Mallick; Georgi Hudjashov; Mari Nelis; Reedik Mägi; Ene Metspalu; Maido Remm; Ramasamy Pitchappan; Lalji Singh; Kumarasamy Thangaraj; Richard Villems; Toomas Kivisild
Journal:  Am J Hum Genet       Date:  2011-12-09       Impact factor: 11.025

3.  Robust estimation of local genetic ancestry in admixed populations using a nonparametric Bayesian approach.

Authors:  Kyung-Ah Sohn; Zoubin Ghahramani; Eric P Xing
Journal:  Genetics       Date:  2012-05-29       Impact factor: 4.562

4.  Genetic Ancestry Is not Associated with Breast Cancer Recurrence or Survival in U.S. Latina Women Enrolled in the Kaiser Permanente Pathways Study.

Authors:  Natalie J Engmann; Isaac J Ergas; Song Yao; Marilyn L Kwan; Janise M Roh; Christine B Ambrosone; Lawrence H Kushi; Laura Fejerman
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2017-09       Impact factor: 4.254

5.  Development and validation of a small SNP panel for feed efficiency in beef cattle.

Authors:  M K Abo-Ismail; N Lansink; E Akanno; B K Karisa; J J Crowley; S S Moore; E Bork; P Stothard; J A Basarab; G S Plastow
Journal:  J Anim Sci       Date:  2018-03-06       Impact factor: 3.159

Review 6.  Recent advances in the study of fine-scale population structure in humans.

Authors:  John Novembre; Benjamin M Peter
Journal:  Curr Opin Genet Dev       Date:  2016-09-20       Impact factor: 5.578

7.  Chocó, Colombia: a hotspot of human biodiversity.

Authors:  Miguel A Medina-Rivas; Emily T Norris; Lavanya Rishishwar; Andrew B Conley; Camila Medrano-Trochez; Augusto Valderrama-Aguirre; Fredrik O Vannberg; Leonardo Mariño-Ramírez; I King Jordan
Journal:  Rev Biodivers Neotrop       Date:  2016 Jan-Jun

8.  Genome-Wide Analysis of SNPs Is Consistent with No Domestic Dog Ancestry in the Endangered Mexican Wolf (Canis lupus baileyi).

Authors:  Robert R Fitak; Sarah E Rinkevich; Melanie Culver
Journal:  J Hered       Date:  2018-05-11       Impact factor: 2.645

9.  Chip-based direct genotyping of coding variants in genome wide association studies: utility, issues and prospects.

Authors:  Caroline M Nievergelt; Nathan E Wineinger; Ondrej Libiger; Phillip Pham; Guangfa Zhang; Dewleen G Baker; Nicholas J Schork
Journal:  Gene       Date:  2014-02-09       Impact factor: 3.688

10.  The NIH Toolbox Cognition Battery: results from a large normative developmental sample (PING).

Authors:  Natacha Akshoomoff; Erik Newman; Wesley K Thompson; Connor McCabe; Cinnamon S Bloss; Linda Chang; David G Amaral; B J Casey; Thomas M Ernst; Jean A Frazier; Jeffrey R Gruen; Walter E Kaufmann; Tal Kenet; David N Kennedy; Ondrej Libiger; Stewart Mostofsky; Sarah S Murray; Elizabeth R Sowell; Nicholas Schork; Anders M Dale; Terry L Jernigan
Journal:  Neuropsychology       Date:  2013-11-11       Impact factor: 3.295

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.