Literature DB >> 23211701

Genetic ancestry inference using support vector machines, and the active emergence of a unique American population.

Ryan J Haasl1, Catherine A McCarty, Bret A Payseur.   

Abstract

We use genotype data from the Marshfield Clinical Research Foundation Personalized Medicine Research Project to investigate genetic similarity and divergence between Europeans and the sampled population of European Americans in Central Wisconsin, USA. To infer recent genetic ancestry of the sampled Wisconsinites, we train support vector machines (SVMs) on the positions of Europeans along top principal components (PCs). Our SVM models partition continent-wide European genetic variance into eight regional classes, which is an improvement over the geographically broader categories of recent ancestry reported by personal genomics companies. After correcting for misclassification error associated with the SVMs (<10%, in all cases), we observe a >14% discrepancy between insular ancestries reported by Wisconsinites and those inferred by SVM. Values of FST as well as Mantel tests for correlation between genetic and European geographic distances indicate minimal divergence between Europe and the local Wisconsin population. However, we find that individuals from the Wisconsin sample show greater dispersion along higher-order PCs than individuals from Europe. Hypothesizing that this pattern is characteristic of nascent divergence, we run computer simulations that mimic the recent peopling of Wisconsin. Simulations corroborate the pattern in higher-order PCs, demonstrate its transient nature, and show that admixture accelerates the rate of divergence between the admixed population and its parental sources relative to drift alone. Together, empirical and simulation results suggest that genetic divergence between European source populations and European Americans in Central Wisconsin is subtle but already under way.

Mesh:

Year:  2012        PMID: 23211701      PMCID: PMC3641388          DOI: 10.1038/ejhg.2012.258

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  39 in total

Review 1.  Genetic ancestry and the search for personalized genetic histories.

Authors:  Mark D Shriver; Rick A Kittles
Journal:  Nat Rev Genet       Date:  2004-08       Impact factor: 53.242

2.  Design and analysis of admixture mapping studies.

Authors:  C J Hoggart; M D Shriver; R A Kittles; D G Clayton; P M McKeigue
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

3.  A high-density admixture map for disease gene discovery in african americans.

Authors:  Michael W Smith; Nick Patterson; James A Lautenberger; Ann L Truelove; Gavin J McDonald; Alicja Waliszewska; Bailey D Kessing; Michael J Malasky; Charles Scafe; Ernest Le; Philip L De Jager; Andre A Mignault; Zeng Yi; Guy De The; Myron Essex; Jean-Louis Sankale; Jason H Moore; Kwabena Poku; John P Phair; James J Goedert; David Vlahov; Scott M Williams; Sarah A Tishkoff; Cheryl A Winkler; Francisco M De La Vega; Trevor Woodage; John J Sninsky; David A Hafler; David Altshuler; Dennis A Gilbert; Stephen J O'Brien; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

4.  Reliability of self-reported ancestry among siblings: implications for genetic association studies.

Authors:  Melinda S Burnett; Kari J Strain; Timothy G Lesnick; Mariza de Andrade; Walter A Rocca; Demetrius M Maraganore
Journal:  Am J Epidemiol       Date:  2006-01-18       Impact factor: 4.897

5.  Elevated male European and female African contributions to the genomes of African American individuals.

Authors:  Joanne M Lind; Holli B Hutcheson-Dilks; Scott M Williams; Jason H Moore; Myron Essex; Eduardo Ruiz-Pesini; Douglas C Wallace; Sarah A Tishkoff; Stephen J O'Brien; Michael W Smith
Journal:  Hum Genet       Date:  2006-09-28       Impact factor: 4.132

6.  Estimating local ancestry in admixed populations.

Authors:  Sriram Sankararaman; Srinath Sridhar; Gad Kimmel; Eran Halperin
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

7.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

8.  Methods for high-density admixture mapping of disease genes.

Authors:  Nick Patterson; Neil Hattangadi; Barton Lane; Kirk E Lohmueller; David A Hafler; Jorge R Oksenberg; Stephen L Hauser; Michael W Smith; Stephen J O'Brien; David Altshuler; Mark J Daly; David Reich
Journal:  Am J Hum Genet       Date:  2004-04-14       Impact factor: 11.025

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Discerning the ancestry of European Americans in genetic association studies.

Authors:  Alkes L Price; Johannah Butler; Nick Patterson; Cristian Capelli; Vincenzo L Pascali; Francesca Scarnicci; Andres Ruiz-Linares; Leif Groop; Angelica A Saetta; Penelope Korkolopoulou; Uri Seligsohn; Alicja Waliszewska; Christine Schirmer; Kristin Ardlie; Alexis Ramos; James Nemesh; Lori Arbeitman; David B Goldstein; David Reich; Joel N Hirschhorn
Journal:  PLoS Genet       Date:  2007-11-19       Impact factor: 5.917

View more
  2 in total

Review 1.  Systematic Review on Local Ancestor Inference From a Mathematical and Algorithmic Perspective.

Authors:  Jie Wu; Yangxiu Liu; Yiqiang Zhao
Journal:  Front Genet       Date:  2021-05-24       Impact factor: 4.599

2.  Estimation of Genomic Breed Composition for Purebred and Crossbred Animals Using Sparsely Regularized Admixture Models.

Authors:  Yangfan Wang; Xiao-Lin Wu; Zhi Li; Zhenmin Bao; Richard G Tait; Stewart Bauck; Guilherme J M Rosa
Journal:  Front Genet       Date:  2020-06-11       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.