Literature DB >> 22095078

Fine-scale estimation of location of birth from genome-wide single-nucleotide polymorphism data.

Clive J Hoggart1, Paul F O'Reilly, Marika Kaakinen, Weihua Zhang, John C Chambers, Jaspal S Kooner, Lachlan J M Coin, Marjo-Riitta Jarvelin.   

Abstract

Systematic nonrandom mating in populations results in genetic stratification and is predominantly caused by geographic separation, providing the opportunity to infer individuals' birthplace from genetic data. Such inference has been demonstrated for individuals' country of birth, but here we use data from the Northern Finland Birth Cohort 1966 (NFBC1966) to investigate the characteristics of genetic structure within a population and subsequently develop a method for inferring location to a finer scale. Principal component analysis (PCA) shows that while the first PCs are particularly informative for location, there is also location information in the higher-order PCs, but it cannot be captured by a linear model. We introduce a new method, pcLOCATE, which is able to exploit this information to improve the accuracy of location inference. pcLOCATE uses individuals' PC values to estimate the probability of birth in each town and then averages over all towns to give an estimated longitude and latitude of birth using a fully Bayesian model. We apply pcLOCATE to the NFBC1966 data to estimate parental birthplace, testing with successively more PCs and finding the model with the top 23 PCs most accurate, with a median distance of 23 km between the estimated and the true location. pcLOCATE predicts the most recent residence of NFBC1966 individuals to a median distance of 47 km. We also apply pcLOCATE to Indian individuals from the London Life Sciences Prospective Population Study (LOLIPOP) data, and find that birthplace is predicated to a median distance of 54 km from the true location. A method with such accuracy is potentially valuable in population genetics and forensics.

Entities:  

Mesh:

Year:  2011        PMID: 22095078      PMCID: PMC3276643          DOI: 10.1534/genetics.111.135657

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  19 in total

Review 1.  Finnish Disease Heritage I: characteristics, causes, background.

Authors:  Reijo Norio
Journal:  Hum Genet       Date:  2003-03-08       Impact factor: 4.132

2.  The interval of linkage disequilibrium (LD) detected with microsatellite and SNP markers in chromosomes of Finnish populations with different histories.

Authors:  Teppo Varilo; Tiina Paunio; Alex Parker; Markus Perola; Joanne Meyer; Joseph D Terwilliger; Leena Peltonen
Journal:  Hum Mol Genet       Date:  2003-01-01       Impact factor: 6.150

3.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

4.  The genome-wide patterns of variation expose significant substructure in a founder population.

Authors:  Eveliina Jakkula; Karola Rehnström; Teppo Varilo; Olli P H Pietiläinen; Tiina Paunio; Nancy L Pedersen; Ulf deFaire; Marjo-Riitta Järvelin; Juha Saharinen; Nelson Freimer; Samuli Ripatti; Shaun Purcell; Andrew Collins; Mark J Daly; Aarno Palotie; Leena Peltonen
Journal:  Am J Hum Genet       Date:  2008-12       Impact factor: 11.025

5.  Principal component analysis under population genetic models of range expansion and admixture.

Authors:  Olivier François; Mathias Currat; Nicolas Ray; Eunjung Han; Laurent Excoffier; John Novembre
Journal:  Mol Biol Evol       Date:  2010-01-21       Impact factor: 16.240

6.  Interpreting principal component analyses of spatial population genetic variation.

Authors:  John Novembre; Matthew Stephens
Journal:  Nat Genet       Date:  2008-04-20       Impact factor: 38.330

7.  Genetic variation in SCN10A influences cardiac conduction.

Authors:  John C Chambers; Jing Zhao; Cesare M N Terracciano; Connie R Bezzina; Weihua Zhang; Riyaz Kaba; Manoraj Navaratnarajah; Amol Lotlikar; Joban S Sehmi; Manraj K Kooner; Guohong Deng; Urszula Siedlecka; Saurabh Parasramka; Ismail El-Hamamsy; Mark N Wass; Lukas R C Dekker; Jonas S S G de Jong; Michael J E Sternberg; William McKenna; Nicholas J Severs; Ranil de Silva; Arthur A M Wilde; Praveen Anand; Magdi Yacoub; James Scott; Paul Elliott; John N Wood; Jaspal S Kooner
Journal:  Nat Genet       Date:  2010-01-10       Impact factor: 38.330

8.  Genes mirror geography within Europe.

Authors:  John Novembre; Toby Johnson; Katarzyna Bryc; Zoltán Kutalik; Adam R Boyko; Adam Auton; Amit Indap; Karen S King; Sven Bergmann; Matthew R Nelson; Matthew Stephens; Carlos D Bustamante
Journal:  Nature       Date:  2008-08-31       Impact factor: 49.962

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  Reconstructing Indian population history.

Authors:  David Reich; Kumarasamy Thangaraj; Nick Patterson; Alkes L Price; Lalji Singh
Journal:  Nature       Date:  2009-09-24       Impact factor: 49.962

View more
  4 in total

Review 1.  Benefits and limitations of genome-wide association studies.

Authors:  Vivian Tam; Nikunj Patel; Michelle Turcotte; Yohan Bossé; Guillaume Paré; David Meyre
Journal:  Nat Rev Genet       Date:  2019-08       Impact factor: 53.242

2.  Genome-wide insights into the genetic history of human populations.

Authors:  Irina Pugach; Mark Stoneking
Journal:  Investig Genet       Date:  2015-04-01

3.  Anisotropic isolation by distance: the main orientations of human genetic differentiation.

Authors:  Flora Jay; Per Sjödin; Mattias Jakobsson; Michael G B Blum
Journal:  Mol Biol Evol       Date:  2012-11-20       Impact factor: 16.240

4.  A quantitative comparison of the similarity between genes and geography in worldwide human populations.

Authors:  Chaolong Wang; Sebastian Zöllner; Noah A Rosenberg
Journal:  PLoS Genet       Date:  2012-08-23       Impact factor: 5.917

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.