Literature DB >> 23726367

Enhanced localization of genetic samples through linkage-disequilibrium correction.

Yael Baran1, Inés Quintela, Angel Carracedo, Bogdan Pasaniuc, Eran Halperin.   

Abstract

Characterizing the spatial patterns of genetic diversity in human populations has a wide range of applications, from detecting genetic mutations associated with disease to inferring human history. Current approaches, including the widely used principal-component analysis, are not suited for the analysis of linked markers, and local and long-range linkage disequilibrium (LD) can dramatically reduce the accuracy of spatial localization when unaccounted for. To overcome this, we have introduced an approach that performs spatial localization of individuals on the basis of their genetic data and explicitly models LD among markers by using a multivariate normal distribution. By leveraging external reference panels, we derive closed-form solutions to the optimization procedure to achieve a computationally efficient method that can handle large data sets. We validate the method on empirical data from a large sample of European individuals from the POPRES data set, as well as on a large sample of individuals of Spanish ancestry. First, we show that by modeling LD, we achieve accuracy superior to that of existing methods. Importantly, whereas other methods show decreased performance when dense marker panels are used in the inference, our approach improves in accuracy as more markers become available. Second, we show that accurate localization of genetic data can be achieved with only a part of the genome, and this could potentially enable the spatial localization of admixed samples that have a fraction of their genome originating from a given continent. Finally, we demonstrate that our approach is resistant to distortions resulting from long-range LD regions; such distortions can dramatically bias the results when unaccounted for.
Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23726367      PMCID: PMC3675263          DOI: 10.1016/j.ajhg.2013.04.023

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  31 in total

1.  Fast and accurate inference of local ancestry in Latino populations.

Authors:  Yael Baran; Bogdan Pasaniuc; Sriram Sankararaman; Dara G Torgerson; Christopher Gignoux; Celeste Eng; William Rodriguez-Cintron; Rocio Chapela; Jean G Ford; Pedro C Avila; Jose Rodriguez-Santana; Esteban Gonzàlez Burchard; Eran Halperin
Journal:  Bioinformatics       Date:  2012-04-11       Impact factor: 6.937

2.  Colloquium paper: genome-wide patterns of population structure and admixture among Hispanic/Latino populations.

Authors:  Katarzyna Bryc; Christopher Velez; Tatiana Karafet; Andres Moreno-Estrada; Andy Reynolds; Adam Auton; Michael Hammer; Carlos D Bustamante; Harry Ostrer
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-05       Impact factor: 11.205

3.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

4.  Long-range LD can confound genome scans in admixed populations.

Authors:  Alkes L Price; Michael E Weale; Nick Patterson; Simon R Myers; Anna C Need; Kevin V Shianna; Dongliang Ge; Jerome I Rotter; Esther Torres; Kent D Taylor; David B Goldstein; David Reich
Journal:  Am J Hum Genet       Date:  2008-07       Impact factor: 11.025

Review 5.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

6.  A model-based approach for analysis of spatial structure in genetic data.

Authors:  Wen-Yun Yang; John Novembre; Eleazar Eskin; Eran Halperin
Journal:  Nat Genet       Date:  2012-05-20       Impact factor: 38.330

7.  USING LINEAR PREDICTORS TO IMPUTE ALLELE FREQUENCIES FROM SUMMARY OR POOLED GENOTYPE DATA.

Authors:  Xiaoquan Wen; Matthew Stephens
Journal:  Ann Appl Stat       Date:  2010-09       Impact factor: 2.083

8.  Discovering genetic ancestry using spectral graph theory.

Authors:  Ann B Lee; Diana Luca; Lambertus Klei; Bernie Devlin; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-01       Impact factor: 2.135

9.  Population structure and eigenanalysis.

Authors:  Nick Patterson; Alkes L Price; David Reich
Journal:  PLoS Genet       Date:  2006-12       Impact factor: 5.917

10.  A quantitative comparison of the similarity between genes and geography in worldwide human populations.

Authors:  Chaolong Wang; Sebastian Zöllner; Noah A Rosenberg
Journal:  PLoS Genet       Date:  2012-08-23       Impact factor: 5.917

View more
  15 in total

1.  A spatial haplotype copying model with applications to genotype imputation.

Authors:  Wen-Yun Yang; Farhad Hormozdiari; Eleazar Eskin; Bogdan Pasaniuc
Journal:  J Comput Biol       Date:  2014-12-19       Impact factor: 1.479

2.  Population Structure of UK Biobank and Ancient Eurasians Reveals Adaptation at Genes Influencing Blood Pressure.

Authors:  Kevin J Galinsky; Po-Ru Loh; Swapan Mallick; Nick J Patterson; Alkes L Price
Journal:  Am J Hum Genet       Date:  2016-10-20       Impact factor: 11.025

3.  The contribution of rare variation to prostate cancer heritability.

Authors:  Nicholas Mancuso; Nadin Rohland; Kristin A Rand; Arti Tandon; Alexander Allen; Dominique Quinque; Swapan Mallick; Heng Li; Alex Stram; Xin Sheng; Zsofia Kote-Jarai; Douglas F Easton; Rosalind A Eeles; Loic Le Marchand; Alex Lubwama; Daniel Stram; Stephen Watya; David V Conti; Brian Henderson; Christopher A Haiman; Bogdan Pasaniuc; David Reich
Journal:  Nat Genet       Date:  2015-11-16       Impact factor: 38.330

4.  Detecting individual ancestry in the human genome.

Authors:  Andreas Wollstein; Oscar Lao
Journal:  Investig Genet       Date:  2015-05-01

5.  HaploPOP: a software that improves population assignment by combining markers into haplotypes.

Authors:  Nicolas Duforet-Frebourg; Lucie M Gattepaille; Michael G B Blum; Mattias Jakobsson
Journal:  BMC Bioinformatics       Date:  2015-07-31       Impact factor: 3.169

6.  Spatial localization of recent ancestors for admixed individuals.

Authors:  Wen-Yun Yang; Alexander Platt; Charleston Wen-Kai Chiang; Eleazar Eskin; John Novembre; Bogdan Pasaniuc
Journal:  G3 (Bethesda)       Date:  2014-11-03       Impact factor: 3.154

Review 7.  Challenges in analysis and interpretation of microsatellite data for population genetic studies.

Authors:  Alexander I Putman; Ignazio Carbone
Journal:  Ecol Evol       Date:  2014-10-30       Impact factor: 2.912

8.  GAGA: a new algorithm for genomic inference of geographic ancestry reveals fine level population substructure in Europeans.

Authors:  Oscar Lao; Fan Liu; Andreas Wollstein; Manfred Kayser
Journal:  PLoS Comput Biol       Date:  2014-02-20       Impact factor: 4.475

9.  Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis.

Authors:  Gancho T Slavov; Rick Nipper; Paul Robson; Kerrie Farrar; Gordon G Allison; Maurice Bosch; John C Clifton-Brown; Iain S Donnison; Elaine Jensen
Journal:  New Phytol       Date:  2013-12-06       Impact factor: 10.151

10.  Apolipoprotein L1 risk variants associate with prevalent atherosclerotic disease in African American systemic lupus erythematosus patients.

Authors:  Ashira Blazer; Binhuan Wang; Danny Simpson; Tomas Kirchhoff; Sean Heffron; Robert M Clancy; Adriana Heguy; Karina Ray; Matija Snuderl; Jill P Buyon
Journal:  PLoS One       Date:  2017-08-29       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.