Literature DB >> 21058333

A generic coalescent-based framework for the selection of a reference panel for imputation.

Bogdan Paşaniuc1, Ram Avinery, Tom Gur, Christine F Skibola, Paige M Bracci, Eran Halperin.   

Abstract

An important component in the analysis of genome-wide association studies involves the imputation of genotypes that have not been measured directly in the studied samples. The imputation procedure uses the linkage disequilibrium (LD) structure in the population to infer the genotype of an unobserved single nucleotide polymorphism. The LD structure is normally learned from a dense genotype map of a reference population that matches the studied population. In many instances there is no reference population that exactly matches the studied population, and a natural question arises as to how to choose the reference population for the imputation. Here we present a Coalescent-based method that addresses this issue. In contrast to the current paradigm of imputation methods, our method assigns a different reference dataset for each sample in the studied population, and for each region in the genome. This allows the flexibility to account for the diversity within populations, as well as across populations. Furthermore, because our approach treats each region in the genome separately, our method is suitable for the imputation of recently admixed populations. We evaluated our method across a large set of populations and found that our choice of reference data set considerably improves the accuracy of imputation, especially for regions with low LD and for populations without a reference population available as well as for admixed populations such as the Hispanic population. Our method is generic and can potentially be incorporated in any of the available imputation methods as an add-on.
© 2010 Wiley-Liss, Inc.

Entities:  

Mesh:

Year:  2010        PMID: 21058333      PMCID: PMC3876740          DOI: 10.1002/gepi.20505

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  23 in total

1.  Estimating the time to the most recent common ancestor for the Y chromosome or mitochondrial DNA for a pair of individuals.

Authors:  B Walsh
Journal:  Genetics       Date:  2001-06       Impact factor: 4.562

2.  GERBIL: Genotype resolution and block identification using likelihood.

Authors:  Gad Kimmel; Ron Shamir
Journal:  Proc Natl Acad Sci U S A       Date:  2004-12-22       Impact factor: 11.205

3.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

4.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

5.  The relationship between imputation error and statistical power in genetic association studies in diverse populations.

Authors:  Lucy Huang; Chaolong Wang; Noah A Rosenberg
Journal:  Am J Hum Genet       Date:  2009-10-22       Impact factor: 11.025

Review 6.  Coalescents and genealogical structure under neutrality.

Authors:  P Donnelly; S Tavaré
Journal:  Annu Rev Genet       Date:  1995       Impact factor: 16.830

7.  The coalescent process and background selection.

Authors:  R R Hudson; N L Kaplan
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  1995-07-29       Impact factor: 6.237

8.  A multiethnic cohort in Hawaii and Los Angeles: baseline characteristics.

Authors:  L N Kolonel; B E Henderson; J H Hankin; A M Nomura; L R Wilkens; M C Pike; D O Stram; K R Monroe; M E Earle; F S Nagamine
Journal:  Am J Epidemiol       Date:  2000-02-15       Impact factor: 4.897

9.  Genetic variants at 6p21.33 are associated with susceptibility to follicular lymphoma.

Authors:  Christine F Skibola; Paige M Bracci; Eran Halperin; Lucia Conde; David W Craig; Luz Agana; Kelly Iyadurai; Nikolaus Becker; Angela Brooks-Wilson; John D Curry; John J Spinelli; Elizabeth A Holly; Jacques Riby; Luoping Zhang; Alexandra Nieters; Martyn T Smith; Kevin M Brown
Journal:  Nat Genet       Date:  2009-07-20       Impact factor: 38.330

10.  A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.

Authors:  Bryan N Howie; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-06-19       Impact factor: 5.917

View more
  18 in total

1.  A spatial haplotype copying model with applications to genotype imputation.

Authors:  Wen-Yun Yang; Farhad Hormozdiari; Eleazar Eskin; Bogdan Pasaniuc
Journal:  J Comput Biol       Date:  2014-12-19       Impact factor: 1.479

2.  RefRGim: an intelligent reference panel reconstruction method for genotype imputation with convolutional neural networks.

Authors:  Shuo Shi; Qiheng Qian; Shuhuan Yu; Qi Wang; Jinyue Wang; Jingyao Zeng; Zhenglin Du; Jingfa Xiao
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

3.  Genotype imputation in a coalescent model with infinitely-many-sites mutation.

Authors:  Lucy Huang; Erkan O Buzbas; Noah A Rosenberg
Journal:  Theor Popul Biol       Date:  2012-10-16       Impact factor: 1.570

4.  MaCH-admix: genotype imputation for admixed populations.

Authors:  Eric Yi Liu; Mingyao Li; Wei Wang; Yun Li
Journal:  Genet Epidemiol       Date:  2012-10-16       Impact factor: 2.135

5.  Imputation of rare variants in next-generation association studies.

Authors:  Jennifer L Asimit; Eleftheria Zeggini
Journal:  Hum Hered       Date:  2013-04-11       Impact factor: 0.444

6.  Choosing Subsamples for Sequencing Studies by Minimizing the Average Distance to the Closest Leaf.

Authors:  Jonathan T L Kang; Peng Zhang; Sebastian Zöllner; Noah A Rosenberg
Journal:  Genetics       Date:  2015-08-24       Impact factor: 4.562

7.  Haplotype variation and genotype imputation in African populations.

Authors:  Lucy Huang; Mattias Jakobsson; Trevor J Pemberton; Muntaser Ibrahim; Thomas Nyambo; Sabah Omar; Jonathan K Pritchard; Sarah A Tishkoff; Noah A Rosenberg
Journal:  Genet Epidemiol       Date:  2011-12       Impact factor: 2.135

8.  Evaluation of the imputation performance of the program IMPUTE in an admixed sample from Mexico City using several model designs.

Authors:  S Krithika; Adán Valladares-Salgado; Jesus Peralta; Jorge Escobedo-de La Peña; Jesus Kumate-Rodríguez; Miguel Cruz; Esteban J Parra
Journal:  BMC Med Genomics       Date:  2012-05-01       Impact factor: 3.063

9.  A coalescent model for genotype imputation.

Authors:  Ethan M Jewett; Matthew Zawistowski; Noah A Rosenberg; Sebastian Zöllner
Journal:  Genetics       Date:  2012-05-17       Impact factor: 4.562

10.  Enhanced statistical tests for GWAS in admixed populations: assessment using African Americans from CARe and a Breast Cancer Consortium.

Authors:  Bogdan Pasaniuc; Noah Zaitlen; Guillaume Lettre; Gary K Chen; Arti Tandon; W H Linda Kao; Ingo Ruczinski; Myriam Fornage; David S Siscovick; Xiaofeng Zhu; Emma Larkin; Leslie A Lange; L Adrienne Cupples; Qiong Yang; Ermeg L Akylbekova; Solomon K Musani; Jasmin Divers; Joe Mychaleckyj; Mingyao Li; George J Papanicolaou; Robert C Millikan; Christine B Ambrosone; Esther M John; Leslie Bernstein; Wei Zheng; Jennifer J Hu; Regina G Ziegler; Sarah J Nyante; Elisa V Bandera; Sue A Ingles; Michael F Press; Stephen J Chanock; Sandra L Deming; Jorge L Rodriguez-Gil; Cameron D Palmer; Sarah Buxbaum; Lynette Ekunwe; Joel N Hirschhorn; Brian E Henderson; Simon Myers; Christopher A Haiman; David Reich; Nick Patterson; James G Wilson; Alkes L Price
Journal:  PLoS Genet       Date:  2011-04-21       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.