Literature DB >> 23079542

Genotype imputation in a coalescent model with infinitely-many-sites mutation.

Lucy Huang1, Erkan O Buzbas, Noah A Rosenberg.   

Abstract

Empirical studies have identified population-genetic factors as important determinants of the properties of genotype-imputation accuracy in imputation-based disease association studies. Here, we develop a simple coalescent model of three sequences that we use to explore the theoretical basis for the influence of these factors on genotype-imputation accuracy, under the assumption of infinitely-many-sites mutation. Employing a demographic model in which two populations diverged at a given time in the past, we derive the approximate expectation and variance of imputation accuracy in a study sequence sampled from one of the two populations, choosing between two reference sequences, one sampled from the same population as the study sequence and the other sampled from the other population. We show that, under this model, imputation accuracy-as measured by the proportion of polymorphic sites that are imputed correctly in the study sequence-increases in expectation with the mutation rate, the proportion of the markers in a chromosomal region that are genotyped, and the time to divergence between the study and reference populations. Each of these effects derives largely from an increase in information available for determining the reference sequence that is genetically most similar to the sequence targeted for imputation. We analyze as a function of divergence time the expected gain in imputation accuracy in the target using a reference sequence from the same population as the target rather than from the other population. Together with a growing body of empirical investigations of genotype imputation in diverse human populations, our modeling framework lays a foundation for extending imputation techniques to novel populations that have not yet been extensively examined.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Coalescent; Imputation; Population divergence

Mesh:

Year:  2012        PMID: 23079542      PMCID: PMC3587719          DOI: 10.1016/j.tpb.2012.09.006

Source DB:  PubMed          Journal:  Theor Popul Biol        ISSN: 0040-5809            Impact factor:   1.570


  28 in total

1.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

2.  Testing untyped alleles (TUNA)-applications to genome-wide association studies.

Authors:  Dan L Nicolae
Journal:  Genet Epidemiol       Date:  2006-12       Impact factor: 2.135

3.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

4.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

5.  Progress and challenges in genome-wide association studies in humans.

Authors:  Peter Donnelly
Journal:  Nature       Date:  2008-12-11       Impact factor: 49.962

6.  Genotype-imputation accuracy across worldwide human populations.

Authors:  Lucy Huang; Yun Li; Andrew B Singleton; John A Hardy; Gonçalo Abecasis; Noah A Rosenberg; Paul Scheet
Journal:  Am J Hum Genet       Date:  2009-02       Impact factor: 11.025

7.  Analyses and comparison of accuracy of different genotype imputation methods.

Authors:  Yu-Fang Pei; Jian Li; Lei Zhang; Christopher J Papasian; Hong-Wen Deng
Journal:  PLoS One       Date:  2008-10-29       Impact factor: 3.240

8.  Imputation-based analysis of association studies: candidate regions and quantitative traits.

Authors:  Bertrand Servin; Matthew Stephens
Journal:  PLoS Genet       Date:  2007-05-30       Impact factor: 5.917

9.  Genomewide association studies and human disease.

Authors:  John Hardy; Andrew Singleton
Journal:  N Engl J Med       Date:  2009-04-15       Impact factor: 91.245

10.  Practical issues in imputation-based association mapping.

Authors:  Yongtao Guan; Matthew Stephens
Journal:  PLoS Genet       Date:  2008-12-05       Impact factor: 5.917

View more
  3 in total

1.  Theory and applications of a deterministic approximation to the coalescent model.

Authors:  Ethan M Jewett; Noah A Rosenberg
Journal:  Theor Popul Biol       Date:  2014-01-07       Impact factor: 1.570

2.  Why are rare variants hard to impute? Coalescent models reveal theoretical limits in existing algorithms.

Authors:  Yichen Si; Brett Vanderwerff; Sebastian Zöllner
Journal:  Genetics       Date:  2021-04-15       Impact factor: 4.562

3.  IMPUTOR: Phylogenetically Aware Software for Imputation of Errors in Next-Generation Sequencing.

Authors:  Matthew Jobin; Haiko Schurz; Brenna M Henn
Journal:  Genome Biol Evol       Date:  2018-04-01       Impact factor: 3.416

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.