Literature DB >> 25526526

A spatial haplotype copying model with applications to genotype imputation.

Wen-Yun Yang1, Farhad Hormozdiari, Eleazar Eskin, Bogdan Pasaniuc.   

Abstract

Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.

Entities:  

Keywords:  1000 Genomes; expectation maximization (EM) algorithm; genotype imputation; linkage disequilibrium; polymorphism; single nucleotide; spatial genetics; stochastic gradient descent

Mesh:

Year:  2014        PMID: 25526526      PMCID: PMC4425418          DOI: 10.1089/cmb.2014.0151

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  34 in total

1.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.

Authors:  Na Li; Matthew Stephens
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

2.  A linear complexity phasing method for thousands of genomes.

Authors:  Olivier Delaneau; Jonathan Marchini; Jean-François Zagury
Journal:  Nat Methods       Date:  2011-12-04       Impact factor: 28.547

3.  Efficiency and power in genetic association studies.

Authors:  Paul I W de Bakker; Roman Yelensky; Itsik Pe'er; Stacey B Gabriel; Mark J Daly; David Altshuler
Journal:  Nat Genet       Date:  2005-10-23       Impact factor: 38.330

4.  Multilocus association mapping using variable-length Markov chains.

Authors:  Sharon R Browning
Journal:  Am J Hum Genet       Date:  2006-04-07       Impact factor: 11.025

5.  Fast and accurate estimation of the population-scaled mutation rate, theta, from microsatellite genotype data.

Authors:  Arindam Roychoudhury; Matthew Stephens
Journal:  Genetics       Date:  2007-06       Impact factor: 4.562

6.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

7.  Low-coverage sequencing: implications for design of complex trait association studies.

Authors:  Yun Li; Carlo Sidore; Hyun Min Kang; Michael Boehnke; Gonçalo R Abecasis
Journal:  Genome Res       Date:  2011-04-01       Impact factor: 9.043

8.  A model-based approach for analysis of spatial structure in genetic data.

Authors:  Wen-Yun Yang; John Novembre; Eleazar Eskin; Eran Halperin
Journal:  Nat Genet       Date:  2012-05-20       Impact factor: 38.330

9.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

10.  Meta-analysis identifies four new loci associated with testicular germ cell tumor.

Authors:  Charles C Chung; Peter A Kanetsky; Zhaoming Wang; Michelle A T Hildebrandt; Roelof Koster; Rolf I Skotheim; Christian P Kratz; Clare Turnbull; Victoria K Cortessis; Anne C Bakken; D Timothy Bishop; Michael B Cook; R Loren Erickson; Sophie D Fosså; Kevin B Jacobs; Larissa A Korde; Sigrid M Kraggerud; Ragnhild A Lothe; Jennifer T Loud; Nazneen Rahman; Eila C Skinner; Duncan C Thomas; Xifeng Wu; Meredith Yeager; Fredrick R Schumacher; Mark H Greene; Stephen M Schwartz; Katherine A McGlynn; Stephen J Chanock; Katherine L Nathanson
Journal:  Nat Genet       Date:  2013-05-12       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.