Literature DB >> 24939589

Improving accuracy of rare variant imputation with a two-step imputation approach.

Eskil Kreiner-Møller1, Carolina Medina-Gomez2, André G Uitterlinden2, Fernando Rivadeneira2, Karol Estrada3.   

Abstract

Genotype imputation has been the pillar of the success of genome-wide association studies (GWAS) for identifying common variants associated with common diseases. However, most GWAS have been run using only 60 HapMap samples as reference for imputation, meaning less frequent and rare variants not being comprehensively scrutinized. Next-generation arrays ensuring sufficient coverage together with new reference panels, as the 1000 Genomes panel, are emerging to facilitate imputation of low frequent single-nucleotide polymorphisms (minor allele frequency (MAF) <5%). In this study, we present a two-step imputation approach improving the quality of the 1000 Genomes imputation by genotyping only a subset of samples to create a local reference population on a dense array with many low-frequency markers. In this approach, the study sample, genotyped with a first generation array, is imputed first to the local reference sample genotyped on a dense array and hereafter to the 1000 Genomes reference panel. We show that mean imputation quality, measured by the r(2) using this approach, increases by 28% for variants with a MAF between 1 and 5% as compared with direct imputation to 1000 Genomes reference. Similarly, the concordance rate between calls of imputed and true genotypes was found to be significantly higher for heterozygotes (P<1e-15) and rare homozygote calls (P<1e-15) in this low frequency range. The two-step approach in our setting improves imputation quality compared with traditional direct imputation noteworthy in the low-frequency spectrum and is a cost-effective strategy in large epidemiological studies.

Mesh:

Year:  2014        PMID: 24939589      PMCID: PMC4326719          DOI: 10.1038/ejhg.2014.91

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  14 in total

Review 1.  Five years of GWAS discovery.

Authors:  Peter M Visscher; Matthew A Brown; Mark I McCarthy; Jian Yang
Journal:  Am J Hum Genet       Date:  2012-01-13       Impact factor: 11.025

2.  Performance of genotype imputations using data from the 1000 Genomes Project.

Authors:  Yun Ju Sung; Lihua Wang; Tuomo Rankinen; Claude Bouchard; D C Rao
Journal:  Hum Hered       Date:  2011-12-30       Impact factor: 0.444

Review 3.  Genotype imputation for genome-wide association studies.

Authors:  Jonathan Marchini; Bryan Howie
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

4.  Genomics: The search for association.

Authors:  Monya Baker
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

5.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

6.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies.

Authors:  Paul I W de Bakker; Manuel A R Ferreira; Xiaoming Jia; Benjamin M Neale; Soumya Raychaudhuri; Benjamin F Voight
Journal:  Hum Mol Genet       Date:  2008-10-15       Impact factor: 6.150

7.  A two-platform design for next generation genome-wide association studies.

Authors:  Joshua N Sampson; Kevin Jacobs; Zhaoming Wang; Meredith Yeager; Stephen Chanock; Nilanjan Chatterjee
Journal:  Genet Epidemiol       Date:  2012-04-16       Impact factor: 2.135

Review 8.  Finding the missing heritability of complex diseases.

Authors:  Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

9.  1000 Genomes-based imputation identifies novel and refined associations for the Wellcome Trust Case Control Consortium phase 1 Data.

Authors:  Jie Huang; David Ellinghaus; Andre Franke; Bryan Howie; Yun Li
Journal:  Eur J Hum Genet       Date:  2012-02-01       Impact factor: 4.246

10.  The Rotterdam Study: 2012 objectives and design update.

Authors:  Albert Hofman; Cornelia M van Duijn; Oscar H Franco; M Arfan Ikram; Harry L A Janssen; Caroline C W Klaver; Ernst J Kuipers; Tamar E C Nijsten; Bruno H Ch Stricker; Henning Tiemeier; André G Uitterlinden; Meike W Vernooij; Jacqueline C M Witteman
Journal:  Eur J Epidemiol       Date:  2011-08-30       Impact factor: 8.082

View more
  14 in total

1.  The Rotterdam Study: 2016 objectives and design update.

Authors:  Albert Hofman; Guy G O Brusselle; Sarwa Darwish Murad; Cornelia M van Duijn; Oscar H Franco; André Goedegebure; M Arfan Ikram; Caroline C W Klaver; Tamar E C Nijsten; Robin P Peeters; Bruno H Ch Stricker; Henning W Tiemeier; André G Uitterlinden; Meike W Vernooij
Journal:  Eur J Epidemiol       Date:  2015-09-19       Impact factor: 8.082

2.  Power Analysis for Genetic Association Test (PAGEANT) provides insights to challenges for rare variant association studies.

Authors:  Andriy Derkach; Haoyu Zhang; Nilanjan Chatterjee
Journal:  Bioinformatics       Date:  2018-05-01       Impact factor: 6.937

3.  Genotype imputation in the domestic dog.

Authors:  S G Friedenberg; K M Meurs
Journal:  Mamm Genome       Date:  2016-04-29       Impact factor: 2.957

4.  Choosing Subsamples for Sequencing Studies by Minimizing the Average Distance to the Closest Leaf.

Authors:  Jonathan T L Kang; Peng Zhang; Sebastian Zöllner; Noah A Rosenberg
Journal:  Genetics       Date:  2015-08-24       Impact factor: 4.562

5.  Impact of genetic similarity on imputation accuracy.

Authors:  Nab Raj Roshyara; Markus Scholz
Journal:  BMC Genet       Date:  2015-07-22       Impact factor: 2.797

6.  A new strategy for enhancing imputation quality of rare variants from next-generation sequencing data via combining SNP and exome chip data.

Authors:  Young Jin Kim; Juyoung Lee; Bong-Jo Kim; Taesung Park
Journal:  BMC Genomics       Date:  2015-12-29       Impact factor: 3.969

7.  Imputing rare variants in families using a two-stage approach.

Authors:  Samantha Lent; Xuan Deng; L Adrienne Cupples; Kathryn L Lunetta; C T Liu; Yanhua Zhou
Journal:  BMC Proc       Date:  2016-10-18

8.  Genome-wide compound heterozygote analysis highlights alleles associated with adult height in Europeans.

Authors:  Kaiyin Zhong; Gu Zhu; Xiaoxi Jing; A Emile J Hendriks; Sten L S Drop; M Arfan Ikram; Scott Gordon; Changqing Zeng; Andre G Uitterlinden; Nicholas G Martin; Fan Liu; Manfred Kayser
Journal:  Hum Genet       Date:  2017-09-18       Impact factor: 4.132

9.  Revisit Population-based and Family-based Genotype Imputation.

Authors:  Ching-Ti Liu; Xuan Deng; Virginia Fisher; Nancy Heard-Costa; Hanfei Xu; Yanhua Zhou; Ramachandran S Vasan; L Adrienne Cupples
Journal:  Sci Rep       Date:  2019-02-12       Impact factor: 4.379

10.  Assessing accuracy of imputation using different SNP panel densities in a multi-breed sheep population.

Authors:  Ricardo V Ventura; Stephen P Miller; Ken G Dodds; Benoit Auvray; Michael Lee; Matthew Bixley; Shannon M Clarke; John C McEwan
Journal:  Genet Sel Evol       Date:  2016-09-23       Impact factor: 4.297

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.