Literature DB >> 21364697

Imputation of low-frequency variants using the HapMap3 benefits from large, diverse reference sets.

Luke Jostins1, Katherine I Morley, Jeffrey C Barrett.   

Abstract

Imputation allows the inference of unobserved genotypes in low-density data sets, and is often used to test for disease association at variants that are poorly captured by standard genotyping chips (such as low-frequency variants). Although much effort has gone into developing the best imputation algorithms, less is known about the effects of reference set choice on imputation accuracy. We assess the improvements afforded by increases in reference size and diversity, specifically comparing the HapMap2 data set, which has been used to date for imputation, and the new HapMap3 data set, which contains more samples from a more diverse range of populations. We find that, for imputation into Western European samples, the HapMap3 reference provides more accurate imputation with better-calibrated quality scores than HapMap2, and that increasing the number of HapMap3 populations included in the reference set grant further improvements. Improvements are most pronounced for low-frequency variants (frequency <5%), with the largest and most diverse reference sets bringing the accuracy of imputation of low-frequency variants close to that of common ones. For low-frequency variants, reference set diversity can improve the accuracy of imputation, independent of reference sample size. HapMap3 reference sets provide significant increases in imputation accuracy relative to HapMap2, and are of particular use if highly accurate imputation of low-frequency variants is required. Our results suggest that, although the sample sizes from the 1000 Genomes Pilot Project will not allow reliable imputation of low-frequency variants, the larger sample sizes of the main project will allow.

Mesh:

Year:  2011        PMID: 21364697      PMCID: PMC3110048          DOI: 10.1038/ejhg.2011.10

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


  15 in total

Review 1.  Genotype imputation for genome-wide association studies.

Authors:  Jonathan Marchini; Bryan Howie
Journal:  Nat Rev Genet       Date:  2010-07       Impact factor: 53.242

2.  Cohort profile: 1958 British birth cohort (National Child Development Study).

Authors:  Chris Power; Jane Elliott
Journal:  Int J Epidemiol       Date:  2005-09-09       Impact factor: 7.196

3.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

4.  A comprehensive evaluation of SNP genotype imputation.

Authors:  Michael Nothnagel; David Ellinghaus; Stefan Schreiber; Michael Krawczak; Andre Franke
Journal:  Hum Genet       Date:  2008-12-17       Impact factor: 4.132

Review 5.  Detecting shared pathogenesis from the shared genetics of immune-related diseases.

Authors:  Alexandra Zhernakova; Cleo C van Diemen; Cisca Wijmenga
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

6.  A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Am J Hum Genet       Date:  2009-02-05       Impact factor: 11.025

7.  Evaluating the effects of imputation on the power, coverage, and cost efficiency of genome-wide SNP platforms.

Authors:  Carl A Anderson; Fredrik H Pettersson; Jeffrey C Barrett; Joanna J Zhuang; Jiannis Ragoussis; Lon R Cardon; Andrew P Morris
Journal:  Am J Hum Genet       Date:  2008-06-26       Impact factor: 11.025

8.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

9.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

10.  Genotype-imputation accuracy across worldwide human populations.

Authors:  Lucy Huang; Yun Li; Andrew B Singleton; John A Hardy; Gonçalo Abecasis; Noah A Rosenberg; Paul Scheet
Journal:  Am J Hum Genet       Date:  2009-02       Impact factor: 11.025

View more
  27 in total

1.  Performance of genotype imputations using data from the 1000 Genomes Project.

Authors:  Yun Ju Sung; Lihua Wang; Tuomo Rankinen; Claude Bouchard; D C Rao
Journal:  Hum Hered       Date:  2011-12-30       Impact factor: 0.444

2.  Comprehensive evaluation of imputation performance in African Americans.

Authors:  Pritam Chanda; Naoya Yuhki; Man Li; Joel S Bader; Alex Hartz; Eric Boerwinkle; W H Linda Kao; Dan E Arking
Journal:  J Hum Genet       Date:  2012-05-31       Impact factor: 3.172

3.  Population-specific genotype imputations using minimac or IMPUTE2.

Authors:  Elisabeth M van Leeuwen; Alexandros Kanterakis; Patrick Deelen; Mathijs V Kattenberg; P Eline Slagboom; Paul I W de Bakker; Cisca Wijmenga; Morris A Swertz; Dorret I Boomsma; Cornelia M van Duijn; Lennart C Karssen; Jouke Jan Hottenga
Journal:  Nat Protoc       Date:  2015-07-30       Impact factor: 13.491

4.  Imputation and quality control steps for combining multiple genome-wide datasets.

Authors:  Shefali S Verma; Mariza de Andrade; Gerard Tromp; Helena Kuivaniemi; Elizabeth Pugh; Bahram Namjou-Khales; Shubhabrata Mukherjee; Gail P Jarvik; Leah C Kottyan; Amber Burt; Yuki Bradford; Gretta D Armstrong; Kimberly Derr; Dana C Crawford; Jonathan L Haines; Rongling Li; David Crosslin; Marylyn D Ritchie
Journal:  Front Genet       Date:  2014-12-11       Impact factor: 4.599

5.  Phasing of many thousands of genotyped samples.

Authors:  Amy L Williams; Nick Patterson; Joseph Glessner; Hakon Hakonarson; David Reich
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

6.  Improved imputation accuracy in Hispanic/Latino populations with larger and more diverse reference panels: applications in the Hispanic Community Health Study/Study of Latinos (HCHS/SOL).

Authors:  Sarah C Nelson; Adrienne M Stilp; George J Papanicolaou; Kent D Taylor; Jerome I Rotter; Timothy A Thornton; Cathy C Laurie
Journal:  Hum Mol Genet       Date:  2016-06-26       Impact factor: 6.150

Review 7.  Haplotype phasing: existing methods and new developments.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Nat Rev Genet       Date:  2011-09-16       Impact factor: 53.242

8.  Imputation of rare variants in next-generation association studies.

Authors:  Jennifer L Asimit; Eleftheria Zeggini
Journal:  Hum Hered       Date:  2013-04-11       Impact factor: 0.444

9.  Beyond the fourth wave of genome-wide obesity association studies.

Authors:  C H Sandholt; T Hansen; O Pedersen
Journal:  Nutr Diabetes       Date:  2012-07-30       Impact factor: 5.097

10.  Assessment of genotype imputation performance using 1000 Genomes in African American studies.

Authors:  Dana B Hancock; Joshua L Levy; Nathan C Gaddis; Laura J Bierut; Nancy L Saccone; Grier P Page; Eric O Johnson
Journal:  PLoS One       Date:  2012-11-30       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.