Literature DB >> 19200528

A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals.

Brian L Browning1, Sharon R Browning.   

Abstract

We present methods for imputing data for ungenotyped markers and for inferring haplotype phase in large data sets of unrelated individuals and parent-offspring trios. Our methods make use of known haplotype phase when it is available, and our methods are computationally efficient so that the full information in large reference panels with thousands of individuals is utilized. We demonstrate that substantial gains in imputation accuracy accrue with increasingly large reference panel sizes, particularly when imputing low-frequency variants, and that unphased reference panels can provide highly accurate genotype imputation. We place our methodology in a unified framework that enables the simultaneous use of unphased and phased data from trios and unrelated individuals in a single analysis. For unrelated individuals, our imputation methods produce well-calibrated posterior genotype probabilities and highly accurate allele-frequency estimates. For trios, our haplotype-inference method is four orders of magnitude faster than the gold-standard PHASE program and has excellent accuracy. Our methods enable genotype imputation to be performed with unphased trio or unrelated reference panels, thus accounting for haplotype-phase uncertainty in the reference panel. We present a useful measure of imputation accuracy, allelic R(2), and show that this measure can be estimated accurately from posterior genotype probabilities. Our methods are implemented in version 3.0 of the BEAGLE software package.

Mesh:

Year:  2009        PMID: 19200528      PMCID: PMC2668004          DOI: 10.1016/j.ajhg.2009.01.005

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  28 in total

1.  Evaluating and improving power in whole-genome association studies using fixed marker sets.

Authors:  Itsik Pe'er; Paul I W de Bakker; Julian Maller; Roman Yelensky; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2006-05-21       Impact factor: 38.330

2.  Testing untyped alleles (TUNA)-applications to genome-wide association studies.

Authors:  Dan L Nicolae
Journal:  Genet Epidemiol       Date:  2006-12       Impact factor: 2.135

3.  A new multipoint method for genome-wide association studies by imputation of genotypes.

Authors:  Jonathan Marchini; Bryan Howie; Simon Myers; Gil McVean; Peter Donnelly
Journal:  Nat Genet       Date:  2007-06-17       Impact factor: 38.330

4.  Simple and efficient analysis of disease association with missing genotype data.

Authors:  D Y Lin; Y Hu; B E Huang
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

5.  Haplotypic analysis of Wellcome Trust Case Control Consortium data.

Authors:  Brian L Browning; Sharon R Browning
Journal:  Hum Genet       Date:  2008-01-26       Impact factor: 4.132

6.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

Authors:  Sharon R Browning; Brian L Browning
Journal:  Am J Hum Genet       Date:  2007-09-21       Impact factor: 11.025

7.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

8.  Imputation-based analysis of association studies: candidate regions and quantitative traits.

Authors:  Bertrand Servin; Matthew Stephens
Journal:  PLoS Genet       Date:  2007-05-30       Impact factor: 5.917

9.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors: 
Journal:  Nature       Date:  2007-06-07       Impact factor: 49.962

10.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease.

Authors:  Cristen J Willer; Serena Sanna; Anne U Jackson; Angelo Scuteri; Lori L Bonnycastle; Robert Clarke; Simon C Heath; Nicholas J Timpson; Samer S Najjar; Heather M Stringham; James Strait; William L Duren; Andrea Maschio; Fabio Busonero; Antonella Mulas; Giuseppe Albai; Amy J Swift; Mario A Morken; Narisu Narisu; Derrick Bennett; Sarah Parish; Haiqing Shen; Pilar Galan; Pierre Meneton; Serge Hercberg; Diana Zelenika; Wei-Min Chen; Yun Li; Laura J Scott; Paul A Scheet; Jouko Sundvall; Richard M Watanabe; Ramaiah Nagaraja; Shah Ebrahim; Debbie A Lawlor; Yoav Ben-Shlomo; George Davey-Smith; Alan R Shuldiner; Rory Collins; Richard N Bergman; Manuela Uda; Jaakko Tuomilehto; Antonio Cao; Francis S Collins; Edward Lakatta; G Mark Lathrop; Michael Boehnke; David Schlessinger; Karen L Mohlke; Gonçalo R Abecasis
Journal:  Nat Genet       Date:  2008-01-13       Impact factor: 38.330

View more
  822 in total

1.  The association between a polygenic Alzheimer score and cortical thickness in clinically normal subjects.

Authors:  Mert R Sabuncu; Randy L Buckner; Jordan W Smoller; Phil Hyoun Lee; Bruce Fischl; Reisa A Sperling
Journal:  Cereb Cortex       Date:  2011-12-13       Impact factor: 5.357

2.  Performance of genotype imputations using data from the 1000 Genomes Project.

Authors:  Yun Ju Sung; Lihua Wang; Tuomo Rankinen; Claude Bouchard; D C Rao
Journal:  Hum Hered       Date:  2011-12-30       Impact factor: 0.444

3.  Identification of two new loci at IL23R and RAB32 that influence susceptibility to leprosy.

Authors:  Furen Zhang; Hong Liu; Shumin Chen; Huiqi Low; Liangdan Sun; Yong Cui; Tongsheng Chu; Yi Li; Xi'an Fu; Yongxiang Yu; Gongqi Yu; Benqing Shi; Hongqing Tian; Dianchang Liu; Xiulu Yu; Jinghui Li; Nan Lu; Fangfang Bao; Chunying Yuan; Jian Liu; Huaxu Liu; Lin Zhang; Yonghu Sun; Mingfei Chen; Qing Yang; Haitao Yang; Rongde Yang; Lianhua Zhang; Qiang Wang; Hong Liu; Fuguang Zuo; Haizhen Zhang; Chiea Chuen Khor; Martin L Hibberd; Sen Yang; Jianjun Liu; Xuejun Zhang
Journal:  Nat Genet       Date:  2011-10-23       Impact factor: 38.330

4.  Family-based association tests using genotype data with uncertainty.

Authors:  Zhaoxia Yu
Journal:  Biostatistics       Date:  2011-12-08       Impact factor: 5.899

5.  On the meta-analysis of genome-wide association studies: a robust and efficient approach to combine population and family-based studies.

Authors:  Sungho Won; Qing Lu; Lars Bertram; Rudolph E Tanzi; Christoph Lange
Journal:  Hum Hered       Date:  2012-01-18       Impact factor: 0.444

Review 6.  Using chromatin marks to interpret and localize genetic associations to complex human traits and diseases.

Authors:  Gosia Trynka; Soumya Raychaudhuri
Journal:  Curr Opin Genet Dev       Date:  2013-11-25       Impact factor: 5.578

7.  Improving power of association tests using multiple sets of imputed genotypes from distributed reference panels.

Authors:  Wei Zhou; Lars G Fritsche; Sayantan Das; He Zhang; Jonas B Nielsen; Oddgeir L Holmen; Jin Chen; Maoxuan Lin; Maiken B Elvestad; Kristian Hveem; Goncalo R Abecasis; Hyun Min Kang; Cristen J Willer
Journal:  Genet Epidemiol       Date:  2017-09-01       Impact factor: 2.135

Review 8.  Determining causality and consequence of expression quantitative trait loci.

Authors:  A Battle; S B Montgomery
Journal:  Hum Genet       Date:  2014-04-26       Impact factor: 4.132

9.  Genetic association analysis of 300 genes identifies a risk haplotype in SLC18A2 for post-traumatic stress disorder in two independent samples.

Authors:  Nadia Solovieff; Andrea L Roberts; Andrew Ratanatharathorn; Michelle Haloosim; Immaculata De Vivo; Anthony P King; Israel Liberzon; Allison Aiello; Monica Uddin; Derek E Wildman; Sandro Galea; Jordan W Smoller; Shaun M Purcell; Karestan C Koenen
Journal:  Neuropsychopharmacology       Date:  2014-02-14       Impact factor: 7.853

10.  A network-based kernel machine test for the identification of risk pathways in genome-wide association studies.

Authors:  Saskia Freytag; Juliane Manitz; Martin Schlather; Thomas Kneib; Christopher I Amos; Angela Risch; Jenny Chang-Claude; Joachim Heinrich; Heike Bickeböller
Journal:  Hum Hered       Date:  2014-01-14       Impact factor: 0.444

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.