Literature DB >> 25246238

Systematic assessment of imputation performance using the 1000 Genomes reference panels.

Qian Liu, Elizabeth T Cirulli, Yujun Han, Song Yao, Song Liu, Qianqian Zhu.   

Abstract

Genotype imputation has been widely adopted in the postgenome-wide association studies (GWAS) era. Owing to its ability to accurately predict the genotypes of untyped variants, imputation greatly boosts variant density, allowing fine-mapping studies of GWAS loci and large-scale meta-analysis across different genotyping arrays. By leveraging genotype data from 90 whole-genome deeply sequenced individuals as the evaluation benchmark and the 1000 Genomes Project data as reference panels, we systematically examined four important issues related to genotype imputation practice. First, in a study of imputation accuracy, we found that IMPUTE2 and minimac have the best imputation performance among the three popular imputing software evaluated and that using a multi-population reference panel is beneficial. Second, the optimal imputation quality cutoff for removing poorly imputed variants varies according to the software used. Third, the major contributing factors to consistently poor imputation are low variant heterozygosity, high sequence similarity to other genomic regions, high GC content, segmental duplication and being far from genotyping markers. Lastly, in an evaluation of the imputability of all known GWAS regions, we found that GWAS loci associated with hematological measurements and immune system diseases are harder to impute, as compared with other human traits. Recommendations made based on the above findings may provide practical guidance for imputation exercise in future genetic studies.
© The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  genetic association; genotype estimation; haplotyping

Mesh:

Year:  2014        PMID: 25246238      PMCID: PMC4580532          DOI: 10.1093/bib/bbu035

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  34 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

3.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

Review 4.  Positive selection on the human genome.

Authors:  Eric J Vallender; Bruce T Lahn
Journal:  Hum Mol Genet       Date:  2004-10-01       Impact factor: 6.150

Review 5.  Genotype imputation.

Authors:  Yun Li; Cristen Willer; Serena Sanna; Gonçalo Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

6.  Fine mapping of the NRG1 Hirschsprung's disease locus.

Authors:  Clara Sze-Man Tang; Wai-Kiu Tang; Man-Ting So; Xiao-Ping Miao; Brian Man-Chun Leung; Benjamin Hon-Kei Yip; Thomas Yuk-Yu Leon; Elly Sau-Wai Ngan; Vincent Chi-Hang Lui; Yan Chen; Ivy Hau-Yee Chan; Patrick Ho-Yu Chung; Xue-Lai Liu; Xuan-Zhao Wu; Kenneth Kak-Yuen Wong; Pak-Chung Sham; Stacey S Cherny; Paul Kwong-Hang Tam; Maria-Mercè Garcia-Barceló
Journal:  PLoS One       Date:  2011-01-20       Impact factor: 3.240

7.  Multiple common susceptibility variants near BMP pathway loci GREM1, BMP4, and BMP2 explain part of the missing heritability of colorectal cancer.

Authors:  Ian P M Tomlinson; Luis G Carvajal-Carmona; Sara E Dobbins; Albert Tenesa; Angela M Jones; Kimberley Howarth; Claire Palles; Peter Broderick; Emma E M Jaeger; Susan Farrington; Annabelle Lewis; James G D Prendergast; Alan M Pittman; Evropi Theodoratou; Bianca Olver; Marion Walker; Steven Penegar; Ella Barclay; Nicola Whiffin; Lynn Martin; Stephane Ballereau; Amy Lloyd; Maggie Gorman; Steven Lubbe; Bryan Howie; Jonathan Marchini; Clara Ruiz-Ponte; Ceres Fernandez-Rozadilla; Antoni Castells; Angel Carracedo; Sergi Castellvi-Bel; David Duggan; David Conti; Jean-Baptiste Cazier; Harry Campbell; Oliver Sieber; Lara Lipton; Peter Gibbs; Nicholas G Martin; Grant W Montgomery; Joanne Young; Paul N Baird; Steven Gallinger; Polly Newcomb; John Hopper; Mark A Jenkins; Lauri A Aaltonen; David J Kerr; Jeremy Cheadle; Paul Pharoah; Graham Casey; Richard S Houlston; Malcolm G Dunlop
Journal:  PLoS Genet       Date:  2011-06-02       Impact factor: 5.917

8.  Fine mapping of five loci associated with low-density lipoprotein cholesterol detects variants that double the explained heritability.

Authors:  Serena Sanna; Bingshan Li; Antonella Mulas; Carlo Sidore; Hyun M Kang; Anne U Jackson; Maria Grazia Piras; Gianluca Usala; Giuseppe Maninchedda; Alessandro Sassu; Fabrizio Serra; Maria Antonietta Palmas; William H Wood; Inger Njølstad; Markku Laakso; Kristian Hveem; Jaakko Tuomilehto; Timo A Lakka; Rainer Rauramaa; Michael Boehnke; Francesco Cucca; Manuela Uda; David Schlessinger; Ramaiah Nagaraja; Gonçalo R Abecasis
Journal:  PLoS Genet       Date:  2011-07-28       Impact factor: 5.917

9.  Genotype imputation with thousands of genomes.

Authors:  Bryan Howie; Jonathan Marchini; Matthew Stephens
Journal:  G3 (Bethesda)       Date:  2011-11-01       Impact factor: 3.154

10.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  11 in total

1.  Genome-wide association study of INDELs identified four novel susceptibility loci associated with lung cancer risk.

Authors:  Juncheng Dai; Mingtao Huang; Christopher I Amos; Rayjean J Hung; Adonina Tardon; Angeline Andrew; Chu Chen; David C Christiani; Demetrius Albanes; Gadi Rennert; Jingyi Fan; Gary Goodman; Geoffrey Liu; John K Field; Kjell Grankvist; Lambertus A Kiemeney; Loic Le Marchand; Matthew B Schabath; Mattias Johansson; Melinda C Aldrich; Mikael Johansson; Neil Caporaso; Philip Lazarus; Stephan Lam; Stig E Bojesen; Susanne Arnold; Maria Teresa Landi; Angela Risch; H-Erich Wichmann; Heike Bickeboller; Paul Brennan; Sanjay Shete; Olle Melander; Hans Brunnstrom; Shan Zienolddiny; Penella Woll; Victoria Stevens; Zhibin Hu; Hongbing Shen
Journal:  Int J Cancer       Date:  2019-10-31       Impact factor: 7.396

2.  Extent to which array genotyping and imputation with large reference panels approximate deep whole-genome sequencing.

Authors:  Sarah C Hanks; Lukas Forer; Sebastian Schönherr; Jonathon LeFaive; Taylor Martins; Ryan Welch; Sarah A Gagliano Taliun; David Braff; Jill M Johnsen; Eimear E Kenny; Barbara A Konkle; Markku Laakso; Ruth F J Loos; Steven McCarroll; Carlos Pato; Michele T Pato; Albert V Smith; Michael Boehnke; Laura J Scott; Christian Fuchsberger
Journal:  Am J Hum Genet       Date:  2022-08-17       Impact factor: 11.043

3.  False positive findings during genome-wide association studies with imputation: influence of allele frequency and imputation accuracy.

Authors:  Zhihui Zhang; Xiangjun Xiao; Wen Zhou; Dakai Zhu; Christopher I Amos
Journal:  Hum Mol Genet       Date:  2021-12-17       Impact factor: 5.121

4.  Comparison among three variant callers and assessment of the accuracy of imputation from SNP array data to whole-genome sequence level in chicken.

Authors:  Guiyan Ni; Tim M Strom; Hubert Pausch; Christian Reimer; Rudolf Preisinger; Henner Simianer; Malena Erbe
Journal:  BMC Genomics       Date:  2015-10-21       Impact factor: 3.969

5.  Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture.

Authors:  Guiyan Ni; David Cavero; Anna Fangmann; Malena Erbe; Henner Simianer
Journal:  Genet Sel Evol       Date:  2017-01-16       Impact factor: 4.297

6.  Meta-analysis of sequence-based association studies across three cattle breeds reveals 25 QTL for fat and protein percentages in milk at nucleotide resolution.

Authors:  Hubert Pausch; Reiner Emmerling; Birgit Gredler-Grandl; Ruedi Fries; Hans D Daetwyler; Michael E Goddard
Journal:  BMC Genomics       Date:  2017-11-09       Impact factor: 3.969

7.  Identification of intermediate-sized deletions and inference of their impact on gene expression in a human population.

Authors:  Jing Hao Wong; Daichi Shigemizu; Yukiko Yoshii; Shintaro Akiyama; Azusa Tanaka; Hidewaki Nakagawa; Shu Narumiya; Akihiro Fujimoto
Journal:  Genome Med       Date:  2019-07-24       Impact factor: 11.117

8.  Rare Variants Imputation in Admixed Populations: Comparison Across Reference Panels and Bioinformatics Tools.

Authors:  Sanjeev Sariya; Joseph H Lee; Richard Mayeux; Badri N Vardarajan; Dolly Reyes-Dumeyer; Jennifer J Manly; Adam M Brickman; Rafael Lantigua; Martin Medrano; Ivonne Z Jimenez-Velazquez; Giuseppe Tosto
Journal:  Front Genet       Date:  2019-04-03       Impact factor: 4.599

9.  Imputation accuracy to whole-genome sequence in Nellore cattle.

Authors:  Gerardo A Fernandes Júnior; Roberto Carvalheiro; Henrique N de Oliveira; Mehdi Sargolzaei; Roy Costilla; Ricardo V Ventura; Larissa F S Fonseca; Haroldo H R Neves; Ben J Hayes; Lucia G de Albuquerque
Journal:  Genet Sel Evol       Date:  2021-03-12       Impact factor: 4.297

10.  Gene-Based Variant Analysis of Whole-Exome Sequencing in Relation to Eosinophil Count.

Authors:  Julia Höglund; Fatemeh Hadizadeh; Weronica E Ek; Torgny Karlsson; Åsa Johansson
Journal:  Front Immunol       Date:  2022-07-22       Impact factor: 8.786

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.