| Literature DB >> 28861891 |
Wei Zhou1, Lars G Fritsche2,3, Sayantan Das3, He Zhang4, Jonas B Nielsen4, Oddgeir L Holmen2,5, Jin Chen4, Maoxuan Lin4, Maiken B Elvestad2, Kristian Hveem6,7, Goncalo R Abecasis3, Hyun Min Kang3, Cristen J Willer1,4,8.
Abstract
The accuracy of genotype imputation depends upon two factors: the sample size of the reference panel and the genetic similarity between the reference panel and the target samples. When multiple reference panels are not consented to combine together, it is unclear how to combine the imputation results to optimize the power of genetic association studies. We compared the accuracy of 9,265 Norwegian genomes imputed from three reference panels-1000 Genomes phase 3 (1000G), Haplotype Reference Consortium (HRC), and a reference panel containing 2,201 Norwegian participants from the population-based Nord Trøndelag Health Study (HUNT) from low-pass genome sequencing. We observed that the population-matched reference panel allowed for imputation of more population-specific variants with lower frequency (minor allele frequency (MAF) between 0.05% and 0.5%). The overall imputation accuracy from the population-specific panel was substantially higher than 1000G and was comparable with HRC, despite HRC being 15-fold larger. These results recapitulate the value of population-specific reference panels for genotype imputation. We also evaluated different strategies to utilize multiple sets of imputed genotypes to increase the power of association studies. We observed that testing association for all variants imputed from any panel results in higher power to detect association than the alternative strategy of including only one version of each genetic variant, selected for having the highest imputation quality metric. This was particularly true for lower frequency variants (MAF < 1%), even after adjusting for the additional multiple testing burden.Entities:
Keywords: GWAS; genotype imputation; multiple reference panels; population-specific; study power
Mesh:
Year: 2017 PMID: 28861891 PMCID: PMC6324190 DOI: 10.1002/gepi.22067
Source DB: PubMed Journal: Genet Epidemiol ISSN: 0741-0395 Impact factor: 2.135