| Literature DB >> 35981533 |
Sarah C Hanks1, Lukas Forer2, Sebastian Schönherr2, Jonathon LeFaive1, Taylor Martins1, Ryan Welch1, Sarah A Gagliano Taliun3, David Braff4, Jill M Johnsen5, Eimear E Kenny6, Barbara A Konkle7, Markku Laakso8, Ruth F J Loos9, Steven McCarroll10, Carlos Pato11, Michele T Pato11, Albert V Smith1, Michael Boehnke1, Laura J Scott1, Christian Fuchsberger12.
Abstract
Understanding the genetic basis of human diseases and traits is dependent on the identification and accurate genotyping of genetic variants. Deep whole-genome sequencing (WGS), the gold standard technology for SNP and indel identification and genotyping, remains very expensive for most large studies. Here, we quantify the extent to which array genotyping followed by genotype imputation can approximate WGS in studies of individuals of African, Hispanic/Latino, and European ancestry in the US and of Finnish ancestry in Finland (a population isolate). For each study, we performed genotype imputation by using the genetic variants present on the Illumina Core, OmniExpress, MEGA, and Omni 2.5M arrays with the 1000G, HRC, and TOPMed imputation reference panels. Using the Omni 2.5M array and the TOPMed panel, ≥90% of bi-allelic single-nucleotide variants (SNVs) are well imputed (r2 > 0.8) down to minor-allele frequencies (MAFs) of 0.14% in African, 0.11% in Hispanic/Latino, 0.35% in European, and 0.85% in Finnish ancestries. There was little difference in TOPMed-based imputation quality among the arrays with >700k variants. Individual-level imputation quality varied widely between and within the three US studies. Imputation quality also varied across genomic regions, producing regions where even common (MAF > 5%) variants were consistently not well imputed across ancestries. The extent to which array genotyping and imputation can approximate WGS therefore depends on reference panel, genotype array, sample ancestry, and genomic location. Imputation quality by variant or genomic region can be queried with our new tool, RsqBrowser, now deployed on the Michigan Imputation Server.Entities:
Keywords: genotype imputation; genotyping array; whole-genome sequencing
Mesh:
Year: 2022 PMID: 35981533 PMCID: PMC9502057 DOI: 10.1016/j.ajhg.2022.07.012
Source DB: PubMed Journal: Am J Hum Genet ISSN: 0002-9297 Impact factor: 11.043