Literature DB >> 31977056

Accuracy of Imputation for Apolipoprotein E ε Alleles in Genome-Wide Genotyping Data.

Eero Vuoksimaa1, Teemu Palviainen1, Noora Lindgren2, Juha O Rinne2,3, Jaakko Kaprio1,4.   

Abstract

Entities:  

Year:  2020        PMID: 31977056      PMCID: PMC6991323          DOI: 10.1001/jamanetworkopen.2019.19960

Source DB:  PubMed          Journal:  JAMA Netw Open        ISSN: 2574-3805


× No keyword cloud information.

Introduction

Given the importance of the apolipoprotein E (APOE) gene for risk of Alzheimer disease, determining this genotype is important in cognitive aging studies. Before the genome-wide genotyping era, the APOE gene (alleles ε2, ε3, ε4) was directly genotyped and defined by 2 single-nucleotide polymorphisms (SNPs), rs429358 and rs7412, in chromosome 19. Owing to rapid development of genotyping technology, the price of genome-wide arrays covering approximately 500 000 SNPs has decreased to less than $100, making it more cost-effective compared with direct genotyping of a single gene, such as APOE. In addition to genotyped SNPs, the information can be used to impute common nonmeasured variants such as rs429358 and rs7412 that are not directly genotyped on many chips. However, imputation accuracy depends on the genome-wide arrays, quality control, and reference samples.[1,2] In this diagnostic study, we evaluated the association of reference panels with imputation quality of rs429358 and rs7412 by comparing imputation based on 3 different reference panels: 1000 Genomes (1000G),[3] Haplotype Reference Consortium (HRC),[4] and the Finnish-specific Sequencing Initiative Suomi (SISu).[5]

Methods

We used a population-based older Finnish Twin Cohort[6] study to examine the correspondence between rs429358 and rs7412 directly genotyped using a Sequenom (Taqman) and imputed rs429358 and rs7412 using 1000G Phase III version 5, HRC release 1.1, and SISu reference panels. Raw genotype data in a larger sample (5343 participants) using 5 array versions (12 v1.0 A, 12 v1.1 A, 24 v1.0 A, 24 v1.1 A, and 24 v1.2 A) of HumanCoreExome (Illumina) were merged before the quality control phase. We removed variants with call rate less than 97.5%, samples with call rate less than 95%, variants with minor allele frequency less than 1%, and variants with Hardy-Weinberg equilibrium P < 1.0 × 10−6. We removed samples with heterozygosity test method-of-moments F coefficient estimate values less than −0.03 or greater than 0.05, multidimensional scaling principal component analysis outliers, and samples that failed sex check. The number of genotyped autosomal variants after quality control was 239 894 (5328 participants). We then performed prephasing using Eagle software version 2.3 (Broad Institute) and imputation with Minimac3 software version 2.0.1 (University of Michigan) (Table 1). The study sample (1704 participants) with directly genotyped rs429358 and rs7412 was extracted from each imputed data set. Ethical approval was obtained from the ethical committee of the Hospital District of Southwest Finland, and participants gave written informed consent.
Table 1.

Cross Tabulation of Directly Genotyped and Imputed Single-Nucleotide Polymorphism of rs429358 and rs7412 for 1704 Individuals

Imputation BasedGenotyped, No. (%)Total, No.
1000 Genomes
rs429358TTCTCC
TT1152 (99.74)3 (0.26)0 1155
CT0 503 (100)0 503
CC0 0 46 (100)46
rs7412CCCTTT
CC1505 (100)0 0 1505
CT2 (1.03)192 (98.97)0 194
TT0 1 (20.0)4 (80.0)5
Haplotype Reference Consortium
rs429358TTCTCC
TT1153 (99.83)2 (0.17)0 1155
CT0 503 (100)0 503
CC0 0 46 (100)46
rs7412CCCTTT
CC1505 (100)0 0 1505
CT0 194 (100)0 194
TT0 1 (20.0)4 (80.0)5
Sequencing Initiative Suomi
rs429358TTCTCC
TT1153 (99.83)2 (0.17)0 1155
CT0 503 (100)0 503
CC0 0 46 (100)46
rs7412CCCTTT
CC1505 (100)0 0 1505
CT0 194 (100)0 194
TT0 1 (20.0)4 (80.0)5

Genotypes were imputed to 1000 Genomes Phase III version 5,[3] Haplotype Reference Consortium release 1.1,[4] and Sequencing Initiative Suomi Finnish-only reference panels.[5] Imputation to 1000 Genomes and Haplotype Reference Consortium reference panels was done using the University of Michigan Imputation Server. The Sequencing Initiative Suomi reference panel consists of 16 962 023 variants from 3775 high-pass whole-genome (depth up to 30×) sequences.

Genotypes were imputed to 1000 Genomes Phase III version 5,[3] Haplotype Reference Consortium release 1.1,[4] and Sequencing Initiative Suomi Finnish-only reference panels.[5] Imputation to 1000 Genomes and Haplotype Reference Consortium reference panels was done using the University of Michigan Imputation Server. The Sequencing Initiative Suomi reference panel consists of 16 962 023 variants from 3775 high-pass whole-genome (depth up to 30×) sequences.

Results

Participants were of European ancestry (mean [SD] age, 74.2 [4.9] years; 775 [45%] women). For directly genotyped individuals, 984 (57.7%) had ε3/ε3 genotype, 521 (30.6%) had ε3/ε4 or ε4/ε4, 171 (10%) had ε2/ε2 or ε2/ε3, and 28 (1.6%) had ε2/ε4. Allele frequencies were 0.060 for ε2, 0.765 for ε3, and 0.175 for ε4. Based on 1000G, 1701 individuals (99.82%) had correctly classified alleles of both rs429358 and rs7412 (Table 1). Results were similar for HRC and SISu: 1702 individuals (99.88%) and 1703 individuals (99.94%) had correctly classified rs429358 and rs7412, respectively (Table 1). Using the HRC reference panel, 1702 individuals (99.88%) had correctly classified APOE genotype and ε4 carrier status (Table 2). Two (0.12%) of the ε4 noncarriers based on directly genotyped SNPs were incorrectly classified as ε4 carriers.
Table 2.

Cross Tabulation of Directly Genotyped and Haplotype Reference Consortium Reference Panel Imputation–Based APOE Status in 1704 Individuals

APOE Status (% Individuals)APOE Status Based on Imputed Single-Nucleotide Polymorphisms, No.Total
ε2/ε2ε2/ε3ε2/ε4ε3/ε3ε3/ε4ε4/ε4
ε2/ε2 (0.29%)4010005
ε2/ε3 (9.74%)01660000166
ε2/ε4 (1.64%)002800028
ε3/ε3 (57.75%)00098310984
ε3/ε4 (27.88%)00004750475
ε4/ε4 (2.70%)000004646

Abbreviation: APOE, apolipoprotein E.

Abbreviation: APOE, apolipoprotein E.

Discussion

This study found that by using arrays described in the Methods section, imputation to all 3 reference panels, 1000G, HRC, and SISu, yielded high imputation accuracy of rs429358 and rs7412, 2 SNPs needed to determine polymorphic APOE ε alleles. The number of Finnish samples do vary in different reference panels: 99 in 1000G Phase III, approximately 1900 in the HRC, and 3800 in the Finnish-only SISu. Considering 1000G Phase III yielded improved accuracy compared with Phase I.[1,2] Our results also suggest that determination of APOE can be reached equally well with most recent freely available cosmopolitan reference panels compared with a population-specific reference panel. Still, all Finnish samples and inclusion of only 1 brand of arrays were also limitations, and these results should be confirmed in people with different ancestry.
  6 in total

1.  Accuracy of Inferred APOE Genotypes for a Range of Genotyping Arrays and Imputation Reference Panels.

Authors:  Michelle K Lupton; Sarah E Medland; Scott D Gordon; Tabatha Goncalves; Stuart MacGregor; David A Mackey; Terri L Young; David L Duffy; Peter M Visscher; Naomi R Wray; Dale R Nyholt; Lisa Bain; Manuel A Ferreira; Anjali K Henders; Leanne Wallace; Grant W Montgomery; Margaret J Wright; Nicholas G Martin
Journal:  J Alzheimers Dis       Date:  2018       Impact factor: 4.472

2.  Concordance between direct and imputed APOE genotypes using 1000 Genomes data.

Authors:  Christopher Oldmeadow; Elizabeth G Holliday; Mark McEvoy; Rodney Scott; John B J Kwok; Karen Mather; Perminder Sachdev; Peter Schofield; John Attia
Journal:  J Alzheimers Dis       Date:  2014       Impact factor: 4.472

3.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

4.  A reference panel of 64,976 haplotypes for genotype imputation.

Authors:  Shane McCarthy; Sayantan Das; Warren Kretzschmar; Olivier Delaneau; Andrew R Wood; Alexander Teumer; Hyun Min Kang; Christian Fuchsberger; Petr Danecek; Kevin Sharp; Yang Luo; Carlo Sidore; Alan Kwong; Nicholas Timpson; Seppo Koskinen; Scott Vrieze; Laura J Scott; He Zhang; Anubha Mahajan; Jan Veldink; Ulrike Peters; Carlos Pato; Cornelia M van Duijn; Christopher E Gillies; Ilaria Gandin; Massimo Mezzavilla; Arthur Gilly; Massimiliano Cocca; Michela Traglia; Andrea Angius; Jeffrey C Barrett; Dorrett Boomsma; Kari Branham; Gerome Breen; Chad M Brummett; Fabio Busonero; Harry Campbell; Andrew Chan; Sai Chen; Emily Chew; Francis S Collins; Laura J Corbin; George Davey Smith; George Dedoussis; Marcus Dorr; Aliki-Eleni Farmaki; Luigi Ferrucci; Lukas Forer; Ross M Fraser; Stacey Gabriel; Shawn Levy; Leif Groop; Tabitha Harrison; Andrew Hattersley; Oddgeir L Holmen; Kristian Hveem; Matthias Kretzler; James C Lee; Matt McGue; Thomas Meitinger; David Melzer; Josine L Min; Karen L Mohlke; John B Vincent; Matthias Nauck; Deborah Nickerson; Aarno Palotie; Michele Pato; Nicola Pirastu; Melvin McInnis; J Brent Richards; Cinzia Sala; Veikko Salomaa; David Schlessinger; Sebastian Schoenherr; P Eline Slagboom; Kerrin Small; Timothy Spector; Dwight Stambolian; Marcus Tuke; Jaakko Tuomilehto; Leonard H Van den Berg; Wouter Van Rheenen; Uwe Volker; Cisca Wijmenga; Daniela Toniolo; Eleftheria Zeggini; Paolo Gasparini; Matthew G Sampson; James F Wilson; Timothy Frayling; Paul I W de Bakker; Morris A Swertz; Steven McCarroll; Charles Kooperberg; Annelot Dekker; David Altshuler; Cristen Willer; William Iacono; Samuli Ripatti; Nicole Soranzo; Klaudia Walter; Anand Swaroop; Francesco Cucca; Carl A Anderson; Richard M Myers; Michael Boehnke; Mark I McCarthy; Richard Durbin
Journal:  Nat Genet       Date:  2016-08-22       Impact factor: 38.330

5.  Distribution and medical impact of loss-of-function variants in the Finnish founder population.

Authors:  Elaine T Lim; Peter Würtz; Aki S Havulinna; Priit Palta; Taru Tukiainen; Karola Rehnström; Tõnu Esko; Reedik Mägi; Michael Inouye; Tuuli Lappalainen; Yingleong Chan; Rany M Salem; Monkol Lek; Jason Flannick; Xueling Sim; Alisa Manning; Claes Ladenvall; Suzannah Bumpstead; Eija Hämäläinen; Kristiina Aalto; Mikael Maksimow; Marko Salmi; Stefan Blankenberg; Diego Ardissino; Svati Shah; Benjamin Horne; Ruth McPherson; Gerald K Hovingh; Muredach P Reilly; Hugh Watkins; Anuj Goel; Martin Farrall; Domenico Girelli; Alex P Reiner; Nathan O Stitziel; Sekar Kathiresan; Stacey Gabriel; Jeffrey C Barrett; Terho Lehtimäki; Markku Laakso; Leif Groop; Jaakko Kaprio; Markus Perola; Mark I McCarthy; Michael Boehnke; David M Altshuler; Cecilia M Lindgren; Joel N Hirschhorn; Andres Metspalu; Nelson B Freimer; Tanja Zeller; Sirpa Jalkanen; Seppo Koskinen; Olli Raitakari; Richard Durbin; Daniel G MacArthur; Veikko Salomaa; Samuli Ripatti; Mark J Daly; Aarno Palotie
Journal:  PLoS Genet       Date:  2014-07-31       Impact factor: 5.917

6.  Middle age self-report risk score predicts cognitive functioning and dementia in 20-40 years.

Authors:  Eero Vuoksimaa; Juha O Rinne; Noora Lindgren; Kauko Heikkilä; Markku Koskenvuo; Jaakko Kaprio
Journal:  Alzheimers Dement (Amst)       Date:  2016-09-14
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.