Literature DB >> 21267008

The effect of genome-wide association scan quality control on imputation outcome for common variants.

Lorraine Southam¹, Kalliope Panoutsopoulou, N William Rayner, Kay Chapman, Caroline Durrant, Teresa Ferreira, Nigel Arden, Andrew Carr, Panos Deloukas, Michael Doherty, John Loughlin, Andrew McCaskie, William E R Ollier, Stuart Ralston, Timothy D Spector, Ana M Valdes, Gillian A Wallis, J Mark Wilkinson, Jonathan Marchini, Eleftheria Zeggini.

Abstract

Imputation is an extremely valuable tool in conducting and synthesising genome-wide association studies (GWASs). Directly typed SNP quality control (QC) is thought to affect imputation quality. It is, therefore, common practise to use quality-controlled (QCed) data as an input for imputing genotypes. This study aims to determine the effect of commonly applied QC steps on imputation outcomes. We performed several iterations of imputing SNPs across chromosome 22 in a dataset consisting of 3177 samples with Illumina 610 k (Illumina, San Diego, CA, USA) GWAS data, applying different QC steps each time. The imputed genotypes were compared with the directly typed genotypes. In addition, we investigated the correlation between alternatively QCed data. We also applied a series of post-imputation QC steps balancing elimination of poorly imputed SNPs and information loss. We found that the difference between the unQCed data and the fully QCed data on imputation outcome was minimal. Our study shows that imputation of common variants is generally very accurate and robust to GWAS QC, which is not a major factor affecting imputation outcome. A minority of common-frequency SNPs with particular properties cannot be accurately imputed regardless of QC stringency. These findings may not generalise to the imputation of low frequency and rare variants.

Entities: Chemical

Mesh：

Year: 2011 PMID： 21267008 PMCID： PMC3083623 DOI： 10.1038/ejhg.2010.242

Source DB: PubMed Journal: Eur J Hum Genet ISSN： 1018-4813 Impact factor: 4.246

Introduction

Genome-wide association scans (GWASs) have proven to be a successful strategy for detecting common variants exerting modest effects on complex disease risk. Currently available commercial platforms focus on common variants and capture the majority of HapMap[1] SNPs with minor allele frequency (MAF) >0.05 in European populations.[2] Several large-scale consortia have been formed in order to carry out GWAS meta-analyses for various phenotypes, with successful outcome (eg, Zeggini et al,[3] Prokopenko et al,[4] Franke et al,[5] Barret et al[6] and Soranzo et al[7]). To enable the combination of data across studies carried out on different platforms, and to enable in silico fine mapping of association signals, imputation approaches were proposed a few years ago[8] as a means of statistically inferring genotypes at untyped loci using a reference set, for example, the HapMap (∼2 500 000 SNPs). An important aspect of any GWAS analysis is the implementation of a series of rigorous quality control (QC) steps before testing for association. These QC procedures help guard against genotyping error, population stratification, sample duplication and other confounders that can affect the analysis results. QC steps are typically applied at the sample- and SNP-specific level. Sample-level QC includes filtering out samples with low call rates, evidence for different ethnic origin, high heterozygosity, relatedness/duplication, gender discrepancies and genotyping batch effects. SNP-level QC includes filtering out SNPs with low call rates and deviation from Hardy–Weinberg equilibrium (HWE) at pre-determined thresholds. It is generally believed that datasets should be stringently quality controlled (QCed) at the marker level before applying imputation approaches. For this reason, lower MAF SNPs tend to also be excluded, as their accuracy can be hampered by poor clustering properties and incorrect automated genotype calling (at least with currently widely used algorithms). Even though such weight is placed on pre-imputation SNP QC, the effects of applying different criteria and thresholds to the starting dataset have not been investigated thus far. In this report, we evaluate the effect of GWAS QC on imputation outcome, and find that imputation works very well for common variants irrespective of QC, and that a minority of some common-frequency SNPs with particular properties cannot be accurately imputed regardless of QC stringency.

Materials and methods

We used an empirical GWAS dataset to assess the effect of QC on imputation outcome. We focused on chromosome 22, (n=9038 directly typed SNPs) from 3177 osteoarthritis (OA) cases from the United Kingdom, typed on the Illumina 610k quad chip (Illumina) as part of the arcOGEN consortium GWAS (manuscript submitted). Chromosome 22 is representative of the genome in terms of the proportion of directly typed to imputed SNPs. All samples included in our analysis had passed standard sample level QC (based on call rate, heterozygosity, relatedness, ethnicity and gender discrepancies). We imputed genotypes at variants on the basis of HapMap phase II release 22 CEU data (n=33 815 SNPs on chr22) using IMPUTE v1 (https://mathgen.stats.ox.ac.uk/impute/impute.html).[8] We performed each imputation in duplicate, with and without the IMPUTE v1 predict genotyped SNPs flag, which resulted in one set of imputed data containing the original genotypes and in the other imputed genotypes. To assess the effect of varying levels of QC, we carried out several rounds of imputation, using differently QCed OA SNP data as the starting point. Initially, we imputed on the basis of no SNP-level QC, including all directly typed SNPs, regardless of MAF, call rate and HWE. We also imputed on the basis of only those SNPs that passed stringent QC thresholds (call rate >95% for SNPs with a MAF ≥5% and call rate >99% for SNPs with a MAF <5%, HWE exact P>0.0001, MAF >0.01 and removing all SNPs with GC or TA alleles; Table 1). Although imputation biases can occur due to poor clustering of SNPs with miscalled genotypes in the starting dataset, cluster plot checking is not feasible at the genome-wide scale and therefore, it is not implemented in standard GWAS QC.

Table 1

Summary of QC steps and related SNP number breakdown

		Post-imputation unfiltered SNPs		Post-imputation QC filtered SNPsa
Pre-impute QC threshold applied	Directly typed SNPs also present in HapMap	NS	S	NS	S
None (‘unQCed' dataset)	8064b	7689	375	6498	77
Typical GWAS QC (‘QCed' dataset)c	7910	7585	325	6446	67
As above plus 14 significant SNPs removed with poor cluster plotsd	7896	7592	304	6449	61
As above plus 36 additional SNPs removed with poor cluster plotse	7860	7557	303	6419	58
Typical GWAS QC plus MAF <5%c	7554	7269	285	6434	65
Typical GWAS QC plus MAF <10%c	6544	6287	257	5569	53

Abbreviations: GWAS, genome-wide association study; MAF, minor allele frequency; NS, not significant; QC, quality control; QCed, quality controlled; S, significant.

Filtering is based on removal of SNPs with an IMPUTE-info score of <0.8 and MAF <5%.

There were 8082 SNPs in the unQCed data, of which 18 were monomorphic in the arcOGEN cases but polymorphic in HapMap; these SNPs were removed by IMPUTE.

Typical GWAS QC was MAF ≤5% with call rate <95% and MAF <5% with call rate <99%, Hardy–Weinberg equilibrium P<1 × 10−4, and exclusion of GC and AT allele SNPs and MAF <1% SNPs, applied as an additional post-association analysis and pre-imputation QC step.

Significant SNPs with poor cluster plots removed.

Those SNPs flanking the significant SNPs with poor cluster plots removed.

arcOGEN data for chromosome 22 detailing the different pre-imputation QC steps. A breakdown of the SNP number for each QC threshold is indicated both with and without the post-imputation QC.

NS, P≥1 × 10−6; significant SNPs, P<1 × 10−6.

We evaluated the accuracy of imputed genotypes by comparing allele frequencies at the same SNP between imputed and true, directly typed data. For each QC-imputation iteration, we performed an allele frequency comparison between the actual directly typed and imputed SNPs. Under perfect imputation, we would expect to see alignment with the null hypothesis of no association. We used SNPTEST (http://www.stats.ox.ac.uk/~marchini/ software/gwas/snptest.html)[9] to investigate differences between directly typed and imputed genotypes at the same variants within the same samples, taking into account the distribution of genotype probabilities for each individual. For the purposes of our comparison, we used those SNPs that were directly genotyped in OA cases and also present in the HapMap reference samples. Table 1 summarises the number of these SNPs for each QC threshold. When comparing directly typed with imputed allele frequencies at the same variant in the same individuals, we arbitrarily considered P<10−6 as significantly different. We calculated the correlation between imputed and directly typed MAF, using the expected counts to allow for genotype-associated probabilities. We also applied a series of post-imputation QC steps in order to eliminate unreliably imputed SNPs, aiming to filter out as many of these SNPs as possible while retaining a good proportion of nonsignificant SNPs. We compared two alternative methods for post-imputation QC filtering, first, the IMPUTE-info score, which is associated with the imputed allele frequency estimate which ranges from 1, indicating high confidence, to 0 suggesting decreased confidence, and second, the freq-add-proper-info score provided by SNPTEST, a relative statistical score ranging from 0 to 1, representing no information to complete information, respectively. The SNPTEST freq-add-proper-info score has been shown to be highly correlated with the IMPUTE-info score under the additive model.[10] In both scenarios, we also filtered out SNPs with MAF <5%. Figure 1 illustrates the effects of altering post-imputation QC filters on the QCed data. On the basis of these results, we chose to use the IMPUTE-info score with a filtering threshold <0.8 and MAF <5%, which effectively eliminated ∼79% of the significant SNPs while retaining ∼85% of the nonsignificant ones (SNPTEST freq-add-proper-info <0.9 and MAF 5% would be roughly equivalent to this eliminating ∼73% of the significant SNPs while retaining ∼89% of the nonsignificant ones). We applied this post-imputation filter to each of our datasets and compared the results. We looked at the unQCed and QCed datasets first, as synopsised in Table 1. For each scenario, we examined frequency differences between the directly typed and the imputed genotypes as described above. In addition, we compared the imputed genotypes at imputed SNPs only for the unQCed and the fully QCed (QCed data with all poorly clustered markers removed) strategies.

Figure 1

(a) Imputation results for the QCed data indicating the total number of SNPs filtered for different QC thresholds using the IMPUTE-info and freq-add-proper-info scores. The SNPs remaining after the filter (red bar) have been subdivided into SNPs that are significant (green bar) and not significant (yellow bar). (b) The same data as percentage of significant and nonsignificant SNPs removed for each threshold. Both methods of filtering appear to be equivalent, but the freq-add-proper-info is shifted to the right for the same numerical threshold; we chose the IMPUTE-info <0.8 for further analysis (similar to a freq-add-proper-info <0.9).

Results

Table 1 summarises the number of SNPs with significantly (P<10−6) different allele frequencies between the directly typed and imputed data in the same set of individuals for each of the different QC sets. Correlation plots and R2 values for the comparisons of the QCed and unQCed datasets are presented in Figure 2. The difference between the unQCed (R2=0.993) and QCed data (R2=0.994) was minimal. After post-imputation filtering there were 77 SNPs with significantly different (imputed v. directly typed) allele frequencies in the unQCed data compared with 67 significant SNPs in the QCed data. In an attempt to improve imputation for the small subset of poorly imputed SNPs in the QCed data, we excluded all SNPs with MAF<5% and, subsequently, also SNPs with MAF<10%. We found that eliminating these lower MAF SNPs before imputation had little effect overall. The R2 for the post-imputation QC filtered comparison with the QCed data was virtually identical both when excluding all SNPs with MAF<5% (R2=0.994) and when excluding all SNPs with MAF<10% (R2=0.991).

Figure 2

Correlation plots and the associated R2 for (a) The unQCed and the QCed with and without post-imputation QC filtering (IMPUTE-info <0.8 and MAF <5%). (b) The imputed-only markers in the unQCed and fully QCed data (QCed data with all poorly clustered markers removed) without post-imputation QC filtering.

Given this apparent minimal influence of input data QC on imputation outcome, we investigated further the small set of SNPs showing significant allele frequency differences for the presence of a common characteristic that could conceivably be used as a post-imputation filter. To rule out poor genotyping as the cause of these significant differences, we examined all cluster plots for the unfiltered significant SNPs (P<1 × 10−6, n=325). In all, 14 poorly clustered SNPs were removed and the data were re-imputed. After post-imputation QC, three additional SNPs were not significant and six were less significant. We then inspected the cluster plots for 10 SNPs on either side of the 61 SNPs remaining significantly different to rule out poor imputation due to flanking SNP poor clustering properties. We examined the cluster plots for 1008 SNPs and found that 36 of these were poor; these resided in the proximity of 35 of the significant SNPs. We subsequently removed these SNPs and re-imputed. We found that following post-imputation QC filtering, only 3 of the 61 SNPs were no longer significant, and the R2 remained the same as for the QCed data (R2=0.994) for the post-imputation QC filtered data. When we repeated comparisons using IMPUTE v2 with the HapMap3 (CEU, release no. 2 February 2009) and data from the 1000 genomes project (Pilot 1 genotypes released March 2010; phased haplotypes released June 2010) as the reference panels, we observed qualitatively similar results. Differences in region-specific recombination rates may account for the few remaining significant SNPs, as variants in areas of especially high recombination rate may be more challenging to impute accurately regardless of QC. To investigate this, we first examined the QCed unfiltered data and found that when the data were dichotomised into those markers with lower (<1 cm/Mb) and higher (≥1 cM/Mb) recombination rates, there were more significant SNPs present in the higher recombination rate group compared with the lower recombination group (P=1.85 × 10−27, average recombination rates of 12.8 and 3.04, respectively). When we examined the QCed data post-imputation QC, this difference disappeared (P=0.526). This clearly indicates that application of the post-imputation QC filter successfully identifies the majority of significant SNPs with high recombination rates. Therefore, to include recombination rate as an extra filter would not be prudent, for example, using the QCed post-imputation QC filtered data and applying a further filter using a recombination rate threshold of >1 cM/Mb would eliminate 2075 SNPs, only 24 of which are significantly different.

Discussion

The imputation accuracy of common variants does not appear to be substantially affected by GWAS QC steps. Our data demonstrate that there is little difference in imputation accuracy observed in unQCed GWAS data when compared with QCed GWAS data. Furthermore, the implementation of additional QC steps (eg, filtering out variants with MAF<0.05 and <0.10) does not considerably improve overall imputation accuracy. Missing variants and directly typed variants that fail pre-imputation QC checks are imputed and these data are used for downstream analyses. Post-imputation QC successfully eliminates a good proportion of inaccurately imputed SNPs. Specifically, by applying a very stringent post-imputation QC threshold, a smaller set of variants with more accurately predicted genotypes remain. The IMPUTE-info threshold of <0.8 and MAF ≤5% criterion successfully filtered out the majority of poorly imputed SNPs. However, the application of these strict filters in GWAS data could result in many SNPs being excluded from the data, and thus potential true association signals could be missed. Some of the inaccurately imputed variants were due to poor clustering properties. It is plausible that the handful of variants that still remained inaccurately imputed could be because of the differences in ethnicity between our data and the HapMap CEU reference panel from which the genotypes were predicted. We have used IMPUTE, but do not expect our results and conclusions to qualitatively differ with different imputation methods, for example, BEAGLE and MACH exhibit similar imputation accuracy to IMPUTE.[11] Differences in population structure between the reference panel and target dataset can be a source of imputation inaccuracy. Imputation accuracy for common SNPs may be further increased by using larger reference panels with data on denser sets of variants. Our results show that GWAS QC is not of paramount importance for the imputation of common variants. This may be different for the imputation of low frequency and rare variants based on emerging reference panels such as the 1000 genomes (http://www.1000genomes.org) and UK10k (http://www.uk10k.org) projects. In summary, our study demonstrates that imputation of common variants is generally very accurate and robust to GWAS QC, which is not a major factor affecting imputation outcome.

11 in total

1. The International HapMap Project.

Authors:
Journal: Nature Date: 2003-12-18 Impact factor: 49.962

Review 2. Genotype imputation for genome-wide association studies.

Authors: Jonathan Marchini; Bryan Howie
Journal: Nat Rev Genet Date: 2010-07 Impact factor: 53.242

3. Evaluating coverage of genome-wide association studies.

Authors: Jeffrey C Barrett; Lon R Cardon
Journal: Nat Genet Date: 2006-05-21 Impact factor: 38.330

4. A comprehensive evaluation of SNP genotype imputation.

Authors: Michael Nothnagel; David Ellinghaus; Stefan Schreiber; Michael Krawczak; Andre Franke
Journal: Hum Genet Date: 2008-12-17 Impact factor: 4.132

5. Replication of signals from recent studies of Crohn's disease identifies previously unknown disease loci for ulcerative colitis.

Authors: Andre Franke; Tobias Balschun; Tom H Karlsen; Jürgen Hedderich; Sandra May; Tim Lu; Dörthe Schuldt; Susanna Nikolaus; Philip Rosenstiel; Michael Krawczak; Stefan Schreiber
Journal: Nat Genet Date: 2008-04-27 Impact factor: 38.330

6. A genome-wide meta-analysis identifies 22 loci associated with eight hematological parameters in the HaemGen consortium.

Authors: Nicole Soranzo; Tim D Spector; Massimo Mangino; Brigitte Kühnel; Augusto Rendon; Alexander Teumer; Christina Willenborg; Benjamin Wright; Li Chen; Mingyao Li; Perttu Salo; Benjamin F Voight; Philippa Burns; Roman A Laskowski; Yali Xue; Stephan Menzel; David Altshuler; John R Bradley; Suzannah Bumpstead; Mary-Susan Burnett; Joseph Devaney; Angela Döring; Roberto Elosua; Stephen E Epstein; Wendy Erber; Mario Falchi; Stephen F Garner; Mohammed J R Ghori; Alison H Goodall; Rhian Gwilliam; Hakon H Hakonarson; Alistair S Hall; Naomi Hammond; Christian Hengstenberg; Thomas Illig; Inke R König; Christopher W Knouff; Ruth McPherson; Olle Melander; Vincent Mooser; Matthias Nauck; Markku S Nieminen; Christopher J O'Donnell; Leena Peltonen; Simon C Potter; Holger Prokisch; Daniel J Rader; Catherine M Rice; Robert Roberts; Veikko Salomaa; Jennifer Sambrook; Stefan Schreiber; Heribert Schunkert; Stephen M Schwartz; Jovana Serbanovic-Canic; Juha Sinisalo; David S Siscovick; Klaus Stark; Ida Surakka; Jonathan Stephens; John R Thompson; Uwe Völker; Henry Völzke; Nicholas A Watkins; George A Wells; H-Erich Wichmann; David A Van Heel; Chris Tyler-Smith; Swee Lay Thein; Sekar Kathiresan; Markus Perola; Muredach P Reilly; Alexandre F R Stewart; Jeanette Erdmann; Nilesh J Samani; Christa Meisinger; Andreas Greinacher; Panos Deloukas; Willem H Ouwehand; Christian Gieger
Journal: Nat Genet Date: 2009-10-11 Impact factor: 38.330

7. Genome-wide association study and meta-analysis find that over 40 loci affect risk of type 1 diabetes.

Authors: Jeffrey C Barrett; David G Clayton; Patrick Concannon; Beena Akolkar; Jason D Cooper; Henry A Erlich; Cécile Julier; Grant Morahan; Jørn Nerup; Concepcion Nierras; Vincent Plagnol; Flemming Pociot; Helen Schuilenburg; Deborah J Smyth; Helen Stevens; John A Todd; Neil M Walker; Stephen S Rich
Journal: Nat Genet Date: 2009-05-10 Impact factor: 38.330

8. Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes.

Authors: Eleftheria Zeggini; Laura J Scott; Richa Saxena; Benjamin F Voight; Jonathan L Marchini; Tianle Hu; Paul I W de Bakker; Gonçalo R Abecasis; Peter Almgren; Gitte Andersen; Kristin Ardlie; Kristina Bengtsson Boström; Richard N Bergman; Lori L Bonnycastle; Knut Borch-Johnsen; Noël P Burtt; Hong Chen; Peter S Chines; Mark J Daly; Parimal Deodhar; Chia-Jen Ding; Alex S F Doney; William L Duren; Katherine S Elliott; Michael R Erdos; Timothy M Frayling; Rachel M Freathy; Lauren Gianniny; Harald Grallert; Niels Grarup; Christopher J Groves; Candace Guiducci; Torben Hansen; Christian Herder; Graham A Hitman; Thomas E Hughes; Bo Isomaa; Anne U Jackson; Torben Jørgensen; Augustine Kong; Kari Kubalanza; Finny G Kuruvilla; Johanna Kuusisto; Claudia Langenberg; Hana Lango; Torsten Lauritzen; Yun Li; Cecilia M Lindgren; Valeriya Lyssenko; Amanda F Marvelle; Christa Meisinger; Kristian Midthjell; Karen L Mohlke; Mario A Morken; Andrew D Morris; Narisu Narisu; Peter Nilsson; Katharine R Owen; Colin N A Palmer; Felicity Payne; John R B Perry; Elin Pettersen; Carl Platou; Inga Prokopenko; Lu Qi; Li Qin; Nigel W Rayner; Matthew Rees; Jeffrey J Roix; Anelli Sandbaek; Beverley Shields; Marketa Sjögren; Valgerdur Steinthorsdottir; Heather M Stringham; Amy J Swift; Gudmar Thorleifsson; Unnur Thorsteinsdottir; Nicholas J Timpson; Tiinamaija Tuomi; Jaakko Tuomilehto; Mark Walker; Richard M Watanabe; Michael N Weedon; Cristen J Willer; Thomas Illig; Kristian Hveem; Frank B Hu; Markku Laakso; Kari Stefansson; Oluf Pedersen; Nicholas J Wareham; Inês Barroso; Andrew T Hattersley; Francis S Collins; Leif Groop; Mark I McCarthy; Michael Boehnke; David Altshuler
Journal: Nat Genet Date: 2008-03-30 Impact factor: 38.330

9. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls.

Authors:
Journal: Nature Date: 2007-06-07 Impact factor: 49.962

10. Variants in MTNR1B influence fasting glucose levels.

Authors: Inga Prokopenko; Claudia Langenberg; Jose C Florez; Richa Saxena; Nicole Soranzo; Gudmar Thorleifsson; Ruth J F Loos; Alisa K Manning; Anne U Jackson; Yurii Aulchenko; Simon C Potter; Michael R Erdos; Serena Sanna; Jouke-Jan Hottenga; Eleanor Wheeler; Marika Kaakinen; Valeriya Lyssenko; Wei-Min Chen; Kourosh Ahmadi; Jacques S Beckmann; Richard N Bergman; Murielle Bochud; Lori L Bonnycastle; Thomas A Buchanan; Antonio Cao; Alessandra Cervino; Lachlan Coin; Francis S Collins; Laura Crisponi; Eco J C de Geus; Abbas Dehghan; Panos Deloukas; Alex S F Doney; Paul Elliott; Nelson Freimer; Vesela Gateva; Christian Herder; Albert Hofman; Thomas E Hughes; Sarah Hunt; Thomas Illig; Michael Inouye; Bo Isomaa; Toby Johnson; Augustine Kong; Maria Krestyaninova; Johanna Kuusisto; Markku Laakso; Noha Lim; Ulf Lindblad; Cecilia M Lindgren; Owen T McCann; Karen L Mohlke; Andrew D Morris; Silvia Naitza; Marco Orrù; Colin N A Palmer; Anneli Pouta; Joshua Randall; Wolfgang Rathmann; Jouko Saramies; Paul Scheet; Laura J Scott; Angelo Scuteri; Stephen Sharp; Eric Sijbrands; Jan H Smit; Kijoung Song; Valgerdur Steinthorsdottir; Heather M Stringham; Tiinamaija Tuomi; Jaakko Tuomilehto; André G Uitterlinden; Benjamin F Voight; Dawn Waterworth; H-Erich Wichmann; Gonneke Willemsen; Jacqueline C M Witteman; Xin Yuan; Jing Hua Zhao; Eleftheria Zeggini; David Schlessinger; Manjinder Sandhu; Dorret I Boomsma; Manuela Uda; Tim D Spector; Brenda Wjh Penninx; David Altshuler; Peter Vollenweider; Marjo Riitta Jarvelin; Edward Lakatta; Gerard Waeber; Caroline S Fox; Leena Peltonen; Leif C Groop; Vincent Mooser; L Adrienne Cupples; Unnur Thorsteinsdottir; Michael Boehnke; Inês Barroso; Cornelia Van Duijn; Josée Dupuis; Richard M Watanabe; Kari Stefansson; Mark I McCarthy; Nicholas J Wareham; James B Meigs; Gonçalo R Abecasis
Journal: Nat Genet Date: 2008-12-07 Impact factor: 38.330

14 in total

1. Imputation and quality control steps for combining multiple genome-wide datasets.

Authors: Shefali S Verma; Mariza de Andrade; Gerard Tromp; Helena Kuivaniemi; Elizabeth Pugh; Bahram Namjou-Khales; Shubhabrata Mukherjee; Gail P Jarvik; Leah C Kottyan; Amber Burt; Yuki Bradford; Gretta D Armstrong; Kimberly Derr; Dana C Crawford; Jonathan L Haines; Rongling Li; David Crosslin; Marylyn D Ritchie
Journal: Front Genet Date: 2014-12-11 Impact factor: 4.599

Review 2. From genome-wide associations to candidate causal variants by statistical fine-mapping.

Authors: Daniel J Schaid; Wenan Chen; Nicholas B Larson
Journal: Nat Rev Genet Date: 2018-08 Impact factor: 53.242

3. Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.

Authors: Eric O Johnson; Dana B Hancock; Joshua L Levy; Nathan C Gaddis; Nancy L Saccone; Laura J Bierut; Grier P Page
Journal: Hum Genet Date: 2013-01-22 Impact factor: 4.132

4. Association of FTO gene variants with body composition in UK twins.

Authors: Gregory Livshits; Ida Malkin; Alireza Moayyeri; Timothy D Spector; Christopher J Hammond
Journal: Ann Hum Genet Date: 2012-07-23 Impact factor: 1.670

5. No association between CTNNBL1 and episodic memory performance.

Authors: T Liu; S-C Li; G Papenberg; J Schröder; J T Roehr; W Nietfeld; U Lindenberger; L Bertram
Journal: Transl Psychiatry Date: 2014-09-30 Impact factor: 6.222

6. Impact of pre-imputation SNP-filtering on genotype imputation results.

Authors: Nab Raj Roshyara; Holger Kirsten; Katrin Horn; Peter Ahnert; Markus Scholz
Journal: BMC Genet Date: 2014-08-12 Impact factor: 2.797

7. Common genetic variants do not associate with CAD in familial hypercholesterolemia.

Authors: Erik P A van Iperen; Suthesh Sivapalaratnam; S Matthijs Boekholdt; G Kees Hovingh; Stephanie Maiwald; Michael W Tanck; Nicole Soranzo; Jonathan C Stephens; Jennifer G Sambrook; Marcel Levi; Willem H Ouwehand; John Jp Kastelein; Mieke D Trip; Aeilko H Zwinderman
Journal: Eur J Hum Genet Date: 2013-11-13 Impact factor: 4.246

8. Assessment of genotype imputation performance using 1000 Genomes in African American studies.

Authors: Dana B Hancock; Joshua L Levy; Nathan C Gaddis; Laura J Bierut; Nancy L Saccone; Grier P Page; Eric O Johnson
Journal: PLoS One Date: 2012-11-30 Impact factor: 3.240

9. Predicting HLA genotypes using unphased and flanking single-nucleotide polymorphisms in Han Chinese population.

Authors: Ai-Ru Hsieh; Su-Wei Chang; Pei-Lung Chen; Chen-Chung Chu; Ching-Lin Hsiao; Wei-Shiung Yang; Chien-Ching Chang; Jer-Yuarn Wu; Yuan-Tsong Chen; Tien-Chun Chang; Cathy Sj Fann
Journal: BMC Genomics Date: 2014-01-29 Impact factor: 3.969

10. Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study.

Authors: Chrissy h Roberts; Christopher S Franklin; Pateh Makalo; Hassan Joof; Isatou Sarr; Olaimatu S Mahdi; Ansumana Sillah; Momodou Bah; Felicity Payne; Anna E Jeffreys; William Bottomley; Angels Natividad; Sandra Molina-Gonzalez; Sarah E Burr; Mark Preston; Dominic Kwiatkowski; Kirk A Rockett; Taane G Clark; Matthew J Burton; David C W Mabey; Robin Bailey; Inês Barroso; Martin J Holland
Journal: Sci Rep Date: 2015-11-30 Impact factor: 4.379