Literature DB >> 21849791

Including additional controls from public databases improves the power of a genome-wide association study.

Semanti Mukherjee1, Jennifer Simon, Sharon Bayuga, Emmy Ludwig, Sarah Yoo, Irene Orlow, Agnes Viale, Kenneth Offit, Robert C Kurtz, Sara H Olson, Robert J Klein.   

Abstract

Though genome-wide association studies (GWAS) have identified numerous susceptibility loci for common diseases, their use is limited due to the expense of genotyping large cohorts of individuals. One potential solution is to use 'additional controls', or genotype data from control individuals deposited in public repositories. While this approach has been used by several groups, the genetically heterogeneous nature of the population of the United States makes this approach potentially problematic. We empirically investigated the utility of this approach in a US-based GWAS. In a small GWAS of pancreatic cancer in New York, we observed clear population structure differences relative to controls from the database of Genotypes and Phenotypes (dbGaP). When we conduct the GWAS using these additional controls, we find large inflation of the test statistic that is properly corrected by using eigenvectors from principal components analysis as covariates. To deal with errors introduced due to different sources, we propose simultaneously genotyping a small number of controls along with cases and then comparing this group to the additional controls. We show that removing SNPs that show differences between these control groups reduces false-positive findings. Thus, through an empirical approach, this report provides practical guidance for using additional controls from publicly available datasets.
Copyright © 2011 S. Karger AG, Basel.

Entities:  

Mesh:

Year:  2011        PMID: 21849791      PMCID: PMC3171281          DOI: 10.1159/000330149

Source DB:  PubMed          Journal:  Hum Hered        ISSN: 0001-5652            Impact factor:   0.444


  30 in total

1.  Comment on a simple and improved correction for population stratification.

Authors:  Seunggeun Lee; Patrick F Sullivan; Fei Zou; Fred A Wright
Journal:  Am J Hum Genet       Date:  2008-02       Impact factor: 11.025

2.  A genome-wide association study identifies six susceptibility loci for chronic lymphocytic leukemia.

Authors:  Maria Chiara Di Bernardo; Dalemari Crowther-Swanepoel; Peter Broderick; Emily Webb; Gabrielle Sellick; Ruth Wild; Kate Sullivan; Jayaram Vijayakrishnan; Yufei Wang; Alan M Pittman; Nicola J Sunter; Andrew G Hall; Martin J S Dyer; Estella Matutes; Claire Dearden; Tryfonia Mainou-Fowler; Graham H Jackson; Geoffrey Summerfield; Robert J Harris; Andrew R Pettitt; Peter Hillmen; David J Allsup; James R Bailey; Guy Pratt; Chris Pepper; Chris Fegan; James M Allan; Daniel Catovsky; Richard S Houlston
Journal:  Nat Genet       Date:  2008-08-31       Impact factor: 38.330

3.  European population genetic substructure: further definition of ancestry informative markers for distinguishing among diverse European ethnic groups.

Authors:  Chao Tian; Roman Kosoy; Rami Nassir; Annette Lee; Pablo Villoslada; Lars Klareskog; Lennart Hammarström; Henri-Jean Garchon; Ann E Pulver; Michael Ransom; Peter K Gregersen; Michael F Seldin
Journal:  Mol Med       Date:  2009-08-27       Impact factor: 6.354

4.  Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's disease.

Authors:  Denise Harold; Richard Abraham; Paul Hollingworth; Rebecca Sims; Amy Gerrish; Marian L Hamshere; Jaspreet Singh Pahwa; Valentina Moskvina; Kimberley Dowzell; Amy Williams; Nicola Jones; Charlene Thomas; Alexandra Stretton; Angharad R Morgan; Simon Lovestone; John Powell; Petroula Proitsi; Michelle K Lupton; Carol Brayne; David C Rubinsztein; Michael Gill; Brian Lawlor; Aoibhinn Lynch; Kevin Morgan; Kristelle S Brown; Peter A Passmore; David Craig; Bernadette McGuinness; Stephen Todd; Clive Holmes; David Mann; A David Smith; Seth Love; Patrick G Kehoe; John Hardy; Simon Mead; Nick Fox; Martin Rossor; John Collinge; Wolfgang Maier; Frank Jessen; Britta Schürmann; Reinhard Heun; Hendrik van den Bussche; Isabella Heuser; Johannes Kornhuber; Jens Wiltfang; Martin Dichgans; Lutz Frölich; Harald Hampel; Michael Hüll; Dan Rujescu; Alison M Goate; John S K Kauwe; Carlos Cruchaga; Petra Nowotny; John C Morris; Kevin Mayo; Kristel Sleegers; Karolien Bettens; Sebastiaan Engelborghs; Peter P De Deyn; Christine Van Broeckhoven; Gill Livingston; Nicholas J Bass; Hugh Gurling; Andrew McQuillin; Rhian Gwilliam; Panagiotis Deloukas; Ammar Al-Chalabi; Christopher E Shaw; Magda Tsolaki; Andrew B Singleton; Rita Guerreiro; Thomas W Mühleisen; Markus M Nöthen; Susanne Moebus; Karl-Heinz Jöckel; Norman Klopp; H-Erich Wichmann; Minerva M Carrasquillo; V Shane Pankratz; Steven G Younkin; Peter A Holmans; Michael O'Donovan; Michael J Owen; Julie Williams
Journal:  Nat Genet       Date:  2009-09-06       Impact factor: 38.330

5.  A germline JAK2 SNP is associated with predisposition to the development of JAK2(V617F)-positive myeloproliferative neoplasms.

Authors:  Outi Kilpivaara; Semanti Mukherjee; Alison M Schram; Martha Wadleigh; Ann Mullally; Benjamin L Ebert; Adam Bass; Sachie Marubayashi; Adriana Heguy; Guillermo Garcia-Manero; Hagop Kantarjian; Kenneth Offit; Richard M Stone; D Gary Gilliland; Robert J Klein; Ross L Levine
Journal:  Nat Genet       Date:  2009-03-15       Impact factor: 38.330

6.  Genome-wide association study identifies five susceptibility loci for glioma.

Authors:  Sanjay Shete; Fay J Hosking; Lindsay B Robertson; Sara E Dobbins; Marc Sanson; Beatrice Malmer; Matthias Simon; Yannick Marie; Blandine Boisselier; Jean-Yves Delattre; Khe Hoang-Xuan; Soufiane El Hallani; Ahmed Idbaih; Diana Zelenika; Ulrika Andersson; Roger Henriksson; A Tommy Bergenheim; Maria Feychting; Stefan Lönn; Anders Ahlbom; Johannes Schramm; Michael Linnebank; Kari Hemminki; Rajiv Kumar; Sarah J Hepworth; Amy Price; Georgina Armstrong; Yanhong Liu; Xiangjun Gu; Robert Yu; Ching Lau; Minouk Schoemaker; Kenneth Muir; Anthony Swerdlow; Mark Lathrop; Melissa Bondy; Richard S Houlston
Journal:  Nat Genet       Date:  2009-07-05       Impact factor: 38.330

7.  Genome-wide association study identifies variants in the ABO locus associated with susceptibility to pancreatic cancer.

Authors:  Laufey Amundadottir; Peter Kraft; Rachael Z Stolzenberg-Solomon; Charles S Fuchs; Gloria M Petersen; Alan A Arslan; H Bas Bueno-de-Mesquita; Myron Gross; Kathy Helzlsouer; Eric J Jacobs; Andrea LaCroix; Wei Zheng; Demetrius Albanes; William Bamlet; Christine D Berg; Franco Berrino; Sheila Bingham; Julie E Buring; Paige M Bracci; Federico Canzian; Françoise Clavel-Chapelon; Sandra Clipp; Michelle Cotterchio; Mariza de Andrade; Eric J Duell; John W Fox; Steven Gallinger; J Michael Gaziano; Edward L Giovannucci; Michael Goggins; Carlos A González; Göran Hallmans; Susan E Hankinson; Manal Hassan; Elizabeth A Holly; David J Hunter; Amy Hutchinson; Rebecca Jackson; Kevin B Jacobs; Mazda Jenab; Rudolf Kaaks; Alison P Klein; Charles Kooperberg; Robert C Kurtz; Donghui Li; Shannon M Lynch; Margaret Mandelson; Robert R McWilliams; Julie B Mendelsohn; Dominique S Michaud; Sara H Olson; Kim Overvad; Alpa V Patel; Petra H M Peeters; Aleksandar Rajkovic; Elio Riboli; Harvey A Risch; Xiao-Ou Shu; Gilles Thomas; Geoffrey S Tobias; Dimitrios Trichopoulos; Stephen K Van Den Eeden; Jarmo Virtamo; Jean Wactawski-Wende; Brian M Wolpin; Herbert Yu; Kai Yu; Anne Zeleniuch-Jacquotte; Stephen J Chanock; Patricia Hartge; Robert N Hoover
Journal:  Nat Genet       Date:  2009-08-02       Impact factor: 38.330

8.  Genes mirror geography within Europe.

Authors:  John Novembre; Toby Johnson; Katarzyna Bryc; Zoltán Kutalik; Adam R Boyko; Adam Auton; Amit Indap; Karen S King; Sven Bergmann; Matthew R Nelson; Matthew Stephens; Carlos D Bustamante
Journal:  Nature       Date:  2008-08-31       Impact factor: 49.962

9.  Tracing sub-structure in the European American population with PCA-informative markers.

Authors:  Peristera Paschou; Petros Drineas; Jamey Lewis; Caroline M Nievergelt; Deborah A Nickerson; Joshua D Smith; Paul M Ridker; Daniel I Chasman; Ronald M Krauss; Elad Ziv
Journal:  PLoS Genet       Date:  2008-07-04       Impact factor: 5.917

10.  Analysis and application of European genetic substructure using 300 K SNP information.

Authors:  Chao Tian; Robert M Plenge; Michael Ransom; Annette Lee; Pablo Villoslada; Carlo Selmi; Lars Klareskog; Ann E Pulver; Lihong Qi; Peter K Gregersen; Michael F Seldin
Journal:  PLoS Genet       Date:  2008-01       Impact factor: 5.917

View more
  9 in total

1.  KAT2B polymorphism identified for drug abuse in African Americans with regulatory links to drug abuse pathways in human prefrontal cortex.

Authors:  Eric O Johnson; Dana B Hancock; Joshua L Levy; Nathan C Gaddis; Grier P Page; Cristie Glasheen; Nancy L Saccone; Laura J Bierut; Alex H Kral
Journal:  Addict Biol       Date:  2015-07-23       Impact factor: 4.280

2.  Hypothesis testing at the extremes: fast and robust association for high-throughput data.

Authors:  Yi-Hui Zhou; Fred A Wright
Journal:  Biostatistics       Date:  2015-03-18       Impact factor: 5.899

3.  Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.

Authors:  Eric O Johnson; Dana B Hancock; Joshua L Levy; Nathan C Gaddis; Nancy L Saccone; Laura J Bierut; Grier P Page
Journal:  Hum Genet       Date:  2013-01-22       Impact factor: 4.132

4.  FOLD: a method to optimize power in meta-analysis of genetic association studies with overlapping subjects.

Authors:  Emma E Kim; Seunghoon Lee; Cue Hyunkyu Lee; Hyunjung Oh; Kyuyoung Song; Buhm Han
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

5.  GAWMerge expands GWAS sample size and diversity by combining array-based genotyping and whole-genome sequencing.

Authors:  Ravi Mathur; Fang Fang; Nathan Gaddis; Dana B Hancock; Michael H Cho; John E Hokanson; Laura J Bierut; Sharon M Lutz; Kendra Young; Albert V Smith; Edwin K Silverman; Grier P Page; Eric O Johnson
Journal:  Commun Biol       Date:  2022-08-11

6.  Syringohydromyelia in Dogs: The Genomic Component Underlying a Complex Neurological Disease.

Authors:  Sandra Andrino; Valentina Lorenzo; Susana Dunner; Elisabeth Contreras; Javier Cañón; Natalia Sevane
Journal:  Animals (Basel)       Date:  2022-09-29       Impact factor: 3.231

7.  Susceptibility loci associated with specific and shared subtypes of lymphoid malignancies.

Authors:  Joseph Vijai; Tomas Kirchhoff; Kasmintan A Schrader; Jennifer Brown; Ana Virginia Dutra-Clarke; Christopher Manschreck; Nichole Hansen; Rohini Rau-Murthy; Kara Sarrel; Jennifer Przybylo; Sohela Shah; Srujana Cheguri; Zsofia Stadler; Liying Zhang; Ora Paltiel; Dina Ben-Yehuda; Agnes Viale; Carol Portlock; David Straus; Steven M Lipkin; Mortimer Lacher; Mark Robson; Robert J Klein; Andrew Zelenetz; Kenneth Offit
Journal:  PLoS Genet       Date:  2013-01-17       Impact factor: 5.917

8.  Genome-wide analysis of the role of copy-number variation in pancreatic cancer risk.

Authors:  Jason A Willis; Semanti Mukherjee; Irene Orlow; Agnes Viale; Kenneth Offit; Robert C Kurtz; Sara H Olson; Robert J Klein
Journal:  Front Genet       Date:  2014-02-13       Impact factor: 4.599

9.  Genome-wide association analysis identifies a meningioma risk locus at 11p15.5.

Authors:  Elizabeth B Claus; Alex J Cornish; Peter Broderick; Joellen M Schildkraut; Sara E Dobbins; Amy Holroyd; Lisa Calvocoressi; Lingeng Lu; Helen M Hansen; Ivan Smirnov; Kyle M Walsh; Johannes Schramm; Per Hoffmann; Markus M Nöthen; Karl-Heinz Jöckel; Anthony Swerdlow; Signe Benzon Larsen; Christoffer Johansen; Matthias Simon; Melissa Bondy; Margaret Wrensch; Richard S Houlston; Joseph L Wiemels
Journal:  Neuro Oncol       Date:  2018-10-09       Impact factor: 12.300

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.