Richard Leslie1, Christopher J O'Donnell1, Andrew D Johnson2. 1. Cardiovascular Epidemiology and Human Genomics Branch, National Heart, Lung and Blood Institute, The Framingham Heart Study, Framingham, MA 01702, University of Massachusetts Medical School, Worcester, MA 01655 and Division of Cardiology, Massachusetts General Hospital, Boston, MA 02114, USACardiovascular Epidemiology and Human Genomics Branch, National Heart, Lung and Blood Institute, The Framingham Heart Study, Framingham, MA 01702, University of Massachusetts Medical School, Worcester, MA 01655 and Division of Cardiology, Massachusetts General Hospital, Boston, MA 02114, USA. 2. Cardiovascular Epidemiology and Human Genomics Branch, National Heart, Lung and Blood Institute, The Framingham Heart Study, Framingham, MA 01702, University of Massachusetts Medical School, Worcester, MA 01655 and Division of Cardiology, Massachusetts General Hospital, Boston, MA 02114, USA.
Abstract
SUMMARY: We created a deeply extracted and annotated database of genome-wide association studies (GWAS) results. GRASP v1.0 contains >6.2 million SNP-phenotype association from among 1390 GWAS studies. We re-annotated GWAS results with 16 annotation sources including some rarely compared to GWAS results (e.g. RNAediting sites, lincRNAs, PTMs). MOTIVATION: To create a high-quality resource to facilitate further use and interpretation of human GWAS results in order to address important scientific questions. RESULTS: GWAS have grown exponentially, with increases in sample sizes and markers tested, and continuing bias toward European ancestry samples. GRASP contains >100 000 phenotypes, roughly: eQTLs (71.5%), metabolite QTLs (21.2%), methylation QTLs (4.4%) and diseases, biomarkers and other traits (2.8%). cis-eQTLs, meQTLs, mQTLs and MHC region SNPs are highly enriched among significant results. After removing these categories, GRASP still contains a greater proportion of studies and results than comparable GWAS catalogs. Cardiovascular disease and related risk factors pre-dominate remaining GWAS results, followed by immunological, neurological and cancer traits. Significant results in GWAS display a highly gene-centric tendency. Sex chromosome X (OR = 0.18[0.16-0.20]) and Y (OR = 0.003[0.001-0.01]) genes are depleted for GWAS results. Gene length is correlated with GWAS results at nominal significance (P ≤ 0.05) levels. We show this gene-length correlation decays at increasingly more stringent P-value thresholds. Potential pleotropic genes and SNPs enriched for multi-phenotype association in GWAS are identified. However, we note possible population stratification at some of these loci. Finally, via re-annotation we identify compelling functional hypotheses at GWAS loci, in some cases unrealized in studies to date. CONCLUSION: Pooling summary-level GWAS results and re-annotating with bioinformatics predictions and molecular features provides a good platform for new insights. AVAILABILITY: The GRASP database is available at http://apps.nhlbi.nih.gov/grasp. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
SUMMARY: We created a deeply extracted and annotated database of genome-wide association studies (GWAS) results. GRASP v1.0 contains >6.2 million SNP-phenotype association from among 1390 GWAS studies. We re-annotated GWAS results with 16 annotation sources including some rarely compared to GWAS results (e.g. RNAediting sites, lincRNAs, PTMs). MOTIVATION: To create a high-quality resource to facilitate further use and interpretation of human GWAS results in order to address important scientific questions. RESULTS: GWAS have grown exponentially, with increases in sample sizes and markers tested, and continuing bias toward European ancestry samples. GRASP contains >100 000 phenotypes, roughly: eQTLs (71.5%), metabolite QTLs (21.2%), methylation QTLs (4.4%) and diseases, biomarkers and other traits (2.8%). cis-eQTLs, meQTLs, mQTLs and MHC region SNPs are highly enriched among significant results. After removing these categories, GRASP still contains a greater proportion of studies and results than comparable GWAS catalogs. Cardiovascular disease and related risk factors pre-dominate remaining GWAS results, followed by immunological, neurological and cancer traits. Significant results in GWAS display a highly gene-centric tendency. Sex chromosome X (OR = 0.18[0.16-0.20]) and Y (OR = 0.003[0.001-0.01]) genes are depleted for GWAS results. Gene length is correlated with GWAS results at nominal significance (P ≤ 0.05) levels. We show this gene-length correlation decays at increasingly more stringent P-value thresholds. Potential pleotropic genes and SNPs enriched for multi-phenotype association in GWAS are identified. However, we note possible population stratification at some of these loci. Finally, via re-annotation we identify compelling functional hypotheses at GWAS loci, in some cases unrealized in studies to date. CONCLUSION: Pooling summary-level GWAS results and re-annotating with bioinformatics predictions and molecular features provides a good platform for new insights. AVAILABILITY: The GRASP database is available at http://apps.nhlbi.nih.gov/grasp. Published by Oxford University Press 2014. This work is written by US Government employees and is in the public domain in the US.
Authors: Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio Journal: Proc Natl Acad Sci U S A Date: 2009-05-27 Impact factor: 11.205
Authors: Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker Journal: Bioinformatics Date: 2008-10-30 Impact factor: 6.937
Authors: L K Davis; N Maltman; M W Mosconi; C Macmillan; L Schmitt; K Moore; S M Francis; S Jacob; J A Sweeney; E H Cook Journal: Am J Med Genet A Date: 2012-06-07 Impact factor: 2.802
Authors: Günter U Höglinger; Nadine M Melhem; Dennis W Dickson; Patrick M A Sleiman; Li-San Wang; Lambertus Klei; Rosa Rademakers; Rohan de Silva; Irene Litvan; David E Riley; John C van Swieten; Peter Heutink; Zbigniew K Wszolek; Ryan J Uitti; Jana Vandrovcova; Howard I Hurtig; Rachel G Gross; Walter Maetzler; Stefano Goldwurm; Eduardo Tolosa; Barbara Borroni; Pau Pastor; Laura B Cantwell; Mi Ryung Han; Allissa Dillman; Marcel P van der Brug; J Raphael Gibbs; Mark R Cookson; Dena G Hernandez; Andrew B Singleton; Matthew J Farrer; Chang-En Yu; Lawrence I Golbe; Tamas Revesz; John Hardy; Andrew J Lees; Bernie Devlin; Hakon Hakonarson; Ulrich Müller; Gerard D Schellenberg Journal: Nat Genet Date: 2011-06-19 Impact factor: 38.330
Authors: Michael R Lang; Larissa B Patterson; Tiffany N Gordon; Stephen L Johnson; David M Parichy Journal: PLoS Genet Date: 2009-11-26 Impact factor: 5.917
Authors: Nathan Pankratz; Jemma B Wilk; Jeanne C Latourelle; Anita L DeStefano; Cheryl Halter; Elizabeth W Pugh; Kimberly F Doheny; James F Gusella; William C Nichols; Tatiana Foroud; Richard H Myers Journal: Hum Genet Date: 2008-11-06 Impact factor: 4.132
Authors: Elisa Majounie; William Cross; Victoria Newsway; Allissa Dillman; Jana Vandrovcova; Christopher M Morris; Michael A Nalls; Luigi Ferrucci; Michael J Owen; Michael C O'Donovan; Mark R Cookson; Andrew B Singleton; Rohan de Silva; Huw R Morris Journal: Neurobiol Aging Date: 2013-02-19 Impact factor: 4.673
Authors: Liang He; Ilya Zhbannikov; Konstantin G Arbeev; Anatoliy I Yashin; Alexander M Kulminski Journal: Genet Epidemiol Date: 2017-06-21 Impact factor: 2.135