Literature DB >> 20634204

LocusZoom: regional visualization of genome-wide association scan results.

Randall J Pruim1, Ryan P Welch, Serena Sanna, Tanya M Teslovich, Peter S Chines, Terry P Gliedt, Michael Boehnke, Gonçalo R Abecasis, Cristen J Willer.   

Abstract

UNLABELLED: Genome-wide association studies (GWAS) have revealed hundreds of loci associated with common human genetic diseases and traits. We have developed a web-based plotting tool that provides fast visual display of GWAS results in a publication-ready format. LocusZoom visually displays regional information such as the strength and extent of the association signal relative to genomic position, local linkage disequilibrium (LD) and recombination patterns and the positions of genes in the region. AVAILABILITY: LocusZoom can be accessed from a web interface at http://csg.sph.umich.edu/locuszoom. Users may generate a single plot using a web form, or many plots using batch mode. The software utilizes LD information from HapMap Phase II (CEU, YRI and JPT+CHB) or 1000 Genomes (CEU) and gene information from the UCSC browser, and will accept SNP identifiers in dbSNP or 1000 Genomes format. Single plots are generated in approximately 20 s. Source code and associated databases are available for download and local installation, and full documentation is available online.

Entities:  

Mesh:

Year:  2010        PMID: 20634204      PMCID: PMC2935401          DOI: 10.1093/bioinformatics/btq419

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Genome-wide association studies (GWAS) have identified hundreds of loci associated with complex human diseases and traits (Manolio et al., 2009). GWAS test for association with dichotomous or quantitative traits at millions of SNPs across the genome and can identify variants many hundreds of kilobases away from any known gene. The next challenge in human genetics will be to identify the causal variants and genes responsible for disease association at the many disease-associated loci identified from GWAS. An associated region may contain only a single strongly associated SNP, or more commonly, a set of SNPs with varying degrees of association due to local linkage disequilibrium (LD) patterns. When examining results from a GWAS, it is important to visually inspect regions showing association to determine the extent of the association signal and the position relative to nearby genes. Genes several hundred kb from an associated SNP may be functionally relevant (Loos et al., 2008). We have developed a web-based tool that provides graphical display of locus-specific association results and gives an overview of the extent of LD and the position relative to nearby genes and local recombination hotspots.

2 IMPLEMENTATION

2.1 Features and functionality

The main panel of a LocusZoom plot shows association P-values on the −log10 scale on the vertical axis, and the chromosomal position along the horizontal axis (Fig. 1). The user can specify the region to display in one of three ways: (i) an index SNP and a window size, (ii) the chromosome together with start and stop positions or (iii) gene name and size of flanking region. We allow for the display of a ‘rug’ above the main panel which gives a tick for any SNP in the results file, or for all SNPs from HapMap Phase II. The plots were designed to display ∼1 Mb windows of the genome, although for regions with several association signals or long-range LD patterns, plots extending further can be drawn.
Fig. 1.

An example LocusZoom plot showing the HDL cholesterol-associated region near the MMAB gene (Kathiresan et al., 2009).

An example LocusZoom plot showing the HDL cholesterol-associated region near the MMAB gene (Kathiresan et al., 2009). To identify SNPs that may be potentially causative, LocusZoom plots show not only the magnitude of association for each SNP, but also the pairwise LD pattern with the most strongly associated SNP or another user-specified SNP. Quick inspection can reveal the extent of the associated region and the location and number of SNPs in strong LD with the index SNP. In addition, a locus may show strongly associated variants that are weakly correlated, suggesting the presence of multiple independent association signals. Users may choose to display LD (r2 or D′) estimates from HapMap Phase II (CEU, YRI or JPT + CHB) or from the 1000 Genomes Project. LocusZoom is compatible with 1000 Genomes SNP naming format (chr:position) and will plot association results for novel SNPs identified by sequencing studies. We provide an option for the data point symbol to reflect genomic annotation (nonsense, non-synonymous, coding, UTR, splice variants, transcription factor binding sites and multi-species conservation), which is available for all SNPs in dbSNP or the 1000 Genomes Project (August 2009 release). The size of the data points can optionally reflect the square root of the sample size. The bottom panel of a LocusZoom plot shows the name and location of genes in the UCSC Genome Browser (Kent et al., 2002). Positions of exons are displayed, and the transcribed strand is indicated with an arrow. This allows the visual comparison of association results relative to coding regions. Gene names are automatically spaced relative to one another to avoid overlap. Currently used plotting tools include regional association plotter SNAP (Johnson et al., 2008) and LD-based viewers such as LD-Plus (Bush et al., 2010), CandiSNPer (Schmitt et al., 2010) and VALID (Jorgenson et al., 2009). LocusZoom provides additional features not currently available in any other single tool, such as: (i) the display of 1000 Genomes or novel SNPs from sequencing studies, (ii) functional annotation of SNPs, (iii) exon/intron distinction and automated gene spacing, (iv) ability to plot regions larger than 500 kb, (v) no pre-selection of input files and (vi) web-based batch mode and availability of source code and databases for download and local installation of LocusZoom.

2.2 Usage

LocusZoom was written in R using the grid and lattice graphics packages and runs within a Python wrapper. SQLite tables with relevant data for recombination rate, SNP position, annotation and gene information can be accessed using Python's built-in SQLite tools. A simple plot can be generated from the web form by uploading a file with SNP names and P-values, and specifying the region to be plotted and optional features using drop-down buttons. Typical run time for a single plot returned to the browser window is ∼20 s, not including time required to upload a meta-analysis file, which varies according to the user's internet upload speed and file size. To reduce upload time, users may choose to restrict data files to the region being plotted. To generate a series of locus plots from the web form, users can submit a specification file where custom specifications for each plot can be listed. When a specification file is used to draw many plots, a single pdf containing all generated plots is returned to the user by e-mail. Finally, users can download our scripts, which require R and Python, and associated databases in SQLite format to enable plot generation on their local unix machine. Full documentation of all features is available on the LocusZoom website. The LocusZoom webpage comes pre-loaded with genome-wide association results for HDL cholesterol, LDL cholesterol and triglycerides in ∼20 000 individuals of European ancestry (Kathiresan et al., 2009).

3 CONCLUSIONS

We have created a user-friendly tool to generate regional plots of association results in their genomic context. LocusZoom allows for quick visual inspection of the strength of association evidence, the extent of the association signal and LD, and the position of the associated SNPs relative to genes in the region. LocusZoom plots provide an option to size the data points relative to sample size and can display functional annotation. LocusZoom can be accessed from a simple web-based form with drop-down menus or by uploading a specification file to generate many plots at once. LocusZoom Python application, source code in R, and associated databases are available for download and we provide instruction for users to create custom database tables. It is anticipated that, in the future, additional publicly available result sets will be available for convenient viewing.
  8 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  CandiSNPer: a web tool for the identification of candidate SNPs for causal variants.

Authors:  Armin O Schmitt; Jens Assmus; Ralf H Bortfeldt; Gudrun A Brockmann
Journal:  Bioinformatics       Date:  2010-02-19       Impact factor: 6.937

3.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

4.  Visualizing SNP statistics in the context of linkage disequilibrium using LD-Plus.

Authors:  William S Bush; Scott M Dudek; Marylyn D Ritchie
Journal:  Bioinformatics       Date:  2010-02-03       Impact factor: 6.937

Review 5.  Finding the missing heritability of complex diseases.

Authors:  Teri A Manolio; Francis S Collins; Nancy J Cox; David B Goldstein; Lucia A Hindorff; David J Hunter; Mark I McCarthy; Erin M Ramos; Lon R Cardon; Aravinda Chakravarti; Judy H Cho; Alan E Guttmacher; Augustine Kong; Leonid Kruglyak; Elaine Mardis; Charles N Rotimi; Montgomery Slatkin; David Valle; Alice S Whittemore; Michael Boehnke; Andrew G Clark; Evan E Eichler; Greg Gibson; Jonathan L Haines; Trudy F C Mackay; Steven A McCarroll; Peter M Visscher
Journal:  Nature       Date:  2009-10-08       Impact factor: 49.962

6.  VALID: visualization of association study results and linkage disequilibrium.

Authors:  Eric Jorgenson; Mark Kvale; John S Witte
Journal:  Genet Epidemiol       Date:  2009-11       Impact factor: 2.135

7.  Common variants near MC4R are associated with fat mass, weight and risk of obesity.

Authors:  Ruth J F Loos; Cecilia M Lindgren; Shengxu Li; Eleanor Wheeler; Jing Hua Zhao; Inga Prokopenko; Michael Inouye; Rachel M Freathy; Antony P Attwood; Jacques S Beckmann; Sonja I Berndt; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Sven Bergmann; Amanda J Bennett; Sheila A Bingham; Murielle Bochud; Morris Brown; Stéphane Cauchi; John M Connell; Cyrus Cooper; George Davey Smith; Ian Day; Christian Dina; Subhajyoti De; Emmanouil T Dermitzakis; Alex S F Doney; Katherine S Elliott; Paul Elliott; David M Evans; I Sadaf Farooqi; Philippe Froguel; Jilur Ghori; Christopher J Groves; Rhian Gwilliam; David Hadley; Alistair S Hall; Andrew T Hattersley; Johannes Hebebrand; Iris M Heid; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Blanca Herrera; Anke Hinney; Sarah E Hunt; Marjo-Riitta Jarvelin; Toby Johnson; Jennifer D M Jolley; Fredrik Karpe; Andrew Keniry; Kay-Tee Khaw; Robert N Luben; Massimo Mangino; Jonathan Marchini; Wendy L McArdle; Ralph McGinnis; David Meyre; Patricia B Munroe; Andrew D Morris; Andrew R Ness; Matthew J Neville; Alexandra C Nica; Ken K Ong; Stephen O'Rahilly; Katharine R Owen; Colin N A Palmer; Konstantinos Papadakis; Simon Potter; Anneli Pouta; Lu Qi; Joshua C Randall; Nigel W Rayner; Susan M Ring; Manjinder S Sandhu; André Scherag; Matthew A Sims; Kijoung Song; Nicole Soranzo; Elizabeth K Speliotes; Holly E Syddall; Sarah A Teichmann; Nicholas J Timpson; Jonathan H Tobias; Manuela Uda; Carla I Ganz Vogel; Chris Wallace; Dawn M Waterworth; Michael N Weedon; Cristen J Willer; Xin Yuan; Eleftheria Zeggini; Joel N Hirschhorn; David P Strachan; Willem H Ouwehand; Mark J Caulfield; Nilesh J Samani; Timothy M Frayling; Peter Vollenweider; Gerard Waeber; Vincent Mooser; Panos Deloukas; Mark I McCarthy; Nicholas J Wareham; Inês Barroso; Kevin B Jacobs; Stephen J Chanock; Richard B Hayes; Claudia Lamina; Christian Gieger; Thomas Illig; Thomas Meitinger; H-Erich Wichmann; Peter Kraft; Susan E Hankinson; David J Hunter; Frank B Hu; Helen N Lyon; Benjamin F Voight; Martin Ridderstrale; Leif Groop; Paul Scheet; Serena Sanna; Goncalo R Abecasis; Giuseppe Albai; Ramaiah Nagaraja; David Schlessinger; Anne U Jackson; Jaakko Tuomilehto; Francis S Collins; Michael Boehnke; Karen L Mohlke
Journal:  Nat Genet       Date:  2008-05-04       Impact factor: 38.330

8.  Common variants at 30 loci contribute to polygenic dyslipidemia.

Authors:  Sekar Kathiresan; Cristen J Willer; Gina M Peloso; Serkalem Demissie; Kiran Musunuru; Eric E Schadt; Lee Kaplan; Derrick Bennett; Yun Li; Toshiko Tanaka; Benjamin F Voight; Lori L Bonnycastle; Anne U Jackson; Gabriel Crawford; Aarti Surti; Candace Guiducci; Noel P Burtt; Sarah Parish; Robert Clarke; Diana Zelenika; Kari A Kubalanza; Mario A Morken; Laura J Scott; Heather M Stringham; Pilar Galan; Amy J Swift; Johanna Kuusisto; Richard N Bergman; Jouko Sundvall; Markku Laakso; Luigi Ferrucci; Paul Scheet; Serena Sanna; Manuela Uda; Qiong Yang; Kathryn L Lunetta; Josée Dupuis; Paul I W de Bakker; Christopher J O'Donnell; John C Chambers; Jaspal S Kooner; Serge Hercberg; Pierre Meneton; Edward G Lakatta; Angelo Scuteri; David Schlessinger; Jaakko Tuomilehto; Francis S Collins; Leif Groop; David Altshuler; Rory Collins; G Mark Lathrop; Olle Melander; Veikko Salomaa; Leena Peltonen; Marju Orho-Melander; Jose M Ordovas; Michael Boehnke; Gonçalo R Abecasis; Karen L Mohlke; L Adrienne Cupples
Journal:  Nat Genet       Date:  2008-12-07       Impact factor: 38.330

  8 in total
  1382 in total

1.  Genetic variation of the transthyretin gene in wild-type transthyretin amyloidosis (ATTRwt).

Authors:  Jacquelyn L Sikora; Mark W Logue; Gloria G Chan; Brian H Spencer; Tatiana B Prokaeva; Clinton T Baldwin; David C Seldin; Lawreen H Connors
Journal:  Hum Genet       Date:  2014-11-04       Impact factor: 4.132

2.  Genome-wide association study identifies a susceptibility locus for schizophrenia in Han Chinese at 11p11.2.

Authors:  Wei-Hua Yue; Hai-Feng Wang; Liang-Dan Sun; Fu-Lei Tang; Zhong-Hua Liu; Hong-Xing Zhang; Wen-Qiang Li; Yan-Ling Zhang; Yang Zhang; Cui-Cui Ma; Bo Du; Li-Fang Wang; Yun-Qing Ren; Yong-Feng Yang; Xiao-Feng Hu; Yi Wang; Wei Deng; Li-Wen Tan; Yun-Long Tan; Qi Chen; Guang-Ming Xu; Gui-Gang Yang; Xian-bo Zuo; Hao Yan; Yan-Yan Ruan; Tian-Lan Lu; Xue Han; Xiao-Hong Ma; Yan Wang; Li-Wei Cai; Chao Jin; Hong-Yan Zhang; Jun Yan; Wei-Feng Mi; Xian-Yong Yin; Wen-Bin Ma; Qi Liu; Lan Kang; Wei Sun; Cheng-Ying Pan; Mei Shuang; Fu-De Yang; Chuan-Yue Wang; Jian-Li Yang; Ke-Qing Li; Xin Ma; Ling-Jiang Li; Xin Yu; Qi-Zhai Li; Xun Huang; Lu-Xian Lv; Tao Li; Guo-Ping Zhao; Wei Huang; Xue-Jun Zhang; Dai Zhang
Journal:  Nat Genet       Date:  2011-10-30       Impact factor: 38.330

3.  A GWAS follow-up study reveals the association of the IL12RB2 gene with systemic sclerosis in Caucasian populations.

Authors:  Lara Bossini-Castillo; Jose-Ezequiel Martin; Jasper Broen; Olga Gorlova; Carmen P Simeón; Lorenzo Beretta; Madelon C Vonk; Jose Luis Callejas; Ivan Castellví; Patricia Carreira; Francisco José García-Hernández; Mónica Fernández Castro; Marieke J H Coenen; Gabriela Riemekasten; Torsten Witte; Nicolas Hunzelmann; Alexander Kreuter; Jörg H W Distler; Bobby P Koeleman; Alexandre E Voskuyl; Annemie J Schuerwegh; Øyvind Palm; Roger Hesselstrand; Annika Nordin; Paolo Airó; Claudio Lunardi; Raffaella Scorza; Paul Shiels; Jacob M van Laar; Ariane Herrick; Jane Worthington; Christopher Denton; Filemon K Tan; Frank C Arnett; Sandeep K Agarwal; Shervin Assassi; Carmen Fonseca; Maureen D Mayes; Timothy R D J Radstake; Javier Martin
Journal:  Hum Mol Genet       Date:  2011-11-10       Impact factor: 6.150

4.  Hypoxia-Sensitive COMMD1 Integrates Signaling and Cellular Metabolism in Human Macrophages and Suppresses Osteoclastogenesis.

Authors:  Koichi Murata; Celestia Fang; Chikashi Terao; Eugenia G Giannopoulou; Ye Ji Lee; Min Joon Lee; Se-Hwan Mun; Seyeon Bae; Yu Qiao; Ruoxi Yuan; Moritoshi Furu; Hiromu Ito; Koichiro Ohmura; Shuichi Matsuda; Tsuneyo Mimori; Fumihiko Matsuda; Kyung-Hyun Park-Min; Lionel B Ivashkiv
Journal:  Immunity       Date:  2017-07-18       Impact factor: 31.745

5.  The Psoriasis Risk Allele HLA-C*06:02 Shows Evidence of Association with Chronic or Recurrent Streptococcal Tonsillitis.

Authors:  Karita Haapasalo; Lotta L E Koskinen; Jari Suvilehto; Pekka Jousilahti; Annika Wolin; Sari Suomela; Richard Trembath; Jonathan Barker; Jaana Vuopio; Juha Kere; T Sakari Jokiranta; Päivi Saavalainen
Journal:  Infect Immun       Date:  2018-09-21       Impact factor: 3.441

6.  Genome-wide association study identifies the GLDC/IL33 locus associated with survival of osteosarcoma patients.

Authors:  Roelof Koster; Orestis A Panagiotou; William A Wheeler; Eric Karlins; Julie M Gastier-Foster; Silvia Regina Caminada de Toledo; Antonio S Petrilli; Adrienne M Flanagan; Roberto Tirabosco; Irene L Andrulis; Jay S Wunder; Nalan Gokgoz; Ana Patiño-Garcia; Fernando Lecanda; Massimo Serra; Claudia Hattinger; Piero Picci; Katia Scotlandi; David M Thomas; Mandy L Ballinger; Richard Gorlick; Donald A Barkauskas; Logan G Spector; Margaret Tucker; D Hicks Belynda; Meredith Yeager; Robert N Hoover; Sholom Wacholder; Stephen J Chanock; Sharon A Savage; Lisa Mirabello
Journal:  Int J Cancer       Date:  2017-12-23       Impact factor: 7.396

7.  Inhaled corticosteroid treatment modulates ZNF432 gene variant's effect on bronchodilator response in asthmatics.

Authors:  Ann Chen Wu; Blanca E Himes; Jessica Lasky-Su; Augusto Litonjua; Stephen P Peters; John Lima; Michiaki Kubo; Mayumi Tamari; Yusuke Nakamura; Weiliang Qiu; Scott T Weiss; Kelan Tantisira
Journal:  J Allergy Clin Immunol       Date:  2013-11-23       Impact factor: 10.793

8.  Identification of Genetic Loci Shared Between Attention-Deficit/Hyperactivity Disorder, Intelligence, and Educational Attainment.

Authors:  Kevin S O'Connell; Alexey Shadrin; Olav B Smeland; Shahram Bahrami; Oleksandr Frei; Francesco Bettella; Florian Krull; Chun C Fan; Ragna B Askeland; Gun Peggy S Knudsen; Anne Halmøy; Nils Eiel Steen; Torill Ueland; G Bragi Walters; Katrín Davíðsdóttir; Gyða S Haraldsdóttir; Ólafur Ó Guðmundsson; Hreinn Stefánsson; Ted Reichborn-Kjennerud; Jan Haavik; Anders M Dale; Kári Stefánsson; Srdjan Djurovic; Ole A Andreassen
Journal:  Biol Psychiatry       Date:  2019-11-29       Impact factor: 13.382

9.  Citalopram and escitalopram plasma drug and metabolite concentrations: genome-wide associations.

Authors:  Yuan Ji; Daniel J Schaid; Zeruesenay Desta; Michiaki Kubo; Anthony J Batzler; Karen Snyder; Taisei Mushiroda; Naoyuki Kamatani; Evan Ogburn; Daniel Hall-Flavin; David Flockhart; Yusuke Nakamura; David A Mrazek; Richard M Weinshilboum
Journal:  Br J Clin Pharmacol       Date:  2014-08       Impact factor: 4.335

10.  A common functional regulatory variant at a type 2 diabetes locus upregulates ARAP1 expression in the pancreatic beta cell.

Authors:  Jennifer R Kulzer; Michael L Stitzel; Mario A Morken; Jeroen R Huyghe; Christian Fuchsberger; Johanna Kuusisto; Markku Laakso; Michael Boehnke; Francis S Collins; Karen L Mohlke
Journal:  Am J Hum Genet       Date:  2014-01-16       Impact factor: 11.025

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.