Literature DB >> 33734315

LocusZoom.js: Interactive and embeddable visualization of genetic association study results.

Andrew P Boughton1, Ryan P Welch1, Matthew Flickinger1, Peter VandeHaar1, Daniel Taliun1, Gonçalo R Abecasis1,2, Michael Boehnke1.   

Abstract

LocusZoom.js is a JavaScript library for creating interactive web-based visualizations of genetic association study results. It can display one or more traits in the context of relevant biological data (such as gene models and other genomic annotation), and allows interactive refinement of analysis models (by selecting linkage disequilibrium reference panels, identifying sets of likely causal variants, or comparisons to the GWAS catalog). It can be embedded in web pages to enable data sharing and exploration. Views can be customized and extended to display other data types such as phenome-wide association study (PheWAS) results, chromatin co-accessibility, or eQTL measurements. A new web upload service harmonizes datasets, adds annotations, and makes it easy to explore user-provided result sets. Availability LocusZoom.js is open-source software under a permissive MIT license. Code and documentation are available at: https://github.com/statgen/locuszoom/. Installable packages for all versions are also distributed via NPM. Additional features are provided as standalone libraries to promote reuse. Use with your own GWAS results at https://my.locuszoom.org/. Supplementary information Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 33734315      PMCID: PMC8479674          DOI: 10.1093/bioinformatics/btab186

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Genetic association studies are a workhorse technique in the study of complex disease genetics, and in recent years the numbers of reported studies and associations have risen dramatically (Buniello et al., 2019). It has long been recognized that interpreting genetic association studies requires significant context, including linkage disequilibrium (LD) patterns, recombination rate, nearby genes, results for related traits, chromatin accessibility, and other information. LocusZoom (Pruim et al., 2010) is now the standard tool for visualizing GWAS results in the context of relevant genetic annotations, with both command line and web-based versions. Here, we describe a new JavaScript implementation of LocusZoom that significantly expands its capabilities, supporting interactive web-based visualizations with options for both public and private datasets.

2 Key features

The basic LocusZoom style of plot has been described elsewhere (Pruim ). Briefly, the main area presents a scatter plot of association P-values versus chromosomal position, frequently using color to summarize LD with the lead genetic variant. Additional tracks show recombination rate, nearby genes, and relevant annotations such as variants for which associations have been reported in the EBI GWAS catalog (Buniello ). The region size is user selectable, typically < 1 Mb (Fig. 1).
Fig. 1.

A LocusZoom.js region plot. Association summary statistics are presented as a scatter plot, with point colors indicating LD relative to the lead (most significant) variant in the region. Tick marks represent variants with significant hits in the GWAS catalog. Individual points can be labeled or annotated interactively via tooltips that expose additional information. Genes indicate introns and exons (horizontal and vertical bars)

A LocusZoom.js region plot. Association summary statistics are presented as a scatter plot, with point colors indicating LD relative to the lead (most significant) variant in the region. Tick marks represent variants with significant hits in the GWAS catalog. Individual points can be labeled or annotated interactively via tooltips that expose additional information. Genes indicate introns and exons (horizontal and vertical bars) LocusZoom.js is an embeddable JavaScript widget for creating such visualizations, with all data rendering handled client side using web-standard technologies such as HTML, CSS and SVG. This allows significant dynamic behavior. For example, the user may hide certain elements (e.g. pseudogenes), recolor the plot based on a different LD reference population, or toggle to visualize a credible set. To facilitate exploration of a genome-wide dataset with many significant loci, the region in view may be changed either by user gestures (drag and zoom) or programmatically. At any time, the current view may be exported as PNG or high-resolution SVG files. Event callbacks allow the plot to trigger and respond to external events, such as updating a data table to reflect the information shown on the plot. Within the plot, all data elements provide tooltips that expose additional information or functionality. LocusZoom.js most commonly fetches data via REST APIs, and we provide an API server (https://portaldev.sph.umich.edu/docs/api/v1/) with access to standard datasets such as 1000 Genomes Project LD, GENCODE genes, HapMap-based recombination rate, and the EBI GWAS Catalog. All annotations are provided for genome builds GRCh37 and GRCh38. A plugin-based architecture makes it straightforward to add data adapters for other formats, including local and remote tabixed files. In practice, this allows plots to be generated from a static file host such as S3, reducing the barrier to entry. Since rendering is performed in the browser, the only dependency is the embedded code; private datasets do not need to be transmitted to a third-party server. When LocusZoom.js is integrated into existing web applications, significant customization is possible due to a track and plugin-based architecture, with a declarative layout system that can be expanded to support additional data types. LocusZoom.js has already been used to visualize PheWAS, eQTL, chromatin co-accessibility and chromatin state annotations; an example gallery demonstrates many visual and interactive features (https://statgen.github.io/locuszoom/). Track types can be layered, stacked, and dynamically added to foster exploration. LocusZoom.js has been incorporated into numerous third-party sites including the Type 2 Diabetes Knowledge Portal (https://t2d.hugeamp.org/) and PheWeb (Gagliano Taliun et al., 2020).

3 Interactive plots of user-provided data

Although originally developed for use in centralized data sharing portals, LocusZoom.js can be used via our web-based ‘LocalZoom’ tool (https://statgen.github.io/localzoom/) to interactively plot user-provided datasets from a local, tabix-indexed (Li, 2011) file without the need to write code. This occurs in the web browser and no data are uploaded to a central server, thus protecting privacy of sensitive results. Upon selecting the file, a modal dialog prompts the user to identify key fields such as chromosome, position, reference and alternate alleles, (-log10) P-value, effect size, standard error, and alternate allele frequency. Built-in heuristics and data validation allow LocalZoom to parse most user-provided files automatically. To aid exploration of results, LocalZoom exposes a number of advanced features (Supplementary Table S1). A search box allows easy access to genetic regions based on chromosome: position, gene name or rsID syntax. A 95% credible set is calculated for the region in view based on a simple Bayesian refinement method (The Wellcome Trust Case Control Consortium ), and the visualization can be updated to reflect this calculation. Additional GWAS datasets can be added to the plot as vertically stacked panels to allow comparisons between studies, and clicking on a variant of interest will display associations from other phenotypes in a PheWAS of the UK BioBank (Zhou et al., 2018). Data may be viewed and exported via an accompanying tabular view that includes dynamically calculated results such as credible set membership.

3.1 Upload service with additional features

Due to the size and complexity of GWAS summary statistics, some additional features require an asynchronous pre-processing step outside the web browser. We have created a new, fully interactive upload service (https://my.locuszoom.org) that allows both public and private sharing, while incorporating all of the advanced display features from LocalZoom (Supplementary Table S1). By allowing users to upload their files in advance, additional processing and annotations are possible, such as Manhattan and QQ plots, nearest gene and rsID labels, and a convenient batch view mode to browse the most significant hits across the entire genome. At present, uploaded files are limited to ∼1 GB in size. To protect sensitive data, a Google account is required to upload a study, but shared datasets can be viewed anonymously. A range of studies are provided for public browsing. In its first year of operation, this service has generated >102 000 plots across >7700 datasets. In the future, we plan to support additional features such as custom user-defined annotations and comparisons to published studies. Click here for additional data file.
  6 in total

1.  Tabix: fast retrieval of sequence features from generic TAB-delimited files.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-01-05       Impact factor: 6.937

2.  Exploring and visualizing large-scale genetic associations by using PheWeb.

Authors:  Sarah A Gagliano Taliun; Peter VandeHaar; Andrew P Boughton; Ryan P Welch; Daniel Taliun; Ellen M Schmidt; Wei Zhou; Jonas B Nielsen; Cristen J Willer; Seunggeun Lee; Lars G Fritsche; Michael Boehnke; Gonçalo R Abecasis
Journal:  Nat Genet       Date:  2020-06       Impact factor: 38.330

3.  LocusZoom: regional visualization of genome-wide association scan results.

Authors:  Randall J Pruim; Ryan P Welch; Serena Sanna; Tanya M Teslovich; Peter S Chines; Terry P Gliedt; Michael Boehnke; Gonçalo R Abecasis; Cristen J Willer
Journal:  Bioinformatics       Date:  2010-07-15       Impact factor: 6.937

4.  The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019.

Authors:  Annalisa Buniello; Jacqueline A L MacArthur; Maria Cerezo; Laura W Harris; James Hayhurst; Cinzia Malangone; Aoife McMahon; Joannella Morales; Edward Mountjoy; Elliot Sollis; Daniel Suveges; Olga Vrousgou; Patricia L Whetzel; Ridwan Amode; Jose A Guillen; Harpreet S Riat; Stephen J Trevanion; Peggy Hall; Heather Junkins; Paul Flicek; Tony Burdett; Lucia A Hindorff; Fiona Cunningham; Helen Parkinson
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

5.  Bayesian refinement of association signals for 14 loci in 3 common diseases.

Authors:  Julian B Maller; Gilean McVean; Jake Byrnes; Damjan Vukcevic; Kimmo Palin; Zhan Su; Joanna M M Howson; Adam Auton; Simon Myers; Andrew Morris; Matti Pirinen; Matthew A Brown; Paul R Burton; Mark J Caulfield; Alastair Compston; Martin Farrall; Alistair S Hall; Andrew T Hattersley; Adrian V S Hill; Christopher G Mathew; Marcus Pembrey; Jack Satsangi; Michael R Stratton; Jane Worthington; Nick Craddock; Matthew Hurles; Willem Ouwehand; Miles Parkes; Nazneen Rahman; Audrey Duncanson; John A Todd; Dominic P Kwiatkowski; Nilesh J Samani; Stephen C L Gough; Mark I McCarthy; Panagiotis Deloukas; Peter Donnelly
Journal:  Nat Genet       Date:  2012-10-28       Impact factor: 38.330

6.  Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies.

Authors:  Wei Zhou; Jonas B Nielsen; Lars G Fritsche; Rounak Dey; Maiken E Gabrielsen; Brooke N Wolford; Jonathon LeFaive; Peter VandeHaar; Sarah A Gagliano; Aliya Gifford; Lisa A Bastarache; Wei-Qi Wei; Joshua C Denny; Maoxuan Lin; Kristian Hveem; Hyun Min Kang; Goncalo R Abecasis; Cristen J Willer; Seunggeun Lee
Journal:  Nat Genet       Date:  2018-08-13       Impact factor: 38.330

  6 in total
  18 in total

1.  FIVEx: an interactive eQTL browser across public datasets.

Authors:  Alan Kwong; Andrew P Boughton; Mukai Wang; Peter VandeHaar; Michael Boehnke; Gonçalo Abecasis; Hyun Min Kang
Journal:  Bioinformatics       Date:  2021-08-30       Impact factor: 6.931

2.  Circulating miRNAs as Potential Biomarkers for Patient Stratification in Bipolar Disorder: A Combined Review and Data Mining Approach.

Authors:  Alexandra R Clausen; Simon Durand; Rasmus L Petersen; Nicklas H Staunstrup; Per Qvist
Journal:  Genes (Basel)       Date:  2022-06-10       Impact factor: 4.141

3.  Immunity gene IFITM3 variant: Relation to cognition and Alzheimer's disease pathology.

Authors:  Jung-Min Pyun; Young Ho Park; Angela Hodges; Jae-Won Jang; Paula J Bice; SangYun Kim; Andrew J Saykin; Kwangsik Nho
Journal:  Alzheimers Dement (Amst)       Date:  2022-06-21

4.  dbBIP: a comprehensive bipolar disorder database for genetic research.

Authors:  Xiaoyan Li; Shunshuai Ma; Wenhui Yan; Yong Wu; Hui Kong; Mingshan Zhang; Xiongjian Luo; Junfeng Xia
Journal:  Database (Oxford)       Date:  2022-07-02       Impact factor: 4.462

Review 5.  The human genetic epidemiology of COVID-19.

Authors:  Mari E K Niemi; Mark J Daly; Andrea Ganna
Journal:  Nat Rev Genet       Date:  2022-05-02       Impact factor: 59.581

6.  GWAS of Hematuria.

Authors:  Sarah A Gagliano Taliun; Patrick Sulem; Gardar Sveinbjornsson; Daniel F Gudbjartsson; Kari Stefansson; Andrew D Paterson; Moumita Barua
Journal:  Clin J Am Soc Nephrol       Date:  2022-04-26       Impact factor: 10.614

7.  Genetic Contributors of Incident Stroke in 10,700 African Americans With Hypertension: A Meta-Analysis From the Genetics of Hypertension Associated Treatments and Reasons for Geographic and Racial Differences in Stroke Studies.

Authors:  Nicole D Armstrong; Vinodh Srinivasasainagendra; Amit Patki; Rikki M Tanner; Bertha A Hidalgo; Hemant K Tiwari; Nita A Limdi; Ethan M Lange; Leslie A Lange; Donna K Arnett; Marguerite R Irvin
Journal:  Front Genet       Date:  2021-12-21       Impact factor: 4.772

8.  A year of COVID-19 GWAS results from the GRASP portal reveals potential genetic risk factors.

Authors:  Florian Thibord; Melissa V Chan; Ming-Huei Chen; Andrew D Johnson
Journal:  HGG Adv       Date:  2022-02-22

9.  PDE4B Proposed as a High Myopia Susceptibility Gene in Chinese Population.

Authors:  Fuxin Zhao; Wei Chen; Hui Zhou; Peter S Reinach; Yuhan Wang; Suh-Hang H Juo; Zhenglin Yang; Anquan Xue; Yi Shi; Chung-Ling Liang; Changqing Zeng; Jia Qu; Xiangtian Zhou
Journal:  Front Genet       Date:  2022-01-12       Impact factor: 4.599

10.  Multi-Omics Characterization of Early- and Adult-Onset Major Depressive Disorder.

Authors:  Caroline W Grant; Erin F Barreto; Rakesh Kumar; Rima Kaddurah-Daouk; Michelle Skime; Taryn Mayes; Thomas Carmody; Joanna Biernacka; Liewei Wang; Richard Weinshilboum; Madhukar H Trivedi; William V Bobo; Paul E Croarkin; Arjun P Athreya
Journal:  J Pers Med       Date:  2022-03-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.