Literature DB >> 25750659

LocusTrack: Integrated visualization of GWAS results and genomic annotation.

Gabriel Cuellar-Partida1, Miguel E Renteria2, Stuart MacGregor3.   

Abstract

BACKGROUND: Genome-wide association studies (GWAS) are an important tool for the mapping of complex traits and diseases. Visual inspection of genomic annotations may be used to generate insights into the biological mechanisms underlying GWAS-identified loci.
RESULTS: We developed LocusTrack, a web-based application that annotates and creates plots of regional GWAS results and incorporates user-specified tracks that display annotations such as linkage disequilibrium (LD), phylogenetic conservation, chromatin state, and other genomic and regulatory elements. Currently, LocusTrack can integrate annotation tracks from the UCSC genome-browser as well as from any tracks provided by the user.
CONCLUSION: LocusTrack is an easy-to-use application and can be accessed at the following URL: http://gump.qimr.edu.au/general/gabrieC/LocusTrack/. Users can upload and manage GWAS results and select from and/or provide annotation tracks using simple and intuitive menus. LocusTrack scripts and associated data can be downloaded from the website and run locally.

Entities:  

Year:  2015        PMID: 25750659      PMCID: PMC4351846          DOI: 10.1186/s13029-015-0032-8

Source DB:  PubMed          Journal:  Source Code Biol Med        ISSN: 1751-0473


Background

Genome-wide association studies (GWAS) have revolutionised the genetic mapping of complex traits and diseases over the last decade [1-3]. However, a considerable amount of the markers identified to date lie within non-coding regions and/or might be only proxy markers to the actual causal variants [2,3]. Tools that aid the visual inspection of these loci may facilitate the identification of functional elements located near GWAS-associated variants. LocusZoom [4] and SNAP-plot [5] have become widely used tools to generate locus-specific graphical displays of association results in the context of linkage disequilibrium (LD) as well as the position relative to nearby genes and local recombination hotspots. However, it is now becoming increasingly important to also visualise GWAS results in the context of functional annotations beyond genes (e.g. chromatin state, transcription factor binding sites, phylogenetic conservation, etc.) [6]. Thus, we have developed LocusTrack, a web-based application that allows the user to generate regional GWAS results plots that incorporate genomic annotations within the same figure. Currently LocusTrack supports both user-provided custom tracks as well as tracks from the UCSC genome-browser.

Implementation

Features and functionality

LocusTrack plots display regional GWAS results in the top panel (Figure 1a). Here, the user can opt between showing P-values on the –log10 scale (i.e. LocusZoom-like fashion) on the left y-axis or displaying LD (r2) (i.e. SNAP-like fashion), which is often useful for investigating a region in the absence of P-values. Recombination rates are represented on the right y-axis. By default, LocusTrack selects the SNP with the strongest association and generates a plot according to a user-defined window-frame size. However, it is also possible for the user to specify any other SNP(s) if desired. The plot also shows the pairwise (LD) pattern of each SNP with the user specified SNP. Users can choose to compute LD (r2) estimates from different 1000 Genomes Project populations available.
Figure 1

Example of a LocusTrack plot. The plot shows regional GWAS results (a) from publicly available GWAS results of the schizophrenia working group along with the genes within the region (b) and seven genomic annotation tracks (c).

Example of a LocusTrack plot. The plot shows regional GWAS results (a) from publicly available GWAS results of the schizophrenia working group along with the genes within the region (b) and seven genomic annotation tracks (c). The second LocusTrack panel displays symbol and location of genes within that region (Figure 1b). Intron and exon positions are displayed in a similar fashion to LocusZoom. Orientation of the transcribed strand is indicated by differential colouring (blue = plus strand; red = minus strand) and arrows. The position of gene symbols is automatically adjusted to minimise the area occupied in the figure and to avoid overlap with one another. LocusZoom provides the option for the data point to reflect different genomic annotations such as synonymous variants, splice variants, transcription factor binding sites, conservation, and whether they are in the GWAS catalogue. LocusTrack can incorporate any type of annotation in the form of genomic tracks in a third panel (Figure 1c). In this way, the user can specify between 1 and 10 different tracks which can be either custom tracks (i.e. the user must upload the data), LD tracks (i.e. a track displaying LD of the SNPs in another population), or publicly-available UCSC tracks. Note that LocusTrack uses the bioconductor package rtracklayer [7] to retrieve and parse UCSC tables. However, some tables come in a non-parseable form, or are truncated by the UCSC browser if they exceed certain limits (usually around 100,000 records), so they cannot be obtained by the program. This is particularly true for wiggle and big-wiggle format files. However, for these cases, the user can download directly the tracks via UCSC Table browser (http://genome.ucsc.edu/cgi-bin/hgTables) and input them as custom tracks. Our application also allows users to zoom in and focus on a smaller region in the bottom panel, drawn from that shown in the first two panels. This provides a closer look to the annotation tracks at the region of interest, without modifying the plots in the upper panels. This region can be defined either based on an LD cut-off or based on a simple zoom in. In addition, to facilitate inspection, LocusTrack can display every assessed SNP in a track-like fashion which uses the same color-coding of SNPs in the first panel. Finally, our application generates an R object with the annotations requested for each specified loci (e.g. genes located in that region, LD, and the information of the tracks selected), facilitating the GWAS annotation.

Results and discussion

Figure 1 shows an example of a LocusTrack plot. We used the genome wide significant associated SNP rs548181 from the publicly available GWAS results of the PGC (Psychiatric Genomics Consortium) schizophrenia working group [8]. The upper panel of the plot shows the extent of regional LD in the CEU population, the SNP p-values and the genes contained within the region. In addition, unlike similar applications such as LocusZoom [4] our software provides users with the option of displaying genomic tracks in the lower panel, which greatly facilitates the visual inspection of genomic context and annotation. In our example, the information in the lower panel corresponds to a region 5X zoom into the upper panel region. Further, the conservation extent within the region, as well as 3 different annotation tracks illustrating regulatory elements (i.e. transcription factor binding site clusters, DNAseI clusters and inferred chromatin state in human embryonic stem cells) are shown. We also include an LD track that shows the LD pattern in a different population (i.e. Han Chinese from Beijing; CHB). Our plot shows that the region around SNP rs548181 contains a probable transcription factor binding site and a DNaseI hot spot. Finally, the computationally inferred chromatin states by the ENCODE/Broad indicates that the SNP is within a putative active promoter region in human embryonic stem cells (hESC). This information may be useful for functional annotation and support hypothesis generation toward the follow up of GWAS hits.

Conclusions

We created a simple and intuitive application that allows the user to easily generate regional plots of GWAS results that incorporate both custom genomic annotations and tracks produced by the ENCODE project [9] and available from the UCSC genome browser [10]. LocusTrack facilitates visual inspection and annotation of genomic elements neighbouring associated loci. Our web application offers an easy way to handle GWAS and annotation files and adds functionality to popular existing tools LocusZoom and SNAP-plot.

Availability

LocusTrack was written in R and runs within an R script wrapper. It implements R core graphics to generate the figures and the Bioconductor package rtracklayer [7] to extract UCSC tracks. Recombination data was downloaded from http://ebi.edu.au/ftp/software/software/ensembl/encode/users/anshul/temp/chromatinVariation/rawdata/phasing/geneticMaps/ and compressed into an R object for quicker access. Pairwise linkage disequilibrium is computed using PLINK (1.9) [11] and the 1000 Genomes Project [12] data (23/11/2010 version). The web application allows for an easy file management. Users can upload GWAS results files with data organised in columns with SNPs, positions and P-values as well as annotation tracks to the web server. The time needed to generate a single plot depends on the number of tracks selected, the size of the region to be displayed, and number of jobs currently running on the server. However, it is generally within a few minutes. Finally, although LocusTrack is mainly intended as a web application, it is possible to run it locally on any Unix machine. The user only requires to have R and PLINK installed and to download the LocusTrack scripts along with the associated data from http://gump.qimr.edu.au/general/gabrieC/LocusTrack/downloads.html. Documentation of all features and the scripts are available on the LocusTrack website.
  12 in total

1.  Genome-wide association studies and beyond.

Authors:  John S Witte
Journal:  Annu Rev Public Health       Date:  2010       Impact factor: 21.981

2.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

Review 3.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges.

Authors:  Mark I McCarthy; Gonçalo R Abecasis; Lon R Cardon; David B Goldstein; Julian Little; John P A Ioannidis; Joel N Hirschhorn
Journal:  Nat Rev Genet       Date:  2008-05       Impact factor: 53.242

4.  Joint analysis of functional genomic data and genome-wide association studies of 18 human traits.

Authors:  Joseph K Pickrell
Journal:  Am J Hum Genet       Date:  2014-04-03       Impact factor: 11.025

5.  rtracklayer: an R package for interfacing with genome browsers.

Authors:  Michael Lawrence; Robert Gentleman; Vincent Carey
Journal:  Bioinformatics       Date:  2009-05-25       Impact factor: 6.937

6.  LocusZoom: regional visualization of genome-wide association scan results.

Authors:  Randall J Pruim; Ryan P Welch; Serena Sanna; Tanya M Teslovich; Peter S Chines; Terry P Gliedt; Michael Boehnke; Gonçalo R Abecasis; Cristen J Willer
Journal:  Bioinformatics       Date:  2010-07-15       Impact factor: 6.937

7.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

8.  ENCODE data in the UCSC Genome Browser: year 5 update.

Authors:  Kate R Rosenbloom; Cricket A Sloan; Venkat S Malladi; Timothy R Dreszer; Katrina Learned; Vanessa M Kirkup; Matthew C Wong; Morgan Maddren; Ruihua Fang; Steven G Heitner; Brian T Lee; Galt P Barber; Rachel A Harte; Mark Diekhans; Jeffrey C Long; Steven P Wilder; Ann S Zweig; Donna Karolchik; Robert M Kuhn; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

9.  Genome-wide association analysis identifies 13 new risk loci for schizophrenia.

Authors:  Stephan Ripke; Colm O'Dushlaine; Kimberly Chambert; Jennifer L Moran; Anna K Kähler; Susanne Akterin; Sarah E Bergen; Ann L Collins; James J Crowley; Menachem Fromer; Yunjung Kim; Sang Hong Lee; Patrik K E Magnusson; Nick Sanchez; Eli A Stahl; Stephanie Williams; Naomi R Wray; Kai Xia; Francesco Bettella; Anders D Borglum; Brendan K Bulik-Sullivan; Paul Cormican; Nick Craddock; Christiaan de Leeuw; Naser Durmishi; Michael Gill; Vera Golimbet; Marian L Hamshere; Peter Holmans; David M Hougaard; Kenneth S Kendler; Kuang Lin; Derek W Morris; Ole Mors; Preben B Mortensen; Benjamin M Neale; Francis A O'Neill; Michael J Owen; Milica Pejovic Milovancevic; Danielle Posthuma; John Powell; Alexander L Richards; Brien P Riley; Douglas Ruderfer; Dan Rujescu; Engilbert Sigurdsson; Teimuraz Silagadze; August B Smit; Hreinn Stefansson; Stacy Steinberg; Jaana Suvisaari; Sarah Tosato; Matthijs Verhage; James T Walters; Douglas F Levinson; Pablo V Gejman; Kenneth S Kendler; Claudine Laurent; Bryan J Mowry; Michael C O'Donovan; Michael J Owen; Ann E Pulver; Brien P Riley; Sibylle G Schwab; Dieter B Wildenauer; Frank Dudbridge; Peter Holmans; Jianxin Shi; Margot Albus; Madeline Alexander; Dominique Campion; David Cohen; Dimitris Dikeos; Jubao Duan; Peter Eichhammer; Stephanie Godard; Mark Hansen; F Bernard Lerer; Kung-Yee Liang; Wolfgang Maier; Jacques Mallet; Deborah A Nertney; Gerald Nestadt; Nadine Norton; Francis A O'Neill; George N Papadimitriou; Robert Ribble; Alan R Sanders; Jeremy M Silverman; Dermot Walsh; Nigel M Williams; Brandon Wormley; Maria J Arranz; Steven Bakker; Stephan Bender; Elvira Bramon; David Collier; Benedicto Crespo-Facorro; Jeremy Hall; Conrad Iyegbe; Assen Jablensky; Rene S Kahn; Luba Kalaydjieva; Stephen Lawrie; Cathryn M Lewis; Kuang Lin; Don H Linszen; Ignacio Mata; Andrew McIntosh; Robin M Murray; Roel A Ophoff; John Powell; Dan Rujescu; Jim Van Os; Muriel Walshe; Matthias Weisbrod; Durk Wiersma; Peter Donnelly; Ines Barroso; Jenefer M Blackwell; Elvira Bramon; Matthew A Brown; Juan P Casas; Aiden P Corvin; Panos Deloukas; Audrey Duncanson; Janusz Jankowski; Hugh S Markus; Christopher G Mathew; Colin N A Palmer; Robert Plomin; Anna Rautanen; Stephen J Sawcer; Richard C Trembath; Ananth C Viswanathan; Nicholas W Wood; Chris C A Spencer; Gavin Band; Céline Bellenguez; Colin Freeman; Garrett Hellenthal; Eleni Giannoulatou; Matti Pirinen; Richard D Pearson; Amy Strange; Zhan Su; Damjan Vukcevic; Peter Donnelly; Cordelia Langford; Sarah E Hunt; Sarah Edkins; Rhian Gwilliam; Hannah Blackburn; Suzannah J Bumpstead; Serge Dronov; Matthew Gillman; Emma Gray; Naomi Hammond; Alagurevathi Jayakumar; Owen T McCann; Jennifer Liddle; Simon C Potter; Radhi Ravindrarajah; Michelle Ricketts; Avazeh Tashakkori-Ghanbaria; Matthew J Waller; Paul Weston; Sara Widaa; Pamela Whittaker; Ines Barroso; Panos Deloukas; Christopher G Mathew; Jenefer M Blackwell; Matthew A Brown; Aiden P Corvin; Mark I McCarthy; Chris C A Spencer; Elvira Bramon; Aiden P Corvin; Michael C O'Donovan; Kari Stefansson; Edward Scolnick; Shaun Purcell; Steven A McCarroll; Pamela Sklar; Christina M Hultman; Patrick F Sullivan
Journal:  Nat Genet       Date:  2013-08-25       Impact factor: 38.330

10.  The UCSC Genome Browser database: 2014 update.

Authors:  Donna Karolchik; Galt P Barber; Jonathan Casper; Hiram Clawson; Melissa S Cline; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Angie S Hinrichs; Katrina Learned; Brian T Lee; Chin H Li; Brian J Raney; Brooke Rhead; Kate R Rosenbloom; Cricket A Sloan; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

View more
  14 in total

1.  LDassoc: an online tool for interactively exploring genome-wide association study results and prioritizing variants for functional investigation.

Authors:  Mitchell J Machiela; Stephen J Chanock
Journal:  Bioinformatics       Date:  2018-03-01       Impact factor: 6.937

2.  Genetic Determinants for Leisure-Time Physical Activity.

Authors:  Xiaochen Lin; Katie Kei-Hang Chan; Yen-Tsung Huang; X I Luo; Liming Liang; James Wilson; Adolfo Correa; Daniel Levy; Simin Liu
Journal:  Med Sci Sports Exerc       Date:  2018-08       Impact factor: 5.411

3.  Performing post-genome-wide association study analysis: overview, challenges and recommendations.

Authors:  Yagoub Adam; Chaimae Samtal; Jean-Tristan Brandenburg; Oluwadamilare Falola; Ezekiel Adebiyi
Journal:  F1000Res       Date:  2021-10-04

4.  A Phenomic Scan of the Norfolk Island Genetic Isolate Identifies a Major Pleiotropic Effect Locus Associated with Metabolic and Renal Disorder Markers.

Authors:  Miles C Benton; Rodney A Lea; Donia Macartney-Coxson; Michelle Hanna; David A Eccles; Melanie A Carless; Geoffrey K Chambers; Claire Bellis; Harald H Goring; Joanne E Curran; Jacquie L Harper; Gregory Gibson; John Blangero; Lyn R Griffiths
Journal:  PLoS Genet       Date:  2015-10-16       Impact factor: 5.917

5.  IVAG: An Integrative Visualization Application for Various Types of Genomic Data Based on R-Shiny and the Docker Platform.

Authors:  Tae-Rim Lee; Jin Mo Ahn; Gyuhee Kim; Sangsoo Kim
Journal:  Genomics Inform       Date:  2017-12-29

6.  Functional mapping and annotation of genetic associations with FUMA.

Authors:  Kyoko Watanabe; Erdogan Taskesen; Arjen van Bochoven; Danielle Posthuma
Journal:  Nat Commun       Date:  2017-11-28       Impact factor: 14.919

7.  X Chromosome Contribution to the Genetic Architecture of Primary Biliary Cholangitis.

Authors:  Rosanna Asselta; Elvezia M Paraboschi; Alessio Gerussi; Heather J Cordell; George F Mells; Richard N Sandford; David E Jones; Minoru Nakamura; Kazuko Ueno; Yuki Hitomi; Minae Kawashima; Nao Nishida; Katsushi Tokunaga; Masao Nagasaki; Atsushi Tanaka; Ruqi Tang; Zhiqiang Li; Yongyong Shi; Xiangdong Liu; Ma Xiong; Gideon Hirschfield; Katherine A Siminovitch; Marco Carbone; Giulia Cardamone; Stefano Duga; M Eric Gershwin; Michael F Seldin; Pietro Invernizzi
Journal:  Gastroenterology       Date:  2021-03-04       Impact factor: 33.883

8.  A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism.

Authors:  Anu Loukola; Jadwiga Buchwald; Richa Gupta; Teemu Palviainen; Jenni Hällfors; Emmi Tikkanen; Tellervo Korhonen; Miina Ollikainen; Antti-Pekka Sarin; Samuli Ripatti; Terho Lehtimäki; Olli Raitakari; Veikko Salomaa; Richard J Rose; Rachel F Tyndale; Jaakko Kaprio
Journal:  PLoS Genet       Date:  2015-09-25       Impact factor: 5.917

9.  Genome-wide analyses of aggressiveness in attention-deficit hyperactivity disorder.

Authors:  Erlend J Brevik; Marjolein M J van Donkelaar; Heike Weber; Cristina Sánchez-Mora; Christian Jacob; Olga Rivero; Sarah Kittel-Schneider; Iris Garcia-Martínez; Marcel Aebi; Kimm van Hulzen; Bru Cormand; Josep A Ramos-Quiroga; Klaus-Peter Lesch; Andreas Reif; Marta Ribasés; Barbara Franke; Maj-Britt Posserud; Stefan Johansson; Astri J Lundervold; Jan Haavik; Tetyana Zayats
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2016-03-29       Impact factor: 3.568

10.  Neuregulin signaling pathway in smoking behavior.

Authors:  R Gupta; B Qaiser; L He; T S Hiekkalinna; A B Zheutlin; S Therman; M Ollikainen; S Ripatti; M Perola; V Salomaa; L Milani; T D Cannon; P A F Madden; T Korhonen; J Kaprio; A Loukola
Journal:  Transl Psychiatry       Date:  2017-08-22       Impact factor: 6.222

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.