Literature DB >> 25431330

SNiPA: an interactive, genetic variant-centered annotation browser.

Matthias Arnold1, Johannes Raffler1, Arne Pfeufer1, Karsten Suhre2, Gabi Kastenmüller1.   

Abstract

MOTIVATION: Linking genes and functional information to genetic variants identified by association studies remains difficult. Resources containing extensive genomic annotations are available but often not fully utilized due to heterogeneous data formats. To enhance their accessibility, we integrated many annotation datasets into a user-friendly webserver.
AVAILABILITY AND IMPLEMENTATION: http://www.snipa.org/ CONTACT: g.kastenmueller@helmholtz-muenchen.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2014        PMID: 25431330      PMCID: PMC4393511          DOI: 10.1093/bioinformatics/btu779

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

Genome-wide association studies (GWAS) and next-generation sequencing (NGS) are performed routinely to identify genetic variants and novel genes implicated in both common and rare human diseases. A key step in translating results from such studies into a better understanding of molecular disease mechanisms and, ultimately, into clinical applications, is the prioritization of potentially functional variants that may be active in vivo. To this end, comprehensive collection and evaluation of existing functional annotation from genetic, informatics and experimental resources is essential (MacArthur ). This comprises the integration of data and knowledge across multiple levels including the variant, the gene and the chromatin level. Several large resources (Ensembl, UCSC, NCBI, etc.) aim at providing genome-wide genome-level annotation tracks from an extensive set of resources. However, retrieving statistical and functional annotation relevant at the single nucleotide level remains difficult. For instance, common genome browsers often display single nucleotide variants (SNVs) as thin bars that trail away in the wealth of other annotation tracks and are even less prepared to display statistics such as linkage disequilibrium (LD) relationships between variants. This limits visual distinction of relevant variants from those without relevant annotations and leaves the complex task of aggregating position-based data to the researcher. Variant-centered resources, on the other hand, typically concentrate on specific types of data such as amino acid changes (Adzhubei ; Kumar ), expression quantitative trait loci (eQTLs) (GTEx Consortium, 2013; Xia ), trait associations (Beck ; Hindorff ) or regulatory effect predictions (Boyle ). Moreover, these annotations are often presented in resource-specific data structures. For individual inspection of single variants, both resource types are extremely valuable. However, for simultaneous processing of larger variant sets, collection and examination of annotations from different data sources quickly becomes cumbersome. This presents a major bottleneck in genome-wide scans of genetic influences on human traits since the collection of such evidences is the key to understanding the effects of phenotype-linked genetic variants. Here we propose SNiPA, a web service offering variant-centered genome browsing and interactive visualization tools tailored for easy inspection of many variants in their locus context (Fig. 1).
Fig. 1.

The SNiPA Variant Browser shows variants (top), genes (center) and regulatory regions (bottom). Top-level information is available in mouse-over tooltips for all plot elements as shown here for the query SNP rs174583. The example highlights the value of variant-centered accumulation of annotations: rs174583 is associated with the concentration of a lipid metabolite as well as with the expression levels of two genes encoding enzymes involved in lipid metabolism (FADS1/2) and the gene coding for LDL receptor, a major regulator of cholesterol homeostasis. Furthermore, the variant was linked to the response to lipid lowering drugs (statins), which target HMG-CoA reductase regulated by the LDL receptor

The SNiPA Variant Browser shows variants (top), genes (center) and regulatory regions (bottom). Top-level information is available in mouse-over tooltips for all plot elements as shown here for the query SNP rs174583. The example highlights the value of variant-centered accumulation of annotations: rs174583 is associated with the concentration of a lipid metabolite as well as with the expression levels of two genes encoding enzymes involved in lipid metabolism (FADS1/2) and the gene coding for LDL receptor, a major regulator of cholesterol homeostasis. Furthermore, the variant was linked to the response to lipid lowering drugs (statins), which target HMG-CoA reductase regulated by the LDL receptor

2 Data and features

SNiPA includes a wide range of genome-level datasets contained in the Ensembl database (Flicek ) as an established backbone of annotations for the human genome. We combine this backbone with numerous variant-specific annotations taken from published datasets. Thus, SNiPA covers information ranging from regulatory elements, over gene annotations to variant annotations and associations (Table 1; Supplementary Text S1). SNiPA contains annotations for all bi-allelic variants in phase 3 version 5 of the 1000 genomes project (1000 Genomes Project Consortium ) and provides pre-calculated LD-data for r2 ≥ 0.1 for all super populations (African, American, South and East Asian, European). We use the Ensembl VEP tool (McLaren ) for primary effect prediction of SNVs. Additional position-based data is included in the VEP prediction as custom annotation files. For other annotations, we wrote a Perl module to extend the output provided by VEP (Table 1; Supplementary Text S1).
Table 1.

Annotation data compiled in SNiPA

Entity typeData typeNEntriesaNSourcesb
Variantcis-eQTL associations919 8608
trans-eQTL associations17 8916
Trait associations245 3339
Conservation and deleteriousness scoresgenome-wide4
GeneTrait annotations3 7523
Regulatory elementsmicroRNA target sites606 4085
Promoters106 1692
Enhancers455 8002
ENCODE feature clusters406 6321

aEntries are unified w.r.t. the entities given in the first column, i.e. numbers listed are counts of annotated entities (e.g. variants).

bDetails and references for all included datasets are described in Supplementary Text S1.

Annotation data compiled in SNiPA aEntries are unified w.r.t. the entities given in the first column, i.e. numbers listed are counts of annotated entities (e.g. variants). bDetails and references for all included datasets are described in Supplementary Text S1. SNiPA provides user-friendly starting points for annotating individual SNVs as well as sets of SNVs, LD blocks or genetic regions of interest. We have implemented several entry points to access the data: (i) a variant-centered implementation of a genome browser (‘Variant Browser’); (ii) ‘Association Maps’ for browsing through GWAS results; (iii) an interface for batch retrieval of variant annotations via ID-list, gene ID or genomic coordinates (‘Variant Annotation’); (iv) a combined listing of annotations across a set of variants within LD blocks or chromosomal regions (‘Block Annotation’); (v) ‘Regional Association Plot’ and ‘Linkage Disequilibrium Plot’ (Diabetes Genetics Initiative of Broad Institute of Harvard ) that combine publication-ready plotting of association results and LD values, respectively, with the interactive interface of the ‘Variant Browser’; (vi) ‘Proxy Search’ and ‘Pairwise LD’ that allow querying precalculated LD values augmented with variant annotations. SNiPA enables the user to download condensed annotation data in tabular format for further off-line processing. Detailed descriptions of SNiPA modules are available in the online documentation and Supplementary Text S1. The complex information contained in SNiPA is organized in a clear, comprehensive and informative structure extending effect categories contained in the Sequence Ontology (Eilbeck ) (Supplementary Text S1). For instance, variant annotations are presented as ‘SNiPAcards’ grouping information into semantic sections. All annotations are linked to their primary sources and to the Ensembl genome browser.

3 Conclusion

Mechanistic characterization of variants identified by genetic studies is the key to understanding molecular disease mechanisms. SNiPA combines a comprehensive set of genomic annotations with a genetic variant-based genome browser to simplify the task of variant annotation. SNiPA as well as all underlying data is freely available to the scientific community (commercial use may be limited by third-party constraints) and will be automatically updated following the Ensembl releases.

Funding

This work was supported by the Helmholtz Portfolio theme ‘Metabolic Dysfunction and common disease’ and by the research project Greifswald Approach to Individualized Medicine (GANI_MED) (BMBF: 03IS2061A). K.S. is supported by Biomedical Research Program funds at Weill Cornell Medical College in Qatar, a program funded by the Qatar Foundation. Conflict of Interest: none declared.
  13 in total

1.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

2.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm.

Authors:  Prateek Kumar; Steven Henikoff; Pauline C Ng
Journal:  Nat Protoc       Date:  2009-06-25       Impact factor: 13.491

3.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

4.  The Genotype-Tissue Expression (GTEx) project.

Authors: 
Journal:  Nat Genet       Date:  2013-06       Impact factor: 38.330

5.  Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels.

Authors:  Richa Saxena; Benjamin F Voight; Valeriya Lyssenko; Noël P Burtt; Paul I W de Bakker; Hong Chen; Jeffrey J Roix; Sekar Kathiresan; Joel N Hirschhorn; Mark J Daly; Thomas E Hughes; Leif Groop; David Altshuler; Peter Almgren; Jose C Florez; Joanne Meyer; Kristin Ardlie; Kristina Bengtsson Boström; Bo Isomaa; Guillaume Lettre; Ulf Lindblad; Helen N Lyon; Olle Melander; Christopher Newton-Cheh; Peter Nilsson; Marju Orho-Melander; Lennart Råstam; Elizabeth K Speliotes; Marja-Riitta Taskinen; Tiinamaija Tuomi; Candace Guiducci; Anna Berglund; Joyce Carlson; Lauren Gianniny; Rachel Hackett; Liselotte Hall; Johan Holmkvist; Esa Laurila; Marketa Sjögren; Maria Sterner; Aarti Surti; Margareta Svensson; Malin Svensson; Ryan Tewhey; Brendan Blumenstiel; Melissa Parkin; Matthew Defelice; Rachel Barry; Wendy Brodeur; Jody Camarata; Nancy Chia; Mary Fava; John Gibbons; Bob Handsaker; Claire Healy; Kieu Nguyen; Casey Gates; Carrie Sougnez; Diane Gage; Marcia Nizzari; Stacey B Gabriel; Gung-Wei Chirn; Qicheng Ma; Hemang Parikh; Delwood Richardson; Darrell Ricke; Shaun Purcell
Journal:  Science       Date:  2007-04-26       Impact factor: 47.728

6.  Annotation of functional variation in personal genomes using RegulomeDB.

Authors:  Alan P Boyle; Eurie L Hong; Manoj Hariharan; Yong Cheng; Marc A Schaub; Maya Kasowski; Konrad J Karczewski; Julie Park; Benjamin C Hitz; Shuai Weng; J Michael Cherry; Michael Snyder
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

7.  Deriving the consequences of genomic variants with the Ensembl API and SNP Effect Predictor.

Authors:  William McLaren; Bethan Pritchard; Daniel Rios; Yuan Chen; Paul Flicek; Fiona Cunningham
Journal:  Bioinformatics       Date:  2010-06-18       Impact factor: 6.937

8.  The Sequence Ontology: a tool for the unification of genome annotations.

Authors:  Karen Eilbeck; Suzanna E Lewis; Christopher J Mungall; Mark Yandell; Lincoln Stein; Richard Durbin; Michael Ashburner
Journal:  Genome Biol       Date:  2005-04-29       Impact factor: 13.583

9.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

10.  GWAS Central: a comprehensive resource for the comparison and interrogation of genome-wide association studies.

Authors:  Tim Beck; Robert K Hastings; Sirisha Gollapudi; Robert C Free; Anthony J Brookes
Journal:  Eur J Hum Genet       Date:  2013-12-04       Impact factor: 4.246

View more
  125 in total

1.  Genome-epigenome interactions associated with Myalgic Encephalomyelitis/Chronic Fatigue Syndrome.

Authors:  Santiago Herrera; Wilfred C de Vega; David Ashbrook; Suzanne D Vernon; Patrick O McGowan
Journal:  Epigenetics       Date:  2018-12-05       Impact factor: 4.528

2.  Modeling Parent-Specific Genetic Nurture in Families with Missing Parental Genotypes: Application to Birthweight and BMI.

Authors:  Justin D Tubbs; Liang-Dar Hwang; Justin Luong; David M Evans; Pak C Sham
Journal:  Behav Genet       Date:  2021-01-16       Impact factor: 2.805

3.  Kleine-Levin syndrome is associated with birth difficulties and genetic variants in the TRANK1 gene loci.

Authors:  Aditya Ambati; Ryan Hillary; Smaranda Leu-Semenescu; Hanna M Ollila; Ling Lin; Emmanuel H During; Neal Farber; Thomas J Rico; Juliette Faraco; Eileen Leary; Andrea N Goldstein-Piekarski; Yu-Shu Huang; Fang Han; Yakov Sivan; Michel Lecendreux; Pauline Dodet; Makoto Honda; Natan Gadoth; Sona Nevsimalova; Fabio Pizza; Takashi Kanbayashi; Rosa Peraita-Adrados; Guy D Leschziner; Rosa Hasan; Francesca Canellas; Kazuhiko Kume; Makrina Daniilidou; Patrice Bourgin; David Rye; José L Vicario; Birgit Hogl; Seung Chul Hong; Guiseppe Plazzi; Geert Mayer; Anne Marie Landtblom; Yves Dauvilliers; Isabelle Arnulf; Emmanuel Jean-Marie Mignot
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-23       Impact factor: 11.205

4.  Screening the full leucocyte receptor complex genomic region revealed associations with pemphigus that might be explained by gene regulation.

Authors:  Ticiana Della Justina Farias; Danillo G Augusto; Rodrigo Coutinho de Almeida; Danielle Malheiros; Maria Luiza Petzl-Erler
Journal:  Immunology       Date:  2018-10-11       Impact factor: 7.397

5.  Variations in ADIPOR1 But Not ADIPOR2 are Associated With Hypertriglyceridemia and Diabetes in an Admixed Latin American Population.

Authors:  Gustavo Mora-García; María S Ruiz-Díaz; Fabian Espitia-Almeida; Doris Gómez-Camargo
Journal:  Rev Diabet Stud       Date:  2017-10-10

6.  Genome-wide meta-analyses identifies novel taxane-induced peripheral neuropathy-associated loci.

Authors:  Lara E Sucheston-Campbell; Alyssa I Clay-Gilmour; William E Barlow; G Thomas Budd; Daniel O Stram; Christopher A Haiman; Xin Sheng; Li Yan; Gary Zirpoli; Song Yao; Chen Jiang; Kouros Owzar; Dawn Hershman; Kathy S Albain; Daniel F Hayes; Halle C Moore; Timothy J Hobday; James A Stewart; Abbas Rizvi; Claudine Isaacs; Muhammad Salim; Jule R Gralow; Gabriel N Hortobagyi; Robert B Livingston; Deanna L Kroetz; Christine B Ambrosone
Journal:  Pharmacogenet Genomics       Date:  2018-02       Impact factor: 2.089

7.  A genome-wide scan for pleiotropy between bone mineral density and nonbone phenotypes.

Authors:  Maria A Christou; Georgios Ntritsos; Georgios Markozannes; Fotis Koskeridis; Spyros N Nikas; David Karasik; Douglas P Kiel; Evangelos Evangelou; Evangelia E Ntzani
Journal:  Bone Res       Date:  2020-07-01       Impact factor: 13.567

Review 8.  Genetics meets proteomics: perspectives for large population-based studies.

Authors:  Karsten Suhre; Mark I McCarthy; Jochen M Schwenk
Journal:  Nat Rev Genet       Date:  2020-08-28       Impact factor: 53.242

9.  Broadening our understanding of the genetics of Juvenile Idiopathic Arthritis (JIA): Interrogation of three dimensional chromatin structures and genetic regulatory elements within JIA-associated risk loci.

Authors:  Kaiyu Jiang; Haeja Kessler; Yungki Park; Marc Sudman; Susan D Thompson; James N Jarvis
Journal:  PLoS One       Date:  2020-07-30       Impact factor: 3.240

10.  Candidate gene variants of the immune system and sudden infant death syndrome.

Authors:  Delnaz Fard; Katharina Läer; Thomas Rothämel; Peter Schürmann; Matthias Arnold; Marta Cohen; Mechtild Vennemann; Heidi Pfeiffer; Thomas Bajanowski; Arne Pfeufer; Thilo Dörk; Michael Klintschar
Journal:  Int J Legal Med       Date:  2016-03-14       Impact factor: 2.686

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.