| Literature DB >> 33235230 |
Matthias Munz1,2,3,4,5, Inken Wohlers6, Eric Simon7, Tobias Reinberger8,9,10, Hauke Busch6, Arne S Schaefer11, Jeanette Erdmann12,13,14.
Abstract
Exploration of genetic variant-to-gene relationships by quantitative trait loci such as expression QTLs is a frequently used tool in genome-wide association studies. However, the wide range of public QTL databases and the lack of batch annotation features complicate a comprehensive annotation of GWAS results. In this work, we introduce the tool "Qtlizer" for annotating lists of variants in human with associated changes in gene expression and protein abundance using an integrated database of published QTLs. Features include incorporation of variants in linkage disequilibrium and reverse search by gene names. Analyzing the database for base pair distances between best significant eQTLs and their affected genes suggests that the commonly used cis-distance limit of 1,000,000 base pairs might be too restrictive, implicating a substantial amount of wrongly and yet undetected eQTLs. We also ranked genes with respect to the maximum number of tissue-specific eQTL studies in which a most significant eQTL signal was consistent. For the top 100 genes we observed the strongest enrichment with housekeeping genes (P = 2 × 10-6) and with the 10% highest expressed genes (P = 0.005) after grouping eQTLs by r2 > 0.95, underlining the relevance of LD information in eQTL analyses. Qtlizer can be accessed via https://genehopper.de/qtlizer or by using the respective Bioconductor R-package ( https://doi.org/10.18129/B9.bioc.Qtlizer ).Entities:
Mesh:
Year: 2020 PMID: 33235230 PMCID: PMC7687904 DOI: 10.1038/s41598-020-75770-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(A) Data from various publicly available resources was integrated (B) applying an Extraction, Transformation and Loading (ETL) process. (C) Qtlizer consists of a relational database and a web application (D) Qtlizer can be queried for QTL data via the web-based user interface or programmatically by inputting lists of genetic variants and genes using a REST API. (E) Annotation results are displayed in a table view.
Figure 2(A) Per chromosome distribution of the TAD sizes in base pairs (B) Distribution of the base pair distances between best eQTLs and affected genes in the GTEx dataset. (C) Distribution of the maximum number of tissue-specific eQTL studies per eQTL LD block for each gene.