| Literature DB >> 34273980 |
Theodore G Drivas1,2, Anastasia Lucas3, Marylyn D Ritchie3,4.
Abstract
BACKGROUND: Genomic studies increasingly integrate expression quantitative trait loci (eQTL) information into their analysis pipelines, but few tools exist for the visualization of colocalization between eQTL and GWAS results. Those tools that do exist are limited in their analysis options, and do not integrate eQTL and GWAS information into a single figure panel, making the visualization of colocalization difficult.Entities:
Keywords: Colocalization; GWAS; Visualization; eQTL
Year: 2021 PMID: 34273980 PMCID: PMC8285863 DOI: 10.1186/s13040-021-00267-6
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Description of required and optional input data frames for eQTpLot
| Required Input Data Frames | ||
| Integer | Chromosome for SNP (sex chromosomes coded numerically) | |
| Integer | Chromosomal position for each SNP, in base pairs | |
| Character | Variant ID (such as dbSNP ID “rs...”. ( | |
| Numeric | P-value for the SNP from GWAS analysis | |
| Numeric | Beta for SNP from GWAS analysis | |
| Character | Name of the phenotype for which the GWAS data refers. This column is optional and is useful if your | |
| Character | Variant ID (such as dbSNP ID “rs...”. ( | |
| Character | Gene symbol to which the eQTL expression data refers ( | |
| Numeric | P-value for the SNP from eQTL analysis | |
| Numeric | Normalized effect size for the SNP from eQTL analysis (Per GTEx, defined as the slope of the linear regression, and is computed as the effect of the alternative allele relative to the reference allele in the human genome reference. | |
| Character | Tissue type to which the eQTL pvalue/NES refer ( | |
| Numeric | The number of samples used to calculate the p-value and NES for the eQTL data. This value is used if performing a MultiTissue or PanTissue analysis with the option CollapseMethod set to “meta” for a simple sample size weighted meta-analysis. | |
| Character | Gene symbol/name ( | |
| Integer | Chromosome the gene is on ( | |
| Integer | Base pair coordinate of the beginning of the gene ( | |
| Integer | Base pair coordinate of the end of the gene ( | |
| The genome build (either hg19 or hg38) for the location data | ||
| Integer | Base pair position of the first variant in the LD pair | |
| Character | Variant ID of the first variant in the LD pair ( | |
| Integer | Base pair position of the second variant in the LD pair | |
| Character | Variant ID of the second variant in the LD pair ( | |
| Numeric | Squared correlation measure of linkage between the two variants | |
Description of required and optional arguments for eQTpLot
| A data frame of eQTL summary statistic data, as defined in Table | |
| A data frame of GWAS summary statistic data, as defined in Table | |
| The name/symbol of the gene to analyze, in quotes ( | |
| The name of the GWAS phenotype to analyze, in quotes. If all the data in | |
| A data frame of gene coordinates, as defined in Table | |
| A data frame of pairwise linkage data, as defined in Table | |
| the maximum and minimum limits in the format c (min,max), to display for the NES value in | |
| used to manually adjust the x axis maximum for the P-P plot, if needed | |
| used to manually adjust the y axis maximum in plot A, if needed | |
| used to manually adjust the y axis maximum for the P-P plot, if needed | |
Fig. 1Example eQTpLot for LDL cholesterol and the gene BBS1. eQTpLot was used to generate a series of plots illustrating the colocalization between eQTLs for the gene BBS1 and a GWAS signal for the LDL cholesterol trait on chromosome 11 using a PanTissue approach as described in example 1. Panel A shows the locus of interest, containing the BBS1 gene, with chromosomal space indicated along the horizontal axis. The position of each point on the vertical axis corresponding to the p-value of association for that variant with the LDL trait, while the color scale for each point corresponds to the magnitude of that variant’s p-value for association with BBS1 expression. The directionality of each triangle corresponds to the GWAS direction of effect, while the size of each triangle corresponds to the NES for the eQTL data. The default genome-wide p-value significance threshold for the GWAS analysis, 5e-8, is depicted with a horizontal red line. Panel B displays the genomic positions of all genes within the LOI. Panel C depicts the enrichment of BBS1 eQTLs among GWAS-significant variants, while panel D depicts the correlation between PGWAS and PeQTL for BBS1 and the LDL trait, with the computed Pearson correlation coefficient (r) and p-value (p) displayed on the plot
Fig. 2Example eQTpLot for LDL cholesterol and the gene ACTN3. eQTpLot was used to generate a series of plots illustrating the colocalization between eQTLs for the gene ACTN3 and a GWAS signal for the LDL cholesterol trait on chromosome 11 using a PanTissue approach as described in example 1. Panel A shows the locus of interest, containing the ACTN3 gene, with chromosomal space indicated along the horizontal axis. The position of each point on the vertical axis corresponding to the p-value of association for that variant with the LDL trait, while the color scale for each point corresponds to the magnitude of that variant’s p-value for association with ACTN3 expression. The directionality of each triangle corresponds to the GWAS direction of effect, while the size of each triangle corresponds to the NES for the eQTL data. The default genome-wide p-value significance threshold for the GWAS analysis, 5e-8, is depicted with a horizontal red line. Panel B displays the genomic positions of all genes within the LOI. Panel C depicts the enrichment of ACTN3 eQTLs among GWAS-significant variants, while panel D depicts the correlation between PGWAS and PeQTL for ACTN3 and the LDL trait, with the computed Pearson correlation coefficient (r) and p-value (p) displayed on the plot
Fig. 3Example eQTpLot for LDL cholesterol and the gene BBS1, incorporating LD data. eQTpLot was used to generate a series of plots illustrating the colocalization between eQTLs for the gene BBS1 and a GWAS signal for the LDL cholesterol trait on chromosome 11 as described in example 2, specifically within the tissue “Whole_Blood” and with the inclusion of LD data. Panels A, B, and D are generated identically to Figure panels 1A, 1B, and 1C respectively. Panel C depicts a heatmap of LD information of all BBS1 eQTL variants, displayed in the same chromosomal space as panels A and B for ease of reference. Panel E depicts the correlation between PGWAS and PeQTL for BBS1 and the LDL trait, similar to panel 1D, only here a lead variant, rs3741360, is identified (by default the upper-right-most variant on the P-P plot), with all other variants plotted using a color scale corresponding to their squared coefficient of linkage correlation with this lead variant. For reference, the same lead variant is also labelled in panel A
Fig. 4Example eQTpLot for LDL cholesterol and the gene BBS1, discriminating between congruous and incongruous variants. eQTpLot was used to generate a series of plots illustrating the colocalization between eQTLs for the gene BBS1 and a GWAS signal for the LDL cholesterol trait on chromosome 11 as described in example 3, with an analysis identical to that described for Fig. 3, but with the additional discrimination between variants with congruous and incongruous directions of effect. Panel A is generated identically to panel 1A and 3A, only instead of using a single color scale, variants with congruous effects are plotted using a blue color scale, while variants with incongruous effects are plotted using a red color scale. Panels B-D are identical to panels 3B-D. Panel E and F both represent P-P plots, generated similarly to the P-P plot in panel 3E. For panel E, however, the analysis is confined only to variants with congruous directions of effect, while for panel F the analysis includes only variants with incongruous directions of effect. A lead variant is indicated in both panels E anf F, and both are also labeled in panel A