| Literature DB >> 35810165 |
Kenneth E Westerman1,2,3, Timothy D Majarian4, Franco Giulianini5, Dong-Keun Jang4, Jenkai Miao6, Jose C Florez4,7,8, Han Chen9,10, Daniel I Chasman5,11,12,13, Miriam S Udler4,7,8, Alisa K Manning14,4,7, Joanne B Cole15,16,17.
Abstract
Gene-environment interactions represent the modification of genetic effects by environmental exposures and are critical for understanding disease and informing personalized medicine. These often induce differential phenotypic variance across genotypes; these variance-quantitative trait loci can be prioritized in a two-stage interaction detection strategy to greatly reduce the computational and statistical burden and enable testing of a broader range of exposures. We perform genome-wide variance-quantitative trait locus analysis for 20 serum cardiometabolic biomarkers by multi-ancestry meta-analysis of 350,016 unrelated participants in the UK Biobank, identifying 182 independent locus-biomarker pairs (p < 4.5×10-9). Most are concentrated in a small subset (4%) of loci with genome-wide significant main effects, and 44% replicate (p < 0.05) in the Women's Genome Health Study (N = 23,294). Next, we test each locus-biomarker pair for interaction across 2380 exposures, identifying 847 significant interactions (p < 2.4×10-7), of which 132 are independent (p < 0.05) after accounting for correlation between exposures. Specific examples demonstrate interaction of triglyceride-associated variants with distinct body mass- versus body fat-related exposures as well as genotype-specific associations between alcohol consumption and liver stress at the ADH1B gene. Our catalog of variance-quantitative trait loci and gene-environment interactions is publicly available in an online portal.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35810165 PMCID: PMC9271055 DOI: 10.1038/s41467-022-31625-5
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Analysis workflow.
Unrelated individuals without major disease from any of four ancestry groups in the UK Biobank were included in the analysis. Twenty metabolic biomarkers were pre-processed, including log transformation, adjustment for biological and technical covariates, and outlier removal. vQTL analysis was performed genome-wide, separately in each ancestry group followed by a multi-ancestry meta-analysis. Significant variant-biomarker pairs were taken forward for GEI analysis, along with 2380 exposures pre-processed using the PHESANT tool. Finally, GEI tests were performed for each combination of variant-biomarker-exposure triplet, again using ancestry-stratified analysis followed by meta-analysis.
Fig. 2vQTLs identified across 20 cardiometabolic serum biomarkers.
a −log10PvQTL from Levene’s test is shown for all significant index variants for each biomarker. Labels correspond to the closest gene (shown for variants with PvQTL < 10−20), highlighting some known GWAS loci. P values are truncated at 10−300 for visualization purposes. b Histogram displaying the number of biomarker associations for each vQTL locus. c The number of significant vQTL loci is shown for each biomarker (inset: analogous plot for main effects). Colors denote three categories: vQTL loci not shared with an ME locus (red), vQTL loci shared with an ME locus (purple), and ME loci not shared with a vQTL locus (blue).
Fig. 3Chord diagram displays GEI links between vQTL-biomarker pairs (top of circle) and exposures (bottom of circle).
Lines correspond to interactions that are Bonferroni significant (p < 2.38 × 10−7) for the associated variant, biomarker, and exposure based on two-sided p-values from linear regression. vQTL-biomarker pairs are colored according to biomarker and labeled with the nearest gene. Exposures are colored according to exposure categories.
Fig. 4Exploration of anthropometric interactions influencing triglycerides.
a Heatmap shows interaction z-scores from standard linear regression models between nine genetic variants (x axis) and 33 anthropometric exposures (y axis). Colored panels pass a nominal significance threshold (p < 0.05). Variants are annotated with the closest gene, as well as a second likely causal gene based on manual annotation where appropriate. b Heatmap shows interaction z scores for a single variant (rs139566989 in LIPC), with varying TG lipid subfraction outcomes from nuclear magnetic resonance (x axis) and three representative anthropometric exposures (y axis). c Three stratified plots showing means and 95% confidence intervals for inverse-normal transformed total TG or TG in very large HDL after stratification by rs139566989 and tertiles of the relevant exposure (labeled at the top of each plot; n = 90,644 total participants).
Fig. 5vQTL and GEI relationships for ADH1B, alcohol, and ALT.
a vQTL (red) and ME (blue) significance for rs1229984 is shown for each biomarker, based on Levene’s test and linear regression, respectively. The dashed line represents the study-wide Bonferroni vQTL significance threshold. Biomarkers with neither vQTL nor ME having p < 0.01 are not shown. b EWIS results for rs1229984 impacting ALT are shown, with GEI significance plotted (y axis) for each exposure having p < 0.05 (x axis). c Means and standard errors for ALT (n = 341,815 [CC], 17,539 [CT], 318 [TT] participants) are plotted as a function of genotype at rs1229984 (x axis) and self-reported alcohol intake (colors).