| Literature DB >> 34699229 |
Reyhan Sönmez Flitman1,2, Bita Khalili1,2, Zoltan Kutalik1,3,2, Rico Rueedi1,2, Anneke Brümmer1,2, Sven Bergmann1,2,4.
Abstract
Gene products can affect the concentrations of small molecules (aka "metabolites"), and conversely, some metabolites can modulate the concentrations of gene transcripts. While many specific instances of this interplay have been revealed, a global approach to systematically uncover human gene-metabolite interactions is still lacking. We performed a metabolome- and transcriptome-wide association study to identify genes influencing the human metabolome using untargeted metabolome features, extracted from 1H nuclear magnetic resonance spectroscopy (NMR) of urine samples, and gene expression levels, quantified from RNA-Seq of lymphoblastoid cell lines (LCL) from 555 healthy individuals. We identified 20 study-wide significant associations corresponding to 15 genes, of which 5 associations (with 2 genes) were confirmed with follow-up NMR data. Using metabomatching, we identified the metabolites corresponding to metabolome features associated with the genes, namely, N-acetylated compounds with ALMS1 and trimethylamine (TMA) with HPS1. Finally, Mendelian randomization analysis supported a potential causal link between the expression of genes in both the ALMS1- and HPS1-loci and their associated metabolite concentrations. In the case of HPS1, we additionally observed that TMA concentration likely exhibits a reverse causal effect on HPS1 expression levels, indicating a negative feedback loop. Our study highlights how the integration of metabolomics, gene expression, and genetic data can pinpoint causal genes modulating metabolite concentrations.Entities:
Keywords: ALMS1; HPS1; N-acetylated compounds; NAT8; PYROXD2; genome-wide association study; metabolomics; transcriptomics; trimethylamine
Mesh:
Year: 2021 PMID: 34699229 PMCID: PMC9286311 DOI: 10.1021/acs.jproteome.1c00585
Source DB: PubMed Journal: J Proteome Res ISSN: 1535-3893 Impact factor: 5.370
Figure 1QQ-plot of −log10 (p values) of metabolome- and transcriptome-wide association analysis. The highly significant associations (FDR < 0.05) with ALMS1 expression are ranked 1st and 2nd and with HPS1 expression 3rd and 4th.
20 Study-Wide Significant Associations from Metabolome- and Transcriptome-Wide Association Analysisa
| genes | metabolite | association | published as mGWAS | |||
|---|---|---|---|---|---|---|
| ensembl gene ID | Chr | gene symbol | feature(s) | effect size | body fluid | |
| ENSG00000116127 | 2 | ALMS1 | 2.0375, 2.0325, 2.0275, 2.0425 | 0.72, 0.69, 0.45, 0.41 | 1.1 × 10–20, 2.0 × 10–19, 7.8 × 10–09, 1.2 × 10–07 | serum,[ |
| ENSG00000107521 | 10 | HPS1 | 2.8575, 2.8725 | –0.38, −0.37 | 1.1 × 10–10, 5.6 × 10–10 | serum,[ |
| ENSG00000149089 | 11 | APIP | 2.7925 | –0.33 | 4.7 × 10–09 | serum,[ |
| ENSG00000256029 | 1 | RP11-190A12.7 | 3.0925 | 0.26 | 9.7 × 10–09 | serum[ |
| ENSG00000100603 | 14 | SNW1 | 8.1275 | –0.38 | 1.5 × 10–08 | serum[ |
| ENSG00000163016 | 2 | ALMS1P | 2.0325, 2.0375 | 0.27, 0.27 | 2.5 × 10–08, 2.7 × 10–08 | serum,[ |
| ENSG00000219257 | 6 | RP11-14I4.2 | 2.3275 | –0.25 | 5.7 × 10–08 | |
| ENSG00000259357 | 1 | RP11-316M1.12 | 7.7875 | 0.33 | 6.1 × 10–08 | |
| ENSG00000163520 | 3 | FBLN2 | 5.4375 | 0.29 | 9.5 × 10–08 | serum[ |
| ENSG00000226430 | 8 | USP17L7 | 2.7075 | –0.24 | 1.8 × 10–07 | |
| ENSG00000219355 | 12 | RPL31P52 | 2.8675 | –0.23 | 2.1 × 10–07 | |
| ENSG00000266795 | 17 | RP11-744K17.9 | 7.2725 | 0.26 | 2.2 × 10–07 | serum[ |
| ENSG00000150593 | 10 | PDCD4 | 5.4075 | 0.42 | 2.2 × 10–07 | serum,[ |
| ENSG00000266805 | 18 | RP11-61L19.1 | 5.3525 | 0.24 | 2.7 × 10–07 | |
| ENSG00000254396 | 9 | RP11-56F10.3 | 3.0925 | 0.23 | 3.6 × 10–07 | |
20 study-wide significant associations involving 15 unique genes and 17 unique features. Associations are grouped by genes and sorted by the lowest association p value for each gene.
Figure 2Metabomatching[22] results for pseudospectra derived from gene expression - metabolome feature associations for ALMS1 (A) and HPS1 (B). Upper panels show the features in each pseudospectrum, color-coded according to the direction of the effect (positive in blue and negative in orange). Lower panels show the highest ranking candidate metabolites with their reference NMR spectra (color coded to indicate their relative peak intensities). Leading features allowing metabolite identification are in (A) at 2.04 ppm, which matches well with the highest intensity peak of the NAA spectrum and in (B) at 2.87 ppm, which matches well with the TMA singlet.
Figure 3SNP - metabolome feature and SNP - gene expression associations in ALMS1/NAT8 locus. (A) LocusZoom plot for ALMS1/NAT8 locus, where the SNPs are associated with metabolome feature at 2.0375 ppm, LD colored with respect to lead mQTL. (B) Bar plot shows −log10 transformed p values from associating expression values of nine genes in the locus with the five NAA features.
Figure 4SNP - metabolome feature and SNP - gene expression associations in HPS1/PYROXD2 locus. (A) LocusZoom plot for HPS1/PYROXD2 locus, showing the association significance of SNP with the metabolome feature at 2.8725 ppm. Colors indicate the correlation (LD) to the lead QTL. (B) Bar plot shows −log10 transformed p values from associating expression values of seven genes in the locus with the same feature.
Previously Reported mGWAS Results for the ALMS1/NAT8 and HPS1/PYROXD2 Locia
| reference | platform | biofluid | locus | metabolite |
|---|---|---|---|---|
| Nicholson et al. 2011[ | MS + NMR | urine + plasma | ALMS1, NAT8 | N-acetylated compounds |
| Montoliu et al.
2013[ | NMR | urine | ALMS1 | N-acetylated compounds |
| Rueedi et al. 2014[ | NMR | urine | ALMS1 | 2.0375 (suggested as N-acetylated compounds) |
| Raffler et al.
2015[ | NMR | urine | NAT8 | 2.031 (suggested as |
| Suhre et al. 2011[ | MS | serum | NAT8 | |
| Yu et al. 2014[ | MS | serum | NAT8 | |
| Shin et al. 2014[ | MS | serum | NAT8 | |
| Nicholson et al. 2011[ | MS + NMR | urine + plasma | HPS1, PYROXD2 | trimethylamine (urine), dimethylamine (plasma), |
| Rueedi et al. 2014[ | NMR | urine | PYROXD2 | trimethylamine, unknown compound, 2.8575, 1.8025 |
| Raffler et al. 2015[ | NMR | urine | PYROXD2 | 2.854 (suggested as trimethylamine) |
| Raffler et al. 2013[ | NMR | plasma | PYROXD2 | 2.757 |
| Rhee et al. 2013[ | MS | plasma | HPS1 | asymmetric dimethylarginine |
| Krumsiek et al. 2012[ | MS | serum | HPS1, PYROXD2 | multiple compounds, unknown compounds |
| Hong et al. 2013[ | MS | serum | HPS1 | caprolactam |
| Shin et al. 2014[ | MS | serum | PYROXD2 | unknown compounds |
MS: mass spectrometry; numbers in metabolite section refer to NMR spectral shift positions in ppm. Reported genes are mostly based on proximity to the mQTL or based on gene function.
Figure 5Scatter plot of the mQTL effect of SNP (rs7566315) on NAC and its eQTL effect on ALMS1 gene expression. Each point represents a study sample. NAC concentration is approximated by the feature at 2.0375 ppm that is log10 transformed after feature- and sample-wise z-scoring (y axis). ALMS1 expression is z-scored after log2 transforming RPKM+1 values (x axis). Color code represents the genotype of rs7566315 (legend) that is an eQTL of ALMS1 and mQTL of NAA.
MR Results for Testing a Causal Link between ALMS1 Expression and Concentration of N-Acetylated Compoundsa
| method | causal effect size estimate | std. error | 95% CI | Cochran’s | |||
|---|---|---|---|---|---|---|---|
| A | ALMS1 → NAC | inverse variance weighted | 0.967 | 0.061 | 0.847–1.087 | <2 × 10–16 | 0.2323 |
| weighted median | 1.111 | 0.075 | 0.965–1.257 | <2 × 10–16 | NA | ||
| MR - Egger | 0.994 | 0.092 | 0.812–1.175 | <2 × 10–16 | 0.1776 | ||
| maximum-likelihood | 0.999 | 0.065 | 0.872–1.126 | <2 × 10–16 | 0.249 | ||
| B | NAC → ALMS1 | inverse variance weighted | –0.015 | 0.264 | –0.532–0.502 | 0.955 | 0.7443 |
| weighted median | 0.122 | 0.321 | –0.507–0.751 | 0.704 | NA | ||
| MR - Egger | 1.495 | 1.976 | –2.377–5.368 | 0.449 | 0.7256 | ||
| maximum-likelihood | –0.015 | 0.266 | –0.535–0.505 | 0.955 | 0.7443 | ||
| C | ALMS1 → NAC | inverse variance weighted | 0.796 | 0.183 | 0.437–1.155 | <2 × 10–16 | 0.1902 |
| (NAT8 related SNPs removed) | weighted median | 0.668 | 0.242 | 0.193–1.142 | 0.006 | NA | |
| MR - Egger | 1.912 | 0.704 | 0.532–3.291 | 0.007 | 0.4144 | ||
| maximum-likelihood | 0.805 | 0.185 | 0.444–1.167 | < 2 × 10–16 | 0.2199 |
MR results for testing (A) causal effect of ALMS1 gene expression levels on N-acetylated compounds (ALMS1 → NAC), (B) causal effect of N-acetylated compounds on ALMS1 gene expression levels (NAC → ALMS1), (C) causal effect of ALMS1 gene expression levels on N-acetylated compounds (ALMS1 → NAC) when NAT8-related SNPs were removed from the instrument set.
MR Results for Testing a Causal Link between HPS1 Expression and TMA Concentrationa
| method | causal effect size estimate | std. error | 95% CI | Cochran’s | |||
|---|---|---|---|---|---|---|---|
| A | HPS1 → TMA | inverse variance weighted | 0.266 | 0.094 | 0.082–0.450 | 0.005 | 0.0803 |
| weighted median | 0.311 | 0.072 | 0.170–0.453 | <2 × 10–16 | NA | ||
| MR - Egger | 0.37 | 0.126 | 0.123–0.617 | 0.003 | 0.0852 | ||
| maximum-likelihood | 0.267 | 0.094 | 0.083–0.452 | 0.004 | 0.0829 | ||
| B | TMA → HPS1 | inverse variance weighted | –0.089 | 0.012 | –0.113 to –0.065 | <2 × 10–16 | 0.0958 |
| weighted median | –0.09 | 0.011 | –0.111 to –0.068 | <2 × 10–16 | NA | ||
| MR - Egger | –0.086 | 0.013 | –0.111 to –0.061 | <2 × 10–16 | 0.0758 | ||
| maximum-likelihood | –0.09 | 0.012 | –0.114 to –0.066 | <2 × 10–16 | 0.1258 | ||
| C | HPS1 → TMA | inverse variance weighted | |||||
| (PYROXD2 related SNPs removed) | weighted median | 1.079 | 0.121 | 0.842–1.315 | <2 × 10–16 | NA | |
| MR - Egger | 1.705 | 0.305 | 1.107–2.303 | <2 × 10–16 | 0.5575 | ||
| maximum-likelihood |
MR results for testing (A) causal effect of HPS1 gene expression levels on TMA (HPS1 → TMA), (B) causal effect of TMA on HPS1 gene expression levels (TMA → HPS1), (C) causal effect of HPS1 gene expression levels on TMA (HPS1 → TMA) when PYROXD2-related SNPs were removed from the instrument set.