| Literature DB >> 35995766 |
Usman A Tahir1, Daniel H Katz1, Julian Avila-Pachecho2, Clary B Clish2, Pradeep Natarajan2,3, Robert E Gerszten4,5, Alexander G Bick2, Akhil Pampana2, Jeremy M Robbins1, Zhi Yu2, Zsu-Zsu Chen1, Mark D Benson1, Daniel E Cruz1, Debby Ngo1, Shuliang Deng1, Xu Shi1, Shuning Zheng1, Aaron S Eisman1, Laurie Farrell1, Michael E Hall6, Adolfo Correa6, Russell P Tracy7, Peter Durda7, Kent D Taylor8, Yongmei Liu9, W Craig Johnson10, Xiuqing Guo8, Jie Yao8, Yii-Der Ida Chen8, Ani W Manichaikul11,12, Frederick L Ruberg13, William S Blaner14, Deepti Jain15, Claude Bouchard16, Mark A Sarzynski17, Stephen S Rich11,12, Jerome I Rotter8, Thomas J Wang18, James G Wilson1.
Abstract
Integrating genetic information with metabolomics has provided new insights into genes affecting human metabolism. However, gene-metabolite integration has been primarily studied in individuals of European Ancestry, limiting the opportunity to leverage genomic diversity for discovery. In addition, these analyses have principally involved known metabolites, with the majority of the profiled peaks left unannotated. Here, we perform a whole genome association study of 2,291 metabolite peaks (known and unknown features) in 2,466 Black individuals from the Jackson Heart Study. We identify 519 locus-metabolite associations for 427 metabolite peaks and validate our findings in two multi-ethnic cohorts. A significant proportion of these associations are in ancestry specific alleles including findings in APOE, TTR and CD36. We leverage tandem mass spectrometry to annotate unknown metabolites, providing new insight into hereditary diseases including transthyretin amyloidosis and sickle cell disease. Our integrative omics approach leverages genomic diversity to provide novel insights into diverse cardiometabolic diseases.Entities:
Mesh:
Year: 2022 PMID: 35995766 PMCID: PMC9395431 DOI: 10.1038/s41467-022-32275-3
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 17.694
Fig. 1Whole Genome Association Study of known and unknown metabolites in the Jackson Heart Study.
Flow diagram detailing whole genome association study of the metabolome, main results, and subsequent bioinformatic pipeline for unknown metabolite identification. Rare minor allele frequency is defined as <1% in NFE using gnomAD. Confirmation of metabolite identities was limited to commercially available metabolite standards. WGAS whole genome association study, MS mass spectrometry; NFE non-Finish Europeans, GNPS global natural product social networking.
Fig. 2Phenogram of 519 locus-metabolite relationships in the Jackson Heart Study.
118 loci-metabolite associations are for known metabolites. 401 associations are for unknown metabolite features. The most common metabolite class includes amino acids, peptides, and analogs. Highlighted are sentinel genes with ≧4 locus-metabolite associations.
Fig. 3Genetic architecture of metabolite-WGAS associations.
A Number of metabolites associated with each locus; B Absolute distance from mQTL position to transcription start site; C Minor allele frequency and effect size; D Frequency of mQTL sentinel allele in non-Finnish European Individuals vs African individuals; E Location of mQTL. WGAS whole genome association study, mQTL metabolite quantitative loci.
Ancestry-specific mQTLs in the Jackson Heart Study
| rsID | Position | Metabolite | Gene | Position | Allele | Beta | P Value | AFR AF |
|---|---|---|---|---|---|---|---|---|
| rs141239670 | Chr1:171219209 | Succinic acid | Exon | T | 1.01 | 4.4E-11 | 0.01 | |
| rs56072071 | Chr2:215328065 | AICA-riboside** | Intron | A | 0.38 | 2.9E-17 | 0.11 | |
| rs754490766 | Chr3:51959148 | Intron | A | 1.40 | 3.0E-36 | 0.015 | ||
| rs754490766 | Chr3:51959148 | Intron | A | 1.06 | 1.2E-21 | 0.015 | ||
| rs73733867 | Chr6:44207081 | Intergenic | T | 1.29 | 1.9E-37 | 0.02 | ||
| rs3211938 | Chr7:80671133 | C38:6 PE plasmalogen | Exon | G | 0.38 | 1.6E-15 | 0.09 | |
| rs3211938 | Chr7:80671133 | C38:7 PC plasmalogen | Exon | G | 0.38 | 1.6E-15 | 0.09 | |
| rs115027210 | Chr7:76062732 | 2-Hydroxyglutaric Acid | Exon | C | 0.53 | 4.6E-16 | 0.05 | |
| rs28832309 | Chr7:80690622 | C40:7 PE plasmalogen | Intergenic | C | 0.35 | 2.3E-12 | 0.09 | |
| rs7079286 | Chr10:106656814 | Exon | T | −0.48 | 1.2E-23 | 0.11 | ||
| rs334 | Chr11:5227002 | LPC(OH-16:0)* | Exon | A | 0.48 | 4.19E-11 | 0.04 | |
| rs624307 | Chr11:65376604 | 3-Hydroxycarnitine | Exon | T | −0.42 | 7.6E-14 | 0.08 | |
| rs12322356 | Chr12:56378580 | UDP-GlcNAc | Intergenic | C | 0.29 | 6.0E-17 | 0.31 | |
| rs13333418 | Chr16:30975943 | Cholestanone** | Exon | C | −0.23 | 1.4E-13 | 0.30 | |
| rs28934585 | Chr17:7220519 | CAR 14:1 | Exon | T | −0.37 | 3.0E-15 | 0.11 | |
| rs28934585 | Chr17:7220519 | CAR 14:2 | Exon | T | −0.38 | 1.8E-14 | 0.11 | |
| rs28934585 | Chr17:7220519 | CAR 12:0 | Exon | T | −0.35 | 2.5E-13 | 0.11 | |
| rs76992529 | Chr18:31598655 | All-trans retinol** | Exon | A | −0.76 | 4.6E-14 | 0.02 | |
| rs12721054 | Chr19:44919330 | DG (36:4) | UTR | G | −0.31 | 1.3E-11 | 0.12 |
Novel, ancestry-specific mQTLs for known and unknown metabolites in the Jackson Heart Study (minor allele frequency for sentinel SNP less than 1% in non-Finish Europeans).
AFR African, AF allele frequency, UTR untranslated region.
*Unknown metabolite annotated using MS/MS data.
**Unknown metabolite annotated with MS/MS and confirmed with the chemical standard.
Fig. 4Ancestry-specific alleles reveal novel associations of TTR and APOE with retinol species.
A Association of V122I in TTR with an unknown metabolite (QI722; m/z 269.226); B Correlation between QI722 and retinol-binding protein; C Association of rs769455 missense variant in APOE with unknown metabolite (QI176; m/z 269.226); D TTR associated unknown metabolite matching spectra with trans-retinol; APOE associated unknown metabolite with identical molecular mass but earlier retention time indicating it’s a cis-isomer of retinol. Additional isomers tested without compound match include 9 and 11-cis retinol.
Fig. 5Unknown metabolite annotation pipeline using bioinformatic tools leveraging MS/MS spectra.
Unknown metabolite identification with initial clustering of features to elucidate adducts and fragments of primary features or major ions. Subsequent implementation of tools leveraging MS/MS data, including SIRIUS, GNPS, and CANOPUS. Metabolite ID validation is limited to commercially available standards.
Fig. 6GNPS molecular network identifies carotenoid metabolites linked with genomic loci.
A Molecular network of unknown features matching beta-carotene (m/z 536.4354; identified using MS/MS database) and carotene-related compounds using the Global Natural Products Social Molecular Networking. Nodes represent MS/MS spectra obtained at either discreet collision energies ranging from 10 to 50 V or stepped (SV) collision energies. The circular node shape illustrates whether features are representative ions (highest mean abundance) in clusters of co-eluting features with abundances correlating with Spearman coefficients >0.80. Conversely, square nodes correspond to features that based on correlation with co-eluting compounds, are potentially redundant fragments or adducts. Edges represent the cosine similarity among MS/MS spectra and formulas. Zeaxanthin is the predicted metabolite at m/z 568.427 based on m/z differences and association with BCO1, which catalyzes the conversion of carotenoids to retinal and ISX, which regulates the expression of BCO1. B Spectral comparison of plasma unknowns matching carotene and zeaxanthin MS/MS obtained using stepped and discrete collision energy, respectively, illustrating an edge cosine similarity score of 0.88. C Validation of compound identities for zeaxanthin and carotene confirming the retention time match of authentic standards with the unknown features in plasma (D) as well as their MS/MS spectrum match.