| Literature DB >> 35888728 |
Eva König1, Johannes Rainer1, Vinicius Verri Hernandes1, Giuseppe Paglia2, Fabiola Del Greco M1, Daniele Bottigliengo1, Xianyong Yin3, Lap Sum Chan3, Alexander Teumer4,5, Peter P Pramstaller1,6, Adam E Locke7, Christian Fuchsberger1.
Abstract
Metabolites are intermediates or end products of biochemical processes involved in both health and disease. Here, we take advantage of the well-characterized Cooperative Health Research in South Tyrol (CHRIS) study to perform an exome-wide association study (ExWAS) on absolute concentrations of 175 metabolites in 3294 individuals. To increase power, we imputed the identified variants into an additional 2211 genotyped individuals of CHRIS. In the resulting dataset of 5505 individuals, we identified 85 single-variant genetic associations, of which 39 have not been reported previously. Fifteen associations emerged at ten variants with >5-fold enrichment in CHRIS compared to non-Finnish Europeans reported in the gnomAD database. For example, the CHRIS-enriched ETFDH stop gain variant p.Trp286Ter (rs1235904433-hexanoylcarnitine) and the MCCC2 stop lost variant p.Ter564GlnextTer3 (rs751970792-carnitine) have been found in patients with glutaric acidemia type II and 3-methylcrotonylglycinuria, respectively, but the loci have not been associated with the respective metabolites in a genome-wide association study (GWAS) previously. We further identified three gene-trait associations, where multiple rare variants contribute to the signal. These results not only provide further evidence for previously described associations, but also describe novel genes and mechanisms for diseases and disease-related traits.Entities:
Keywords: ExWAS; GWAS; association study; imputation; metabolomics; whole-exome sequencing
Year: 2022 PMID: 35888728 PMCID: PMC9320943 DOI: 10.3390/metabo12070604
Source DB: PubMed Journal: Metabolites ISSN: 2218-1989
Figure 1Flowchart of the study methods and summary of results.
Figure 2Main results of the metabolite ExWAS. (a) Squared correlation of imputed dosages and sequenced genotypes in 181 validation samples split into bins based on MAC or MAF in the reference panel. Number of variants included in the bins from left to right: 2–4: 1851; 5–9: 4412; 10–0.005 = 24,868; 0.005–0.01 = 24,232; 0.01–0.05 = 53,488; 0.05–0.1 = 19,669; 0.1–0.5 = 57,563. (b) Concordance of imputed and sequenced hard calls in 181 validation samples split into bins based on MAC and MAF in the reference panel. (c) Manhattan plot of the single-variant associations of all 175 traits. The 85 significant associations listed in Table S4 are highlighted and colored by metabolite class. The dashed horizontal line indicates the significance threshold of 5.5 × 10−9. (d–f) −log10 p-value of the skato gene test, adding the variants constituting the gene test iteratively for tryptophan—TDO2 (d), sphingomyeline C18:0—CERS4 (e), and carnitine—SLC22A5 (f). In each step i on the x-axis, the gene test is computed using only the i variants with the smallest single variant p-value. Below each point, the minor allele count of the added variant is given.
Locus-trait associations conditionally significant at a 5.5 × 10−9 threshold in the WES combined dataset.
| Trait Code | Trait Name | Gene (LD Mapped Gene) | Variant (Rsid) | Effect | Beta (SE) | MAF | |
|---|---|---|---|---|---|---|---|
| Ser | Serine |
| 1:11856378_G/A (rs1801133) | missense | 3.4 × 10−8 (4.0 × 10−9) | −0.11 (0.02) | 0.405 |
| Asp | Aspartate |
| 4:111398208_A/G (rs10004516) | missense | 1.2 × 10−7 (1.8 × 10−11) | 0.16 (0.03) | 0.127 |
| C10 | Decanoylcarnitine |
| 4:159618737_G/A (rs1235904433) | stop gained | 1.1 × 10−12 (2.5 × 10−16) | 2.37 (0.33) | 0.001 |
| C6 (C4:1-DC) | Hexanoylcarnitine (Fumarylcarnitine) |
| 4:159618737_G/A (rs1235904433) | stop gained | 2.1 × 10−13 (8.0 × 10−15) | 2.44 (0.33) | 0.001 |
| C8 | Octanoylcarnitine |
| 4:159618737_G/A (rs1235904433) | stop gained | 3.4 × 10−14 (6.5 × 10−18) | 2.51 (0.33) | 0.001 |
| C10:1 | Decenoylcarnitine | 4:159631991_G/T (rs9410) | missense | 3.8 × 10−11 (2.7 × 10−11) | −0.14 (0.02) | 0.296 | |
| C12 | Dodecanoylcarnitine | 4:159631991_G/T (rs9410) | missense | 4.8 × 10−14 (5.2 × 10−14) | −0.17 (0.02) | 0.298 | |
| C5-OH (C3-DC-M) | Hydroxyvalerylcarnitine (Methylmalonylcarnitine) |
| 5:70952685_T/C (rs751970792) | stop lost | 1.9 × 10−12 (1.9 × 10−12) | 2.01 (0.28) | 0.001 |
| C16:1 | Hexadecenoylcarnitine | 5:131607402_T/C (rs10479000) | intron; intron | 2.3 × 10−10 (1.3 × 10−10) | −0.13 (0.02) | 0.489 | |
| C2 | Acetylcarnitine |
| 5:131714129_G/A (rs386134194) | synonymous | 2.4 × 10−13 (3.2 × 10−13) | −1.38 (0.19) | 0.003 |
| C4 | Butyrylcarnitine |
| 5:131714129_G/A (rs386134194) | synonymous | 1.4 × 10−10 (1.3 × 10−12) | −1.22 (0.19) | 0.003 |
| C0 | Carnitine |
| 5:131714129_G/A (rs386134194) | synonymous | 5.6 × 10−12 (6.5 × 10−12) | −1.3 (0.19) | 0.003 |
| Asp | Aspartate |
| 5:176836532_A/G (rs1801020) | 5UTR; intron | 2.5 × 10−9 (3.5 × 10−10) | 0.14 (0.02) | 0.235 |
| Taurine | Taurine |
| 5:176836532_A/G (rs1801020) | 5UTR; intron | 5.9 × 10−16 (5.9 × 10−16) | 0.19 (0.02) | 0.235 |
| Sarcosine | Sarcosine | 6:42946943_G/A (rs9462859) | 5UTR | 7.8 × 10−13 (7.8 × 10−13) | −0.16 (0.02) | 0.478 | |
| C3 | Propionylcarnitine |
| 6:160551204_G/C (rs683369) | missense | 7.7 × 10−12 (1.2 × 10−19) | 0.17 (0.03) | 0.196 |
| Serotonin | Serotonin |
| 6:160560880_CATG/C (rs72552763) | inframe insertion | 1.3 × 10−11 (1.2 × 10−11) | 0.19 (0.03) | 0.159 |
| Putrescine | Putrescine |
| 7:150553605_C/T (rs10156191) | missense | 2.3 × 10−1 (2.5 × 10−15) | 0.03 (0.02) | 0.236 |
| C10 | Decanoylcarnitine |
| 9:116931401_C/T (rs145560419) | synonymous | 8.8 × 10−9 (3.2 × 10−10) | −2.68 (0.47) | 0.001 |
| C8 | Octanoylcarnitine |
| 9:116931401_C/T (rs145560419) | synonymous | 4.7 × 10−9 (4.3 × 10−10) | −2.67 (0.46) | 0.001 |
| Sarcosine | Sarcosine |
| 9:136598926_C/G (rs10993780) | intron | 5.3 × 10−44 (5.3 × 10−44) | −0.39 (0.03) | 0.171 |
| PC aa C36:0 | Phosphatidylcholine diacyl C36:0 |
| 10:52603951_AT/A (-) | intron | 4.1 × 10−9 (5.4 × 10−9) | 2.23 (0.38) | 0.001 |
| Putrescine | Putrescine |
| 10:65225899_A/AGGCGGC (rs3841602) | upstream | 1.7 × 10−19 (5.2 × 10−20) | 0.19 (0.02) | 0.477 |
| C16-OH | Hydroxyhexadecanoylcarnitine |
| 10:100148308_T/G (rs2147895) | intron | 8.9 × 10−19 (8.9 × 10−19) | −0.18 (0.02) | 0.336 |
| His | Histidine |
| 11:47445720_G/A (rs186188306) | synonymous | 5.5 × 10−10 (4.5 × 10−9) | −2.34 (0.38) | 0.001 |
| lysoPC a C26:1 | lysoPhosphatidylcholine acyl C26:1 | 11:61560081_G/A (rs174538) | 5UTR | 3.3 × 10−10 (3.3 × 10−10) | −0.14 (0.02) | 0.264 | |
| Asn | Asparagine |
| 11:62105391_C/T (rs2513749) | 5UTR | 1.2 × 10−15 (7.8 × 10−19) | 0.24 (0.03) | 0.12 |
| Gln | Glutamine |
| 12:56866487_A/G (-) | missense | 8.2 × 10−13 (3.1 × 10−14) | −2.97 (0.41) | 0.001 |
| His | Histidine |
| 12:98929093_A/G (rs867372792) | 3UTR | 8.2 × 10−15 (5.6 × 1013) | 1.72 (0.22) | 0.002 |
| His | Histidine | 12:100492127_T/C (-) | splice region variant | 1.3 × 10−10 (2.6 × 10−9) | 1.32 (0.21) | 0.002 | |
| Phe | Phenylalanine |
| 12:102591269_G/T (rs200627654) | intron | 8.2 × 10−17 (1.4 × 10−14) | 1.3 (0.16) | 0.004 |
| His | Histidine |
| 12:110385016_A/AG (-) | intron; intron | 4.2 × 10−12 (1.2 × 10−11) | 2.76 (0.4) | 0.001 |
| Asn | Asparagine |
| 14:104576448_G/A (rs34362765) | intron | 8.8 × 10−105 (5.8 × 10−25) | −0.46 (0.02) | 0.358 |
| C10 | Decanoylcarnitine |
| 16:16139714_T/C (rs35587) | synonymous | 4.5 × 10−8 (5.0 × 10−9) | −0.12 (0.02) | 0.326 |
| C12:1 | Dodecanoylcarnitine |
| 16:16139714_T/C (rs35587) | synonymous | 7.2 × 10−10 (6.5 × 10−10) | −0.13 (0.02) | 0.326 |
| C12 | Dodecenoylcarnitine |
| 16:16139714_T/C (rs35587) | synonymous | 1.2 × 10−9 (1.2 × 10−9) | −0.13 (0.02) | 0.326 |
| lysoPC a C20:3 | lysoPhosphatidylcholine acyl C20:3 |
| 19:19379549_C/T (rs58542926) | missense | 9.3 × 10−9 (5.0 × 10−9) | −0.25 (0.04) | 0.054 |
| PC aa C34:4 | Phosphatidylcholine diacyl C34:4 | 19:19379549_C/T (rs58542926) | missense | 2.0 × 10−11 (4.1 × 10−12) | −0.29 (0.04) | 0.054 | |
| Pro | Proline |
| 22:18910479_C/T (rs13058335) | intron | 4.8 × 10−55 (4.5 × 10−31) | 0.68 (0.04) | 0.063 |
Figure 3Variant enrichment in CHRIS compared to gnomAD. (a) Minor allele frequency (MAF) of matching variants in CHRIS and gnomAD for variants with a minor allele count greater zero in both cohorts are plotted in grey. For gnomAD the non-Finnish European (NFE) MAF is plotted using both exome and genome sequencing data. Variants with conditionally significant annotations in CHRIS are plotted in color, with variants enriched at least five-fold in CHRIS colored in green and labeled with gene—trait(s). (b) CHRIS MAF versus the absolute value of beta is plotted as grey points for all significant single variants. The conditionally significant associations listed in Table 1 are colored. For variants that existed in gnomAD, the gnomAD NFE MAF is plotted as triangles in green (>5 times enriched) or purple (<5 times or not enriched) and connected for visibility. For variants that did not exist in gnomAD or had an allele count of zero, only the CHRIS MAF is plotted in orange. Associations from Table 1 with an absolute beta value greater than one are annotated with gene—trait(s).
Summary of genes identified in this ExWAS.
| Gene | Associated Metabolite(s) | Lead, LD, or Gene Variant 1 | Level | Description |
|---|---|---|---|---|
|
| PC aa C36:0 | Lead | 2 | Apolipoprotein B (apo B) is a major component of low-density lipoproteins and in mammals exist in two isoforms: apoB-100 and apoB-48. The two isoforms are encoded by a single mRNA transcript. |
|
| C10, C12:1, C12 | Lead | 2 | This gene encodes for an ABC proteins that transport various molecules across extra-and intra-cellular membranes. |
|
| His | LD | 3 | Actin Related Protein 6. The role of this gene is not fully understood as well as its association with histidine. |
|
| Putrescine | Lead | 1 | Amine oxidase copper containing 1 ( |
|
| Asn | Lead | 1 | Predicted to have lysophospholipase activity and mainly responsible to catalyze the conversion of asparagine to aspartate. |
|
| Asn | Lead | 1 | Encodes the l-asparaginase enzyme responsible for the catalysis of asparagine catabolism to aspartate. |
|
| SM C18:0 | Gene | 1 | This gene encodes for the protein Ceramide synthase 4, which catalyzes the formation of ceramides via sphinganine and acyl-CoA substrates, with high selectivity on long-chains. |
|
| C10, C8 | Lead | 3 | The gene encodes a member of the fibrillar collagen family, involved in the cartilage calcification process and the transition of cartilage to bone. Mutations on this gene are known to cause Steel Syndrome. |
|
| Asp | Lead | 1 | |
|
| C10, C6 (C4:1-DC), C8, C10:1, C12 | Lead/LD | 2 | This gene encodes for the Electron transfer flavoprotein (ETF) present in the mitochondria, which acts in the electron transfer for at least 9 flavins. Mutations on this gene (and other ETF genes such as |
|
| PC ae C38:3, lysoPC a C26:1 | LD | 2 | Fatty acid desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds into the fatty acyl chain. |
|
| PC ae C38:3, lysoPC a C26:1 | LD | 2 | Fatty acid desaturase enzymes regulate unsaturation of fatty acids through the introduction of double bonds into the fatty acyl chain. |
|
| Asp, Taurine | Lead | 3 | The human coagulation factor XII ( |
|
| Gln | Lead | 1 | The gene is responsible for encoding the glutaminase 2, an enzyme that catalyzes the conversion of glutamine to glutamate and ammonia, promoting mitochondrial respiration and ATP generation. |
|
| Sarcosine | LD | 1 | Acts on the conversion of S-adenosyl-L-methionine (SAMe) and glycine to S-adenosyl-L-homocysteine and sarcosine. Defects in this gene are a cause of hypermethioninemia. |
|
| Putrescine | Lead | 3 | Plays a central role in histone code and lysine demethylation. |
|
| His | LD | 3 | This gene encodes an enzyme used in the final step of the biosynthesis of leukotriene B4, a proinflammatory mediator. It is known to degrade proline-glycine-proline, biomarker for chronic obstructive pulmonary disease. |
|
| C5-OH (C3-DC-M) | Lead | 2 | Catalyzes the conversion of 3-methylcrotonyl-CoA to 3-methylglutaconyl-CoA, playing an important role in the catabolism of leucine and isovaleric acid. Mutations in this gene are associated with 3-methylcrotonylglycinuria. |
|
| Ser | Lead | 2 | Responsible for the catalysis of 5,10-methylenetetrahydrofolate to 5-methyltetrahydrofolate, involved in the remethylation of homocysteine to produce methionine and tetrahydrofolate, a substrate for serine production. |
|
| PC ae C38:3, lysoPC a C26:1 | Lead, LD | 3 | Encodes an essential transcript factor that acts on the central nervous system myelination process. |
|
| C16:1 | Lead | 3 | |
|
| Sarcosine | Lead | 3 | Encodes a member of the AAA family of ATPases, which plays a direct role in peroxisomal protein import and PTS1 (peroxisomal targeting signal 1, a C-terminal tripeptide of the sequence Ser-Lys-Leu) receptor activity. |
|
| Phe | Lead | 3 | Responsible for the generation of multiple protein products including melanin-concentrating hormone (MCH), neuropeptide-glutamic acid-isoleucine (NEI), and neuropeptide-glycine-glutamic acid (NGE). Acts on behaviors such as hunger and arousal. |
|
| C10:1, C12 | Lead | 3 | Index variant associated with different carnitines and colocalized with decreased gene expression. |
|
| Pro | Lead | 1 | This protein catalysis the intermediate reaction of proline catabolism to glutamic acid and mutations on this gene are associated with hyperprolinemia type 1. |
|
| His | Lead | 3 | Proteasome 26S Subunit, ATPase 3 ( |
|
| C16-OH | Lead | 3 | Predicted oxidoreductase that may play in mitochondrial organization. |
|
| Sarcosine | Lead | 1 | This gene encodes for the sarcosine dehydrogenase enzyme that acts on the conversion of sarcosine to glycine. Mutations in this gene are the cause for sarcosinemia. |
|
| C3, Serotonin | Lead | 1 | An organic cation transporter with polyspecificity, such as for histamine, epinephrine, adrenaline, noradrenaline, dopamine, spermine and spermidine, among others. |
|
| C0, C2, C4, C16:1 | Lead/LD/gene | 1 | An organic cation transporter with high affinity for carnitine. Mutations in this gene are the cause of systemic primary carnitine deficiency. |
|
| PC aa C34:4 | LD | 3 | Acts in pre-mRNA splicing. |
|
| His | Lead | 3 | Trichoplein keratin filament binding ( |
|
| Trp | Gene | 1 | This enzyme catalyzes the first and rate-limiting step in the conversion of tryptophan into kynurenine. |
|
| lysoPC a C20:3, PC aa C34:4 | Lead | 2 | Regulator of liver fat metabolism this gene influences triglyceride secretion and hepatic lipid droplet content. It is associated with fatty liver disease and non-alcoholic fatty liver disease. |
|
| lysoPC a C26:1, PC ae C38:3 | Lead | 3 | Transmembrane Protein 258 ( |
|
| His | Lead | 3 | This gene encodes several proteins containing a LEM domain through an alternative splicing mechanism. These proteins are involved in gene expression, chromatin organization, replication and cell cycle control. |
|
| His | Lead | 3 | UHRF1 Binding Protein 1 Like ( |
1 Method by which this gene has been associated with the metabolite. Lead: index variant was located in the gene; LD: genome-wide significant variant (p < 5 × 10−8) in the gene was in high LD (r2 > 0.8) with index variant; Gene: significant association with the gene in the gene-level test.
Gene-trait associations significant (unconditioned) at a 3.55 × 10−8 threshold in the WES combined dataset, with at least two variants needed to reach significance.
| Trait ID | Trait Name | Gene | Mask 1 | Number of Variants | Cumulative Allele Count | Number of Variants Needed to Reach Significance | |
|---|---|---|---|---|---|---|---|
| Trp | Tryptophan |
| HMI | 8.9 × 10−9 (1.7 × 10−8) | 6 | 45 | 3 |
| SM C18:0 | Sphingomyeline C18:0 |
| HMI | 6.1 × 10−10 (2.7 × 10−8) | 10 | 82 | 3 |
| C0 | Carnitine |
| HMI | 2.9 × 10−10 (4.0 × 10−10) | 6 | 37 | 2 |
1 HMI = high-moderate impact.
Figure 4Results of the colocalization analysis. For each index variant with at least one gene colocalized at a posterior probability (PP) ≥ 0.8 in at least one of the three tissues whole blood (Blood), kidney cortex (Kidney), or liver (Liver), colocalization data of all protein coding genes and traits with PP ≥ 0.3 are displayed. The colors represent the change in gene expression relative to an increase in the colocalized metabolite level.