Literature DB >> 31960908

Genome-wide scan identifies novel genetic loci regulating salivary metabolite levels.

Abhishek Nag1,2, Yuko Kurushima1, Ruth C E Bowyer1, Philippa M Wells1, Stefan Weiss3, Maik Pietzner4, Thomas Kocher5, Johannes Raffler6, Uwe Völker3, Massimo Mangino1, Timothy D Spector1, Michael V Milburn7, Gabi Kastenmüller6, Robert P Mohney7, Karsten Suhre8, Cristina Menni1, Claire J Steves1.   

Abstract

Saliva, as a biofluid, is inexpensive and non-invasive to obtain, and provides a vital tool to investigate oral health and its interaction with systemic health conditions. There is growing interest in salivary biomarkers for systemic diseases, notably cardiovascular disease. Whereas hundreds of genetic loci have been shown to be involved in the regulation of blood metabolites, leading to significant insights into the pathogenesis of complex human diseases, little is known about the impact of host genetics on salivary metabolites. Here we report the first genome-wide association study exploring 476 salivary metabolites in 1419 subjects from the TwinsUK cohort (discovery phase), followed by replication in the Study of Health in Pomerania (SHIP-2) cohort. A total of 14 distinct locus-metabolite associations were identified in the discovery phase, most of which were replicated in SHIP-2. While only a limited number of the loci that are known to regulate blood metabolites were also associated with salivary metabolites in our study, we identified several novel saliva-specific locus-metabolite associations, including associations for the AGMAT (with the metabolites 4-guanidinobutanoate and beta-guanidinopropanoate), ATP13A5 (with the metabolite creatinine) and DPYS (with the metabolites 3-ureidopropionate and 3-ureidoisobutyrate) loci. Our study suggests that there may be regulatory pathways of particular relevance to the salivary metabolome. In addition, some of our findings may have clinical significance, such as the utility of the pyrimidine (uracil) degradation metabolites in predicting 5-fluorouracil toxicity and the role of the agmatine pathway metabolites as biomarkers of oral health.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 31960908      PMCID: PMC7104674          DOI: 10.1093/hmg/ddz308

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


Introduction

Metabolic reactions pervade every aspect of human physiology, abnormalities in which underlie a plethora of human diseases (1). Investigating the genetic underpinnings of population-wide variation of metabolites can offer novel insights into human metabolism and diseases, in addition to providing potential therapeutic targets to modulate metabolite levels. Large-scale genetic association studies have so far identified hundreds of loci that regulate the levels of metabolites in blood (2–6) and to a lesser extent in other biospecimens as well (7–9). Previous studies have shown that genetic variants on average explain a greater proportion of trait variance for metabolites compared to what is generally observed for complex traits (3,4), highlighting the utility of metabolites as intermediate traits for dissecting the genetics of complex diseases. Saliva is an abundantly produced biofluid, and it can be obtained in an inexpensive and non-invasive manner, without the need for healthcare professionals. It is mainly composed of water (>99%) and several other minor constituents such as mucous, digestive enzymes, cytokines, immunoglobulins, antibacterial peptides, and low molecular weight metabolites (10). Recent advances in metabolomic profiling allow quantification of hundreds of metabolites belonging to diverse biochemical pathways in large population samples (4,11). In 2015, the Human Metabolome Database (HMDB) incorporated data on the ‘salivary metabolome’ which included 853 salivary metabolites that were systematically characterized using a multiplatform approach (12). Since saliva is separated from the systemic circulation by just a thin layer of cells, which allows passive and active exchange of substances (13), it provides a reflection of not just oral health but the functioning of other organ systems as well (14). Indeed, a number of studies have previously reported associations between oral health and systemic conditions such as cardiovascular diseases, diabetes, autoimmune diseases, mental health disorders, and dementia, amongst others (15–19). Therefore, investigation of salivary metabolites could not only provide novel biomarkers but also further our understanding of biological pathways underlying oral as well as general health conditions. Here we report a genome-wide association analysis (based on 1000 Genomes imputed data) for 476 salivary metabolites in the population-based TwinsUK study, followed by replication in the population-based Study of Health in Pomerania (SHIP-2) cohort.

Results

Identification of novel genetic loci regulating salivary metabolite levels

Primary genome-wide discovery analysis in TwinsUK identified 13 metabolites that were significantly associated with genetic loci after correcting for multiple testing (P < 10−10). Furthermore, when we narrowed our analysis to just the loci that were identified in the primary stage, one additional metabolite was found associated (P < 10−6). Consequently, a total of 14 distinct locus-metabolite associations (hereafter, referred to as ‘mQTLs’) were identified in the discovery phase of our study. The set of significantly associated variants mapped to 11 distinct genetic loci, which have been referred to by the name(s) of the overlapping or the nearest gene(s) (Fig. 1). Three of those loci (AGMAT, SLC2A9 and DPYS) were associated with two metabolites each (Table 1). In all three instances, the two metabolites regulated by the same locus were correlated (Pearson’s r2 for the metabolite pairs ranged between 0.42 and 0.84) (Supplementary Material, Fig. S1). On the other hand, none of the 14 metabolites that were associated in our study had more than one significant locus. Quantile-quantile (QQ) plots for the significantly associated metabolites are provided in Supplementary Material, Figure S2.
Figure 1

Manhattan plot illustrating the findings of the discovery phase (TwinsUK) of the genome-wide association study for salivary metabolites. The red horizontal line demarcates the study-wide significance threshold of P = 10−10. Eleven loci surpassed the study-wide significance threshold. The loci are referred to by the name (s) of the overlapping or the nearest gene(s).

Table 1

Summary of genetic loci that were significantly associated with salivary metabolite(s) in the discovery phase (TwinsUK)

LocusIndex variantChrPosition (build 37 bp)Biochemical (metabolite)EAEAFBetaSE P-value
AGMATrs10927806115 909 2394-GuanidinobutanoateC0.450.2520.0374.4 × 10−11
rs6690813115 861 073Beta-guanidinopropanoateC0.450.7080.0344.6 × 10−66
ATP13A5rs559183343193 086 314CreatinineG0.530.3050.0393.9 × 10−14
SLC2A9rs1312969749 926 967UrateT0.720.3370.0421.3 × 10−14
rs7675964*49 941 434AllantoinC0.730.2390.0434.1 × 10−8
DMGDHrs248386578 330 227DimethylglycineC0.81−0.3490.0481.1 × 10−12
DPYSrs802743008105 470 8823-UreidopropionateC0.81−0.7640.0438.1 × 10−54
rs802743008105 470 8823-UreidoisobutyrateC0.81−0.4820.0485.8 × 10−22
ABOrs2012989799136 145 424 N-acetylglucosamine/N-acetylgalactosamineAC0.77−0.3140.0462.6 × 10−11
UGCGrs109810989114 591 623Glycosyl-N-stearoyl-sphinganineT0.79−0.3080.0464.6 × 10−11
FADS2rs1745641161 588 3051-(1-Enyl-palmitoyl)-2-arachidonoyl-GPCA0.670.3240.0391.9 × 10−15
ACADSrs3467375112121 171 891EthylmalonateG0.74−0.5390.0415.1 × 10−34
MGPrs57966141215 053 094Gamma-carboxyglutamateG0.60−0.2900.0381.9 × 10−13
TYMS/ENOSF1rs279018673 086RibonateA0.810.3250.0495.8 × 10−11

Chr = Chromosome; EA = effect allele; EAF = effect allele frequency; SE = standard error.

For the primary stage of association analysis in the discovery phase (TwinsUK), locus-metabolite associations (mQTLs) were identified using a genome-wide and metabolome-wide significance threshold of P < 10−10.

*This mQTL was identified when the analysis was restricted to only the loci that were identified in the primary stage of association testing (significance threshold of P < 10−6).

Manhattan plot illustrating the findings of the discovery phase (TwinsUK) of the genome-wide association study for salivary metabolites. The red horizontal line demarcates the study-wide significance threshold of P = 10−10. Eleven loci surpassed the study-wide significance threshold. The loci are referred to by the name (s) of the overlapping or the nearest gene(s). Summary of genetic loci that were significantly associated with salivary metabolite(s) in the discovery phase (TwinsUK) Chr = Chromosome; EA = effect allele; EAF = effect allele frequency; SE = standard error. For the primary stage of association analysis in the discovery phase (TwinsUK), locus-metabolite associations (mQTLs) were identified using a genome-wide and metabolome-wide significance threshold of P < 10−10. *This mQTL was identified when the analysis was restricted to only the loci that were identified in the primary stage of association testing (significance threshold of P < 10−6). Of the 11 genetic loci that were identified in the discovery phase, four of them (SLC2A9, DMGDH, FADS2 and ACADS) have previously been reported in association with blood metabolites (2,4,5); the remaining seven loci were novel, i.e. they had no previously known associations with metabolites in blood or any other biospecimens. For the genetic loci that were associated with a given metabolite in both saliva and blood, co-localization analysis showed that a common underlying causal variant at the locus possibly regulates the metabolite level in both saliva and blood (P < 0.05 for each of the three (out of four) mQTLs for which summary level data were available). The AGMAT locus, one of the novel loci identified, was associated with the metabolites 4-guanidinobutanoate and beta-guanidinopropanoate. These metabolites are generated as intermediate products in the polyamine synthesis pathway, the main site of action for the enzyme agmatinase (encoded by the AGMAT gene) (20). The most significantly associated variants for the two metabolites, i.e. rs10927806 and rs6690813, respectively, are in high LD with one another (r2 = 0.99), suggesting a shared underlying genetic regulation for the two metabolites by the AGMAT locus. Similarly, the association between the ATP13A5 locus and creatinine (a widely used measure of renal function), observed in our study, has also not been reported previously. Another interesting novel association that we identified pertained to the metabolism of the pyrimidine uracil—the DPYS locus (encodes for the enzyme dihydropyrimidinase, involved in uracil degradation) was associated with the metabolites 3-ureidopropionate and 3-ureidoisobutyrate (breakdown products of uracil metabolism). The association between the TYMS/ENOSF1 locus and ribonate is also intriguing, since it has previously been shown that ribonate is one of the substrates for the catalytic activity of reverse thymidylate synthase (rTS), the protein product of ENOSF1 (21). Therefore, it appears that ENOSF1, which is the source of anti-sense RNA of TYMS, is probably the functional gene mediating the observed association of the TYMS/ENOSF1 locus with salivary ribonate. The associations of the SLC2A9 locus with allantoin, the ABO locus with N-acetylglucosamine/N-acetylgalactosamine, the UGCG locus with glycosyl-N-stearoyl-sphinganine, and the MGP locus with gamma-carboxyglutamate were the remaining mQTLs observed in our study that have not been reported previously. Of the 14 mQTLs that we identified, it was possible to test the most significant variant-metabolite pair for nine mQTLs, each in serum and faecal metabolite data in TwinsUK (metabolites corresponding to the remaining five mQTLs were not present in the serum and faecal metabolite datasets). Of them, associations for the ATP13A5 and DPYS loci did not replicate in serum (P > 0.05), while none of the associations, barring the one for the ACADS locus, replicated in the faecal data (P > 0.05) (Supplementary Material, Table S1). Thus, a comparison across all three biospecimens (for the significantly associated salivary metabolites which were also measured in serum and faecal samples in TwinsUK) demonstrates that, in the TwinsUK dataset, the effects of the ATP13A5 and DPYS loci appear to be specific to saliva (Supplementary Material, Fig. S3). For none of the mQTLs did we find any additional independent signals at the associated locus after conditioning for the most significant variant (conditional P > 10−5 for all variants tested at each locus), a finding which was verified by the regional association plots (Supplementary Material, Fig. S4). The observation that a single genetic association signal underlies each of the significantly associated metabolites might partly be due to our lack of power to detect secondary signals at these loci. The strength of association for the most significant variant-metabolite pair did not change much on adjusting for periodontal disease (PD) status, for any of the mQTLs (Supplementary Material, Table S2). Hence, it does not appear that the condition of oral health, which was ascertained using PD status, has a significant effect on the associations observed in our study. Similarly, the most significant variant-metabolite pair for each of the mQTLs remained significant (P > 0.05) on adjusting for either the smoking status, amount of alcohol intake or BMI (Supplementary Material, Table S3). For 8 of the 11 associated loci, it was observed that the most significant variant demonstrated an eQTL effect on at least one transcript in one of the tissues in the GTEx database (no significant eQTL effects were observed for the ATP13A5, SLCA2A9 and ABO loci) (Supplementary Material, Table S4). In case of seven of those eight loci (except DPYS), the significant eQTL effect was observed for the overlapping or the nearest gene transcript, and for five of those seven loci (AGMAT, FADS2, DMGDH, TYMS/ENOSF1 and UGCG), the eQTL effect was observed in one of the gut-related tissues. While eQTL data for transcripts assayed in the minor salivary glands were available for a small number of donors (N = 97) in the GTEx database, we did not observe significant eQTL effects in the salivary tissue for any of the associated loci.

Replication of the discovery phase findings

In SHIP-2, we could attempt replication for 9 of the 14 mQTLs that were identified in the discovery phase (metabolites corresponding to the remaining five mQTLs were not measured in SHIP-2). In the initial replication analysis, which was performed using salivary metabolite data that were not normalized for sample osmolality, eight of the nine discovery phase associations were replicated (P < 0.05), with the direction of effect consistent with that observed in TwinsUK (Table 2). The association between the ABO locus and N-acetylglucosamine/N-acetylgalactosamine was the only finding from the discovery phase that did not replicate in SHIP-2. When the replication analysis in SHIP-2 was repeated with metabolite data that were normalized for the sample osmolality, the strength of all the associations was comparatively reduced (Supplementary Material, Table S5).
Table 2

Summary of the discovery phase associations that were tested in the replication study (SHIP-2)

LocusIndex variantChrBiochemical (metabolite)EABetaSE P-value
AGMATrs1092780614-GuanidinobutanoateC0.1150.0448.9 × 10−3
ATP13A5rs559183343CreatinineG0.2390.0402.8 × 10−9
SLC2A9rs131296974UrateT0.5760.0457.7 × 10−35
rs76759644AllantoinC0.2090.0492.3 × 10−5
DMGDHrs2483865DimethylglycineC−0.3360.0551.4 × 10−9
DPYSrs8027430083-UreidopropionateC−1.0350.0493.1 × 10−82
ABOars94113789 N-acetylglucosamine/N-acetylgalactosamineA0.0320.0480.502
ACADSrs3467375112EthylmalonateG−0.1320.0497.3 × 10−3
TYMS/ENOSF1rs279018RibonateA0.1370.0550.013

aSince information for the index variant (rs201298979) at the ABO locus was not available in the replication study, a proxy variant (rs9411378) that was in high LD (r2 = 0.95) with the index variant was used for the replication analysis.

Replication in SHIP-2 could be attempted for 9 of the 14 mQTLs that were identified in the discovery phase (metabolites corresponding to the remaining five mQTLs were not measured in SHIP-2). For each of these nine mQTLs, the most significant variant-metabolite pair identified in the discovery phase was tested in SHIP-2, using salivary metabolite data that were not normalized for osmolality (since SHIP-2 saliva samples represented stimulated saliva).

Summary of the discovery phase associations that were tested in the replication study (SHIP-2) aSince information for the index variant (rs201298979) at the ABO locus was not available in the replication study, a proxy variant (rs9411378) that was in high LD (r2 = 0.95) with the index variant was used for the replication analysis. Replication in SHIP-2 could be attempted for 9 of the 14 mQTLs that were identified in the discovery phase (metabolites corresponding to the remaining five mQTLs were not measured in SHIP-2). For each of these nine mQTLs, the most significant variant-metabolite pair identified in the discovery phase was tested in SHIP-2, using salivary metabolite data that were not normalized for osmolality (since SHIP-2 saliva samples represented stimulated saliva).

Phenotypic associations for salivary metabolites

The majority of the loci associated with salivary metabolites that were identified in our study have been reported in relation with GWAS traits, inborn errors of metabolism and/or clinically relevant biochemical pathways (Table 3). We further tested the salivary metabolites associated with the DPYS, AGMAT and ATP13A5 loci in relation with specific phenotypes using information available in the TwinsUK database (Table 4).
Table 3

Annotation of the genetic loci and the metabolites that were significantly associated in the discovery phase

LocusBiochemical (metabolite)Metabolite super-pathwayaMetabolite sub-pathwayaKnown GWAS disease associations for the locusbKnown OMIM phenotype for the locuscClinical significance of the locus, the metabolite or the biochemical pathway involved
AGMAT4-GuanidinobutanoateAmino acidGuanidino and acetamido metabolismGlomerular filtration rate, alcoholic chronic pancreatitisPutrescine, a compound that is generated in the pathway which involves the AGMAT gene, i.e. polyamine synthesis pathway, has been implicated in bad breath/oral health (22,23)
Beta-guanidinopropanoateXenobioticsPlant (food) component
ATP13A5CreatinineAmino acidCreatine metabolismKufor–Rakeb syndromeThe ATP13A5 gene encodes the family of proteins that regulate the activity of HMG-CoA reductase (28), the rate-limiting enzyme in cholesterol synthesis
SLC2A9UrateNucleotideHypo(xanthine)/inosine (purine) metabolismGoutdRenal hypouricemia type 2The SLC2A9 gene encodes a carrier protein that is involved in urate transport in the proximal convoluted tubules of kidneys
AllantoinNucleotideHypo(xanthine)/inosine (purine) metabolism
DMGDHDimethylglycineAmino acidGlycine, serine and threonine metabolismDimethylglycine dehydrogenase deficiencyThe DMGDH gene encodes an enzyme (dimethylglycine dehydrogenase), which catalyses the conversion of dimethylglycine to sarcosine
DPYS3-UreidopropionateNucleotideUracil (pyrimidine) metabolismPaget’s diseaseDihydropyrimidinuriaVariants in the DPYS gene, which encodes an enzyme in the pyrimidine degradation pathway, have been associated with 5-fluorouracil toxicity (30)
3-UreidoisobutyrateNucleotideUracil (pyrimidine) metabolism
ABO N-acetylglucosamine/N-acetylgalactosamineCarbohydrateAmino sugar metabolismGastric carcinoma, stroke, venous thromboembolism,d ovarian cancer, malaria, blood cholesterol level, type 2 diabetesThe ABO gene encodes a protein with glycosyltransferase activity, which forms the basis of the ABO blood group system
UGCGGlycosyl-N-stearoyl-sphinganineLipidCeramide metabolismThe UGCG gene is involved in glycosphingolipid synthesis, defects in which are known to cause Gaucher’s disease (51), a lysosomal storage disorder
FADS21-(1-Enyl-palmitoyl)-2-arachidonoyl-GPCLipidPlasmalogenADHD, blood cholesterol levelThe FADS2 gene encodes an enzyme (fatty acid desaturase), which catalyses the rate-limiting step in the desaturation of polyunsaturated fatty acids (PUFA)
ACADSEthylmalonateAmino acidLeucine, isoleucine and valine metabolismEthylmalonic aciduriaThe ACADS gene encodes the acyl-CoA dehydrogenase enzyme, which catalyses the initial step in the fatty acid of oxidation pathway
MGPGamma-carboxyglutamateAmino acidGlutamate metabolismKeutel syndromeThe MGP gene, a regulator of physiologic tissue calcification, is associated with calcification of vasculature in patients with cardiovascular disease (40). The MGP gene has also been associated with natural tooth loss in elderly women (52)
TYMS/ENOSF1RibonateCarbohydratePentose metabolismHypertensionThe TYMS gene encodes the enzyme thymidylate synthase, the main site of action of 5-fluorouracil. The ENOSF1 gene encodes the antisense RNA of TYMS

aObtained from the KEGG database.

bAccessed from the NHGRI GWAS catalogue.

cAccessed from the OMIM database.

The genetic loci that were identified in the discovery phase were annotated for known disease associations using the NHGRI GWAS and OMIM databases. Similarly, the metabolites identified in the discovery phase were annotated for the associated biochemical pathway.

dStrong evidence for co-localization of the association signal for the trait with that for the metabolite which was associated with the same locus.

Table 4

Summary of the phenotypes that were tested in relation with specific salivary metabolites

LocusChrBiochemical (metabolite)Phenotypic association(s) tested
AGMAT14-Guanidinobutanoate1. Periodontal disease
Beta-guanidinopropanoate2. Clinical depression or anxiety disorder
ATP13A53Creatinine1. eGFR (measure of renal function)
2. Grip strength (measure of muscle strength)
3. Statin usage
DPYS83-Ureidopropionate1. Irritable bowel syndrome (based on the ROME-III criteria)
3-Ureidoisobutyrate

eGFR = effective glomerular filtration rate.

The metabolites that were uniquely associated in saliva, i.e. the ones for which a genetic association had not been previously reported in blood, were tested with phenotypes (diseases/traits/adverse drug effects) relating to the metabolite or its associated biochemical pathway, using information from the TwinsUK database.

Annotation of the genetic loci and the metabolites that were significantly associated in the discovery phase aObtained from the KEGG database. bAccessed from the NHGRI GWAS catalogue. cAccessed from the OMIM database. The genetic loci that were identified in the discovery phase were annotated for known disease associations using the NHGRI GWAS and OMIM databases. Similarly, the metabolites identified in the discovery phase were annotated for the associated biochemical pathway. dStrong evidence for co-localization of the association signal for the trait with that for the metabolite which was associated with the same locus. Summary of the phenotypes that were tested in relation with specific salivary metabolites eGFR = effective glomerular filtration rate. The metabolites that were uniquely associated in saliva, i.e. the ones for which a genetic association had not been previously reported in blood, were tested with phenotypes (diseases/traits/adverse drug effects) relating to the metabolite or its associated biochemical pathway, using information from the TwinsUK database. The AGMAT-associated metabolites (4-guanidinobutanoate and beta-guanidinopropanoate) are generated in the polyamine synthesis pathway (https://www.genome.jp/kegg-bin/show_pathway?hsa00330). This pathway also produces the compound putrescine, which has been implicated in poor oral health and foul breath (22,23). Consequently, we evaluated the significance of the AGMAT-associated metabolites in oral health by testing them with PD status—the levels of both 4-guanidinobutanoate and beta-guanidinopropanoate were significantly higher in PD cases compared to controls (P = 0.0003 and P = 0.0006, respectively). Since eGFR (estimated glomerular filtration rate) is calculated on the basis of serum creatinine, and these two commonly used measures of renal function are negatively correlated, we wanted to investigate the association between salivary creatinine and eGFR. We observed a similar strong negative relationship between salivary creatinine and eGFR (P = 1.6 × 10−11), which is indicative of a homeostasis between creatinine concentrations in serum and saliva. Furthermore, creatinine is also known to be a marker of muscle mass and strength (24). We, therefore, tested salivary creatinine in relation with grip strength (a measure of muscle strength), which suggested a positive correlation between them (P = 0.01). Apart from these findings, the remaining phenotypic associations that we tested were largely negative, as follows: The enzyme agmatinase (encoded by the AGMAT gene), which acts on the substrate agmatine, has been implicated in the pathophysiology of mood disorders (25). Moreover, studies have also proposed agmatine as a novel neuromodulator (26). We, therefore, tested the AGMAT-associated metabolites in relation with a diagnosis of clinical depression or anxiety disorder, and responses (on a ordinal scale) to questions in the Hospital Anxiety and Depression Scale (HADS) questionnaire (27). But, neither analysis showed any significant associations (Supplementary Material, Table S6). ATP13A5, the locus that was associated with salivary creatinine, belongs to the family of ATPases that regulate the activity of HMG-CoA reductase (28), the main site of action of the cholesterol-lowering class of drugs called statins. Statins are known to cause muscle dysfunction (myopathy) in a small fraction of patients (29). Given that ATP13A5 and statins both act in the same biochemical pathway, we assessed whether salivary creatinine is also associated with statin usage, and could therefore be used as a biomarker for statin-induced myopathy. There was, however, no association between salivary creatinine and statin usage (P = 0.22). 5-Fluorouracil (5-FU) is a pyrimidine analogue that is a commonly used anticancer drug. It is eliminated from the body via the pyrimidine degradation pathway, and hence, variants in genes coding for the pyrimidine degradation enzymes (for instance, DPYS) are known to be associated with the development of 5-FU toxicity (30), which mainly manifests as gastrointestinal side effects. Since we could not directly assess the DPYS-associated metabolites (3-ureidopropionate and 3-ureidoisobutyrate) in relation to the gastrointestinal side effects of 5-FU toxicity, we instead used a commonly observed phenotype of gastrointestinal dysfunction, irritable bowel syndrome or IBS (ascertained using the ROME-III questionnaire (31)). For both metabolites, we observed that the levels were not significantly different in IBS ‘cases’ compared to ‘controls’ (P > 0.05, for both metabolites). However, this negative finding does not negate the possibility that these metabolites could be of clinical use in predicting 5-FU toxicity.

Discussion

Here we report a genome-wide association analysis of 476 metabolites measured in saliva samples of healthy population-based studies of European descent. We identified a total of 11 distinct genetic loci that regulate the level of 14 salivary metabolites, of which three loci were associated with more than one metabolite each. The fact that saliva is reflective of the concentration of biochemicals in blood forms the basis for certain clinical applications of saliva that others have proposed previously such as therapeutic monitoring of drugs (32), cortisol measurement (33) and renal function monitoring (34). Using salivary metabolite data, we replicated associations for certain well-established genetic loci that are known to regulate the level of blood metabolites. Thus, our findings add further credence to the notion that, as biofluids, a certain degree of homeostasis exists between blood and saliva. Additionally, we identified some novel associations in our study, which have expanded our knowledge of genetic influences on human metabolites. In particular, the association between the DPYS locus and pyrimidine metabolites is intriguing because of its clinical relevance. Mutations in the pyrimidine catabolism genes such as DPYD (encodes dihydropyrimidine dehydrogenase) and DPYS (encodes dihydropyrimidinase) have been linked to inborn errors of metabolism (35,36) as well as development of severe toxicity to the chemotherapeutic agent 5-FU (30,37,38). Studies have previously demonstrated the applicability of salivary measurement of certain pyrimidine pathway metabolites (uracil and dihydrouracil) for evaluating 5-FU toxicity due to deficient DPYD activity (39). In our study, variants in the DPYS gene correlated with the levels of specific pyrimidine metabolites (3-ureidopropionate and 3-ureidoisobutyrate). Therefore, studies to explore the utility of salivary measurements of these metabolites as non-invasive tools for predicting 5-FU toxicity resulting from mutations that affect DPYS activity are warranted. While we could not test 3-ureidopropionate and 3-ureidoisobutyrate in relation to the gastrointestinal side effects of 5-FU toxicity, we did not find any association between these metabolites and a phenotype relevant to gut dysfunction (IBS phenotype). The other novel finding of note was the association between the AGMAT locus and the metabolites 4-guanidinobutanoate and beta-guanidinopropanoate. The AGMAT-associated metabolites are produced in the polyamine synthesis pathway, which also generates the compound putrescine that has been implicated in oral health. Moreover, we found that the AGMAT-associated metabolites were correlated with PD status, a disease related to poor oral health. Together, these findings suggest that the AGMAT-associated metabolites might serve as potential biomarkers for oral health. On the other hand, though the agmatine pathway has been previously implicated in mood disorders, we did not find any association between the AGMAT-associated metabolites and either symptoms of or a prior diagnosis of clinical depression or anxiety disorder. Thus, evidence based on the AGMAT-associated metabolites does not lend support to the hypothesis regarding a potential link between pathways involved in maintaining oral health and regulation of mood (15,18). The association between the ATP13A5 locus and salivary creatinine was also noteworthy. While we observed a degree of homeostasis between salivary and serum creatinine (a commonly used measure of renal function), we did not find any evidence to support our hypothesis of salivary creatinine as a marker for statin-induced myopathy. There were, however, a few other novel associations of interest in our study, such as the one between the MGP locus (known to cause abnormal vascular calcification in patients with cardiovascular disease (40)) and gamma-carboxyglutamate, for which we could not assess the clinical significance since specific phenotypic information was not available in sufficient numbers in those with salivary metabolite data. While we cannot be certain that the novel genetic associations which we identified for salivary metabolites relate to the salivary metabolome alone, we were intrigued to find that these genetic loci had not been reported in association with blood metabolites. This suggests that there may be regulatory pathways of particular relevance to the salivary metabolome. In most cases, the gene transcript nearest to or overlapping our novel genetic loci was expressed in one or more gut-related tissues, including salivary glands. However, for these loci, we did not find much evidence for cis-eQTL effects specific to salivary or other gut tissues, whereby we could attribute their association with the respective salivary metabolite(s) to transcriptional regulation of overlapping/neighbouring genes. Interestingly, there is growing evidence to suggest that the human metabolome is a reflection of an interaction between the host and the gut microbiome (7,41,42). In the case of the salivary metabolome, this can be explored by testing the association of the both the compositional and functional attributes of the salivary metagenome with salivary metabolites. Studies investigating the gut microbiome have shown a relatively low overall influence of host genetics on microbiome composition, but some key taxa have significant heritability (43). If this is recapitulated in the salivary microbiome, it is possible that the observed associations between genetic loci and salivary metabolites could be mediated by the microbiome. Thus, in future studies with both salivary microbial and metabolite data, it will be worth investigating whether the salivary metabolites that were associated in our study correlate with the composition of the salivary microbiome. In summary, our study has provided a map of the genetic loci that influence the salivary metabolome, thus offering insights into hitherto unknown biological pathways involved in the regulation of salivary metabolites. Based on what has been observed for other complex human traits, future studies with larger sample sizes are expected to uncover additional genetic loci with much smaller effects on salivary metabolites. While oral health has been implicated in systemic conditions such as cardiovascular diseases and mental health disorders, the exact mechanisms underlying these associations are far from understood. Our study identified the potential clinical relevance for a few salivary metabolites of interest, such as the utility of the pyrimidine (uracil) degradation metabolites in predicting 5-fluorouracil toxicity and the role of the agmatine pathway metabolites as biomarkers of oral health. However, realizing the potential application of salivary metabolites as biomarkers of systemic health conditions would require conducting a more comprehensive analysis of the salivary metabolome with a wider range of phenotypic domains.

Materials and Methods

Discovery phase

Study population. The discovery phase of the study was conducted in the TwinsUK cohort, an adult twin registry comprising healthy volunteers, based at St. Thomas’ Hospital in London (44). Twins gave fully informed consent under a protocol reviewed by the St. Thomas’ Hospital Local Research Ethics Committee. Subjects of European ancestry with available genotype data and for whom salivary metabolite profiling was done on a fasting state sample were included in our study (N = 1419; mean age = 62.2 years; % females = 92.7). Genotyping, imputation and QC. Subjects were genotyped in two different batches of approximately the same size, using two genotyping platforms from Illumina: 300K Duo and HumanHap610-Quad arrays. Whole genome imputation of the genotypes was performed using the 1000 genomes reference haplotypes (45), further details of which are provided in Moayyeri et al. (44). Stringent QC measures, including minimum genotyping success rate (>95%), Hardy–Weinberg equilibrium (P > 10−6), minimum MAF (>0.5%) and imputation quality score (INFO > 0.5), retained ~9.6 million variants for genome-wide analysis. Saliva sample collection. Saliva samples were obtained by asking the fasted volunteer to spit as much saliva as possible into an empty sterile pot over a period of 10 min. The saliva samples were immediately refrigerated and then frozen at −80°C (usually within 4 h of sample collection) before further processing. Following that, the samples were shipped on dry ice for metabolite profiling at Metabolon Inc., Durham, USA (see Supplementary methods (I) for further details on sample processing). Metabolic profiling of saliva samples. Metabolite concentrations in the saliva samples were estimated using Ultrahigh Performance Liquid Chromatography-Tandem Mass Spectroscopy (UPLC-MS) i.e. chromatographic separation, followed by full-scan mass spectroscopy, to record all detectable ions in the samples (see Supplementary methods (II) for further details). Based on their unique ion signatures (chromatographic and mass spectral), 997 distinct metabolites were identified, of which 823 had known chemical identity at the time of analysis. The 823 known metabolites were broadly classified into eight metabolic groups (amino acids, peptides, carbohydrates, energy, lipids, nucleotides, cofactors and vitamins, and xenobiotics) as described in the KEGG (kyoto encyclopedia of genes and genomes) database (46). The eight metabolic groups were further subdivided into 99 distinct biochemical pathways. Raw metabolite values were normalized for the volume and osmolality measurement of the saliva samples. The normalized metabolite values were then log-transformed, and scaled to uniform mean 0 and standard deviation 1. Of the 823 known metabolites, 476 were retained for analysis based on the presence of measurement in more than 80% samples. For these metabolites, we imputed missing data using the run day minimum value for the metabolite, based on the rationale that missing values represented metabolite concentrations that were too low to be detected. The resulting imputed dataset of the 476 metabolites was used for further analysis. Genome-wide association analysis of salivary metabolites: (i) Primary genome-wide association analysis For each of the 476 metabolites, a linear mixed-model was fitted to test the association between the metabolite (dependent variable) and genome-wide variants (independent variable). Age, sex and time of saliva sample collection were included as covariates in the model. The score test implemented in GEMMA (47), which utilizes a sample kinship matrix (estimated using a subset of ~500 000 variants) to account for the twin structure or relatedness in the TwinsUK data, was used to assess significance of the associations. A genome-wide and metabolome-wide significance cut-off of P < 10−10 (corresponding to the conventional genome-wide significance threshold of 5 × 10−8, corrected for 476 metabolites) was used to identify significant variant-metabolite associations. For each locus that was significantly associated with a metabolite, we reported the variant with the lowest association P-value. (ii) Testing loci identified in the primary analysis for additional metabolite associations Next, we focused just on the loci that were identified in the primary stage of association testing to look for additional variant-metabolite associations for those loci. For each locus that was identified in the primary analysis, we clumped all variants located within a 100 Mb block and with LD (r2) > 0.2, to check for additional metabolite associations at a significance threshold of P < 10−6 (corresponding to P = 0.05, corrected for 476 metabolites and a prior assumption of about 100 associated loci). (iii) Testing the significantly associated loci using metabolite data from other biospecimens For each associated locus, we further assessed the most significant variant-metabolite pair by using measurements for the respective metabolite in serum (4) and faecal samples (7) of the TwinsUK subjects (provided the metabolite was measured in that biospecimen). We tested only those serum and faecal samples that overlapped with the ones in saliva and were collected within 5 years of the saliva samples (in order to ensure that, for a given individual, samples from the different biospecimens being tested were obtained within a certain period of one another). Association testing for serum and faecal metabolites was done using an identical model to that described for the analysis of salivary metabolites. In addition, for the locus-metabolite associations that have been previously reported for blood metabolites, we performed co-localization analysis using the GSMR/HEIDI method (implemented in GCTA) (48) in order to test whether the associations in saliva and blood were mediated by a common underlying signal (pleiotropic effect) or distinct signals at the locus. We used the metabolomic GWAS summary level data made available by Shin et al. (4) and Long et al. (5) for the co-localization analysis. (iv) Conditional analysis for the significantly associated loci (a) Detection of secondary association signals We used approximate conditional analysis, as implemented in GCTA (49), to test whether any of the associated loci had multiple distinct, i.e. secondary association signals, at a ‘locus-wide’ significance threshold of P < 10−5. For each associated locus, all variants that surpassed the study-wide significance threshold (P < 10−10) were conditioned on the most significantly associated variant at that locus (using the association summary statistics). For the conditional analysis, we used genotype data from the complete TwinsUK dataset (N = 5654) to model LD patterns between variants. (b) Adjustment for factors known to affect salivary metabolite levels Since the condition of oral health is known to affect salivary metabolite levels (11), we adjusted the most significant variant-metabolite pair for each locus-metabolite association for a measure of oral health that was available in the TwinsUK dataset, i.e. periodontal disease (PD) status. Self-reported gingival bleeding and a history of gum disease or tooth mobility were used as indicators of PD in TwinsUK (50) (270 PD cases; 1083 controls). Similarly, we also tested the effect of other factors that are known to affect salivary metabolite levels, such as smoking, alcohol consumption and BMI on the significant locus-metabolite associations. Expression quantitative trait locus (eQTL) analysis. We used the version 7 data release of the Genotype-Tissue Expression (GTEx) project (accessed 15 April 2019), which was based on RNA-Seq data obtained from 48 non-diseased tissue sites across ~1000 individuals, to test whether the most significant variant at each associated locus had an eQTL effect on transcripts located within a 1 Mb window of the variant. Annotation of associations using reference databases. We searched the NHGRI GWAS catalogue (accessed 15 April 2019) for previous disease associations for the significantly associated loci that were identified in our study. For the loci that were previously associated with other GWAS traits, we performed co-localization analysis using the GSMR/HEIDI method (implemented in GCTA) (48) in order to test whether the associations were mediated by a common underlying signal (pleiotropic effect) or distinct signals at the locus. We used publicly available GWAS summary level data for the co-localization analysis. We also searched the OMIM database (accessed 15 April 2019) to check the candidate genes at the associated loci for a causal link with inborn errors of metabolism. Moreover, we also queried the HMDB (12) and KEGG (46) databases to identify biochemical pathways and known disease associations for the associated metabolites.

Replication phase

The replication phase was performed in the Study of Health in Pomerania (SHIP-2), a population-based study comprising European ancestry subjects, conducted in the northeastern area of Germany. Further details of SHIP-2, including cohort details, genotyping and imputation, and saliva sample collection are provided in Supplementary methods (III). Metabolic profiling for SHIP-2 saliva samples (N = 1000) was performed using an identical process to that described for TwinsUK. Since the method of saliva sample collection in SHIP-2 (chewing on a piece of cotton) meant that the sample thus obtained represented stimulated saliva, normalization of the metabolite measurements for sample osmolality was not considered necessary. The fact that the salivary osmolality values in SHIP-2 had a much narrower distribution compared to that in TwinsUK verified our rationale (Supplementary Material, Fig. S5). For each locus-metabolite association that was identified in the discovery phase, we tested the most significantly associated variant using a linear regression model that was fitted on R (version 3.5.2). Covariates used in the association model were similar to those used for the discovery phase.

Testing the significantly associated salivary metabolites with phenotypes of interest

We wanted to test how salivary metabolites that were regulated by genetic loci related to relevant phenotypes. For that, we selected the metabolites that were uniquely associated in saliva, i.e. the ones for which a genetic association had not been previously reported in blood, and tested them with phenotypes (diseases/traits/adverse drug effects) relating to the metabolite or its associated biochemical pathway. We obtained the relevant phenotype information from the TwinsUK database, selecting one twin per pair (N = 1426). The phenotype association analysis was performed on R (version 3.5.2) by fitting a linear regression model to test the association between the salivary metabolite and the disease/trait/adverse drug effect (adjusted for age and sex).

Web resources

1000 Genomes project: http://www.internationalgenome.org/ Metabolon: https://www.metabolon.com/ GEMMA: http://www.xzlab.org/software.html KEGG: https://www.genome.jp/kegg/ GCTA: http://cnsgenomics.com/software/gcta/ GTEx: https://gtexportal.org/home/ NHGRI GWAS catalogue: https://www.ebi.ac.uk/gwas/ OMIM: http://omim.org/ HMDB: http://www.hmdb.ca/ LocusZoom: http://locuszoom.org/ UK Biobank GWAS summary data: http://www.nealelab.is/uk-biobank Serum metabolomic GWAS summary data: http://metabolomics.helmholtzmuenchen.de/gwas/; http://www.hli-opendata.com/Metabolome/ Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  52 in total

Review 1.  Metabolome analysis for investigating host-gut microbiota interactions.

Authors:  Michael X Chen; San-Yuan Wang; Ching-Hua Kuo; I-Lin Tsai
Journal:  J Formos Med Assoc       Date:  2018-09-27       Impact factor: 3.282

2.  The association between poor dental health and depression: findings from a large-scale, population-based study (the NHANES study).

Authors:  Adrienne O'Neil; Michael Berk; Kamalesh Venugopal; Sung-Wan Kim; Lana J Williams; Felice N Jacka
Journal:  Gen Hosp Psychiatry       Date:  2014-01-31       Impact factor: 3.238

3.  Agmatinase, an inactivator of the putative endogenous antidepressant agmatine, is strongly upregulated in hippocampal interneurons of subjects with mood disorders.

Authors:  Hans-Gert Bernstein; Claudia Stich; Kristin Jäger; Henrik Dobrowolny; Martin Wick; Johann Steiner; Rüdiger Veh; Bernhard Bogerts; Gregor Laube
Journal:  Neuropharmacology       Date:  2011-07-22       Impact factor: 5.250

4.  Analysis of volatile organic compounds in human saliva by a static sorptive extraction method and gas chromatography-mass spectrometry.

Authors:  Helena A Soini; Iveta Klouckova; Donald Wiesler; Elisabeth Oberzaucher; Karl Grammer; Sarah J Dixon; Yun Xu; Richard G Brereton; Dustin J Penn; Milos V Novotny
Journal:  J Chem Ecol       Date:  2010-08-31       Impact factor: 2.626

5.  Novel single nucleotide polymorphisms of the dihydropyrimidinase gene (DPYS) in Japanese individuals.

Authors:  Fumika Akai; Hiroki Hosono; Noriyasu Hirasawa; Masahiro Hiratsuka
Journal:  Drug Metab Pharmacokinet       Date:  2014-09-28       Impact factor: 3.614

6.  Cadaverine as a putative component of oral malodor.

Authors:  S Goldberg; A Kozlovsky; D Gordon; I Gelernter; A Sintov; M Rosenberg
Journal:  J Dent Res       Date:  1994-06       Impact factor: 6.116

7.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.

Authors:  Jian Yang; Teresa Ferreira; Andrew P Morris; Sarah E Medland; Pamela A F Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael N Weedon; Ruth J Loos; Timothy M Frayling; Mark I McCarthy; Joel N Hirschhorn; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2012-03-18       Impact factor: 38.330

8.  Serum creatinine as a marker of muscle mass in chronic kidney disease: results of a cross-sectional study and review of literature.

Authors:  Sapna S Patel; Miklos Z Molnar; John A Tayek; Joachim H Ix; Nazanin Noori; Deborah Benner; Steven Heymsfield; Joel D Kopple; Csaba P Kovesdy; Kamyar Kalantar-Zadeh
Journal:  J Cachexia Sarcopenia Muscle       Date:  2012-07-10       Impact factor: 12.910

9.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

10.  HMDB 4.0: the human metabolome database for 2018.

Authors:  David S Wishart; Yannick Djoumbou Feunang; Ana Marcu; An Chi Guo; Kevin Liang; Rosa Vázquez-Fresno; Tanvir Sajed; Daniel Johnson; Carin Li; Naama Karu; Zinat Sayeeda; Elvis Lo; Nazanin Assempour; Mark Berjanskii; Sandeep Singhal; David Arndt; Yonjie Liang; Hasan Badran; Jason Grant; Arnau Serra-Cayuela; Yifeng Liu; Rupa Mandal; Vanessa Neveu; Allison Pon; Craig Knox; Michael Wilson; Claudine Manach; Augustin Scalbert
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

View more
  7 in total

1.  A Genome-wide Association Study Discovers 46 Loci of the Human Metabolome in the Hispanic Community Health Study/Study of Latinos.

Authors:  Elena V Feofanova; Han Chen; Yulin Dai; Peilin Jia; Megan L Grove; Alanna C Morrison; Qibin Qi; Martha Daviglus; Jianwen Cai; Kari E North; Cathy C Laurie; Robert C Kaplan; Eric Boerwinkle; Bing Yu
Journal:  Am J Hum Genet       Date:  2020-10-07       Impact factor: 11.025

2.  Association of Physical Activity With Bioactive Lipids and Cardiovascular Events.

Authors:  Rosangela A Hoshi; Yanyan Liu; Mohit Jain; Daniel I Chasman; Olga V Demler; Samia Mora; Heike Luttmann-Gibson; Saumya Tiwari; Franco Giulianini; Allen M Andres; Jeramie D Watrous; Nancy R Cook; Karen H Costenbader; Olivia I Okereke; Paul M Ridker; JoAnn E Manson; I-Min Lee; Manickavasagar Vinayagamoorthy; Susan Cheng; Trisha Copeland
Journal:  Circ Res       Date:  2022-07-19       Impact factor: 23.213

3.  Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome.

Authors:  Karen C Clark; Anne E Kwitek
Journal:  Compr Physiol       Date:  2021-12-29       Impact factor: 8.915

4.  Cerebrospinal fluid metabolomics identifies 19 brain-related phenotype associations.

Authors:  Daniel J Panyard; Kyeong Mo Kim; Burcu F Darst; Yuetiva K Deming; Xiaoyuan Zhong; Yuchang Wu; Hyunseung Kang; Cynthia M Carlsson; Sterling C Johnson; Sanjay Asthana; Corinne D Engelman; Qiongshi Lu
Journal:  Commun Biol       Date:  2021-01-12

5.  Whole Exome Sequencing Enhanced Imputation Identifies 85 Metabolite Associations in the Alpine CHRIS Cohort.

Authors:  Eva König; Johannes Rainer; Vinicius Verri Hernandes; Giuseppe Paglia; Fabiola Del Greco M; Daniele Bottigliengo; Xianyong Yin; Lap Sum Chan; Alexander Teumer; Peter P Pramstaller; Adam E Locke; Christian Fuchsberger
Journal:  Metabolites       Date:  2022-06-29

6.  Salivary metabolites associated with a 5-year tooth loss identified in a population-based setting.

Authors:  Maik Pietzner; Thomas Kocher; Leonie Andörfer; Birte Holtfreter; Stefan Weiss; Rutger Matthes; Vinay Pitchika; Carsten Oliver Schmidt; Stefanie Samietz; Gabi Kastenmüller; Matthias Nauck; Uwe Völker; Henry Völzke; Laszlo N Csonka; Karsten Suhre
Journal:  BMC Med       Date:  2021-07-14       Impact factor: 8.775

7.  Metabolome Genome-Wide Association Study Identifies 74 Novel Genomic Regions Influencing Plasma Metabolites Levels.

Authors:  Pirro G Hysi; Massimo Mangino; Paraskevi Christofidou; Mario Falchi; Edward D Karoly; Robert P Mohney; Ana M Valdes; Tim D Spector; Cristina Menni
Journal:  Metabolites       Date:  2022-01-11
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.