| Literature DB >> 19043545 |
Christian Gieger1, Ludwig Geistlinger, Elisabeth Altmaier, Martin Hrabé de Angelis, Florian Kronenberg, Thomas Meitinger, Hans-Werner Mewes, H-Erich Wichmann, Klaus M Weinberger, Jerzy Adamski, Thomas Illig, Karsten Suhre.
Abstract
The rapidly evolving field of metabolomics aims at a comprehensive measurement of ideally all endogenous metabolites in a cell or body fluid. It thereby provides a functional readout of the physiological state of the human body. Genetic variants that associate with changes in the homeostasis of key lipids, carbohydrates, or amino acids are not only expected to display much larger effect sizes due to their direct involvement in metabolite conversion modification, but should also provide access to the biochemical context of such variations, in particular when enzyme coding genes are concerned. To test this hypothesis, we conducted what is, to the best of our knowledge, the first GWA study with metabolomics based on the quantitative measurement of 363 metabolites in serum of 284 male participants of the KORA study. We found associations of frequent single nucleotide polymorphisms (SNPs) with considerable differences in the metabolic homeostasis of the human body, explaining up to 12% of the observed variance. Using ratios of certain metabolite concentrations as a proxy for enzymatic activity, up to 28% of the variance can be explained (p-values 10(-16) to 10(-21)). We identified four genetic variants in genes coding for enzymes (FADS1, LIPC, SCAD, MCAD) where the corresponding metabolic phenotype (metabotype) clearly matches the biochemical pathways in which these enzymes are active. Our results suggest that common genetic polymorphisms induce major differentiations in the metabolic make-up of the human population. This may lead to a novel approach to personalized health care based on a combination of genotyping and metabolic characterization. These genetically determined metabotypes may subscribe the risk for a certain medical phenotype, the response to a given drug treatment, or the reaction to a nutritional intervention or environmental challenge.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19043545 PMCID: PMC2581785 DOI: 10.1371/journal.pgen.1000282
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Schematic illustration of the role of intermediate phenotypes (IPs), such as metabolic traits, demonstrated at the examples of two genes that code for major enzymes of the long-chain fatty acid metabolism (FADS1 and LIPC).
We show that new information on the functional basis of the observed associations can be inferred from the biochemical properties of the affected metabolites. Moreover, both genes were previously reported to be associated with common clinical phenotypes, FADS1 in an extent which would not attract immediate attention for follow-up in a genome-wide context. Since several genes and pathways are involved in the development of a clinical endpoint, the IP focuses on one pathway (e.g., cholesterol or a given metabotype) which is already known to be involved in the clinical endpoint (e.g. coronary artery disease (CAD)). It is much easier to identify the genes which are associated with the IP since the associations of genetic variation with the IP is much stronger than with the clinical endpoint. Environmental factors interact at different levels with the IPs and thereby add to the variability in the system. The closer the IP is related to the genetic polymorphism, the stronger the association is expected to be. In our case the association reflects enzymatic activity of FADS1 and LIPC which results in very strong effect sizes of the genetically determined metabotype.
Figure 2P-values of association assuming an additive genetic model, superposing the results obtained from all genome-wide tested metabolic traits.
Chromosomal location is indicated by different colors on the x-axis, negative logarithmic p-values are reported on the y-axis. The top ranking SNPs together with the closest gene and the most significant associating metabolite(s) are indicated. A complete list of all associations with p<10−6 is provided in Table S1, together with significant associations from previous GWA studies with medical phenotypes. Metabolite abbreviations are explained in the material and methods section and a full list of all measured metabolites is provided as supplementary data.
Genetically determined metabotypes with the strongest signal of association.
| Gene |
|
|
|
|
|
| Position relative to gene | 21 kb upstream | intron | 3′UTR | intron | 49 kb upstream |
| rs number | rs9309413 | rs992037 | rs1148259 (rs1200826) | rs174548 | rs4775041 |
| Chromosome | 2 | 6 | 10 | 11 | 15 |
| Chromosomal position | 68,482,423 | 161,971,847 | 37,548,456 | 61,327,924 | 56,461,987 |
| Minor allele frequency | 45.2% | 34.7% | 42.2% | 27.5% | 28.0% |
| Best metabolic trait | Sphingomyelin SM C14:0 | Lysine | Sphingomyelin SM(OH,COOH) C18:2 | Phosphatidylcholine PC aa C36:4 | Phosphatidylethanolamine PE aa C38:6 |
| P-value of best metabolic trait | 1.95×10−9 | 1.20×10−7 | 3.04×10−9 | 4.52×10−8 | 9.66×10−8 |
| Explained variance | 12.0% | 9.5% | 11.7% | 10.1% | 9.7% |
|
| |||||
| HDL cholesterola | 0.035 | - | - | 1.89×10−4 | 2.80×10−9 |
| LDL cholesterola | - | - | - | - | - |
| Triglyceridesa | - | - | - | 0.0014 | 7.30×10−5 |
| 2 h glucoseb | - | - | - | - | - |
| 2 h insulinb | - | - | - | - | - |
| Apolipoprotein-I, APOA-1b | - | - | 2.44×10−4 | 0.032 | 2.75×10−4 |
| Apolipoprotein-II, APOA-2b | - | - | 0.033 | 0.0055 | 0.0032 |
| Apolipoprotein B, APOBb | - | - | - | - | - |
| Total cholesterolb | - | - | 0.043 | 1.48×10−4 | 0.055 |
| Fasting glucoseb | - | - | - | - | - |
| Fasting insulinb | - | - | - | - | - |
| HDL cholesterolb | - | - | - | 0.037 | 0.0049 |
| fasting insulin, HOMAb | - | - | - | - | - |
| Insulinogenic indexb | - | - | - | - | 0.016 |
| LDL cholesterolb | - | - | 0.058 | 6.07×10−5 | - |
| Triglycerides/HDLb | 0.010 | - | - | 0.051 | 0.025 |
| Triglyceridesb | - | - | - | 0.028 | 0.0071 |
| Bipolar disorderc | - | - | - | 0.048 | 0.046 |
| Coronary artery diseasec | - | - | - | 0.021 | - |
| Crohn's diseasec | - | - | - | 0.027 | - |
| Hypertensionc | - | - | - | - | - |
| Rheumatoid arthritisc | 0.031 | - | - | - | 0.059 |
| Type 1 diabetes mellitusc | - | - | - | - | - |
| Type 2 diabetes mellitusc | - | - | - | - | 0.061 |
Reported are the SNP identifier (rs number), chromosome, chromosomal position, the minor allele frequency (MAF), the metabolic trait with the lowest p-value of association (test against the null-hypothesis of no association), and percentage of the variance explained by the additive genetic model. Association results for metabolic traits with p<0.05 are provided in Tables 2, S2, S3, S4, and S5. Data for all 363 metabolic traits are available as supporting online data (Datasets S1 and S2). P-values of association from previous GWA studies for the same SNP (neighboring SNP rs1200826 for ANKRD30A) are reported for the following traits: (a) HDL cholesterol, LDL cholesterol, triglycerides are from the publication of Willer et al. [6]; (b) 2 h glucose, 2 h insulin, apolipoproteins A-I, A-II, B, total cholesterol, fasting glucose, fasting insulin, HOMA insulin resistance, insulinogenic index are from the Diabetes Genetics Initiative (DGI) study [5]; (c) bipolar disorder, coronary artery disease, Crohn's disease, hypertension, rheumatoid arthritis, type 1 and type 2 diabetes mellitus are from the WTCCC study [7]. Associations with p-values larger than 0.1 are indicated by a ‘-’.
Figure 3Boxplots of the metabolite concentrations of five top ranking associations as a function of genotype.
They show the differentiation of the population that is induced by these genetically determined metabotypes (0 = major allele homozygote, 1 = heterozygote, 2 = minor allele homozygote). Boxes extend from 1st quartile (Q1) to 3rd quartile (Q3); median is indicated as a horizontal line; whiskers are drawn to the observation that is closest to, but not more than, a distance of 1.5(Q3-Q1) from the end of the box. Observations that are more distant than this are shown individually on the plot. The number of individuals in each group is given below the boxes. P-values for these associations are given in Table 1.
Associations of rs174548 (FADS1) with metabolic traits.
| metabolite | mean | ncases | p-value | estimate | explained variance |
| PC aa C36:4 | 399.41 | 284 | 4.52E-08 | −0.318 | 10.11% |
| PC a C20:4* | 5.09 | 284 | 5.30E-07 | −0.293 | 8.58% |
| PC aa C38:4 | 209.05 | 284 | 4.91E-06 | −0.268 | 7.17% |
| PC ae C36:5* | 19.14 | 284 | 1.46E-05 | −0.255 | 6.48% |
| SM C22:2 | 4.94 | 284 | 5.93E-05 | −0.236 | 5.59% |
| PC ae C38:4 | 30.12 | 284 | 1.42E-04 | −0.224 | 5.03% |
| PE aa C34:2 | 2.22 | 284 | 1.54E-04 | 0.223 | 4.98% |
| PC ae C38:5 | 32.72 | 284 | 1.80E-04 | −0.221 | 4.88% |
| PC aa C38:5 | 128.89 | 284 | 2.01E-04 | −0.219 | 4.81% |
| PE e (COOH) C16:3* | 5.05 | 284 | 1.49E-03 | 0.188 | 3.53% |
| PC ae C36:4 | 35.16 | 284 | 1.68E-03 | −0.186 | 3.46% |
| PE a C10:0 | 4.16 | 284 | 2.34E-03 | −0.180 | 3.25% |
| PC aa C34:2 | 810.00 | 284 | 2.68E-03 | 0.178 | 3.16% |
| SM (COOH) C18:3 | 7.30 | 284 | 3.08E-03 | −0.175 | 3.07% |
| PC aa C34:4 | 3.25 | 284 | 3.25E-03 | −0.174 | 3.04% |
| PC aa C36:5 | 47.53 | 284 | 4.65E-03 | −0.168 | 2.82% |
| PC ae C36:2 | 25.33 | 284 | 5.87E-03 | 0.163 | 2.67% |
| PC aa C40:5 | 27.52 | 284 | 6.21E-03 | −0.162 | 2.63% |
| Arachidonic acid | 4.33 | 283 | 9.04E-03 | −0.155 | 2.41% |
| PC ae C40:5 | 6.79 | 284 | 1.05E-02 | −0.152 | 2.31% |
| PC aa C40:4 | 9.53 | 284 | 1.07E-02 | −0.151 | 2.29% |
| SM (OH) C26:1 | 12.75 | 63 | 1.15E-02 | −0.317 | 10.03% |
| PI aa C36:2* | 7.37 | 284 | 1.15E-02 | 0.150 | 2.25% |
| SM C24:2 | 16.82 | 221 | 1.20E-02 | −0.169 | 2.86% |
| PI aa C38:4* | 27.03 | 284 | 1.21E-02 | −0.149 | 2.22% |
| PC aa (OH, COOH) C30:4 | 342.95 | 284 | 1.29E-02 | 0.148 | 2.18% |
| PC ae C34:2 | 23.44 | 284 | 2.28E-02 | 0.135 | 1.83% |
| SM (OH) C24:0 | 11.80 | 208 | 2.94E-02 | −0.151 | 2.29% |
| LYS | 215.17 | 284 | 3.05E-02 | 0.129 | 1.65% |
| PA aa C20:7 | 197.36 | 284 | 3.20E-02 | −0.128 | 1.63% |
| PE aa C36:2 | 4.42 | 284 | 3.52E-02 | 0.125 | 1.57% |
| PC aa (COOH) C30:3* | 10.38 | 215 | 4.00E-02 | 0.141 | 1.98% |
| PC ae C38:6 | 11.67 | 284 | 4.40E-02 | −0.120 | 1.44% |
| PC aa C38:6 | 146.59 | 284 | 4.42E-02 | −0.120 | 1.43% |
| C5-DC | 0.11 | 284 | 4.43E-02 | −0.120 | 1.43% |
| SM (OH,COOH) C6:0 | 4.79 | 63 | 4.64E-02 | 0.252 | 6.35% |
| SM C28:4 | 5.51 | 284 | 4.73E-02 | −0.118 | 1.39% |
| PI a (OH, COOH) C18:2* | 3.74 | 63 | 4.84E-02 | 0.250 | 6.24% |
| PC aa C36:2 | 412.59 | 284 | 4.92E-02 | 0.117 | 1.37% |
Metabolites associated (p<0.05) with genotype rs174548 (FADS1) in the additive genetic model; in cases where alternative assignments of the metabolites are possible, these are indicated by a ‘*’. Full annotations can be found in the supporting online data files. Reported are the mean concentrations (µM), standard deviation, the number of cases for which metabolite concentrations were obtained (ncases), the p-value of the association, the regression coefficient using an additive genetic model (estimate), and the measure of the observed variance that can be explained by the additive genetic model.
Associations of rs174548 (FADS1) with concentrations and ratios between the concentrations of matching pairs of glycerophospholipid species.
| enumerator | denominator | mean | ncases | p-value | estimate | explained variance |
|
| ||||||
| PC a C20:4* | 1 | 5.094 | 284 | 5.3×10−7 | −0.293 | 8.58% |
| PC aa C34:4 | 1 | 3.249 | 284 | 3.3×10−3 | −0.174 | 3.04% |
| PC aa C36:4 | 1 | 399.407 | 284 | 4.5×10−8 | −0.318 | 10.11% |
| PC aa C38:4 | 1 | 209.050 | 284 | 4.9×10−6 | −0.268 | 7.17% |
| PC ae C36:4 | 1 | 35.160 | 284 | 1.7×10−3 | −0.186 | 3.46% |
| PC ae C38:4 | 1 | 30.117 | 284 | 1.4×10−4 | −0.224 | 5.03% |
| PE aa C38:4 | 1 | 5.357 | 284 | 0.13 | −0.090 | 0.81% |
| PI aa C38:4* | 1 | 27.025 | 284 | 0.012 | −0.149 | 2.22% |
|
| ||||||
| PC a C20:3* | 1 | 2.461 | 208 | 0.86 | −0.013 | 0.02% |
| PC aa C34:3 | 1 | 30.751 | 284 | 0.21 | 0.075 | 0.56% |
| PC aa C36:3 | 1 | 250.496 | 284 | 0.56 | 0.035 | 0.12% |
| PC aa C38:3 | 1 | 123.002 | 284 | 0.66 | −0.027 | 0.07% |
| PC ae C36:3 | 1 | 19.697 | 284 | 0.17 | 0.081 | 0.66% |
| PC ae C38:3 | 1 | 10.641 | 284 | 0.74 | 0.020 | 0.04% |
| PE aa C38:3 | 1 | 1.623 | 132 | 0.92 | −0.009 | 0.01% |
| PI aa C38:3* | 1 | 7.791 | 221 | 0.077 | 0.120 | 1.43% |
|
| ||||||
| PC a C20:4* | PC a C20:3* | 2.224 | 208 | 2.9×10−8 | −0.374 | 13.98% |
| PC aa C34:4 | PC aa C34:3 | 0.107 | 284 | 4.2×10−7 | −0.295 | 8.72% |
| PC aa C36:4 | PC aa C36:3 | 1.613 | 284 | 2.4×10−22 | −0.535 | 28.62% |
| PC aa C38:4 | PC aa C38:3 | 1.708 | 284 | 2.1×10−17 | −0.476 | 22.66% |
| PC ae C36:4 | PC ae C36:3 | 1.832 | 284 | 7.3×10−8 | −0.313 | 9.81% |
| PC ae C38:4 | PC ae C38:3 | 2.888 | 284 | 9.7×10−9 | −0.333 | 11.07% |
| PE aa C38:4 | PE aa C38:3 | 3.693 | 132 | 0.013 | −0.216 | 4.64% |
| PI aa C38:4* | PI aa C38:3* | 3.582 | 221 | 1.5×10−8 | −0.370 | 13.69% |
Association of SNP rs174548 (FADS1) with concentrations and ratios between the concentrations of matching pairs of glycerophospholipid species with three- (denominator) and four-fold (enumerator) unsaturated carbon bonds in their fatty acid side chains; in cases where alternative assignments of the metabolites are possible, these are indicated by a ‘*’; reported are the mean (µM), the number of cases for which metabolite concentrations were obtained (ncases), the p-value of the association, the regression coefficient using an additive genetic model (estimate), and the proportion of the observed variance that can be explained by including the genetic polymorphism in the additive genetic model.
Figure 4Boxplots of the strongest associations of metabolite concentration ratios with polymorphisms in the FADS1 (A; p = 2.4×10−22), SCAD (B; p = 9.3×10−17), and MCAD (C; p = 7.6×10−17) genes (see legend to Figure 3 for details).
The metabolic efficiencies of the reactions that are catalyzed by these three enzymes differ considerably between individuals of different genotype.