| Literature DB >> 23414815 |
Johannes Raffler1, Werner Römisch-Margl2, Ann-Kristin Petersen3, Philipp Pagel4, Florian Blöchl4, Christian Hengstenberg5, Thomas Illig6, Christa Meisinger7, Klaus Stark8, H-Erich Wichmann9, Jerzy Adamski10, Christian Gieger3, Gabi Kastenmüller2, Karsten Suhre11.
Abstract
Nuclear magnetic resonance spectroscopy (NMR) provides robust readouts of many metabolic parameters in one experiment. However, identification of clinically relevant markers in (1)H NMR spectra is a major challenge. Association of NMR-derived quantities with genetic variants can uncover biologically relevant metabolic traits. Using NMR data of plasma samples from 1,757 individuals from the KORA study together with 655,658 genetic variants, we show that ratios between NMR intensities at two chemical shift positions can provide informative and robust biomarkers. We report seven loci of genetic association with NMR-derived traits (APOA1, CETP, CPS1, GCKR, FADS1, LIPC, PYROXD2) and characterize these traits biochemically using mass spectrometry. These ratios may now be used in clinical studies.Entities:
Year: 2013 PMID: 23414815 PMCID: PMC3706909 DOI: 10.1186/gm417
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Genetic associations with NMR intensities and ratios between NMR intensities.
| GCKR | rs780094 | 2 | 27,594,741 | T | C | 41.4% | 1.370 | 1,688 | 0.0413 | 1.2x10-10 | 3.286/1.370 | 1,667 | -0.0407 | 2.8x10-15 | 4.3x104 |
| CPS1 | rs2216405 | 2 | 211,325,139 | C | T | 19.3% | 3.599 | 1,719 | 0.0456 | 4.5x10-14 | 3.599/2.475 | 1,718 | 0.0459 | 1.8x10-19 | 2.5x105 |
| PYROXD2 | rs4488133 | 10 | 100,149,126 | A | T | 34.3% | 2.757 | 1,735 | -0.0190 | 7.3x10-12 | 2.757/2.755 | 1,734 | -0.0123 | 2.9x10-94 | 2.5x1082 |
| FADS1 | rs174547 | 11 | 61,327,359 | C | T | 30.0% | 2.801 | 1,733 | -0.0408 | 4.0x10-35 | 2.801/2.017 | 1,724 | -0.0449 | 1.1x10-94 | 3.8x1059 |
| APOA1 | rs3741298 | 11 | 116,162,771 | C | T | 21.4% | 2.038 | 1,724 | 0.0363 | 8.4x10-11 | 4.162/4.082 | 1,715 | -0.0138 | 1.8x10-14 | 4.6x103 |
| LIPC | rs4775041 | 15 | 56,461,987 | C | G | 28.5% | 1.283 | 1,671 | 0.0320 | 1.4x10-10 | 1.068/1.029 | 1,664 | -0.0056 | 3.6x10-21 | 4.0x1010 |
| CETP | rs247617 | 16 | 55,548,217 | A | C | 31.3% | 3.259 | 1,705 | 0.0529 | 7.6x10-15 | 2.211/2.011 | 1,695 | -0.0124 | 1.1x10-18 | 6.8x103 |
The locus name indicates the gene that most likely hosts the causative SNP.
A/B: minor/major allele; MAF: minor allele frequency; n: number of samples where genetic and NMR data are jointly available; ppm: parts per million.
aChromosomal location (CHR, POS), minor (effect, A) and major (B) alleles are reported with respect to the positive strand of the human genome (NCBI build 36.1).
bBeta': relative effect size, defined as beta' = (10beta- 1), where beta is the slope in the linear model (using log10-scaled traits) and genotype is coded as 0-1-2 (major-hetero-minor genotype). Beta' represents the relative difference per copy of the minor allele for the unscaled metabolic trait (NMR bin or NMR ratio) compared to the estimated mean of the major allele homozygote group (intercept of the linear model).
cThe P gain is defined as min(p(S1)/p(S1/S2), p(S2)/p(S1/S2)), where p(S1), p(S2), and p(S1/S2) denote the P values of association of two log-scaled NMR intensities at chemical shift S1 and S2, and of their ratio S1/S2, respectively.
Comparison with association data from previous studies on the same specimen.
| GCKR | Trait | Pc ae C34:2/Pc aa C32:2 | Glucose/Mannose | L10 | 3.286 ppm/1.370 ppm |
| SNP | rs1260326, LD r2 = 0.93 | rs780094 | rs1260326, LD r2 = 0.93 | rs780094 | |
| CPS1 | Trait | Glycine/PC aa C38:2 | Asparagine/Glycine | 3.599 ppm/2.475 ppm | |
| SNP | rs2216405 | rs2216405 | (locus not reported) | rs2216405 | |
| PYROXD2 | Trait | Saccharin/Threonine | 2.757 ppm/2.755 ppm | ||
| SNP | (locus not reported) | rs4488133 | (locus not reported) | rs4488133 | |
| FADS1 | Trait | PC aa C36:3/PC aa C36:4 | 1-arachidonoylglycero-phosphoethanolamine/1-linoleoylglycero-phosphoethanolamine | L4 | 2.801 ppm/2.017 ppm |
| SNP | rs174547 | rs174547 | rs174546, LD r2 = 1.0 | rs174547 | |
| APOA1 | Trait | PC aa C36:2/PC aa C38:1 | 1-oleoglycerol/Oleamide | L8 | 4.162 ppm/4.082 ppm |
| SNP | rs964184, LD r2 = 0.61 | rs3741298 | rs964184, LD r2 = 0.61 | rs3741298 | |
| LIPC | Trait | PCaa C38:6 | 1-palmitoylglycero-phosphoethanolamine | L5 | 1.068 ppm/1.029 ppm |
| SNP | rs4775041 | rs4775041 | rs1532085, LD r2 = 0.55 | rs4775041 | |
| CETP | Trait | Guanosine/Phenylacetylglutamine | L3 | 2.211 ppm/2.011 ppm | |
| SNP | (locus not reported) | rs247617 | rs3764261, LD r2 = 1.0 | rs247617 | |
Association data from previous GWAS for the loci reported in this study are reported either on the same SNP, or, if a different SNP was reported, the correlation with the SNP in linkage disequilibrium (r2 LD) is given (based on HapMap, release #27, phases I, II, III, CEU population).
Figure 1Association at the CPS1 locus. Top: The red line indicates the strength of association (-log10(P value)) of SNP rs2216405to the chemical shifts. The strongest association can be observed with the NMR signals at δ = 3.599 ppm (P <4.5 × 10-14). Middle: The blue line gives the squared Spearman correlation coefficient (rs2) for MS determined glycine and the NMR signals. The highest correlation (rs2 = 0.21) can be observed as a sharp peak at δ = 3.599 ppm. Bottom: The plot shows the NMR spectra of two samples, where one was spiked with glycine (green spectral line). The inset shows a magnification of the spectral region around δ = 3.6 ppm. This experiment confirms that the signal intensities at δ = 3.599 ppm are indeed driven by glycine.
Figure 2Association at the FADS1 locus. SNPrs174547 associates most strongly with the NMR signal at chemical shift δ = 2.801 ppm (P = 3.7 × 10-35) (top). When ratios between NMR intensities are tested, the strength of association increases by 59 orders of magnitude for the ratio between NMR intensities at 2.801 ppm and 2.017 ppm (P = 1.1×10-94) (Table 1). A similar increase in the strength of association has been observed in previous GWAS for this locus with ratios between phospholipids containing C20:3 and C20:4 fatty acids, such as PC aa C36:3 and PC aa C36:4 [3]. The FADS1 codes for a delta-5 fatty acid desaturase; C20:3 and C20:4 fatty acids are their substrate-product pair. The correlation plot between these lipid species and the NMR intensities (bottom) indicates that the region around δ = 2.801 ppm correlates more strongly with C20:4 fatty acid-containing lipids, while the region around δ = 2.017 ppm more with C20:3 lipids (bottom insets). The ratio between intensities at δ = 2.801 ppm and δ = 2.017 ppm is therefore a likely proxy for the ratio between three- and four-fold desaturated log chain fatty acids.
Figure 3Association at the GCKR locus. SNP rs780094 (and SNPs in LD) have been associated with different diabetes-relevant traits, including fasting glucose and triglyceride levels (TG). We observe a genome-wide significant association (P = 2.8x10-15) at this locus with the ratio of intensities at δ = 3.286 ppm and δ = 1.370 ppm (dashed lines) (Table 1). The correlation spectra for glucose and TG (both parameters were determined by clinical biochemistry) show that the association signal at δ = 3.286 ppm is mainly driven by glucose and that at δ = 1.370 ppm by TG. The ratio of these two NMR intensities is thus a likely proxy for a diabetes-relevant composite readout on the GCKR pathway.
Spearmancorrelations between NMR intensities and metabolic traits determined by MS and clinical biochemistry.
| GCKR | rs780094 | 1.370 | Triglyceride | 0.62 | 3.286/1.370 | Glucose/Triglyceride | 0.42 |
| CPS1 | rs2216405 | 3.599 | Glycine | 0.22 | 3.599/2.475 | Glycine/Tryptophan | 0.29 |
| PYROXD2 | rs4488133 | 2.757 | Total cholesterol | 0.64 | 2.757/2.755 | Total cholesterol/Triglyceride | 0.26 |
| FADS1 | rs174547 | 2.801 | PC aa C38:5 | 0.48 | 2.801/2.017 | PC aa C38:5/ | 0.45 |
| APOA1 | rs3741298 | 2.038 | Triglyceride | 0.73 | 4.162/4.082 | Lactate/Triglyceride | 0.60 |
| LIPC | rs4775041 | 1.283 | PC aa C36:1 | 0.42 | 1.068/1.029 | 4-methyl-2-oxopentanoate/Total cholesterol | 0.21 |
| CETP | rs247617 | 3.259 | HDL cholesterol | 0.55 | 2.211/2.011 | HDL cholesterol | 0.68 |
The 'Metabolite' and 'Metabolite ratios' columns list the MS and biochemically determined metabolites (or ratios thereof) that correlate best at the chemical shift positions (or ratios) identified in the GWAS.
ppm: parts per million.
Figure 4Association at the PYROXD2 locus. SNP rs4488133 associates with the ratio between two neighboring chemical shifts (δ = 2.757 ppm and δ = 2.755 ppm; P = 2.9x10-94), displaying an exceptionally strong increase in the strength of association when using ratios of over 82 orders of magnitude. This figure presents the median NMR signal intensities, differentiated by the genotype of SNP rs4488133 (color-coded). A peak at δ = 2.757 ppm is only present in major allele homozygotes (inset). This peak is located on the shoulder of a much larger underlying NMR signal. We argue that in this case the ratio between signals at two neighboring chemical shifts, one of them in a localized peak, is equivalent to applying a local baseline correction, effectively elevating the peak out of the background noise.
Figure 5Strength of association with metabolic traits obtained using different methods and technologies. Samples from identical blood draws were analyzed using different methods. Identical genotype data were used and the number of samples is the same (except for minor differences due to missing data); height of the colored bars represents the strength of association for the different methods as reported in Table 2 (grey: targeted MS [3,4], green: non-targeted MS [5], blue: NMR lipid classes [13], yellow: NMR chemical shifts (bins), and red: NMR ratios (this work)).