| Literature DB >> 30689875 |
Tuulia Tynkkynen1,2, Qin Wang2,3,4,5, Jussi Ekholm2,3,4, Olga Anufrieva2,3,4, Pauli Ohukainen2,3,4, Jouko Vepsäläinen1, Minna Männikkö3,4,6, Sirkka Keinänen-Kiukaanniemi3,7,8, Michael V Holmes9,10,11,12, Matthew Goodwin12,13, Susan Ring12,13, John C Chambers14,15,16,17, Jaspal Kooner15,16,18, Marjo-Riitta Järvelin3,4,7,14, Johannes Kettunen2,3,4,19, Michael Hill9,10, George Davey Smith12,13, Mika Ala-Korpela1,2,3,4,5,12,13,20.
Abstract
BACKGROUND: Quantitative molecular data from urine are rare in epidemiology and genetics. NMR spectroscopy could provide these data in high throughput, and it has already been applied in epidemiological settings to analyse urine samples. However, quantitative protocols for large-scale applications are not available.Entities:
Keywords: Metabolomics; genome-wide analyses; multicentre; open access; serum; urine
Mesh:
Year: 2019 PMID: 30689875 PMCID: PMC6659374 DOI: 10.1093/ije/dyy287
Source DB: PubMed Journal: Int J Epidemiol ISSN: 0300-5771 Impact factor: 7.196
Intra-assay variation as well as intra-individual and inter-individual variation of quantified urine metabolites
| Metabolite | Intra-assay CV (%) | Intra-individual CV (%) | Inter-individual CV (%) |
|---|---|---|---|
| Amino acids | |||
| Alanine | 1.16 | 28.69 | 49.88 |
| Glycine | 2.21 | 34.71 | 73.98 |
| Histidine | 1.10 | 30.25 | 48.81 |
| Threonine | 4.57 | 38.58 | 75.44 |
| Branched-chain amino acids | |||
| Isoleucine | 6.68 | 23.27 | 54.06 |
| Valine | 4.72 | 20.28 | 39.50 |
| Aromatic amino acids | |||
| Tryptophan | 3.34 | 33.76 | 51.05 |
| Tyrosine | 3.35 | 32.09 | 45.76 |
| Glycolysis-related metabolites | |||
| Glucose | 2.91 | 13.76 | 1654.62 |
| Lactate | 4.28 | 44.26 | 476.52 |
| Citrate cycle-related metabolites | |||
| Cis-aconitate | 0.85 | 22.28 | 39.68 |
| Citrate | 1.51 | 27.92 | 53.35 |
| Urea cycle | |||
| Urea | 1.46 | 32.98 | 39.44 |
| Phenylalanine metabolism | |||
| 4-Hydroxyphenylacetate | 2.24 | 28.73 | 52.12 |
| Hippurate | 1.15 | 58.45 | 69.26 |
| Histidine metabolism | |||
| 3-Methylhistidine | 1.56 | 95.44 | 117.16 |
| Glycine, serine and threonine metabolism | |||
| Creatine | 4.12 | 126.19 | 239.55 |
| Microbial metabolism | |||
| 4-Hydroxyhippurate | 3.43 | 34.85 | 72.12 |
| Acetate | 14.17 | 62.87 | 394.31 |
| Dimethylamine | 0.74 | 9.79 | 30.48 |
| Formate | 8.71 | 41.32 | 584.66 |
| Methylamine | 3.17 | 32.07 | 51.20 |
| p-Cresol sulphate | 1.53 | 35.65 | 71.22 |
| Trimethylamine N-oxide (TMAO) | 1.63 | 80.89 | 127.14 |
| Nicotinate and nicotinamide metabolism | |||
| N1-Methyl-2-pyridone-5-carboxamide (2PY) | 2.14 | 35.29 | 60.72 |
| N1-Methylnicotinamide | 1.32 | 28.24 | 52.21 |
| Trigonelline | 0.79 | 68.71 | 74.64 |
| Purine metabolism | |||
| Hypoxanthine | 3.53 | 38.80 | 338.41 |
| Pyrimidine metabolism | |||
| Pseudouridine | 2.15 | 6.32 | 14.28 |
| Uracil | 4.29 | 37.71 | 148.13 |
| Pentose and glucuronate interconversion | |||
| Arabinose | 3.58 | 35.50 | 59.51 |
| Glucuronate | 4.07 | 18.31 | 50.07 |
| Xylose | 3.38 | 99.60 | 111.96 |
| Galactose metabolism | |||
| Sucrose | 4.45 | 194.15 | 459.31 |
| Miscellaneous | |||
| 2-Furoylglycine | 5.46 | 225.45 | 212.50 |
| 2-Hydroxyisobutyrate | 1.15 | 16.25 | 35.39 |
| 3-(3-Hydroxyphenyl)-3-hydroxypropanoate (HPHPA) | 4.30 | 67.68 | 73.64 |
| 3-Hydroxyhippurate | 2.56 | 51.81 | 99.98 |
| 3-Hydroxyisobutyrate | 2.67 | 34.18 | 61.60 |
| 3-Hydroxyisovalerate | 4.84 | 66.55 | 46.16 |
| Indoxyl sulphate | 1.59 | 32.24 | 46.56 |
| Sumiki’s acid | 2.36 | 35.23 | 133.73 |
| Trans-aconitate | 4.42 | 50.71 | 59.60 |
Concentrations are scaled to the concentration of creatinine; CV% = (standard deviation / average) * 100%.
One urine sample prepared and analysed as 10 replicates; reflects the entire quantitative process, i.e. including all the sample preparation steps, NMR experimentation and mathematical quantification protocols.
A 30-day consecutive urine collection; CV%s first calculated for each individual and then averaged over three different people.
A total of 1004 different individuals from the Northern Finland Birth Cohort 1966.
The intra-individual CV% is slightly higher than the inter-individual CV%. Very few samples for the three people contributing to the intra-individual variation contained 2-furoylglycine (two people had it in seven and one person in four out of 30 samples). For two people contributing to the intra-individual variation, the average concentration of 3-hydroxyisovalerate was lower than the average concentration in the NFBC samples.
Figure 1.Characteristic 1H NMR spectra of human urine from six subjects, and illustration of the sophisticated line shape fitting analyses. Alignment of spectra from six subjects is shown. Heavily overlapping signal structures in multiple areas are typical for these spectra. The insets marked from A to F illustrate how line shape fitting analyses, incorporating prior knowledge on the individual molecular attributes, can robustly solve the overlap and lead to reliable quantification of the metabolites., Black lines represent the observed spectra and the coloured lines represent the fitted signals. Grey lines indicate currently unidentified signals. The green line at the bottom illustrates the difference between the observed spectrum and the fitted signals. The coupling trees above the spectra demonstrate the multiplet structures directly linked to the molecular attributes and used as constraints in the line shape fitting analyses.,
Figure 2.The automated quantification of urinary creatinine and glucose from the NMR spectra. On the left: building and assessment of the final automated regression models for the absolute signal areas for creatinine and glucose in the NMR spectra (n = 999). Training and independent testing results are shown in Supplementary Figure 1, available as Supplementary data at IJE online. In the Bland-Altman plots, the solid line in the middle represents the mean bias (between the automated regression and the line shape fitting analyses results for the absolute signal area) and the two others the mean ± 1.96 SD. The dashed red line represents the regression line for the bias. The equations for the regression lines are + for creatinine and for glucose. Bias as a function of creatinine: with R2 = 0.0006 and bias as a function of glucose: with R2 = 0.000001. Both automated regression models show excellent quantitative performance and robustness with negligible bias. On the right: the distribution of absolute urinary concentration (in µm/mM creatinine) in 4548 urine samples in NFBC66. The absolute signal areas for the urinary creatinine and glucose used to calculate the distribution are based on fully automated NMR spectral analyses using the final models illustrated on the left. The urinary glucose distribution is positively skewed (88 glucose concentration values >80 µm/mM creatinine are not drawn for clarity). This is expected, due to individuals with prediabetes and diabetes in NFBC66.
Figure 3.The intra-fluid metabolic associations in serum. The intra-fluid metabolic correlations in serum, i.e. in circulating metabolism, are strong due to multiple key metabolic pathways under heavy systemic control. For example, the metabolism of apoB-containing lipoprotein particles is a continuum and reflected by strong correlations between adjacent lipoprotein subclass particle concentrations. Strong links exist also, e.g. between triglyceride-rich very low-density lipoprotein (VLDL) particles and large cholesterol-rich high-density lipoprotein (HDL) particles as well as between multiple amino acids. The colour coding refers to partial correlations adjusted for sex: n = 995 individuals from NFBC66. The heat map is organized manually on the basis of the key metabolic groups and pathways represented by the measures., In all, 27 principal components were needed to explain >99% of variation in the metabolic information of these 61 serum measures (leading to Bonferroni-corrected significance P-value of 0.002, i.e. 0.05/27; marked with * in the map). IDL, intermediate-density lipoprotein; XXL refers to the largest and XS to the smallest lipoprotein particles in each lipoprotein fraction; P, particle (concentration); C, cholesterol; TG, triglyceride; PUFA, polyunsaturated fatty acids; MUFA, monounsaturated fatty acids; GlycA, glycoprotein acetyls.
Figure 4.The intra-fluid metabolic associations in urine. The intra-fluid metabolic correlations in urine are generally rather weak and only a few stronger metabolic correlation blocks are noticeable, namely positive correlations among amino acids, glycolysis- and citrate cycle-related metabolites, 3-hydroxyisobutyrate and 3-hydroxyisovalerate result in clear association clusters. These association characteristics are likely to partly reflect the large intra-individual variation in urinary metabolites, but they are also likely a fundamental sign of metabolic waste under only limited systemic control. However, the concentrations of the amino acids are rather strongly correlated, as would be expected for these apparently healthy individuals with healthy kidneys. The amino acid concentrations also correlate with 3-hydroxyisobutyrate and 3-hydroxyisovalerate, both degradation products of branched-chain amino acids, as well as with glucose and lactate, related energy metabolites in gluconeogenesis. Several metabolites related to microbial metabolism are quantified, and an interesting correlation cluster is seen between methylamine, p-cresol sulphate and TMAO. The colour coding refers to partial correlations adjusted for sex; n = 995 individuals from NFBC66. The heat map is organized manually on the basis of the key metabolic groups and pathways represented by the measures (Table 1). Forty principal components were needed to explain >99% of variation in the metabolic information of these 43 urine metabolites (leading to Bonferroni-corrected significance P-value of 0.001, i.e. 0.05/40; marked with * in the map). Thus, the urine metabolites are generally highly uncorrelated and provide independent metabolic information. 2PY, N1-methyl-2-pyridone-5-carboxamide; TMAO, trimethylamine N-oxide; HPHPA, 3–(3-hydroxyphenyl)-3-hydroxypropanoate.
Figure 5.The inter-fluid metabolic associations between urine and serum. The inter-fluid metabolic correlations between urine and serum are rather weak. However, several clearly detectable associations are present. The amino acid concentrations in serum and in urine are strongly positively associated, except for histidine for which the correlation appears very weak. There is an intriguing positive association between urinary TMAO and serum polyunsaturated omega-3 fatty acids. Notably, circulating TMAO has been linked to the pathogenesis of cardiovascular disease. However, we do not yet have data to associate urinary TMAO concentrations with cardiometabolic outcomes, and its concentration in serum is too low to be quantified by serum NMR metabolomics. In addition, serum polyunsaturated omega-6 fatty acids associate negatively with multiple urinary metabolites in relation to amino acid, energy and microbial metabolism, for example, 2-hydroxyisobutyrate, cis-aconitate, and pseudouridine. Multiple urinary metabolites, e.g. 3-hydroxyisobutyrate, lactate, pseudouridine and cis-aconitate associate with circulating amino acids, glucose and creatinine. For example for cis-aconitate, a key component in the citric acid cycle, these associations are not unexpected. Cis-aconitate also associates with serum triglycerides. On the other hand, urinary uracil (a naturally occurring pyrimidine found in RNA and, for example, related to carbohydrate metabolism and sugar transport) is positively associated with serum high-density lipoprotein (HDL) cholesterol. The rationale for this association is not evident, though it could be due to uracil’s involvement in energy metabolism and the inverse association between serum triglycerides and HDL cholesterol. The colour coding refers to partial correlations adjusted for sex: n = 995 individuals from NFBC66. The heat map is organized via two-dimensional hierarchical clustering. In all, 66 principal components were needed to explain >99% of variation in the metabolic information of these 104 metabolic measures combining the quantitative information from urine and serum (leading to Bonferroni-corrected significance P-value of 0.0008 i.e. 0.05/66; marked with * in the map). Combining quantitative urine metabolite data with serum metabolomics would thus evidently increase the independent metabolic information content of the dataset. Abbreviations are as detailed in the captions for Figures 3 and 4.
Figure 6.Associations of metabolites quantified in both urine and serum with body mass index. Multiple associations are notable between urinary metabolites and BMI. For example, BMI associates negatively with urinary p-cresol sulphate and hippurate, and positively with 2-hydroxyisobutyrate and branched-chain amino acids isoleucine and valine and aromatic amino acids tryptophan and tyrosine. For all the amino acids that are quantified from both urine and serum, the association direction with BMI is the same in serum and in urine; the association strengths, however, tend to be weaker in urine. Abbreviations are as detailed in the caption for Figure 4.
Figure 7.Manhattan plots of the GWAS of formate and 2-hydroxyisobutyrate.The SNP associations across the whole genome are presented. For plotting purposes, the associations with P-value larger than 1*10–3 are not shown. Each dot is a –log10 of P-value of the association between the genetic variant and the metabolite using an additive model. The dots are ordered using the chromosome number and base pair position of the variant in the chromosome. The top signals in these two plots were significant after correcting the genome-wide significance threshold for 40 independent tests (P <1.25*10–9; red line). All metabolite concentrations were first adjusted for sex, and 10 first principal components from genomic data and the resulting residuals were transformed to normal distribution by inverse rank-based normal transformation. NFBC66 was genotyped using Illumina HumanHap 370k array. The genotypes were imputed using the Haplotype Reference Consortium pipeline. The results were filtered using minor allele frequency cut-off of 5% or greater and imputation info 0.8 or greater. The analysis software was SNPTEST 2.5.1 using additive model for association testing.