| Literature DB >> 35883459 |
Tianqi Li1,2,3, Tuulia Tynkkynen1,4, Andrei Ihanus1,2,4, Siyu Zhao1,2,3, Ville-Petteri Mäkinen1,2,5,6, Mika Ala-Korpela1,2,3,4.
Abstract
A systematic comparison is presented for the effects of seven different normalization schemes in quantitative urinary metabolomics. Morning spot urine samples were analyzed with nuclear magnetic resonance (NMR) spectroscopy from a population-based group of 994 individuals. Forty-four metabolites were quantified and the metabolite-metabolite associations and the associations of metabolite concentrations with two representative clinical measures, body mass index and mean arterial pressure, were analyzed. Distinct differences were observed when comparing the effects of normalization for the intra-urine metabolite associations with those for the clinical associations. The metabolite-metabolite associations show quite complex patterns of similarities and dissimilarities between the different normalization methods, while the epidemiological association patterns are consistent, leading to the same overall biological interpretations. The results indicate that, in general, the normalization method appears to have only minor influences on standard epidemiological regression analyses with clinical/physiological measures. Multimetabolite normalization schemes showed consistent results with the customary creatinine reference. Nevertheless, interpretations of intra-urine metabolite associations and nuanced understanding of the epidemiological associations call for comparisons with different normalizations and accounting for the physiology, metabolism and kidney function related to the normalization schemes.Entities:
Keywords: NMR; biomarkers; disease risk; epidemiology; kidney function; metabolomics; normalization; urine
Mesh:
Substances:
Year: 2022 PMID: 35883459 PMCID: PMC9313036 DOI: 10.3390/biom12070903
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
The characteristics of employed normalization methods.
| Method | Abbr. | Description | Pros for Epidemiology | Cons for Epidemiology | Ref. |
|---|---|---|---|---|---|
|
|
| Using the original quantified concentrations of the urinary metabolites from the 1H NMR spectra, i.e., no normalization method applied. | Original data are preserved; the concentration values are straightforward to interpret. | Urinary volume, and thus absolute metabolite concentrations, varies greatly day-to-day and person-to-person. Random spot urine samples are thus likely to be too confounded to use without normalization. | [ |
|
|
| The concentration of each metabolite is divided by the concentration of an internal standard. Here, creatinine, glucose, urea, and pseudouridine were used. | Creatinine comes from non-enzymatic breakdown of creatine phosphate in muscles, it is typically produced at a constant rate, and it is stable and inert in plasma. | The stable excretion of creatinine may not hold in the presence of external stimuli or pathophysiological conditions. | [ |
|
| Glucose is freely filtered by the kidneys and mostly reabsorbed. The mechanisms of glucose reabsoption are well-understood. | At high plasma glucose concentrations (>10 mmol/L), the tubular reabsorption saturates, triggering a pronounced part of filtered glucose to be excreted into the urine. | [ | ||
|
| Serum urea is applied as a marker of renal function for routine clinical analysis. | Plasma urea concentrations vary widely depending on protein intake, changes in tissue catabolism, and in various pathological conditions. | [ | ||
|
| Pseudouridine is a low-molecular-mass, water-soluble compound with no significant protein binding in serum. | Pseudouridine concentrations appear related to kidney function with potential associated bias. | [ | ||
|
|
| The concentration of each metabolite is divided by the total sum of all metabolite concentrations. | A generic algorithm that can be applied to any metabolomics platform without requirements for a specific set of metabolites. | Abundance of urinary metabolites resembles the Pareto distribution; a few abundant molecules (e.g., urea) typically dominate the concentration sum. The distribution of a metabolite is usually fat-tailed, thus extreme values may reduce normalization accuracy. Both issues mean that the benefit of averaging across many metabolites may be lost in real data. | [ |
|
|
| A robust version of the CS principle that addresses the Pareto issue of CS (by standardized abundances) and the outlier issue (by the median estimator). | A generic algorithm that can be applied to any metabolomics platform. | All normalized concentrations are interdependent (i.e., if some metabolites go up, then others must go down to maintain balance). This means that undesired correlated variation/confounding across many metabolites may cause normalization artefacts. Since abundances are standardized, including low-abundance metabolites near the detection limit may amplify the impact of measurement noise. | [ |
|
|
| The DESeq2 is a variant of PQN developed for RNA-seq data. Uses geometric mean instead of the median to standardize abundances. | Same benefits as PQN. Geometric mean is better suited for concentrations with limited numerical precision. | Same downsides as in PQN. | [ |
Figure 1The urinary metabolite–metabolite associations as indicated by Spearman’s rank correlations adjusted for sex. The down-left triangle shows results for the absolute urinary metabolite concentrations (i.e., no normalization applied) and the top-right triangle for the creatinine normalization (IS-CREA). Two-dimensional hierarchical clustering was applied to organize the IS-CREA heat map to make detailed comparisons of the metabolite–metabolite associations feasible between the different normalization schemes. All heat maps are presented in the same order of metabolites with creatinine added to the last row. The reference metabolite correlations are left blank in their corresponding heat maps. As a by-product of the hierarchical clustering, eight urinary metabolite clusters were identified as numbered on the left and detailed in the Results section. Abbreviations: TMAO, trimethylamine N-oxide; HPHPA, 3-(3-hydroxyphenyl)-3-hydroxypropanoate; 2-PY, N1-Methyl-2-pyridone-5-carboxamide.
Figure 2The urinary metabolite–metabolite associations as indicated by Spearman’s rank correlations adjusted for sex. The down-left triangle shows results for the glucose normalization (IS-GLUC) and the top-right triangle for the urea normalization (IS-UREA). The order of metabolites (with creatinine added to the last row) is the same as in Figure 1 and based on the two-dimensional hierarchical clustering of the IS-CREA heat map. The reference metabolite correlations are left blank in their corresponding heat maps. Abbreviations: TMAO, trimethylamine N-oxide; HPHPA, 3-(3-hydroxyphenyl)-3-hydroxypropanoate; 2-PY, N1-Methyl-2-pyridone-5-carboxamide.
Figure 3The urinary metabolite–metabolite associations as indicated by Spearman’s rank correlations adjusted for sex. The down-left triangle shows results for the pseudouridine normalization (IS-PSEURID) and the top-right triangle for the constant sum normalization (CS). The order of metabolites (with creatinine added to the last row) is the same as in Figure 1 and based on the two-dimensional hierarchical clustering of the IS-CREA heat map. The reference metabolite correlations are left blank in their corresponding heat maps. Abbreviations: TMAO, trimethylamine N-oxide; HPHPA, 3-(3-hydroxyphenyl)-3-hydroxypropanoate; 2-PY, N1-Methyl-2-pyridone-5-carboxamide.
Figure 4The urinary metabolite–metabolite associations as indicated by Spearman’s rank correlations adjusted for sex. The down-left triangle shows results for the probabilistic quotient normalization (PQN) and the top-right triangle for the DESeq2 normalization. The order of metabolites (with creatinine added to the last row) is the same as in Figure 1 and based on the two-dimensional hierarchical clustering of the IS-CREA heat map. Abbreviations: TMAO, trimethylamine N-oxide; HPHPA, 3-(3-hydroxyphenyl)-3-hydroxypropanoate; 2-PY, N1-Methyl-2-pyridone-5-carboxamide.
Figure 5The robust associations (p < 0.001) of the urinary metabolite concentrations with BMI for the various normalization schemes. See Table 1 for the explanation and basis of the normalization methods. Abbreviations: 2-PY, N1-Methyl-2-pyridone-5-carboxamide; 4-HPA, 4-Hydroxyphenylacetate.
Figure 6The robust associations (p < 0.001) of the urinary metabolite concentrations with MAP for the various normalization schemes. See Table 1 for the explanation and basis of the normalization methods.