| Literature DB >> 25511422 |
Tommy Öman1, May-Britt Tessem2,3, Tone F Bathen4,5, Helena Bertilsson6,7, Anders Angelsen8,9, Mattias Hedenström10, Trygve Andreassen11.
Abstract
BACKGROUND: Identification of individual components in complex mixtures is an important and sometimes daunting task in several research areas like metabolomics and natural product studies. NMR spectroscopy is an excellent technique for analysis of mixtures of organic compounds and gives a detailed chemical fingerprint of most individual components above the detection limit. For the identification of individual metabolites in metabolomics, correlation or covariance between peaks in (1)H NMR spectra has previously been successfully employed. Similar correlation of 2D (1)H-(13)C Heteronuclear Single Quantum Correlation spectra was recently applied to investigate the structure of heparine. In this paper, we demonstrate how a similar approach can be used to identify metabolites in human biofluids (post-prostatic palpation urine).Entities:
Mesh:
Substances:
Year: 2014 PMID: 25511422 PMCID: PMC4274720 DOI: 10.1186/s12859-014-0413-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Procedure for generating correlation plots. Each spectrum is transformed to a row vector where the chemical shifts for both 1H and 13C are encoded, forming a matrix with dimensions n x K (step 1). By plotting one of these vectors, real signals are easily discerned from noise and an appropriate noise threshold may be selected. Data points are removed from the matrix only when all values in the column (from all HSQC spectra) are lower than the selected threshold. This noise exclusion step results in a final matrix X of a more manageable size that still contains all relevant information (step 2). Any of the rows in X can be transformed to a matrix of the original format and plotted as a noise-free HSQC-spectrum. From this plot, a cross-peak (coordinate) of interest may be selected, corresponding to the column vector v peak (step 3) in X. At this point, X (and v peak) is auto-scaled and a correlation vector c peak is calculated according to equation 2. This vector will contain values between −1 and 1, i.e. correlation coefficients, and can be visualized as a 2D spectrum after re-introducing zeros to the data points omitted in the noise exclusion step, followed by transformation to a matrix with the same dimensions as the original data (step 4). A cutoff for the correlation is then chosen for the visualization, for example 0.9, to only show peaks highly correlated (>0.9) with the chosen peak.
Figure 2Real and constructed H- C HSQC spectra from post-prostatic palpation urine. To the left (a) is one of the 50 recorded 1H-13C HSQC spectra from post-prostatic palpation urine. To the right (b) is a constructed 1H-13C HSQC spectrum, prepared by merging correlation plots from 23 metabolites. Peaks from 7 metabolites with only one 1H-13C HSQC cross-peak are also included. TMAO appears broad due to phase distortion.
Identified metabolites from post-prostatic palpation urine
|
|
|
|
|
|---|---|---|---|
| Trigonelline | 9.12 / 148.4 | 0.9 | 5 / 5 |
| Hippuric acid | 7.82 / 129.6 | 0.9 | 4 / 4 |
| Indoxyl sulphate | 7.69 / 119.9 | 0.8 | 4 / 5 |
| Phenylacetylglutamine* | 7.41 / 131.5 | 0.8 | 8 / 8 |
|
| 6.58 / 133.5 | 0.74 | 3 / 2 |
| Levoglucosan | 5.45 / 104.0 | 0.883 | 9 / 7 |
| Carnitine** | 4.56 / 66.8 | 0.7 | 4 / 4 |
| Creatine | 3.92 / 56.5 | 0.9 | 2 / 2 |
| Mannitol | 3.80 / 72.0 | 0.9 | 4 / 4 |
| Erythritol | 3.69 / 74.9 | 0.867 | 3 / 3 |
| Galactitol | 3.66 / 72.6 | 0.65 | 3 / 3 |
| Glycine | 3.56 / 44.2 | 0.8 | 1 / 1 |
| Taurine | 3.44 / 38.0 | 0.9 | 2 / 2 |
| 4-Hydroxyphenylacetic acid | 3.44 / 46.1 | 0.9 | 3 / 3 |
| Methanol | 3.36 / 51.6 | 0.8 | 1 / 1 |
| 1-Methyluric acid | 3.28 / 30.3 | 0.8 | 1 / 1 |
| Betaine | 3.26 / 55.8 | 0.8 | 2 / 2 |
| TMAO | 3.26 / 62.0 | 0.8 | 1 / 1 |
| Ethanolamine | 3.14 / 44.2 | 0.85 | 2 / 2 |
| Isocitric acid | 2.98 / 51.6 | 0.8476 | 5 / 4 |
| Dimethylamine | 2.72 / 37.4 | 0.8 | 1 / 1 |
| Citric acid | 2.54 / 48.1 | 0.899 | 2 / 2 |
| Succinic acid | 2.40 / 36.7 | 0.8 | 1 / 1 |
| Glutamine | 2.14 / 29.0 | 0.8 | 3 / 3 |
| Acetic acid | 1.92 / 26.1 | 0.8 | 1 / 1 |
| Spermine | 1.81 / 25.4 | 0.9 | 5 / 5 |
| Lysine | 1.70 / 29.0 | 0.7 | 5 / 6 |
| Adipic acid | 1.54 / 28.3 | 0.83 | 2 / 2 |
| 3-Hydroxyisovaleric acid | 1.26 / 30.6 | 0.8 | 2 / 2 |
| 3-Aminoisobutanoic acid | 1.20 / 17.6 | 0.8 | 4 / 4 |
*NMR data not reported in HMDB. Correlating 1H signals compatible with literature values [19].
**HSQC data not available in HMDB, compared to data from YMDB [20].
Figure 3HSQC correlation plot of phenylacetylglutamine. Correlation plot showing all data points correlating strongly with 7.41 / 131.5 ppm (1H / 13C) (correlation coefficient higher than 0.8). Correlation to long-range cross-peak is circled. Two cross-peaks are marked with “7” due to diastereotopic protons in this position.