| Literature DB >> 30089873 |
Daniel Cañueto1, Reza M Salek2, Xavier Correig3,4, Nicolau Cañellas5,6.
Abstract
NMR spectroscopy is a technology that is widely used in metabolomic studies. The information that these studies most commonly use from NMR spectra is the metabolite concentration. However, as well as concentration, pH and ionic strength information are also made available by the chemical shift of metabolite signals. This information is typically not used even though it can enhance sample discrimination, since many conditions show pH or ionic imbalance. Here, we demonstrate how chemical shift information can be used to improve the quality of the discrimination between case and control samples in three public datasets of different human matrices. In two of these datasets, chemical shift information helped to provide an AUROC value higher than 0.9 during sample classification. In the other dataset, the chemical shift also showed discriminant potential (AUROC 0.831). These results are consistent with the pH imbalance characteristic of the condition studied in the datasets. In addition, we show that this signal misalignment dependent on sample class can alter the results of fingerprinting approaches in the three datasets. Our results show that it is possible to use chemical shift information to enhance the diagnostic and predictive properties of NMR.Entities:
Mesh:
Year: 2018 PMID: 30089873 PMCID: PMC6082897 DOI: 10.1038/s41598-018-30351-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Exploratory PCA analysis shows the potential of the chemical shift data in the classificaton models. The first PCs of the PCA using chemical shifts (right) show better separation than the ones using concentrations (left). Plots also suggest no batch effects necessary to monitor.
Chemical shift information shows discriminative potential in the MTBLS1 dataset.
| Both sets of information | Concentration information | Chemical shift information | |
|---|---|---|---|
| Accuracy | 0.929 | 0.933 | 0.795 |
| kappa | 0.840 | 0.849 | 0.559 |
| AUROC | 0.980 | 0.979 | 0.831 |
However, it cannot enhance the excellent results given by concentration information during random forest classification.
Adding chemical shift information to concentration information improved the classification between the five different kinds of sample in the MTBLS237 dataset.
| Both sets of information | Concentration information | Chemical shift information | |
|---|---|---|---|
|
| |||
| Accuracy | 0.863 | 0.826 | 0.876 |
| kappa | 0.635 | 0.555 | 0.698 |
| AUROC | 0.870 | 0.811 | 0.917 |
|
| |||
| Accuracy | 0.801 | 0.808 | 0.721 |
| kappa | 0.505 | 0.526 | 0.331 |
| AUROC | 0.768 | 0.777 | 0.661 |
|
| |||
| Accuracy | 0.730 | 0.717 | 0.668 |
| kappa | 0.462 | 0.438 | 0.339 |
| AUROC | 0.768 | 0.743 | 0.682 |
|
| |||
| Accuracy | 0.808 | 0.771 | 0.797 |
| kappa | 0.617 | 0.545 | 0.594 |
| AUROC | 0.870 | 0.810 | 0.841 |
|
| |||
| Accuracy | 0.890 | 0.860 | 0.882 |
| kappa | 0.773 | 0.714 | 0.762 |
| AUROC | 0.948 | 0.914 | 0.926 |
|
| |||
| Accuracy | 0.867 | 0.861 | 0.790 |
| kappa | 0.719 | 0.707 | 0.556 |
| AUROC | 0.921 | 0.916 | 0.839 |
|
| |||
| Accuracy | 0.882 | 0.804 | 0.892 |
| kappa | 0.753 | 0.596 | 0.775 |
| AUROC | 0.926 | 0.823 | 0.943 |
|
| |||
| Accuracy | 0.806 | 0.787 | 0.782 |
| kappa | 0.589 | 0.550 | 0.551 |
| AUROC | 0.854 | 0.825 | 0.81 |
Several quality indicators of the models generated with only concentration information, only chemical shift information and both sources of information are shown for the eight comparisons between the five subclasses (control, active UC, inactive UC, active CD, inactive CD).
Adding chemical shift information to concentration information provides the best classification of samples in the MTBLS374 dataset.
| Both sets of information | Concentration information | Chemical shift information | |
|---|---|---|---|
| Accuracy | 0.899 | 0.806 | 0.883 |
| kappa | 0.797 | 0.614 | 0.766 |
| AUROC | 0.950 | 0.856 | 0.937 |
Several quality indicators of the models generated only with concentration information, only with chemical shift information and with both sources of information are shown.
Figure 2Signals can be misaligned in some sample classes. Low pH mediated by the condition studied increases the chemical shift of the signals. The resulting class-dependent signal misalignment can distort the results of the analysis of fingerprint data: features can show significant differences caused by differences in chemical shift (mediated by pH or ionic strength) rather than by differences in metabolite concentration.