| Literature DB >> 29556190 |
Daniel Stoessel1,2,3, Claudia Schulte4, Marcia C Teixeira Dos Santos5, Dieter Scheller6, Irene Rebollo-Mesa7, Christian Deuschle4, Dirk Walther2,3, Nicolas Schauer1, Daniela Berg4,8, Andre Nogueira da Costa5, Walter Maetzler4,8.
Abstract
Parkinson's disease (PD) shows high heterogeneity with regard to the underlying molecular pathogenesis involving multiple pathways and mechanisms. Diagnosis is still challenging and rests entirely on clinical features. Thus, there is an urgent need for robust diagnostic biofluid markers. Untargeted metabolomics allows establishing low-molecular compound biomarkers in a wide range of complex diseases by the measurement of various molecular classes in biofluids such as blood plasma, serum, and cerebrospinal fluid (CSF). Here, we applied untargeted high-resolution mass spectrometry to determine plasma and CSF metabolite profiles. We semiquantitatively determined small-molecule levels (≤1.5 kDa) in the plasma and CSF from early PD patients (disease duration 0-4 years; n = 80 and 40, respectively), and sex- and age-matched controls (n = 76 and 38, respectively). We performed statistical analyses utilizing partial least square and random forest analysis with a 70/30 training and testing split approach, leading to the identification of 20 promising plasma and 14 CSF metabolites. These metabolites differentiated the test set with an AUC of 0.8 (plasma) and 0.9 (CSF). Characteristics of the metabolites indicate perturbations in the glycerophospholipid, sphingolipid, and amino acid metabolism in PD, which underscores the high power of metabolomic approaches. Further studies will enable to develop a potential metabolite-based biomarker panel specific for PD.Entities:
Keywords: CSF; biomarker; machine learning; neurodegeneration; plasma; untargeted metabolomics
Year: 2018 PMID: 29556190 PMCID: PMC5844983 DOI: 10.3389/fnagi.2018.00051
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
Demographic and clinical features of patients with Parkinson's disease and controls.
| Males/Females [N] | 54/26 | 48/28 | 24/16 | 25/12 |
| Age [y], median (IQR) | 66 (12) | 65 (17) | 67 (14) | 66 (14) |
| Disease duration [y], median | 3 (2) | / | 3 (1) | / |
| LEDD, median (IQR) | 208 (317) | / | 160 (353) | / |
| HY, median (range) | 2 (1–4) | / | 2 (1–4) | / |
| MMSE, median (IQR) | 29 (2) | 30 (1) | 29 (3) | 30 (1) |
| MoCA, median (IQR) | 27 (4) | 28 (3) | 27 (4) | 28 (3) |
| UPDRS (3), median (IQR) | 21 (13) | 0 (2) | 23 (15) | 0 (1) |
| BDI, median (IQR) | 8 (9) | 2 (4) | 7 (5) | 2 (4) |
PD, Parkinson's disease; IQR, interquartile range (Q3–Q1); LEDD, L-Dopa equivalent daily dose; HY, Hoehn and Yahr scale; MMSE, Mini Mental State Examination; MoCA, Montreal Cognitive Assessment; UPDRS, Unified Parkinson's Disease Rating Scale; BDI, Beck Depression Inventory.
Figure 1General workflow for investigating metabolic profiles of PD patients and controls and respective distribution of metabolites in plasma and CSF. (A) General workflow of data analysis for samples from controls and PD patients. (B) Venn diagram of metabolites detected in respective plasma and CSF sets. (C) Bar chart illustrating the relative count of putatively identified metabolites from each metabolite class, classified according to KEGG, Lipidmaps, and HMDB. In total, 334 metabolites were analyzed in plasma and 302 metabolites analyzed in CSF (for a full list of extracted metabolites, see Supplementary Table 1). The identified metabolite classes (relative count > 5%) included (di-, tri-, tetra-) peptides (18% plasma, 25% CSF), glycerophospholipids (plasma 17%, CSF 12%), sphingolipids (plasma 14%, CSF 7%), amino acids and derivatives (plasma 9%, CSF 10%), fatty acyls (plasma 6%, CSF 10%), and unknowns (8% plasma, 9% CSF; no match to our metabolite database).
Figure 2Partial least square (PLS) component plots and the associated separation of Parkinson's disease (PD) and control status based on plasma and cerebrospinal fluid (CSF) samples. (A) Components and their corresponding area under the curve (AUC; gray: plasma, light blue: CSF). Dashed red line represents the maximum of two components used for model training in both cohorts. (B) Boxplots represent the training AUC of 200 randomly split samples (70% of the entire dataset) using PLS and random forest (RF) analyses. Red dot: Performance of samples selected for model training. (C) PLS score plot of all 334 metabolites identified in the plasma training set. (D) PLS score plot of all 302 metabolites identified in the CSF training cohort.
Figure 3Log2 fold differences for top-ranked plasma and cerebrospinal (CSF) metabolites to differentiate early Parkinson's disease (PD) from controls, determined by the partial least square (PLS) model. Log2 fold differences (PD vs. controls) between values of metabolites retrieved from the plasma (A) and the CSF PLS model (B). Red columns indicate higher values in PD. *P < 0.05 and ** < 0.01 according to univariate Welch's t- or Wilcoxon test (p < 0.05) after false discovery rate (FDR) correction by Benjamini and Hochberg (BH). Error bars indicate the standard error of the metabolite measure intensities.
Detailed information on significantly changed metabolites between controls and PD patients in plasma retrieved from the PLS model.
| 73.08939 @ 7.5 | 3.56E+03 | 3.33E+03 | 18.22 | −0.10 | 0.045 | 12 | 56 | No | 0.62 | 0.53–0.71 | / |
| Ethanolamine | 1.26E+04 | 1.15E+04 | 20.22 | −0.13 | 0.019 | 10 | 35 | No | 0.63 | 0.54–0.73 | 1 |
| 475.22627 @ 6.76 | 3.27E+03 | 3.44E+03 | 15.23 | 0.07 | 0.079 | 13 | 66 | No | 0.58 | 0.48–0.68 | / |
| N-Lauroylglycine | 2.59E+03 | 2.45E+03 | 9.55 | −0.08 | 0.012 | 3 | 47 | No | 0.66 | 0.56–0.75 | 3 |
| Alpha-N-Phenylacetyl-L-glutamine | 2.11E+04 | 2.98E+04 | 85.53 | 0.50 | 0.014 | 14 | 33 | No | 0.65 | 0.56–0.73 | 3 |
| PC(35:6) | 1.98E+04 | 1.52E+04 | 57.06 | −0.38 | 0.012 | 20 | 118 | No | 0.64 | 0.55–0.74 | 2 |
| Sarcosine | 8.01E+05 | 8.36E+05 | 17.06 | 0.06 | 0.174 | 16 | 121 | No | 0.55 | 0.45–0.64 | 1 |
| SM(d30:1) | 6.69E+03 | 6.40E+03 | 28.79 | −0.0624 | 0.397 | 17 | 40 | No | 0.56 | 0.46–0.65 | 2 |
| SM(d32:1) | 1.94E+05 | 1.86E+05 | 22.44 | −0.0574 | 0.29 | 18 | 27 | No | 0.56 | 0.47–0.65 | 2 |
| SM(d39:1) | 2.25E+04 | 2.09E+04 | 29.20 | −0.1101 | 0.128 | 11 | 25 | No | 0.59 | 0.48–0.68 | 2 |
| Glu-Ile | 5.45E+03 | 5.07E+03 | 20.73 | −0.1038 | 0.059 | 7 | 1 | Yes | 0.6 | 0.51–0.69 | 2 |
| 535.24187 @ 5.4 | 4.13E+03 | 3.89E+03 | 12.67 | −0.0853 | 0.014 | 5 | 8 | Yes | 0.64 | 0.55–0.74 | / |
| 186.11894 @ 0.95 | 2.38E+04 | 1.98E+04 | 66.10 | −0.2669 | 0.104 | 15 | 18 | Yes | 0.59 | 0.49–0.69 | / |
| 1,3-Dimethyluracil | 4.08E+03 | 5.42E+03 | 62.62 | 0.40917 | 0.014 | 6 | 9 | Yes | 0.63 | 0.52–0.72 | 3 |
| PC(44:5) | 1.15E+04 | 1.35E+04 | 36.99 | 0.23385 | 0.014 | 8 | 11 | Yes | 0.64 | 0.54–0.73 | 3 |
| PC(44:6) | 7.41E+03 | 8.84E+03 | 33.55 | 0.25355 | 0.012 | 1 | 6 | Yes | 0.66 | 0.58–0.74 | 2 |
| PE(34:1) | 3.52E+03 | 2.82E+03 | 41.11 | −0.3214 | 0.014 | 2 | 10 | Yes | 0.64 | 0.54–0.72 | 3 |
| Arg-Ala | 1.00E+04 | 1.29E+04 | 69.02 | 0.35912 | 0.045 | 4 | 2 | Yes | 0.62 | 0.54–0.71 | 2 |
| Lyso-PAF C-16 | 2.53E+04 | 2.79E+04 | 19.11 | 0.14038 | 0.012 | 9 | 3 | Yes | 0.65 | 0.57–0.73 | 2 |
| 354.92649 @ 0.11 | 4.37E+04 | 4.24E+04 | 6.58 | −0.0435 | 0.04 | 19 | 4 | Yes | 0.63 | 0.53–0.72 | / |
Significant ions detected between controls and Parkinson's disease (PD) patients. Proposed metabolite: proposed metabolite for each ion. If no formula could be calculated the proton corrected masses are reported with their corresponding retention time separated by a “@”. Log2 Fold difference (FD): relative abundance of mean of corresponding ion in PD compared to the mean of controls patients (Co). P value: value for unpaired Welch's t-test or Wilcoxon test FDR adjusted. AUC, Area under the curve; CI, 95% confidence interval; RSD, relative standard deviation.
Detailed information on significantly changed metabolites between controls and PD patients in CSF retrieved from the PLS model.
| Prolyl-Tyrosine | 6.99E+03 | 9.46E+03 | 53.98 | 0.44 | 0.045 | 9 | 30 | No | 0.66 | 0.55–0.79 | 3 |
| Sarcosine | 2.14E+04 | 1.96E+04 | 19.72 | −0.13 | 0.111 | 12 | 51 | No | 0.62 | 0.49–0.75 | 1 |
| Ser-Glu | 7.46E+03 | 8.97E+03 | 39.14 | 0.26 | 0.062 | 10 | 26 | Yes | 0.62 | 0.49–0.74 | 2 |
| 432.31975 @ 6.13 | 6.45E+03 | 5.95E+03 | 16.59 | −0.12 | 0.059 | 4 | 18 | No | 0.65 | 0.52–0.77 | / |
| Leu-Trp-Trp | 6.16E+03 | 5.51E+03 | 14.66 | −0.16 | 0.026 | 1 | 2 | Yes | 0.7 | 0.59–0.82 | 2 |
| Alpha-N-Phenylacetyl-L-glutamine | 2.56E+04 | 3.55E+04 | 56.61 | 0.47 | 0.010 | 3 | 3 | Yes | 0.74 | 0.61–0.85 | 2 |
| Betaine | 4.95E+05 | 4.54E+05 | 15.93 | −0.13 | 0.045 | 8 | 7 | Yes | 0.67 | 0.55–0.79 | 1 |
| 517.24582 @ 6.8 | 7.03E+03 | 7.42E+03 | 10.32 | 0.08 | 0.062 | 5 | 17 | No | 0.65 | 0.52–0.77 | / |
| S-(2-Methylpropionyl)-dihydrolipoamide-E | 7.35E+03 | 5.98E+03 | 37.10 | −0.30 | 0.045 | 7 | 4 | Yes | 0.68 | 0.54–0.80 | 3 |
| 3-ketosphingosine | 6.40E+03 | 6.87E+03 | 17.37 | 0.10 | 0.111 | 11 | 12 | No | 0.62 | 0.48–0.76 | 2 |
| 972.90985 @ 12.31 | 5.21E+03 | 4.91E+03 | 12.33 | −0.09 | 0.075 | 2 | 5 | No | 0.63 | 0.50–0.76 | / |
| (+)-gamma-Hydroxy-L-homoarginine | 6.30E+04 | 6.49E+04 | 5.43 | 0.04 | 0.045 | 6 | 11 | No | 0.67 | 0.54–0.80 | 2 |
| O-Adipoylcarnitine | 5.96E+03 | 5.46E+03 | 25.71 | −0.12 | 0.059 | 13 | 9 | No | 0.65 | 0.53–0.76 | 3 |
| Dimethylglycine | 8.06E+04 | 7.29E+04 | 17.91 | −0.15 | 0.034 | 14 | 16 | No | 0.67 | 0.54–0.78 | 1 |
Significant ions detected between controls and Parkinson's disease (PD) patients. Proposed metabolite: proposed metabolite for each ion. If no formula could be calculated the proton corrected masses are reported with their corresponding retention time separated by a “@”. Log2 Fold difference (FD): relative abundance of mean of corresponding ion in PD compared to the mean of controls patients (Co). P value: value for unpaired Welch's t-test or Wilcoxon test FDR adjusted. AUC, Area under the curve; CI, 95% confidence interval; RSD, relative standard deviation.
Figure 4Representative areas under the curve (AUC) for differentiation of Parkinson's disease from controls by use of plasma and cerebrospinal fluid (CSF), and the partial least square (PLS) and random forest (RF) statistical models The PLS model shows superiority over the RF model for the differentiation of states in both compartments.
Figure 5Pathway analysis of altered metabolites in the plasma and cerebrospinal fluid (CSF) of Parkinson's disease (PD) using MetaboAnalyst. The significantly dysregulated metabolites in PD, identified in plasma samples (N metabolites = 20, A) and CSF (N metabolites = 14, B) were subjected to MetaboAnalyst (http://www.metaboanalyst.ca/) (Xia et al., 2015), to assess associations of respective metabolites to defined pathways. *P < 0.05 after FDR correction. Circle extent (larger) correlates to p-value (lower).