| Literature DB >> 35974117 |
Kelley M Swanberg1,2, Abhinav V Kurada3, Hetty Prinsen4, Christoph Juchem3,4,5,6.
Abstract
Multiple sclerosis (MS) is a heterogeneous autoimmune disease for which diagnosis continues to rely on subjective clinical judgment over a battery of tests. Proton magnetic resonance spectroscopy (1H MRS) enables the noninvasive in vivo detection of multiple small-molecule metabolites and is therefore in principle a promising means of gathering information sufficient for multiple sclerosis diagnosis and subtype classification. Here we show that supervised classification using 1H-MRS-visible normal-appearing frontal cortex small-molecule metabolites alone can indeed differentiate individuals with progressive MS from control (held-out validation sensitivity 79% and specificity 68%), as well as between relapsing and progressive MS phenotypes (held-out validation sensitivity 84% and specificity 74%). Post hoc assessment demonstrated the disproportionate contributions of glutamate and glutamine to identifying MS status and phenotype, respectively. Our finding establishes 1H MRS as a viable means of characterizing progressive multiple sclerosis disease status and paves the way for continued refinement of this method as an auxiliary or mainstay of multiple sclerosis diagnostics.Entities:
Mesh:
Year: 2022 PMID: 35974117 PMCID: PMC9381573 DOI: 10.1038/s41598-022-17741-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Frontal cortex metabolite profiling by proton magnetic resonance spectroscopy in individuals with relapsing–remitting, progressive, and no multiple sclerosis. Seven metabolites were measured in a single 27-cc cubic voxel in the prefrontal cortex of individuals with relapsing–remitting (N = 25), progressive (N = 19), or no (N = 24) multiple sclerosis. Individual spectra shown in color; group averaged spectra shown in black. Signals from five compounds, including total N-acetyl aspartate (tNAA: N-acetyl aspartate plus N-acetyl aspartylglutamate), total choline (tCho: choline, phosphocholine, and glycerophosphocholine), myoinositol (mIns), glutamate (Glu), and glutamine (Gln), were obtained via macromolecule-suppressed STimulated Echo Acquisition Mode (STEAM; echo time T 10 ms, repetition time T 3 s, mixing time T 50 ms, inversion time T 300 ms); glutathione (GSH) and GABA were estimated by J-difference editing (JDE) on a semi-Localization by Adiabatic SElective Refocusing sequence (MEGA-sLASER; T 72 ms, T 3 s). Total creatine (tCr) served as a concentration reference for these seven metabolite features, which were calculated in millimolar (mM) assuming 10 mM tCr; additional metabolites listed in italics, including aspartate (Asp), ascorbate (Asc), phosphorylethanolamine (PE), glycine (Gly), taurine (Taur), scylloinositol (sIns), GSH and GABA from STEAM, and glutamate, glutamine, and/or NAA in J-difference-edited spectra, were employed in spectral data quantification for one or more experiments but not included in the machine-learning feature set. RR-MS relapsing–remitting multiple sclerosis, P-MS progressive multiple sclerosis, HC healthy control, ppm parts per million, N number of cases, N number of features.
Model hyperparameters optimized against average training cross-validation accuracy.
aScaling optimized over default model-specific hyperparameters.
σ feature variance, N number of features, RBF radial basis function, QDA quadratic discriminant analysis, KNN K-nearest neighbors, SVM support vector machines.
Figure 2Training, optimization, and validation pipeline for multivariate classifiers of multiple sclerosis state and type by frontal cortex metabolite profiles. Classification pipelines constituted two binary decisions: The first (A) established the presence of multiple sclerosis, while the second (D) characterized the phenotype. Models distinguishing from control relapsing–remitting multiple sclerosis (B) and progressive multiple sclerosis (C) were also implemented for context in interpreting performance differences between (A) and (D). One data set at a time was held out for use as the validation set, yielding a total of N1 × 2 runs for data with smaller group size N1. Undersampling proceeded similarly for an inner cross-validation loop used for hyperparameter optimization and feature selection, in that each of the training cases was removed once for use as a cross-validation set for that training run, with a total of N2 × 2 cross-validations for data with the smaller training group size N2 to ensure class balancing in the population of cross-validation sets used for each training run. Class sizes were then balanced using the Synthetic Minority Over-Sampling Technique (SMOTE). Each training set was scaled using a matrix calculated over that training set only, after which the same scaling was transferred to the held-out validation case. Dimensionality reduction was not performed in order to avoid degrading model interpretability. Classification algorithms tested included support vector machines, K-nearest neighbors, quadratic discriminant analysis, random forests, gradient-boosted decision trees, extremely randomized trees, and logistic regression; only the first three were selected for reporting based on performance separating progressive and relapsing–remitting multiple sclerosis. LOOCV leave-one-out cross-validation.
Figure 3Confusion matrices for top-performing pipeline in each binary classification. Quadratic discriminant analysis (QDA) proved to be the top-performing classification algorithm by area under the receiver operating characteristic for all four questions, with higher validation sensitivity and specificity for classification of progressive patients than relapsing–remitting or all multiple sclerosis patients taken together, despite a smaller group size (progressive N = 19 versus relapsing–remitting N = 25 or control N = 24). LOOCV leave-one-out cross-validation.
Figure 4Receiver operating characteristic (ROC) curves for classification cross-validation and validation performance by question. Quadratic discriminant analysis (QDA) proved to be the top-performing classification algorithm by area under the receiver operating characteristic for all four questions. While it is visually clear that metabolite-based classifications between controls and multiple sclerosis patients, or between controls and relapsing–remitting multiple sclerosis patients, exhibited performance only marginally better than chance, ROC curves exhibited substantially increased convexity for questions involving the classification of progressive multiple sclerosis patients specifically, also reflected in higher sensitivity and specificities as shown in Fig. 3. Multiple sclerosis groups defined as positives in all plots; for the relapsing–remitting versus progressive question a positive is defined as relapsing–remitting multiple sclerosis. LOOCV: leave-one-out cross-validation.
Binary classifier performance by feature-selected algorithm and participant groups.
All statistics reported as means ± standard deviations.
Models with held-out validation set AUC > 0.75 are in bold.
aNo features eliminated for relapsing versus progressive.
CV cross-validation, Sn sensitivity, Sp specificity, R recall, P precision, Acc. Accuracy, AUC area under receiver operating characteristic (ROC) or AUROC, QDA quadratic discriminant analysis, KNN K-nearest neighbors, SVM support vector machines.
Effect of metabolite selection on binary classifier leave-one-out cross-validation (LOOCV) performance.
Grey indicates no feature selection implemented due to resultant reductions in model cross validation accuracy.
aFeature elimination performed in this algorithm despite no substantial improvement in CV accuracy due to appreciable increase in AUROC.
PI permutation importance, S.D. standard deviation, Sn sensitivity, Sp specificity, Acc. Accuracy, AUC area under receiver operating characteristic (ROC) or AUROC, QDA quadratic discriminant analysis, KNN K-nearest neighbors, SVM support vector machines, mIns myoinositol, Glu glutamate, tCho total choline, Gln glutamine, tNAA total N-acetyl aspartate, GSH glutathione.