| Literature DB >> 35269934 |
Youssef El Khoury1, Marie Gebelin1, Jérôme de Sèze2,3, Christine Patte-Mensah2, Gilles Marcou4, Alexandre Varnek4, Ayikoé-Guy Mensah-Nyagan2, Petra Hellwig1, Nicolas Collongues2,3,5.
Abstract
Neuromyelitis optica spectrum disorder (NMOSD) and multiple sclerosis (MS) are both autoimmune inflammatory and demyelinating diseases of the central nervous system. NMOSD is a highly disabling disease and rapid introduction of the appropriate treatment at the acute phase is crucial to prevent sequelae. Specific criteria were established in 2015 and provide keys to distinguish NMOSD and MS. One of the most reliable criteria for NMOSD diagnosis is detection in patient's serum of an antibody that attacks the water channel aquaporin-4 (AQP-4). Another target in NMOSD is myelin oligodendrocyte glycoprotein (MOG), delineating a new spectrum of diseases called MOG-associated diseases. Lastly, patients with NMOSD can be negative for both AQP-4 and MOG antibodies. At disease onset, NMOSD symptoms are very similar to MS symptoms from a clinical and radiological perspective. Thus, at first episode, given the urgency of starting the anti-inflammatory treatment, there is an unmet need to differentiate NMOSD subtypes from MS. Here, we used Fourier transform infrared spectroscopy in combination with a machine learning algorithm with the aim of distinguishing the infrared signatures of sera of a first episode of NMOSD from those of a first episode of relapsing-remitting MS, as well as from those of healthy subjects and patients with chronic inflammatory demyelinating polyneuropathy. Our results showed that NMOSD patients were distinguished from MS patients and healthy subjects with a sensitivity of 100% and a specificity of 100%. We also discuss the distinction between the different NMOSD serostatuses. The coupling of infrared spectroscopy of sera to machine learning is a promising cost-effective, rapid and reliable differential diagnosis tool capable of helping to gain valuable time in patients' treatment.Entities:
Keywords: diagnosis; infrared spectroscopy; machine learning; multiple sclerosis; neuromyelitis optica spectrum disorder
Mesh:
Substances:
Year: 2022 PMID: 35269934 PMCID: PMC8911153 DOI: 10.3390/ijms23052791
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Infrared data of the 235 serum samples included in this study. (A) Normalized Fourier-transform infrared (FTIR) spectra of all serum samples in the 3000–2800 and 1800–700 cm−1 spectral ranges. (B) Second derivatives of the spectra of panel A. Healthy controls (black), neuromyelitis optica spectrum disorder (NMOSD) (red), relapsing-remitting multiple sclerosis (RRMS) (blue) and peripheral neuropathies (NEUR) (green). The groups of spectra and derivatives are off-set for clarity.
Performances (confusion matrix, receiver operating characteristic curve (ROC AUC), sensitivity, specificity, and precision) of the random forest model based on second derivatives of the FTIR spectra of serum samples discriminating HC, NMOSD, RRMS and NEUR. The first row concerns two-fold cross-validation on the training set (70 HC, 54 NMOSD, 54 RRMS and 30 NEUR). The values in parentheses correspond to a ten-iteration internal validation of the model (see Materials and Methods for details). The grey rows record the performances on the test set (10 HC, 6 NMOSD, 6 RRMS and 5 NEUR). True positives are in bold and false negatives/false positives are in italics.
| Pathology | Classified as | ROC AUC | Sensitivity | Specificity | Precision | ||||
|---|---|---|---|---|---|---|---|---|---|
| HC | NMOSD | RRMS | NEUR | ||||||
| 2-fold cross-validation | HC |
|
| 0 | 0 | 99.8 | 97.1 | 98.6 | 97.1 |
| NMOSD |
|
| 0 | 0 | 99.6 | 98.1 | 98.7 | 96.4 | |
| RRMS | 0 | 0 |
| 0 | 100 | 100 | 100 | 100 | |
| NEUR |
| 0 | 0 |
| 100 | 100 | 100 | 100 | |
| Validation set | HC |
| 0 | 0 | 0 | 100 | 100 | 100 | 100 |
| NMOSD | 0 |
| 0 | 0 | 100 | 100 | 100 | 100 | |
| RRMS | 0 | 0 |
| 0 | 100 | 100 | 100 | 100 | |
| NEUR | 0 | 0 | 0 |
| 100 | 100 | 100 | 100 | |
Figure 2Scatterplot of the top 20 nodes with the highest recurrence in the random forest model of Table 1 and their corresponding percentage of separation rate.
Figure 3Infrared data of NMOSD sera samples. (A) Normalized FTIR spectra of the sera samples of all NMOSD patients in the 3000–2800 and 1800–700 cm−1 spectral ranges. (B) Second derivatives of the spectra of panel A. Double negative (DN) (dark red), AQP-4-Ab-positive (AQP-4) (violet), and Myelin oligodendrocyte glycoprotein-Ab-positive (MOG) (light red). The groups of spectra and derivatives are off-set for clarity.
Performances (confusion matrix, ROC AUC, sensitivity, specificity, and precision) of the random forest model based on second derivatives of FTIR spectra of the NMOSD serum samples discriminating DN, AQP-4 and MOG. The first row concerns two-fold cross-validation on the training set (18 DN, 18 AQP-4 and 18 MOG). The values in parentheses correspond to a ten-iteration internal validation of the model (see Materials and Methods for details). The grey rows record the performances on the test set (2 DN, 2 AQP-4 and 2 MOG). True positives are in bold and false negatives/false positives are in italics.
| Pathology | Classified as | ROC AUC | Sensitivity | Specificity | Precision | |||
|---|---|---|---|---|---|---|---|---|
| DN | MOG | AQP-4 | ||||||
| 2-fold cross-validation | DN |
|
|
| 61.4 | 55.6 | 66.7 | 45.5 |
| MOG |
|
|
| 58.6 | 38.9 | 72.2 | 41.2 | |
| AQP-4 |
|
|
| 57.9 | 27.8 | 72.2 | 33.3 | |
| Validation set | DN |
|
| 0 | 75 | 50 | 75 | 50 |
| MOG | 0 |
| 0 | 100 | 100 | 5 | 66.7 | |
| AQP-4 |
| 0 | 1 | 87.5 | 50 | 0 | 100 | |
Figure 4Scatterplot of the top 20 nodes with the highest recurrence in the random forest model of Table 2 and their corresponding percentage of separation rate.
Performances (confusion matrix, ROC AUC, sensitivity, specificity, and precision) of random forest model based on second derivatives of the FTIR spectra of the RRMS serum samples vs. DN NMOSD alone, vs. MOG NMOSD alone and vs. AQP-4 alone. The first row concerns two-fold cross-validation on the training set (18 DN, 18 AQP-4 and 18 MOG). The values in parentheses correspond to a ten-iteration internal validation of the model (see Materials and Methods for details). The grey rows record the performances on the test set (6 RRMS, 2 DN, 2 MOG and 2 AQP-4). True positives are in bold and false negatives/false positives are in italics.
| Pathology | Classified as | ROC AUC | Sensitivity | Specificity | Precision | ||
|---|---|---|---|---|---|---|---|
| RRMS | DN | ||||||
| 2-fold cross-validation | RRMS |
| 0 | 100 | 100 | 100 | 100 |
| DN | 0 |
| 100 | 100 | 100 | 100 | |
| Validation set | RRMS |
| 0 | 100 | 100 | 100 | 100 |
| DN | 0 |
| 100 | 100 | 100 | 100 | |
|
|
|
|
|
|
| ||
|
|
| ||||||
| 2-fold cross-validation | RRMS |
| 0 | 100 | 100 | 99.4 | 98.2 |
| MOG |
|
| 100 | 99.4 | 100 | 100 | |
| Validation set | RRMS |
| 0 | 100 | 100 | 100 | 100 |
| MOG | 0 |
| 100 | 100 | 100 | 100 | |
|
|
|
|
|
|
| ||
|
|
| ||||||
| 2-fold cross-validation | RRMS |
|
| 99.4 | 100 | 83.3 | 94.7 |
| AQP-4 |
|
| 99.4 | 83.3 | 100 | 100 | |
| Validation set | RRMS |
| 0 | 100 | 100 | 100 | 100 |
| AQP-4 | 0 |
| 100 | 100 | 100 | 100 | |
Figure 5Workflow diagram showing the successive steps towards the distinction of the sera samples of the patients.