| Literature DB >> 33263261 |
Ryan Ramanujam1, Feng Zhu2, Katharina Fink3, Virginija Danylaitė Karrenbauer4, Johannes Lorscheider5, Pascal Benkert6, Elaine Kingwell2, Helen Tremlett2, Jan Hillert4, Ali Manouchehrinia7.
Abstract
BACKGROUND: The absence of reliable imaging or biological markers of phenotype transition in multiple sclerosis (MS) makes assignment of current phenotype status difficult.Entities:
Keywords: Multiple sclerosis; classification; decision tree; secondary progressive
Year: 2020 PMID: 33263261 PMCID: PMC8227440 DOI: 10.1177/1352458520975323
Source DB: PubMed Journal: Mult Scler ISSN: 1352-4585 Impact factor: 6.312
Accuracy of classifiers of SPMS, including sensitivity, specificity, positive predictive value (PPV ) and negative predictive value (NPV).
| Classifier (and cohort) |
| Accuracy (%) | Sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) |
|---|---|---|---|---|---|---|
| Decision trees (Swedish cohort) | 14,387 | 89.3 | 93.7 | 79.9 | 90.1 | 85.4 |
| Decision trees (Canadian cohort validation) | 5,431 | 82.0 | 89.8 | 71.4 | 81.2 | 83.5 |
| MSBase algorithm (Swedish cohort) | 14,387 | 77.8 | 76.6 | 85.5 | 97.2 | 35.9 |
| Logistic regression (Swedish cohort) | 14,387 | 89.3 | 94.0 | 79.2 | 90.1 | 86.0 |
| Random forest (Swedish cohort) | 14,387 | 89.3 | 93.6 | 80.1 | 91.0 | 85.4 |
| Support vector machine (Swedish cohort) | 14,387 | 88.6 | 93.6 | 77.7 | 90.0 | 85.0 |
| Neurologists (averaged, Swedish cohort)
| 100 | 84.3 | 92.8 | 53.2 | 88.0 | 66.7 |
RR assigned as positive class.
Average accuracy of three neurologists examining full records of 100 patients. The decision tree model classified 85 of these 100 correctly by comparison.
Characteristics of the Swedish and Canadian cohorts used to build and externally validate the decision tree classifier.
| Swedish cohort | Canadian cohort | |||
|---|---|---|---|---|
| Remained in the relapsing–remitting phase at
the most recent clinic visit ( | Reached secondary progressive phase at the most
recent clinic visit ( | All ( | All ( | |
| Age at the most recent clinic visit mean (SD) (years) | 44.0 (11.5) | 58.4 (9.7) | 48.6 (12.9) | 47.0 (11.3) |
| Sex (female%) | 7056 (71.8%) | 3211 (70.5%) | 10,267 (71.4%) | 4432 (74.0%) |
| Multiple sclerosis symptom onset age mean (SD) (years) | 32.1 (10.0) | 33.0 (10.5) | 32.4 (10.2) | 31.6 (9.6) |
| Disease duration at the most recent clinic visit (years) (median [IQR]) | 10.0 [5.0–17.0] | 25.0 [17.0–33.0] | 14.0 [7.0–23.0] | 14.0 [7.0–22.0] |
| Most recent EDSS score (median [IQR]) | 1.5 [1.0–2.5] | 6.5 [4.5–7.5] | 2.5 [1.0–5.0] | 3.5 [1.5–5.5] |
EDSS: expanded disability status scale, IQR: interquartile range, SD: standard deviation.
Full decision tree model and corresponding terminal node probabilities of SPMS.
| SPMS probability | Classification | EDSS | Age (years) |
|---|---|---|---|
| 0.04 | RR | <3 | Any |
| 0.18 | RR | 3 or 3.5 or 4 | <56 |
| 0.38 | RR | 4.5 or 5 or 5.5 or 6 | <45 |
| 0.39 | RR | 3 or 3.5 | 56–64 |
| 0.48 | RR | 3 | ⩾64 |
| 0.53 | SP | 4 | 56–64 |
| 0.61 | SP | 3.5 or 4 | ⩾64 |
| 0.76 | SP | 4.5 or 5 or 5.5 or 6 | ⩾45 |
| 0.93 | SP | >6 | Any |
EDSS: expanded disability status scale, SP: secondary progressive, SPMS: secondary progressive multiple sclerosis. RR: relapsing–remitting.
Figure 1.Pruned decision tree classifier based on a MS patient’s age and EDSS score. Terminal nodes indicate the number of individuals and the bar length indicates the probability of SPMS.
Figure 2.Decision boundaries of the decision tree relative to the EDSS score and the age at the latest assessment.
Figure 3.Variable importance plot generated in the decision tree indicating the relative importance of the two predictor variables.
Characteristics of patients misclassified by the decision tree classifier and MSBase algorithm.
| Clinically assigned RR in the Swedish cohort | Clinically assigned SP in the Swedish cohort | |||||
|---|---|---|---|---|---|---|
| Clinically assigned RR (reference
phenotype) ( | Misclassified to SP | Clinically assigned SP (reference
phenotype) ( | Misclassified to RR | |||
| Decision tree classifier ( | MSBase algorithm ( | Decision tree classifier ( | MSBase algorithm ( | |||
|
| 44.0 (11.5) | 55.7 (9.7) | 50.1 (11.3) | 58.4 (9.6) | 53.5 (10.4) | 58.0 (9.8) |
|
| 7056 (71.8%) | 457 (73.5%) | 196 (70.5%) | 3211 (70.5%) | 660 (72.1%) | 2069 (70.8%) |
|
| 32.1 (10.0) | 37.4 (11.5) | 33.8 (11.1) | 33.0 (10.5) | 33.8 (10.6) | 33.4 (10.6) |
|
| 10.0 [5.0–17.0] | 17.0 [10.0, 25.0] | 14.0 [9.0, 21.0] | 25.0 [17.0–33.0] | 18.0 [12.0, 25.5] | 23.0 [16.0, 32.0] |
|
| 1.5 [1.0–2.5] | 5.5 [4.5, 6.5] | 5.0 [4.0, 6.0] | 6.5 [4.5–7.5] | 3.0 [2.0, 3.5] | 6.0 [3.5, 7.0] |
EDSS: expanded disability status scale, IQR: interquartile range, SD: standard deviation. RR: relapsing–remitting. SP: secondary progressive.
Figure 4.Kaplan–Meier estimate and 95% confidence intervals (colored bands) of the various models based on age (left) and time from MS onset (right) in years at transition to SPMS (total RR-onset population n = 13,712).