| Literature DB >> 34822120 |
Christoph M Kanzler1,2, Ilse Lamers3,4, Peter Feys3, Roger Gassert5, Olivier Lambercy5.
Abstract
Predicting upper limb neurorehabilitation outcomes in persons with multiple sclerosis (pwMS) is essential to optimize therapy allocation. Previous research identified population-level predictors through linear models and clinical data. This work explores the feasibility of predicting individual neurorehabilitation outcomes using machine learning, clinical data, and digital health metrics. Machine learning models were trained on clinical data and digital health metrics recorded pre-intervention in 11 pwMS. The dependent variables indicated whether pwMS considerably improved across the intervention, as defined by the Action Research Arm Test (ARAT), Box and Block Test (BBT), or Nine Hole Peg Test (NHPT). Improvements in ARAT or BBT could be accurately predicted (88% and 83% accuracy) using only patient master data. Improvements in NHPT could be predicted with moderate accuracy (73%) and required knowledge about sensorimotor impairments. Assessing these with digital health metrics over clinical scales increased accuracy by 10%. Non-linear models improved accuracy for the BBT (+ 9%), but not for the ARAT (-1%) and NHPT (-2%). This work demonstrates the feasibility of predicting upper limb neurorehabilitation outcomes in pwMS, which justifies the development of more representative prediction models in the future. Digital health metrics improved the prediction of changes in hand control, thereby underlining their advanced sensitivity. Graphical Abstract This work explores the feasibility of predicting individual neurorehabilitation outcomes using machine learning, clinical data, and digital health metrics.Entities:
Keywords: Assessment; Digital biomarkers; Neurorehabilitation; Prognostic factors; Upper limb
Mesh:
Year: 2021 PMID: 34822120 PMCID: PMC8724183 DOI: 10.1007/s11517-021-02467-y
Source DB: PubMed Journal: Med Biol Eng Comput ISSN: 0140-0118 Impact factor: 2.602
Fig. 1Approach for prediction of neurorehabilitation outcomes in persons with multiple sclerosis. Eleven persons with multiple sclerosis were assessed before and after eight weeks of neurorehabilitation. Multiple linear and non-linear machine learning models were trained on different feature sets with data collected before the intervention. This included information from conventional clinical assessments about activity limitations and impairments, clinical routine data, and digital health metrics collected with the Virtual Peg Insertion Test (VPIT). The dependent variable of the models defined whether a considerable improvement in activity limitations occurred across the intervention or not. The quality and generalizability of the models were evaluated in a leave-one-subject-out cross-validation
Clinical information on persons with multiple sclerosis
| ID | Time | Side | Age | Sex | MS type | Chronicity | Interv. | EDSS | ARAT | BBT | NHPT |
|---|---|---|---|---|---|---|---|---|---|---|---|
| yrs | yrs | 0–10 | 0–57 | 1/min | s | ||||||
| 01 | Pre | Left | 52 | F | RR | 29 | 1 | 7 | 37 | 22 | 45.25 |
| 01 | Post | Left | 52 | F | RR | 29 | 1 | – | 49 | 33 | 41.14 |
| 01 | Pre | Right | 52 | F | RR | 29 | 1 | 7 | 47 | 31 | 24.75 |
| 01 | Post | Right | 52 | F | RR | 29 | 1 | – | 56 | 47 | 23.43 |
| 02 | Pre | Right | 69 | M | PP | 19 | 1 | 7.5 | 44 | 20 | 140.27 |
| 02 | Post | Right | 69 | M | PP | 19 | 1 | – | 41 | 26 | 216.4 |
| 03 | Pre | Left | 25 | F | RR | 6 | 2 | 6 | 52 | 45 | 29.35 |
| 03 | Post | Left | 25 | F | RR | 6 | 2 | – | 57 | 57 | 23.78 |
| 03 | Pre | Right | 25 | F | RR | 6 | 2 | 6 | 53 | 43 | 29.62 |
| 03 | Post | Right | 25 | F | RR | 6 | 2 | – | 55 | 52 | 23.81 |
| 04 | Pre | Left | 42 | F | RR | 1 | 1 | 4 | 56 | 39 | 27.81 |
| 04 | Post | Left | 42 | F | RR | 1 | 1 | – | 56 | 57 | 23.52 |
| 04 | Pre | Right | 42 | F | RR | 1 | 1 | 4 | 54 | 40 | 20.48 |
| 04 | Post | Right | 42 | F | RR | 1 | 1 | – | 55 | 65 | 20.92 |
| 05 | Pre | Left | 56 | F | SP | 10 | 2 | 7 | 49 | 38 | 33.72 |
| 05 | Post | Left | 56 | F | SP | 10 | 2 | – | 54 | 44 | 35.3 |
| 05 | Pre | Right | 56 | F | SP | 10 | 2 | 7 | 29 | 25 | 89.79 |
| 05 | Post | Right | 56 | F | SP | 10 | 2 | – | 40 | 28 | 70.24 |
| 06 | Pre | Left | 65 | M | SP | 19 | 2 | 8 | 52 | 34 | 39.9 |
| 06 | Post | Left | 65 | M | SP | 19 | 2 | – | 54 | 34 | 44.13 |
| 07 | Pre | Left | 63 | F | RR | 8 | 2 | 4.5 | 57 | 60 | 20.84 |
| 07 | Post | Left | 63 | F | RR | 8 | 2 | – | 57 | 49 | 22.28 |
| 07 | Pre | Right | 63 | F | RR | 8 | 2 | 4.5 | 54 | 47 | 35.04 |
| 07 | Post | Right | 63 | F | RR | 8 | 2 | – | 55 | 45 | 27.3 |
| 08 | Pre | Left | 76 | F | RR | 38 | 1 | 5 | 43 | 42 | 27.01 |
| 08 | Post | Left | 76 | F | RR | 38 | 1 | – | 54 | 56 | 24.61 |
| 08 | Pre | Right | 76 | F | RR | 38 | 1 | 5 | 34 | 43 | 34.46 |
| 08 | Post | Right | 76 | F | RR | 38 | 1 | – | 54 | 49 | 23.46 |
| 09 | Pre | Left | 60 | M | PP | 21 | 1 | 7 | 52 | 44 | 31.48 |
| 09 | Post | Left | 60 | M | PP | 21 | 1 | – | 54 | 49 | 35.66 |
| 09 | Pre | Right | 60 | M | PP | 21 | 1 | 7 | 53 | 51 | 25.29 |
| 09 | Post | Right | 60 | M | PP | 21 | 1 | – | 55 | 48 | 41.93 |
| 10 | Pre | Left | 46 | M | PP | 11 | 2 | 5.5 | 55 | 32 | 30.58 |
| 10 | Post | Left | 46 | M | PP | 11 | 2 | – | 56 | 43 | 39.08 |
| 10 | Pre | Right | 46 | M | PP | 11 | 2 | 5.5 | 56 | 35 | 23.23 |
| 10 | Post | Right | 46 | M | PP | 11 | 2 | – | 56 | 53 | 20.83 |
| 11 | Pre | Left | 70 | F | RR | 37 | 3 | 6 | 53 | 45 | 29.86 |
| 11 | Post | Left | 70 | F | RR | 37 | 3 | – | 55 | 39 | 35.28 |
| 11 | Pre | Right | 70 | F | RR | 37 | 3 | 6 | 45 | 42 | 53.21 |
| 11 | Post | Right | 70 | F | RR | 37 | 3 | – | 52 | 41 | 46.19 |
Subject 2 was defined as a unexpected non-responder, as he had the strongest activity limitations at admission, but did not respond positively to neurorehabilitation. ID: participant identifier. F: female. M: male. Intervention (interv.) group: task-oriented high intensity (1), task-oriented low intensity (2), control (3). RR: relapse remitting. PP: primary progressive. SP: secondary progressive. EDSS: Expanded Disability Status Scale. NHPT: Nine Hole Peg Test. BBT: Box and Block Test. ARAT: Action Research Arm Test. VPIT: Virtual Peg Insertion Test
Predicting intervention outcomes using data collected pre-intervention and a k-nearest neighbor model
| Machine learning: k-nearest neighbor model | ||||||
|---|---|---|---|---|---|---|
| Feature sets | All participants | Unexpected non-responder | ||||
| Outcome prediction for | Outcome prediction for | |||||
| ARAT | BBT | NHPT | ARAT | BBT | NHPT | |
| Balanced accuracy (%) | Correct (yes/no) | |||||
| 1 | 55 | 63 | 43 | y | y | y |
| 2 | 49 | 28 | 54 | n | n | y |
| 3 | 52 | 37 | 43 | y | y | y |
| 4 | 43 | 53 | 40 | n | n | y |
| 5 | 77 | 66 | y | y | y | |
| 6 | 80 | 63 | 64 | n | y | n |
| 1, 2 | 43 | y | y | y | ||
| 1, 3 | 60 | 25 | 50 | y | y | y |
| 1, 4 | 55 | 46 | 63 | y | y | y |
| 1, 5 | 69 | 55 | 64 | y | y | y |
| 1, 6 | 93 | 59 | 33 | n | y | n |
| 1, 2, 3 | 71 | 35 | 50 | y | y | y |
| 1, 4, 6 | 68 | 42 | 48 | n | y | n |
| 1, 5, 6 | 85 | 60 | 68 | n | y | y |
| 1, 4, 5, 6 | 64 | 69 | 61 | n | y | n |
| 1, 2, 3, 4, 5, 6 | 68 | 59 | 64 | n | y | y |
Multiple machine learning models were trained using different feature sets (independent variables, 1–6). The training label indicated whether a considerable change across intervention was observed in a specific conventional score (dependent variable; ARAT, BBT, or NHPT). The models were evaluated in a leave-one-out cross-validation and specifically tested for one individual with strong activity limitations who did not show improvements across neurorehabilitation (referred to as unexpected non-responder). Feature set nomenclature: (1) patient master data (ms type, chronicity, age, sex); (2) intervention group; (3) disability (EDSS, disability group); (4) conventional scales of body functions (motricity index, static fatigue index, monofilament index, symbol digit modality test, Fahn’s tremor rating scale); (5) digital health metrics of sensorimotor impairments (ten VPIT metrics); (6) Conv. scale of activity (ARAT, NHPT, BBT). The best performing (accuracy and unexpected non-responder) models relying on the least amount of features are highlighted in bold for each conventional scale. ARAT: Action Research Arm Test. BBT: Box and Block Test. NHPT: Nine Hole Peg Test. VPIT: Virtual Peg Insertion Test
Predicting intervention outcomes in ARAT, BBT, and NHPT using data collected pre-intervention — detailed performance of best performing models
| Best performing models and feature sets | ||||||
|---|---|---|---|---|---|---|
| Model | Feature set | Balanced accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | |
| ARAT | Linear regression | 1, 6 | 89 | 100 | 79 | 67 |
| BBT | Decision tree | 1 | 83 | 67 | 100 | 100 |
| NHPT | Linear regression | 5 | 73 | 67 | 79 | 57 |
Best performing models were selected according to the balanced accuracy and their ability to correctly identify the unexpected non-responder. Feature set nomenclature: (1) patient master data (ms type, chronicity, age, sex); (2) intervention group; (3) disability (EDSS, disability group); (4) conventional scales of body functions (motricity index, static fatigue index, monofilament index, symbol digit modality test, Fahn’s tremor rating scale); (5) digital health metrics of sensorimotor impairments (ten VPIT metrics); (6) conventional scale of activity (ARAT, NHPT, BBT)