Victor C Espinoza Bernal1, Shivayogi V Hiremath1,2, Bethany Wolf3, Brooke Riley3, Rochelle J Mendonca4, Michelle J Johnson5,6. 1. Department of Health and Rehabilitation Sciences, Temple University, Philadelphia, PA, USA. 2. Director of Personal Health Informatics & Rehabilitation Engineering Laboratory, Temple University, Philadelphia, PA, USA. 3. Friends of the Redeemer United, St Elizabeth, Jamaica. 4. Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA. 5. Department of Physical Medicine and Rehabilitation, University of Pennsylvania, Philadelphia, PA, USA. 6. Director of Rehabilitation Robotics Lab, University of Pennsylvania, Philadelphia, PA, USA.
Abstract
INTRODUCTION: Stroke is the leading cause of disability worldwide. It has been well-documented that rehabilitation (rehab) therapy can aid in regaining health and function for individuals with stroke. Yet, tracking in-home rehab continues to be a challenge because of a lack of resources and population-scale demands. In order to address this gap, we implemented a methodology to classify and track rehab interventions in individuals with stroke. METHODS: We developed personalized classification algorithms, including neural network-based algorithms, to classify four rehab exercises performed by two individuals with stroke who were part of a week-long therapy camp in Jamaica, a low- and middle-income country. Accelerometry-based wearable sensors were placed on each upper and lower limb to collect movement data during therapy. RESULTS: The classification accuracy for traditional and neural network-based algorithms utilizing feature data (e.g., number of peaks) from the sensors ranged from 64 to 94%, respectively. In addition, the study proposes a new method to assess change in bilateral mobility over the camp duration. CONCLUSION: The results of this pilot study indicate that personalized supervised learning algorithms can be used to classify and track rehab activities and functional outcomes in resource limited settings such as LMICs.
INTRODUCTION: Stroke is the leading cause of disability worldwide. It has been well-documented that rehabilitation (rehab) therapy can aid in regaining health and function for individuals with stroke. Yet, tracking in-home rehab continues to be a challenge because of a lack of resources and population-scale demands. In order to address this gap, we implemented a methodology to classify and track rehab interventions in individuals with stroke. METHODS: We developed personalized classification algorithms, including neural network-based algorithms, to classify four rehab exercises performed by two individuals with stroke who were part of a week-long therapy camp in Jamaica, a low- and middle-income country. Accelerometry-based wearable sensors were placed on each upper and lower limb to collect movement data during therapy. RESULTS: The classification accuracy for traditional and neural network-based algorithms utilizing feature data (e.g., number of peaks) from the sensors ranged from 64 to 94%, respectively. In addition, the study proposes a new method to assess change in bilateral mobility over the camp duration. CONCLUSION: The results of this pilot study indicate that personalized supervised learning algorithms can be used to classify and track rehab activities and functional outcomes in resource limited settings such as LMICs.
Stroke is a leading cause of long-term disability for survivors.
Functional limitations, such as mobility impairments,[2,3] language deficits,[4,5] and paralysis, greatly reduce the quality of life.[1,6] Inpatient and outpatient rehabilitation (rehab) improves the health and function of individuals with stroke. However, availability of rehab services for individuals with stroke in low- and middle-income countries (LMICs) may be limited due to lack of resources, healthcare facilities being present mainly in larger cities, transportation barriers, and large population-scale demand.
One of the approaches to address this need may be to use cost-effective wearable technology, which has the potential to assist clinicians in tracking and providing rehab interventions in the community.[8-10] However, a major limitation that needs to be addressed is the limited functional improvement at home due to low and varying patient adherence (20–35% overall) with prescribed exercises.[11-14] Research indicates that a technology coach, which supports patients around-the-clock, may assist patients with adherence.[15-17]The applications of machine-learning algorithms are not new to stroke-related research.[18-21] Prior research by Manini et al. used a support vector machine (SVM) classifier algorithm to distinguish between gait in elderly patients and patients with stroke and Huntington’s disease.
The SVM classier algorithm used a combination of time and frequency domain features with group-specific hidden Markov models (HMMs) to classify 90.5% of patients to the right group in a laboratory setting. Furthermore, the research suggested that investigating left-right symmetry through HMM can yield more accurate results given that gait is cyclical in nature. Xu et al. used a combined approach of using Dynamic Time Warping (DTW) algorithm and Naïve Bayes (NB) classifier to detect and classify walking compared to other activities.
According to their results, the combined method outperformed each technique individually in terms of accuracy. While the hybrid approach achieved a higher accuracy (97.3%), the DTW and NB methods achieved 93.4% and 93.7%, respectively. In another study, Roy et al. used a combination of neural networks and a neuro-fuzzy inference technique to classify activities of daily living in people with stroke.
The classification algorithms used data from surface electromyography and accelerometer sensors to classify 11 activities with an average accuracy of 95.0%. Roy et al. highlight that multi-layered neural networks were more efficient than single-layered ones.
Leveraging prior research, we are proposing the development of personalized machine-learning algorithms for individuals with stroke. Personalized machine-learning algorithms are defined as algorithms developed for each individual to better assess their limb movement in the presence of biomechanical variations due to physical impairments. These new techniques have the potential to improve classification of rehab interventions in individuals with stroke.The primary objective of this pilot study was to develop and evaluate personalized machine-learning algorithms to detect and track rehab exercises for people with stroke. The personalized machine-learning algorithms developed included SVM, random forests (RFs), artificial neural networks (ANN) using stochastic gradient descent (SGD), and ANN using Adaptive moment parameter optimization. A secondary objective was to use the detected activities to assess change in bilateral mobility for upper and lower limbs during rehab exercises over time. We assessed bilateral upper or lower limb mobility of an individual with stroke by developing a new method called SymMetric, which provides a personalized value that captures change in mobility of paretic (affected) side as compared to non-paretic (non-affected) side of a person.
Materials and methods
Protocol
This study was conducted in Jamaica, an LMIC in the Caribbean, with a population of approximately 2.7 million people and a stroke prevalence 74.3 per 100,000.
Participants signed a written consent form prior to participating in the study. The study was considered a case study and approved by the Jamaican Government’s Institutional Review Board. This pilot study included two cases studies, which comprised of data collected from two participants with stroke. Both participants participated in an intense and structured stroke rehab program involving upper and lower extremity functional activities for 7 h/day for a week in Jamaica.Group mat activities included therapeutic exercises that focused on preventing contractures, improving postural control, building capacity in large muscle groups, and neuromuscular re-education. Furthermore, the rehab exercises were provided at four treatment stations that focused on a distinct aspect of motor function. The gait station included a custom body weight–supported treadmill training and over-ground gait training. Over-ground gait training was conducted both indoors and outdoors on a variety of surfaces with emphasis on normal gait kinematics. In the balance station, lower level patients worked on static and dynamic sitting balance, transfer training and static standing with manual assistance or a harness. Higher level patients worked on postural control during dynamic standing balance activities. In the upper extremity station, patients with lower level arm function focused on neuromuscular re-education with neuromuscular electrical stimulation and upper extremity exercises. Higher level patients worked on capacity building active exercises, purposeful movement with and without neuromuscular electrical stimulation, and gross and fine motor activities. In the free station, patients and therapists were able to focus on a deficit or functional activity that required more attention and repetition. In addition to the intervention stations, patients also participated in group activities that consisted of relay races, group volleyball, musical chairs, and dance contests. These activities made participation in rehabilitation fun and engaging.Figure 1 shows an individual with stroke participating in a gait training exercise during the rehab program. The analysis in this study was based on data collected by wearable sensors, placed on participant’s triceps and ankles bilaterally (Figure 1), for four of the 7 days when the participants performed rehab exercises in a semi-structured environment. Table 1 shows the four rehab exercises that participants performed, and their corresponding classification labels assigned. Over-ground gait training was conducted both indoors and outdoors on a variety of surfaces with emphasis on normal gait kinematics. On the first day, a clinician annotated the start and end times of each activity trial during data collection, which was used as the reference for developing and testing of the classification algorithms. This led to 180 min of annotated data for each participant (45 min per rehab exercise) for the first day. Due to the approximately equal duration of the rehab interventions, we obtained a balanced dataset for each activity. Additionally, the duration of data collected for the other 3 days was 600 and 690 min, respectively, for participants 1 and 2.
Figure 1.
A participant with stroke participating in a gait exercise. Sensors are highlighted by rounded rectangles on the upper arm and ankles.
Table 1.
Exercises performed during the rehab program and their corresponding labels for the study.
Activity
Label
Gait training
0
Balance training
1
Upper extremity therapeutic exercise
2
Overall function task training
3
A participant with stroke participating in a gait exercise. Sensors are highlighted by rounded rectangles on the upper arm and ankles.Exercises performed during the rehab program and their corresponding labels for the study.
Instrumentation
The wearable device consists of a three axis accelerometer, a galvanic skin response sensor, a heat flux sensor, and a skin temperature sensor. For the purposes of this study, we only analyzed feature data extracted from the three axis accelerometer. In this study, we chose a tri-axial accelerometer-based sensor as it is embedded in a wearable device which is easy to use, and captures high-fidelity movement data of the person wearing it.[24,25] Additionally, gyroscope-based sensors consume several orders of magnitude more power than accelerometers.
Some of the disadvantages of using a wearable sensor include wearing the sensor and recharging the device on a regular basis.
The wearable device used developed a model, based on static and dynamic calibration tests, to measure three axis acceleration due to gravity and body movements. Static calibration tests provide a reference measurement of acceleration values for the tri-axial accelerometer.
The static test was conducted by the manufacturer through a precision inspection table using gravity as the reference. Dynamic calibrations tests provide the performance of the sensor through a range of frequencies and gravitational loads.
The dynamic calibration tests were conducted by the manufacturer through a vibration test system (i.e., a “shake table”). The device then removed the acceleration component due to gravity to store acceleration due to body movements. The three axis accelerometer sampled movement data at 32 Hz with a range of ±39.2 m/s2. Furthermore, the device used a low-pass moving average filter of 32 samples to obtain smoothed acceleration data in three axes, which were used to estimate feature data every minute. SenseWear software, developed by Bodymedia Inc (Pittsburgh, PA, USA), was used to retrieve the average acceleration and feature data recorded every minute from the wearable sensors for the duration of the study (4 days) (Table 2). The feature data comprised of the average (representing overall limb motion), the mean absolute difference (representing variability of limb motion), and the number of peaks (representing turning points of limb motion) in transverse, forward, and longitudinal directions. Overall, we obtained 36 features from four sensors (three features per axis for three axes accelerometer) that were normalized based on the maximum value for each feature. Principal component analysis was used to better visualize the 36 features. The visualization indicated that the rehab activities were not easily separable on three key principal components obtained from the PCA. Therefore, we chose to use all 36 features to develop and evaluate the classification algorithms.
Table 2.
Feature data retrieved from wearable sensors for transverse, forward, and longitudinal directions.
Feature
Description
Average
Average acceleration (m/s2)
MAD
Mean absolute deviation for acceleration (m/s2)
Peaks
Number of peaks per minute for acceleration (peaks/min)
Feature data retrieved from wearable sensors for transverse, forward, and longitudinal directions.
Data processing
Each participant’s annotated data were divided into two sets: training (80%) and validation (20%). While the training data were used to develop classification models, the validation data were used to gauge the model’s performance on data not used for model development. In addition, a five-fold cross-validation procedure was used during the model development on the training data. The algorithms were developed on a computer with Windows 10 Professional Operating System (System properties: Intel(R) Core(TM) i7-8850 central processing unit at 1.8 Giga Hertz; Random Access Memory: 16 gigabytes; 64-bit x 64-based processor). All classification algorithms’ performance were evaluated in terms of classification accuracy. Additionally, the ANN classification algorithm’s performance was evaluated in terms of accuracy and cross-entropy loss. The performance of the best classifier algorithm was based on a Kruskal–Wallis test, due to the non-normal distribution of accuracy, for the four classification algorithms over 100 iterations. Statistical analyses were performed using IBM SPSS Statistics software (ver. 25.0, Armonk, NY), with a statistical significance value set at an alpha level of 0.05. The following sections highlight the classification algorithms developed and the evaluation of change in bilateral mobility for upper and lower limbs during rehab over time.
Classification
We used TensorFlow
to develop two types of ANN-based classification algorithms including ANN with SGD optimization and ANN with Adaptive moment parameter optimization to classify the rehab exercises. Additionally, we used Scikit-learn
to develop SVMs and RF classification algorithms. The SVM classification algorithm used a linear Kernel, with a penalty parameter of 80, to transform the feature data into a higher dimension. The transformed data was then classified into rehab exercises via hyperplanes. On similar lines, an RF algorithm comprising of multiple decision trees was developed. Within each decision tree, there are binary choices where the tree splits into branches. The quality of each split was quantified by the Gini index, a generalization of binomial variance. The parameters we used for the RF algorithm included nine nodes with a minimum of two splits by internal node (maximum depth), 90 decision trees, and the Gini Index as the chosen impurity function. The final classification was a result of the majority of votes amongst all decision trees.Our ANN model development included two sub processes. First was forward propagation, which made a prediction based on the input data and the model’s current weight values. Second was backpropagation, which entailed updating the weight values within the model towards an error minimum. The ANN model’s architecture had one input layer (normalized feature data), multiple hidden layers, and an output layer. The output layer consisted of four nodes corresponding to the four possible outcomes in our categorical classification problem. We used one-hot encoded classification, which mapped our targets and model’s prediction to a binary vector containing four binary elements in which the only element set to one corresponds to one of the possible outcomes. Equation (1) shows the mathematical operation of a neuron, where
is an element of an input vector x,
is the ith weight value of a layer containing i neurons and b is a constant or bias. Equation (2) shows the function
which was the rectified linear unit or activation function. Once the input traveled through all the hidden layers, the output nodes were activated by the softmax function (Equation (3)) in order to produce a distribution amongst the possible outcomes.
is the dimensionality of input vector
. The weight values were adjusted using two optimization techniques independently, SGD and Adaptive moment parameter optimization (Adam). Figure 2 illustrates the process we used to develop an ANN algorithm with Adam for this study.
In more detail, SGD is an optimization technique used to reduce the model’s classification error based on the network’s gradient, which is the derivative of the model’s prediction error with respect to its current weights. In our work, the ANN with SGD model’s error was quantified by binary cross-entropy loss function; a function which uses the model’s prediction (
) and actual (
) correct values (Equation (4)). Once the gradient,
, was calculated (Equation (5)), the weight values in the model were iteratively optimized to minimize the error while using a constant hyper-parameter
(Equation (6)). Each iteration of the forward and backward propagation comprised an epoch.
In addition to SGD, which is highly dependent on robust weight initialization and slow to reach loss convergence,
we have developed and evaluated ANN with Adam optimization. Adam is an augmented version of SGD which utilizes the mean (first moment) and variance (second moment) of the gradient in order to update the weights (Equation (7)). The
and
are the estimated moments which were initialized to zero (at t = 0) and computed over several iterations (Equations (8) and (9)). The
and
are coefficients to compensate for the zero initialization of the momentum and velocity. Adam optimizes faster than SGD by adapting the learning rate and using sparse features to obtain a faster convergence rate. In order to compensate for the bias towards zero, we computed bias as corrected estimates following Kingma and Ba’s model.
Equations (10) and (11) represent this adjustment. The ANN algorithms were trained for 100 epochs (iterations of forward and backward propagation). Lastly, we recorded the algorithm’s performance with all possible sensor permutations for each participant.
Figure 2.
Activity Classification using ANN with Adam optimization.
Activity Classification using ANN with Adam optimization.
The SymMetric method
Post-classification, we designed a new approach called SymMetric method which can quantify the relative limb functionality for the affected side as compared to the non-affected side in an individual with stroke. Prior research by Lang et al. compared the overall bilateral mobility of upper extremities with the ratio of mobility from affected and non-affected sides.
The resulting two-dimensional plot shows an individual’s upper limb mobility,
but its clinical application is challenging due to personal interpretation. On the other hand, we combined both the frequency and magnitude of movement information from the affected and non-affected sides of an individual to a single value, which can be used to compare within-participant performance over days and activities.The process we followed included using magnitude ratio and bilateral magnitude, two variables which were previously proposed.
Magnitude ratio is calculated as the natural logarithm (ln) of the vector magnitude of the affected limb divided by the magnitude of the non-affected limb vector magnitude (Equation (12)). Bilateral magnitude is calculated as the sum of the magnitude vectors of both limbs (Equation (13)). Figure 3 shows the bilateral magnitude and magnitude ratio for affected and non-affected sides of participant 1 with respect to the center for upper extremity therapeutic exercises during day 1.
Figure 3.
Affected and non-affected halves of a bivariate histogram. Bilateral magnitude (y-axis) versus magnitude ratio (x-axis) for affected (left side: 0 to −1) and non-affected (right side: 0 to 1) sides of participant 1 for upper extremity therapeutic exercises during Day 1.
Affected and non-affected halves of a bivariate histogram. Bilateral magnitude (y-axis) versus magnitude ratio (x-axis) for affected (left side: 0 to −1) and non-affected (right side: 0 to 1) sides of participant 1 for upper extremity therapeutic exercises during Day 1.First, we calculated the pair-wise difference of non-affected (A) and affected side (B) of mobility. Second, we calculated the average value by dividing the sum of difference with the number of pairs that had either one of both values for non-affected or affected sides, which provided us with the amount of symmetry present in an individual’s bilateral upper or lower extremity movement. We then applied a hyperbolic tangent function to the average difference, which provided us with a single SymMetric value ranging from −1 to +1 (Equation (14)). This step provided us with three points of reference to interpret the symmetrical limb movement (Figure 4). The points of reference included: 1 for non-affected limb movement, 0 for symmetrical limb movement, and −1 for affected limb movement.
Figure 4.
The SymMetric method shows a single value (y-axis) for average difference in frequencies of limb mobility for non-affected (A) and affected (B) sides (x-axis).
The SymMetric method shows a single value (y-axis) for average difference in frequencies of limb mobility for non-affected (A) and affected (B) sides (x-axis).In an individual with hemiparesis, we expect to see more activity on the non-affected side, which indicates that the SymMetric value will be positive rather than a negative number. Possible limitation of this approach is that the hyperbolic tangent function converges if the absolute value of the weighted average becomes greater than three.
This multi-angular analysis allows us to assess the mobility and change in mobility of the participant during a given activity over time.
Results
This pilot study included two male participants with stroke. Participant 1 had a stroke 31 months before the stroke camp, was 47 years old, and had left hemiplegia, which resulted in a severe upper limb impairment and moderate/mild lower limb impairment. Participant 2 had a stroke 3 months prior to the stroke camp, was 64 years old, and had right hemiplegia, which resulted in moderate upper limb impairment, and severe/moderate lower limb impairment. Table 3 shows the clinical assessments at the beginning and end of the rehab program. Although Participant 1 had lower scores for most clinical assessments, he was faster at the Box and Block test which led to a higher score on the right side. On the other hand, Participant 2 was slower at the Box and Block test but was able to perform the test with both of his upper extremities.
Table 3.
Clinical assessments of participants at the beginning (Pre) and end (Post) of the rehab program.
Test
Participant 1
Participant 2
Pre
Post
Pre
Post
Box and blocks test (R – Right; L - left)
R: 59; L: 0
R: 65; L: 0
R: 24; L: 34
R: 24; L: 36
Fugl-Meyer (upper extremity motor; out of 66)
11
13
28
38
Berg balance test (out of 56)
26
27
51
53
6 minute walk test (feet)
300
400
790
1205
Timed up and go (sec)
32.3
29.2
24.6
11.4
10 meter walk test (sec)
23.8
21.4
16.4
8.6
Mobility aid
Quad cane
Single point cane
Clinical assessments of participants at the beginning (Pre) and end (Post) of the rehab program.The average (SD) time it took to train SVM, RF, ANN with Adam, and ANN with SGD algorithms for 100 iterations (20*5-fold cross-validation) were 0.01 s (s) (0.00 s), 0.22 s (0.20 s), 21.80 s (20.64 s), and 22.60 s (20.10 s). These training times indicated that ANN-based algorithms were computationally more intensive than SVM and RF algorithms. Figures 5 and 6 shows the classification performance for four different personalized algorithms over 20 iterations of five-fold cross-validation for both participants. The highest median accuracy as well as the best performing interquartile range was achieved by the ANN with Adam optimization. Based on the median performance and statistical comparison of the classification algorithms (Table 4), we chose to develop and evaluate personalized ANN algorithms with Adam optimization (here onwards ANN with Adam) for a varying number of sensors used for classifying four rehab activities. Both cross-entropy loss and accuracy metrics were averaged for all possible permutations when using less than four sensors. These results are displayed in Table 5. The results show that an increase in the number of sensors led to an increase in the average accuracy and a decrease in average cross-entropy loss for training and validation data.
Figure 5.
Performance of four classification algorithms for participant 1. The median (interquartile range) accuracy values obtained from 100 iterations (20*5-fold cross-validation) indicate that the ANN with Adam algorithm performed consistently well.
Figure 6.
Performance of four classification algorithms for participant 2. The median (interquartile range) accuracy values obtained from 100 iterations (20*5-fold cross-validation) indicate that the ANN with Adam algorithm performed consistently well.
Table 4.
Classification performance and comparison between each of the four personalized algorithms for participants 1 and 2.
Participant 1
Participant 2
Classification algorithms accuracy (median (IQR))
SVM
0.74 (0.09)
0.71 (0.09)
RF
0.78 (0.12)
0.75 (0.10)
ANN with Adam
0.78 (0.09)
0.80 (0.07)
ANN with SGD
0.75 (0.11)
0.75 (0.11)
Algorithms compared (p-value)
SVM–RF
p < 0.001a
p < 0.001
SVM–ANN with Adam
p < 0.001a
p < 0.001a
SVM – ANN with SGD
p = 0.241
p = 0.016a
RF–ANN with Adam
p = 0.202
p < 0.001a
RF–ANN with SGD
p = 0.018a
p = 0.240
ANN with Adam–ANN with SGD
p < 0.001a
p < 0.001a
aindicates significantly different
Table 5.
Average classification performance of ANN with Adam algorithm for participants 1 and 2.
Average loss
Average accuracy
# of sensors used
Train
Validation
Train
Validation
Participant 1
4
0.20
0.59
0.94
0.82
3
0.30
0.79
0.89
0.68
2
0.71
0.92
0.73
0.63
1
0.85
1.08
0.64
0.45
Participant 2
4
0.26
0.53
0.88
0.83
3
0.31
0.72
0.89
0.76
2
0.41
1.04
0.85
0.68
1
0.77
0.92
0.70
0.63
Performance of four classification algorithms for participant 1. The median (interquartile range) accuracy values obtained from 100 iterations (20*5-fold cross-validation) indicate that the ANN with Adam algorithm performed consistently well.Performance of four classification algorithms for participant 2. The median (interquartile range) accuracy values obtained from 100 iterations (20*5-fold cross-validation) indicate that the ANN with Adam algorithm performed consistently well.Classification performance and comparison between each of the four personalized algorithms for participants 1 and 2.aindicates significantly differentAverage classification performance of ANN with Adam algorithm for participants 1 and 2.The ANN with Adam algorithm was then used to predict activity classification for three remaining days of rehab intervention. Figures 7 (participant 1) and 8 (participant 2) shows training data, which was collected during day 1, and the predicted rehab exercises for the testing data based on personalized models for the remaining 3 days of the intervention.
Figure 7.
Rehab exercise predictions for participant 1. The x and y axes are time duration in minutes and rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. While Day 1’s predictions are based on the annotated data (highlighted with a dashed rounded rectangle box in orange), the rehab exercises predicted for Days 2, 3, and 4 are based on the personalized classification models developed from Day 1.
Figure 8.
Rehab exercise predictions for participant 2. The x and y axes are time duration in minutes and rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. While Day 1’s predictions are based on the annotated data (highlighted with a dashed rounded rectangle box in orange), the rehab exercises predicted for Days 2, 3, and 4 are based on the personalized classification models developed from Day 1.
Rehab exercise predictions for participant 1. The x and y axes are time duration in minutes and rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. While Day 1’s predictions are based on the annotated data (highlighted with a dashed rounded rectangle box in orange), the rehab exercises predicted for Days 2, 3, and 4 are based on the personalized classification models developed from Day 1.Rehab exercise predictions for participant 2. The x and y axes are time duration in minutes and rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. While Day 1’s predictions are based on the annotated data (highlighted with a dashed rounded rectangle box in orange), the rehab exercises predicted for Days 2, 3, and 4 are based on the personalized classification models developed from Day 1.In order to visualize mobility differences across types of rehab exercises and time, we plotted the values computed by our new SymMetric method for upper extremity movement. Figures 9 and 10 show the SymMetric plots for Days 1–4 for both participants. The personalized model developed from Day 1 predicted the rehab activities for the remaining days (Days 2, 3, and 4). The closer the SymMetric value to zero, the more symmetric the movement of the participant’s limbs. Overall, the plots show that rehab exercises helped: (1) improve bilateral mobility over time, (2) initially improve and then revert to the original value, and (3) initially worsen and then improve bilateral mobility.
Figure 9.
Bilateral mobility for participant 1 during the rehab program. The x and y axes are SymMetric bilateral movement and days of rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. The plot shows that the change in bilateral upper limb movement over days for each participant varies based on their functional abilities. The closer the SymMetric value to zero, the more symmetric the movement of the participant’s limbs. For example, in this participant we observed that: (1) upper extremity therapeutic exercises led to a symmetric use of bilateral upper limbs over time (Day 2 to Day 4); (2) balance training improved from Day 2 to Day 3, but went back to a value similar to Day 2; (3) overall function task showed a decrease in bilateral mobility from Day 2 to Day 3, and improved on Day 4; and (4) gait training showed a decrease in bilateral mobility from Day 2 to Day 3, with no prediction or value for Day 4 as the rehab therapy probably focused more on the other rehab activities.
Figure 10.
Bilateral mobility for participant 2 during the rehab program. The x and y axes are SymMetric bilateral movement and days of rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. The plot shows that the change in bilateral upper limb movement over days for each participant varies based on their functional abilities.
Bilateral mobility for participant 1 during the rehab program. The x and y axes are SymMetric bilateral movement and days of rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. The plot shows that the change in bilateral upper limb movement over days for each participant varies based on their functional abilities. The closer the SymMetric value to zero, the more symmetric the movement of the participant’s limbs. For example, in this participant we observed that: (1) upper extremity therapeutic exercises led to a symmetric use of bilateral upper limbs over time (Day 2 to Day 4); (2) balance training improved from Day 2 to Day 3, but went back to a value similar to Day 2; (3) overall function task showed a decrease in bilateral mobility from Day 2 to Day 3, and improved on Day 4; and (4) gait training showed a decrease in bilateral mobility from Day 2 to Day 3, with no prediction or value for Day 4 as the rehab therapy probably focused more on the other rehab activities.Bilateral mobility for participant 2 during the rehab program. The x and y axes are SymMetric bilateral movement and days of rehab exercises, respectively. Classes 0, 1, 2, and 3 represent gait training, balance training, upper extremity therapeutic exercises, and overall function task training, respectively. The plot shows that the change in bilateral upper limb movement over days for each participant varies based on their functional abilities.
Discussion
In contrast to in-clinic rehab, recommendations for in-home exercises or therapy indicate that functional improvements experienced in a clinic do not continue at home, and in some cases, there is even a functional decline. To assist with tracking participants’ rehab exercises in the community, we have (1) developed and evaluated personalized machine-learning algorithms to detect and track rehab exercises for people with stroke, and (2) developed a new methodology that tracks personalized rehab activities and estimates the change in bilateral mobility for upper and lower limbs during rehab exercises over time.
Personalized machine-learning algorithms
The results of this pilot study indicate that ANN models with Adam optimization achieved greater than 80% classification accuracy with low cross-entropy loss (0.6) for both participants (Table 5). In addition, the interquartile range from the 100 iterations (20 times of a 5-fold cross-validation, Figures 5 and 6) indicated other models achieved a similar range of classification accuracy. While the SVM classification algorithm achieved the lowest accuracy for both participants, the ANN with Adam achieved the highest accuracy. Furthermore, the classification accuracy increased with the number of sensors. In contrast, the model’s loss rate increased as the number of sensors decreased.The personalized algorithms were then used to assess which activities were performed by the individuals on the remaining days. Figures 7 and 8 show the rehab exercises that were predicted throughout the 4-day intervention of the rehab therapy program for both participants. The model performs well on the first-day data since it mainly consists of labeled data, while some patterns of the 45-min intervals can be observed on the second and third day. The predictions on last day data show small amount of gait-related rehab exercises as the therapy exercises focused more on a combination of balance training and upper extremity therapeutic exercise.
Change in bilateral mobility
Furthermore, the results of this pilot study indicate that the SymMetric method proposed by our group has potential to track changes in bilateral mobility over time and across various types of rehab exercises. Previous work by Lang et al.
suggested that a person without a hemiplegia will have a symmetric heat map indicating bilateral mobility. On the other hand, an individual with hemiplegia will have a pattern that is skewed to the non-affected side. While our current study was limited to two participants with stroke, the results suggested that SymMetric method can be used to quantify an individual with stroke’s upper or lower extremity movement into a single value. In addition, the method focused on tracking change of affected limb’s mobility compared to the non-affected side within rehab exercises and the intense rehab therapy program over time. Clinicians can use this quantitative measure to identify rehab activities that lead to symmetric limb movement for each individual leading to a personalized treatment plan.The bilateral mobility scores for specific rehab activities and days indicates the need for personalized algorithms. This is further supported by the clinical measure differences (Fugl-Meyer scores) between participants. The clinical measures suggest that participant 2 would need less assistance than participant 1 in performing the rehab exercises and would likely demonstrate more independence with functional mobility in the community as well. Future research should evaluate how bilateral mobility during rehab exercises promoted improved daily use of the extremities in the home and the community. Furthermore, while both participants had upper extremity motor function improvement (Fugl-Meyer scores), only participant 2 showed an increase in mobility for the affected arm (Box and Blocks test). These results suggest that the clinical measures used in this study were not sensitive to small bilateral changes. Future studies should use clinical measures such as the Upper Extremity Motor Activity Log that capture both the amount and quality of arm use in individuals with stroke.
In addition, there is a need for developing clinical measures that capture change in bilateral mobility over time and relate it to kinematic information obtained from wearable devices.
Limitations
A limitation of this pilot study includes developing personalized algorithms and evaluating SymMetric method in just two individuals with stroke. Based on the results of this work, we will conduct a human subject research study in the community with a much larger sample size, as determined by power analysis. Furthermore, evaluating these models in a larger sample of participants will allow researchers to study variation in models’ parameters for different subjects. While the current study indicated the performance of the personalized algorithms by presenting accuracy and loss entropy (ANN algorithm), future studies should assess specificity and sensitivity of the methodology in a large sample of participants. Another limitation of the study was that the clinicians were able to annotate the rehab activities and sensor data for the first day only due to lack of time and lack of staff resources during the rehab camp. Future studies should annotate the rehab activities and sensors data for multiple days, which would allow researchers to develop and train classification models for some of these days (training data) and gauge the model’s performance on the other days not used for training (validation data).
Conclusions
Our work represents an attempt to automate the detection and quantification of change in limb mobility during rehab therapy for individuals with stroke. Personalized machine-learning algorithms combined with the SymMetric method for data obtained from wearable sensors has the potential to track rehab exercises in individuals with stroke in a semi-structured environment. Functional differences between participants with stroke can be captured through personalized algorithms, which may lead to tailored rehab exercises that promote bilateral and symmetric mobility. The work confirms that it is applicable to resource limited settings in high-income countries and to LMICs.
Authors: Pamela W Duncan; Richard Zorowitz; Barbara Bates; John Y Choi; Jonathan J Glasberg; Glenn D Graham; Richard C Katz; Kerri Lamberty; Dean Reker Journal: Stroke Date: 2005-09 Impact factor: 7.914