| Literature DB >> 34276971 |
Xinyi Wang1,2, Saurabh Garg1, Son N Tran1, Quan Bai1, Jane Alty2.
Abstract
With the increasing prevalence of neurodegenerative diseases, including Parkinson's disease, hand tremor detection has become a popular research topic because it helps with the diagnosis and tracking of disease progression. Conventional hand tremor detection algorithms involved wearable sensors. A non-invasive hand tremor detection algorithm using videos as input is desirable but the existing video-based algorithms are sensitive to environmental conditions. An algorithm, with the capability of detecting hand tremor from videos with a cluttered background, would allow the videos recorded in a non-research environment to be used. Clinicians and researchers could use videos collected from patients and participants in their own home environment or standard clinical settings. Neural network based machine learning architectures provide high accuracy classification results in related fields including hand gesture recognition and body movement detection systems. We thus investigated the accuracy of advanced neural network architectures to automatically detect hand tremor in videos with a cluttered background. We examined configurations with different sets of features and neural network based classification models. We compared the performance of different combinations of features and classification models and then selected the combination which provided the highest accuracy of hand tremor detection. We used cross validation to test the accuracy of the trained model predictions. The highest classification accuracy for automatically detecting tremor (vs non tremor) was 80.6% and this was obtained using Convolutional Neural Network-Long Short-Term Memory and features based on measures of frequency and amplitude change.Entities:
Keywords: Advanced neural network; Hand tremor detection; Machine learning; Videos with cluttered background
Year: 2021 PMID: 34276971 PMCID: PMC8273850 DOI: 10.1007/s13755-021-00159-3
Source DB: PubMed Journal: Health Inf Sci Syst ISSN: 2047-2501
Fig. 1Three steps included in the system of hand tremor detection
Fig. 221 landmarks on left hand produced automatically by Mediapipe (Figure adapted from [25])
Fig. 3CNN-LSTM Architecture for hand tremor detection. Conv1D layer stands for one-dimension Convolution Neural Network layer, LSTM layer stands for Long Short-Term Memory layer and MaxPooling1D layer stands for one-dimension max-pooling layer
Fig. 4LSTM Architecture for hand tremor detection
Fig. 5a 21 tasks are performed by subjects. b Short explanation for each task (Figure taken from [28])
Fig. 6Accuracy results of the CNN-LSTM model with increasing epoch sizes
Fig. 7Accuracy results of the CNN-LSTM model with different batch sizes
Fig. 8Accuracy results of the CNN-LSTM model with different activation functions
10-fold cross-validation results for configurations combining different sets of features with SVM model
| Configurations combining different sets of features with SVM model | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST with SVM model | 0.53 | 0.51 | 0.63 | 0.55 |
| MDC with SVM model | 0.55 | 0.53 | 0.65 | 0.58 |
| DIST + MDC with SVM model | 0.56 | 0.53 | 0.67 | 0.59 |
10-fold cross-validation results for configurations combining different sets of features with LSTM model
| Configurations combining different sets of features with LSTM model | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST with LSTM model | 0.63 | 0.61 | 0.61 | 0.60 |
| MDC with LSTM model | 0.78 | 0.77 | 0.79 | 0.77 |
| DIST + MDC with LSTM model | 0.80 | 0.79 | 0.79 | 0.79 |
10-fold cross-validation results for configurations combining different sets of features with CNN-LSTM model
| Configurations combining different sets of features with CNN-LSTM model | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST with CNN-LSTM model | 0.72 | 0.72 | 0.67 | 0.68 |
| MDC with CNN-LSTM model | 0.79 | 0.79 | 0.78 | 0.78 |
| DIST + MDC with CNN-LSTM model | 0.81 | 0.81 | 0.79 | 0.80 |
10-fold cross-validation results for hand tremor detection algorithms with different classification models
| Hand tremor detection algorithms with different classification models | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| MDC with SVM model (benchmark) | 0.55 | 0.53 | 0.65 | 0.58 |
| DIST + MDC with SVM model | 0.56 | 0.53 | 0.67 | 0.59 |
| DIST + MDC with LSTM model | 0.80 | 0.79 | 0.79 | 0.79 |
| DIST + MDC with CNN-LSTM model | 0.81 | 0.81 | 0.79 | 0.80 |
10-fold cross-validation results LSTM model with different epoch sizes
| LSTM model with different epoch sizes | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST + MDC with LSTM model with epoch of 100 | 0.61 | 0.60 | 0.71 | 0.63 |
| DIST + MDC with LSTM model with epoch of 150 | 0.63 | 0.62 | 0.71 | 0.64 |
| DIST + MDC with LSTM model with epoch of 200 | 0.63 | 0.66 | 0.67 | 0.60 |
10-fold cross-validation results LSTM model with different batch sizes
| LSTM model with different batch sizes | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST + MDC with LSTM model with batch size of 20 | 0.57 | 0.52 | 0.56 | 0.54 |
| DIST + MDC with LSTM model with batch size of 18 | 0.60 | 0.59 | 0.69 | 0.62 |
| DIST + MDC with LSTM model with batch size of 16 | 0.63 | 0.66 | 0.67 | 0.60 |
10-fold cross-validation results LSTM model with different activation functions
| LSTM model with different activation functions | Average accuracy | Average precision | Average recall | Average F1 score |
|---|---|---|---|---|
| DIST + MDC with LSTM model with Sigmoid activation function | 0.74 | 0.73 | 0.75 | 0.73 |
| DIST + MDC with LSTM model with Tanh activation function | 0.80 | 0.79 | 0.79 | 0.79 |
| DIST + MDC with LSTM model with ReLU | 0.63 | 0.66 | 0.67 | 0.60 |