| Literature DB >> 33981195 |
Wei Li1, Ping Shi1, Hongliu Yu1.
Abstract
Amputation of the upper limb brings heavy burden to amputees, reduces their quality of life, and limits their performance in activities of daily life. The realization of natural control for prosthetic hands is crucial to improving the quality of life of amputees. Surface electromyography (sEMG) signal is one of the most widely used biological signals for the prediction of upper limb motor intention, which is an essential element of the control systems of prosthetic hands. The conversion of sEMG signals into effective control signals often requires a lot of computational power and complex process. Existing commercial prosthetic hands can only provide natural control for very few active degrees of freedom. Deep learning (DL) has performed surprisingly well in the development of intelligent systems in recent years. The significant improvement of hardware equipment and the continuous emergence of large data sets of sEMG have also boosted the DL research in sEMG signal processing. DL can effectively improve the accuracy of sEMG pattern recognition and reduce the influence of interference factors. This paper analyzes the applicability and efficiency of DL in sEMG-based gesture recognition and reviews the key techniques of DL-based sEMG pattern recognition for the prosthetic hand, including signal acquisition, signal preprocessing, feature extraction, classification of patterns, post-processing, and performance evaluation. Finally, the current challenges and future prospects in clinical application of these techniques are outlined and discussed.Entities:
Keywords: convolutional neural network; deep learning; hand gesture recognition; pattern recognition; prosthesis hand; recurrent neural network; surface electromyography
Year: 2021 PMID: 33981195 PMCID: PMC8107289 DOI: 10.3389/fnins.2021.621885
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1The surface electromyography (sEMG) data of a gesture.
Figure 2The six stages of the standard structure of the model.
Figure 3The curves of surface electromyography (sEMG) and the motion detection result (Zhang et al., 2020).
Features according to the domain.
| Time domain | WL | Comprehensive information about frequency, duration and amplitude of signal | |
| MAV | Indication of muscle contraction levels | ||
| iEMG | An onset detection index | ||
| RMS | A feature modeled as amplitude modulated Gaussian random process | ||
| ZC | A feature that provides an approximate estimation of frequency domain properties | ||
| V | A non-linear detector for implicit estimation of muscle contraction force | ||
| SSC | A method that represent the frequency information of signal | ||
| VAR | A power index of the sEMG signal | ||
| WAMP | Indicatorof the level of muscle contraction | ||
| SSI | An energy index of the sEMG signal | ||
| Frequency domain | TP | An aggregate of the sEMG power spectrum | |
| MP | An average power of the EMG power spectrum | ||
| MNF | Average frequency | ||
| MDF | A frequency that divides the sEMG power spectrum into two equal amplitude regions |
Notations: N is the length of sampling points, x.
Figure 4Three surface electromyography (sEMG) image representation methods based on raw signal (Hu et al., 2018).
Standard structure used by the selected models.
| Park and Lee ( | NinaPro DB1 | 27/1 | 6/10 | 2,000 ms | RMS | CNN | N.A./94 | No |
| Atzori et al. ( | NinaPro DB1, DB2, DB3 | 78/1 | 52/10 | 150 ms | mDWT, HIST, WL, RMS | CNN | N.A./66.59, 60.27, 38.09 | No |
| Tsinganos et al. ( | NinaPro DB1 | 27/1 | 53/8 | 200 ms | RMS | CNN | 70.5/N.A. | No |
| Chen L. et al. ( | Private | 17/1 | 5/8 | 260 ms | CWT | CNN | 98.81/N.A. | No |
| Asif et al. ( | Private | 18/1 | 10/6 | 150 ms | Raw sEMG | CNN | 92/N.A. | No |
| Triwiyanto et al. ( | Private | 10/1 | 10/2 | 200 ms | Raw sEMG | CNN | 93/N.A. | No |
| Du et al. ( | CapgMyo DBb | 8/2 | 8/128 | 1 sample | Instantaneous sEMG | CNN | 98.6/63.3 | Yes |
| Chen J. et al. ( | CapgMyo DBa | 23/1 | 8/128 | 1 sample | Instantaneous sEMG | CNN | 98.9/N.A. | No |
| Wei et al. ( | CapgMyo DBa | 23/1 | 8/128 | 1 sample | Instantaneous sEMG | CNN | 99.8/N.A. | No |
| He et al. ( | NinaPro DB1 | 27/1 | 52/10 | 400 ms | Raw sEMG | LSTM | 75.45/N.A. | No |
| Simao et al. ( | NinaPro DB5 | 8/16 | 500 ms | Standard deviation | RNN | 92.07 | Yes | |
| Nasri et al. ( | Private | 35/1 | 6/8 | 37.6 ms (4 ms) | Raw sEMG | GRU | 80/N.A. | No |
| Hu et al. ( | NinaPro DB1 | 27/1 | 53/8 | 200 ms | RMS | CNN + RNN | 87.0/N.A | No |
| Wu et al. ( | Private | 4/1 | 5/8 | 260 ms (25 ms) | Raw sEMG | LSTM + CNN | 98.14/N.A. | No |
| Tong et al. ( | Private | 8/1 | 5/18 | SampEn, ZC, MAV | Dual-Flow Network | 78.31/N.A. | No | |
| Betthauser et al. ( | Private | 9/1 | 27/8 | 1675 ms | 20 ms-MAV | TCN | 69.5/N.A. | No |
| Tsinganos et al. ( | NinaPro DB1 | 27/1 | 52/10 | 300 ms, 2,500 ms | RMS | TCN | 89.76/N.A. | No |
| Zanghieri et al. ( | Private | 3/20 | 9/8 | 150 ms | Raw sEMG | TCN | 97.1/93.7 | Yes |
| Betthauser et al. ( | Private | 15/1 | 3/8 | 200 ms (25 ms) | MAV, WL, VA, SSC, ZC | ED-TCN | 72.1/N.A. | No |
| Zanghieri et al. ( | NinaPro DB6 | 10/10 | 7/14 | 150 ms | RMS | TCN | 71.3/65.0 | No |
The commonly used offline performance metrics.
| Accuracy | It is the proportion of the corresponding gesture recognized by the model in all the data. | |
| Recall | It is the proportion of correctly recognized data in a gesture class. | |
| Precision | It is the proportion of a class of gestures correctly recognized among the gestures recognized by the model. | |
| Standard deviation of the accuracy per user ( | It is the dispersion of each subject's recognition accuracy. | |
| Standard deviation of the accuracy per class ( | It is the amount of dispersion of the recalls of a particular model. | |
| Mean square error (MSE) | ||
| Root mean square error (RMSE) | It can be used to evaluate the numerical error of amplitude. | |
| Normalized root mean square error (NRMSE) | It is the standardization of RMSE. | |
| Correlation coefficient (R) | It can measure the similarity between signal shapes. |
TP, true positive; FP, false positive; TN, true negative; FN, false negative. k represents the index of different classes. i represents the set of subjects. n represents the total number of subjects. k represents the set of actual classes. g represents the total number of actual classes. y represents the actual value. ŷ represents the estimated value. ȳ represents the mean of the actual value. x estimated value. .
The commonly used real-time performance metrics.
| Throughput (TP) | Quantify availability by the ratio of speed to accuracy, and defined the ratio of difficulty index of each target task to completion time. | |
| Completion rate (CR) | Describes overall success; a ratio of the completed targets to the total number of tasks attempted. | |
| Path efficiency (PE) | Describes the control quality; the ratio between the shortest distance and the actual distance. | |
| Overshoot (O) | Describes the ability to stop on a target; the average number of times data appears on the target domain and then disappears in each test. | |
| Stopping distance (SD) | Describes the ability to hold no motion the total distance traveled during the 1-s dwell time. | |
| Average speed (AS) | Illustrates the subject's gross ability to control the target; a ratio of time spent successfully completing a task to the total completion time. |
T.
Figure 5Causes of variability of data distribution (Ketyko et al., 2019).