| Literature DB >> 35632069 |
Xinchen Fan1, Lancheng Zou1, Ziwu Liu1, Yanru He1, Lian Zou1, Ruan Chi2.
Abstract
Gesture recognition through surface electromyography (sEMG) provides a new method for the control algorithm of bionic limbs, which is a promising technology in the field of human-computer interaction. However, subject specificity of sEMG along with the offset of the electrode makes it challenging to develop a model that can quickly adapt to new subjects. In view of this, we introduce a new deep neural network called CSAC-Net. Firstly, we extract the time-frequency feature from the raw signal, which contains rich information. Secondly, we design a convolutional neural network supplemented by an attention mechanism for further feature extraction. Additionally, we propose to utilize model-agnostic meta-learning to adapt to new subjects and this learning strategy achieves better results than the state-of-the-art methods. By the basic experiment on CapgMyo and three ablation studies, we demonstrate the advancement of CSAC-Net.Entities:
Keywords: attention convolution network; gesture recognition; meta-learning; short time Fourier transform; surface electromyography
Mesh:
Year: 2022 PMID: 35632069 PMCID: PMC9144628 DOI: 10.3390/s22103661
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1An overview of the method in a block diagram representation.
Figure 2Two processed surface EMG samples and their STFT spectrogram (Hanning window, window length = 64).
Figure 3The detailed architecture of the proposed CSAC-Net.
Figure 4The Channel-Spatial Attention Module.
Figure 5The structure of Channel Attention Module.
Figure 6The structure of Spatial Attention Module.
Figure 7The structure of the CSAC-Cell.
Figure 8Specific block diagram of MAML framework in our work. The colorful lines represent the sEMG signals used for training and verification after windowing. The yellow star represents the model for new subjects.
Traditional supervised learning classification accuracy (%) of different input forms on CapgMyo (18 subjects and 8 gestures for DB-a, 10 subjects and 8 gestures for DB-b, 10 subjects and 12 gestures for DB-c). The boldface numbers represent the best classification accuracies for each dataset.
| Input | DB-a | DB-b | DB-c |
|---|---|---|---|
| Raw sEMG | 46.18 | 52.81 | 42.59 |
| FFT Spectrum | 67.71 | 73.13 | 67.13 |
| STFT Spectrogram |
|
|
|
Comparison between previous methods and the proposed CSAC-Net in terms of the accuracy (%) of traditional supervised learning classification on DB-a, DB-b, and DB-c (10 times average accuracy). The boldface number represents the best classification results.
| Methods | DB-a | DB-b | DB-c |
|---|---|---|---|
|
|
|
|
|
| STFT Spectrogram + Resnet18 [ | 91.32 | 87.81 | 89.81 |
| MAV + ED-TCN [ | 93.75 | 91.88 | 90.28 |
| MAV + KNN (k = 3) [ | 88.54 | 88.75 | 91.67 |
| MAV + SVM ( kernal: Gaussian ) [ | 81.94 | 86.88 | 85.69 |
| MAV + Tree [ | 68.06 | 60.63 | 77.70 |
5-way 1-shot and 5-way 5-shot Classification top 1 and top 5 accuracy (%) for new subject on CapgMyo with MAML. The meta batch size is set to 8. The boldface numbers represent the best classification accuracies for each dataset compared with Table 4.
| Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|
| DB-a |
|
| 77.50 | 76.49 |
| DB-b |
|
|
|
|
| DB-c |
|
|
|
|
5-way 1-shot and 5-way 5-shot Classification top 1 and top 5 accuracy (%) for new subject on CapgMyo with MAML. The meta batch size is set to 64. The boldface numbers represent the best classification accuracies for each dataset compared with Table 3.
| Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|
| DB-a | 61.56 | 59.06 |
|
|
| DB-b | 43.12 | 40.63 | 73.12 | 71.25 |
| DB-c | 68.75 | 65.31 | 82.00 | 80.81 |
Comparison between previous methods and the proposed CSAC-Net in terms of the accuracy (%) of inter-subject classification on DB-a, DB-b, and DB-c (10 times average accuracy). The boldface number represents the best classification results.
| Methods | DB-a | DB-b | DB-c |
|---|---|---|---|
|
|
|
|
|
| MLSVD + DL [ | – | 75.40 | 68.30 |
| STFT Spectrogram + Resnet18 [ | 56.70 | 57.70 | 57.90 |
| MAV + ED-TCN [ | 28.40 | 30.15 | 29.30 |
| raw data + extended AdaBN [ | – | 55.30 | 35.10 |
Figure 9For different methods and datasets, the accuracy of 10 repetitions.
5-way 1-shot and 5-way 5-shot classification top 1 and top 5 accuracy (%) of different input forms for new subject on CapgMyo with MAML. The meta batch size is set to 8. The boldface numbers represent the best classification accuracies for each input and dataset compared with Table 7.
| input | Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|---|
| Raw sEMG | DB-a | 32.50 | 30.00 |
|
|
| DB-b |
| 27.50 |
|
| |
| DB-c | 37.50 | 35.00 |
|
| |
| FFT | DB-a | 42.50 | 32.50 | 46.00 | 43.00 |
| Specturm | DB-b |
|
|
|
|
| DB-c |
|
| 38.50 | 38.00 |
5-way 1-shot and 5-way 5-shot classification top 1 and top 5 accuracy (%) of different input forms for new subject on CapgMyo with MAML. The meta batch size is set to 64. The boldface numbers represent the best classification accuracies for each input and dataset compared with Table 6.
| input | Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|---|
| Raw sEMG | DB-a |
|
| 41.00 | 40.56 |
| DB-b |
|
| 36.25 | 35.81 | |
| DB-c |
|
| 41.25 | 40.26 | |
| FFT | DB-a |
|
|
|
|
| Specturm | DB-b | 32.81 | 31.56 | 40.31 | 39.94 |
| DB-c | 36.25 | 34.38 |
|
|
Classification validation and test accuracy (%) and time for training (ms) on new subject on CapgMyo with pretrained model by supervised learning.
| Dataset | Validation Accuracy | Test Accuracy | Time for Training |
|---|---|---|---|
| DB-a | 94.85 | 23.75 | 253112 |
| DB-b | 95.18 | 57.50 | 316119 |
| DB-c | 95.83 | 35.83 | 189864 |
5-way 1-shot and 5-way 5-shot classification training time (ms) with different meta batch size on new subject on CapgMyo by CSAC-Net with MAML.
| Meta Batch Size | Dataset | 1Shot | 5Shot |
|---|---|---|---|
| 8 | DB-a | 4143 | 5316 |
| DB-b | 4605 | 5515 | |
| DB-c | 3885 | 5144 | |
| 64 | DB-a | 8234 | 17305 |
| DB-b | 8547 | 17757 | |
| DB-c | 8462 | 17295 |
5-way 1-shot and 5-way 5-shot classification with different models on new subject on CapgMyo with MAML. The meta batch size is set to 8. The boldface numbers represent the best classification accuracies for each model and dataset compared with Table 11.
| Model | Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|---|
| SAC-Net | DB-a |
|
| 71.50 | 67.00 |
| DB-b |
|
|
|
| |
| DB-c |
|
|
|
| |
| CAC-Net | DB-a |
|
| 63.50 | 62.50 |
| DB-b |
|
| 63.00 | 61.50 | |
| DB-c |
| 37.50 |
| 61.00 | |
| CNN | DB-a | 40.00 | 32.50 | 67.00 | 65.50 |
| DB-b |
| 42.50 |
|
| |
| DB-c |
| 40.00 | 61.50 | 60.00 |
5-way 1-shot and 5-way 5-shot classification with different models on new subject on CapgMyo with MAML. The meta batch size is set to 64. The boldface numbers represent the best classification accuracies for each model and dataset compared with Table 10.
| Model | Dataset | 1Shot-Top1 | 1Shot-Top5 | 5Shot-Top1 | 5Shot-Top5 |
|---|---|---|---|---|---|
| SAC-Net | DB-a | 41.25 | 38.44 |
|
|
| DB-b | 42.50 | 42.19 | 69.19 | 67.94 | |
| DB-c | 49.69 | 47.81 | 75.19 | 73.87 | |
| CAC-Net | DB-a | 38.44 | 37.80 |
|
|
| DB-b | 38.75 | 34.69 |
|
| |
| DB-c | 39.69 |
| 62.62 |
| |
| CNN | DB-a |
|
|
|
|
| DB-b | 44.06 |
| 69.25 | 68.75 | |
| DB-c | 43.13 |
|
|
|