| Literature DB >> 32195238 |
Ulysse Côté-Allard1, Evan Campbell2, Angkoon Phinyomark2, François Laviolette3, Benoit Gosselin1, Erik Scheme2.
Abstract
Existing research on myoelectric control systems primarily focuses on extracting discriminative characteristics of the electromyographic (EMG) signal by designing handcrafted features. Recently, however, deep learning techniques have been applied to the challenging task of EMG-based gesture recognition. The adoption of these techniques slowly shifts the focus from feature engineering to feature learning. Nevertheless, the black-box nature of deep learning makes it hard to understand the type of information learned by the network and how it relates to handcrafted features. Additionally, due to the high variability in EMG recordings between participants, deep features tend to generalize poorly across subjects using standard training methods. Consequently, this work introduces a new multi-domain learning algorithm, named ADANN (Adaptive Domain Adversarial Neural Network), which significantly enhances (p = 0.00004) inter-subject classification accuracy by an average of 19.40% compared to standard training. Using ADANN-generated features, this work provides the first topological data analysis of EMG-based gesture recognition for the characterization of the information encoded within a deep network, using handcrafted features as landmarks. This analysis reveals that handcrafted features and the learned features (in the earlier layers) both try to discriminate between all gestures, but do not encode the same information to do so. In the later layers, the learned features are inclined to instead adopt a one-vs.-all strategy for a given class. Furthermore, by using convolutional network visualization techniques, it is revealed that learned features actually tend to ignore the most activated channel during contraction, which is in stark contrast with the prevalence of handcrafted features designed to capture amplitude information. Overall, this work paves the way for hybrid feature sets by providing a clear guideline of complementary information encoded within learned and handcrafted features.Entities:
Keywords: CNN; ConvNet; EMG; Grad-CAM; MAPPER; deep learning; feature extraction; gesture recognition
Year: 2020 PMID: 32195238 PMCID: PMC7063031 DOI: 10.3389/fbioe.2020.00158
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Figure 1Diagram of the workflow of this work. The 3DC Dataset is first preprocessed before being used to train the network using standard training and the proposed ADANN training procedure. The handcrafted features are directly calculated from the preprocessed dataset, while the deep features are extracted from the ConvNet trained with ADANN. In the diagram, the blue rectangles represent experiments and the arrows show which methods/algorithms are required to perform them.
Figure 2The eleven hand/wrist gestures recorded in the 3DC Dataset (image re-used from Côté-Allard et al., 2019b).
Handcrafted features extracted for topological landmarks sorted by functional group.
| Phinyomark et al. ( | Amplitude of the first burst | AFB | SAP |
| Kim et al. ( | Difference absolute mean value | DAMV | SAP |
| Kim et al. ( | Difference absolute standard deviation value | DASDV | SAP |
| Zardoshti-Kermani et al. ( | Difference log detector | DLD | SAP |
| Phinyomark et al. ( | Difference temporal moment | DTM | SAP |
| Zardoshti-Kermani et al. ( | Difference variance value | DVARV | SAP |
| Zardoshti-Kermani et al. ( | Difference v-order | DV | SAP |
| Park and Lee ( | Integral of electromyogram | IEMG | SAP |
| Zardoshti-Kermani et al. ( | Log detector | LD | SAP |
| Al-Timemy et al. ( | Second-order moment | M2 | SAP |
| Oskoei and Hu ( | Modified mean absolute value 1 | MMAV1 | SAP |
| Oskoei and Hu ( | Modified mean absolute value 2 | MMAV2 | SAP |
| Saponas et al. ( | Mean absolute value | MAV | SAP |
| Phinyomark et al. ( | Maximum | MAX | SAP |
| Du and Vuskovic ( | Multiple hamming windows | MHW | SAP |
| Du and Vuskovic ( | Mean power | MNP | SAP |
| Du and Vuskovic ( | Multiple trapezoidal windows | MTW | SAP |
| Saponas et al. ( | Root mean squared | RMS | SAP |
| Du and Vuskovic ( | Spectral moment | SM | SAP |
| Du and Vuskovic ( | Sum of squared integral | SSI | SAP |
| Phinyomark et al. ( | Temporal moment | TM | SAP |
| Du and Vuskovic ( | Total power | TTP | SAP |
| Zardoshti-Kermani et al. ( | Variance | VAR | SAP |
| Zardoshti-Kermani et al. ( | v-Order | V | SAP |
| Phinyomark et al. ( | Waveform length | WL | SAP |
| Oskoei and Hu ( | Frequency ratio | FR | FI |
| Thongpanja et al. ( | Median frequency | MDF | FI |
| Thongpanja et al. ( | Mean frequency | MNF | FI |
| Phinyomark et al. ( | Slope sign change | SSC | FI |
| Zardoshti-Kermani et al. ( | Zero crossings | ZC | FI |
| Phinyomark et al. ( | Sample entropy | SAMPEN | NLC |
| Phinyomark et al. ( | Approximate entropy | APEN | NLC |
| Zardoshti-Kermani et al. ( | Willison's amplitude | WAMP | NLC |
| Gitter and Czerniecki ( | Box-counting fractal dimension | BC | NLC |
| Gupta et al. ( | Katz fractal dimension | KATZ | NLC |
| Arjunan and Kumar ( | Maximum fractal length | MFL | NLC |
| Park and Lee ( | Autoregressive coefficients | AR | TSM |
| Park and Lee ( | Cepstral coefficients | CC | TSM |
| Park and Lee ( | Difference autoregressive coefficient | DAR | TSM |
| Park and Lee ( | Difference cepstral coefficients | DCC | TSM |
| Phinyomark et al. ( | Detrend fluctuation analysis | DFA | TSM |
| Qingju and Zhizeng ( | Power spectrum ratio | PSR | TSM |
| Sinderby et al. ( | Signal to noise ratio | SNR | TSM |
| Phinyomark et al. ( | Critical exponent | CE | UNI |
| Sinderby et al. ( | Maximum to minimum drop in power density ratio | DPR | UNI |
| Phinyomark et al. ( | Histogram | HIST | UNI |
| Thongpanja et al. ( | Kurtosis | KURT | UNI |
| Phinyomark et al. ( | Mean absolute value slope | MAVS | UNI |
| Sinderby et al. ( | Power spectrum deformation | OHM | UNI |
| Phinyomark et al. ( | Peak frequency | PKF | UNI |
| Talebinejad et al. ( | Power spectrum density fractal dimension | PSDFD | UNI |
| Thongpanja et al. ( | Skewness | SKEW | UNI |
| Sinderby et al. ( | Signal to motion artifact ratio | SMR | UNI |
| Al-Timemy et al. ( | Time domain power spectral descriptors | TSPSD | UNI |
| Phinyomark et al. ( | Variance of central frequency | VCF | UNI |
| Phinyomark et al. ( | Variance fractal dimension | VFD | UNI |
Figure 3The ConvNet's architecture, employing 543,629 learnable parameters. In this figure, Bi refers to the ith feature extraction block (i∈{1,2,3,4,5,6}). Conv refers to Convolutional layer. As shown, the feature extraction is performed after the non-linearity (leaky ReLU).
Figure 4Overview of the training steps of ADANN (identical to DANN) for one labeled batch from the source ({x, y}, blue lines) and one unlabeled batch from the target ({x}, red dashed lines). The purple dotted lines correspond to the backpropagated gradient. The gradient reversal operation is represented by the purple diamond.
Figure 5An example of step 3 of the Mapper algorithm with W = 2. The purple dots represent the elements of W. In (A), the red square corresponds to ℭ. In (B), ℭ is subdivided using k2 squares of length H (with k = 2 in this case). The orange diamonds, in both (B,C), represent the elements of V. Finally, the square of length D is shown on the upper left corner of (C), overlapping other squares centered on other elements of V (dotted lines).
Figure 6Topological network generated exclusively for the handcrafted features, where nodes are colored to indicate percent composition of: (A) signal amplitude and power features (SAP), (B) non-linear complexity (NLC), (C) frequency information features (FI), (D) time series modeling features (TSM), and (E) unique features (UNI). Dashed boxes highlight dense groupings of the specified functional group in each of the networks.
Figure 7Classification results of deep learning architectures. (A) Per-participant test set accuracy comparison when training the network with and without ADANN, (B) Confusion matrices on the test set for cross-subject training with and without ADANN.
Figure 8Output of Guided Grad-CAM when asked to highlight specific gestures in an example. For all graphs, the y-axis of each channel are scaled to the same range of value (indicated on the first channel of each graph). Warmer colors indicate a higher “importance” of a feature in the input space for the requested gesture. The coloring use a logarithmic scale. For visualization purposes, only features that are within three order of magnitudes to the most contributing feature are colored. (A) The examples shown are real examples and correspond to the same gestures that Guided Grad-CAM is asked to highlight. (B) A single example, generated using Gaussian noise of mean 0 and standard deviation 450, is shown three times. While the visualization algorithm does highlight features in the input space (when the requested gesture is not truly present in the input), the magnitude of these contributions is substantially smaller (half or less) than when the requested gesture is present in the input.
Figure 9Topological network generated for exclusively the learned features, where nodes are colored to indicate percent composition of: (A) Block 1's features, (B) Block 2's features, (C) Block 3's features, (D) Block 4's features, (E) Block 5's features, and (F) Block 6's features. Dashed boxes highlight dense groupings of the specified block features in each of the networks.
Figure 10Topological network generated for all features, where nodes were colored to indicate percent composition of learned features. The dashed boxes highlight dense grouping of handcrafted features with their associated type.
Members of nodes labeled in Figure 6. LeFX refers to a Learned Feature from block X.
| 1 | TSM+LeF5 | AR2 AR4 DAR2 DAR4 CC1 CC4 DCC1 |
| DCC3 SNR 8xLeF1 1xLeF2 4xLeF4 10xLeF5 13xLe5 | ||
| 2 | TSM+UNI+LeF6 | APEN AR2 AR4 DAR2 DAR4 CC1 CC4 DCC1 DCC3 DCC4 CE DFA DPR HIST123 |
| SKEW MAVS OHM PSDFD PSR SMR SNR VCF VFD 1xLeF1 3xLeF2 3xLeF5 21xLeF6 | ||
| 3 | TSM+UNI+LeF6 | APEN AR2 AR4 DAR2 DAR4 CC1 CC4 DCC1 DCC3 DCC4 CE DFA DPR HIST12 |
| SKEW MAVS OHM PSDFD PSR SMR SNR VCF VFD 1xLeF1 1xLeF2 1xLeF5 27xLeF6 | ||
| 4 | UNI+LeF6 | APEN DCC4 CE DFA DPR HIST123 |
| SKEW MAVS OHM PSDFD PSR SMR VCF VFD 2xLeF2 2xLeF5 21xLeF6 | ||
| 2 | TSM+UNI+LeF6 | APEN CC1 CC4 DCC4 CE DFA DPR HIST123 |
| SKEW MAVS OHM PSDFD PSR SMR SNR VCF VFD 37xLeF6 | ||
| 6 | TSM+UNI+LeF6 | CC1 CC4 DCC4 CE DPR HIST123 SKEW MAVS PSDFD SMR |
| SNR VCF VFD 5xLeF2 5xLeF4 1xLeF5 37xLeF6 | ||
| 7 | UNI+LeF6 | DCC4 CE DPR HIST123 SKEW MAVS |
| PSDFD SMR VCF VFD 2xLeF2 15xLeF6 | ||
| 8 | UNI+LeF6 | DCC4 CE DPR HIST123 SKEW MAVS PSDFD SMR |
| VCF VFD 5xLeF2 5xLeF4 1xLeF5 37xLeF6 | ||
| 9 | UNI+LeF6 | APEN DCC4 CE DFA DPR HIST2 SKEW MAVS |
| OHM PSDFD PSR SMR VCF VFD 15xLeF2 36xLeF6 | ||
| 10 | All Handcrafted+LeF6 | APEN CC14 DCC4 CE DFA DPR HIST123 KURT SKEW M2 MAVS MAX MHW23 |
| MTW123 MNP TTP OHM PSDFD PSR SM SMR SNR SSI TM DTM VAR DVARV VCF VFD 11xLeF6 | ||
| 11 | NLC+LeF6 | APEN SAMPEN BC |
| KATZ 1xLeF6 |
Accuracy obtained on the test set using the handcrafted features and the learned features from their respective block.
| SAP | 26.80 | 7.0 | 41.61 |
| FI | 19.95 | 2.87 | 34.80 |
| NLC | 22.32 | 7.15 | 31.49 |
| TSM | 22.24 | 3.33 | 37.18 |
| UNI | 15.32 | 5.11 | 48.37 |
| Block 1 | 74.59 | ||
| Block 2 | 28.28 | 4.66 | 78.26 |
| Block 3 | 28.90 | 5.06 | 79.19 |
| Block 4 | 29.21 | 5.15 | 78.77 |
| Block 5 | 28.18 | 5.48 | 79.23 |
| Block 6 | 26.62 | 6.19 | |
The Single Feature accuracies are given as the average accuracy over all the features of their respective block/category.
Figure 11Confusion matrices using the handcrafted features and the learned features from the first, penultimate and last block as input and a LDA as the classifier. The first column, denoted as All features, shows the confusion matrices when using all 64 learned features of Block 1, 5, and 6, respectively (from top to bottom) and the set of UNI handcrafted features. The next five columns, denoted as Single Feature, show the confusions matrices for handcrafted feature examplars and from the same network's blocks but when training the LDA on a single feature. The subset of learned features was selected as representative of the typical confusion matrices found at each block. The examplars of the handcrafted features were selected from each handcrafted features' category (in order: SAP, FI, NLC, TSM, and UNI).
Figure 12Mean squared error of the regressions from learned features to handcrafted features, with respect to the number of blocks employed for the regression. The features are grouped with their respective functional groups.