| Literature DB >> 30347776 |
Teak-Wei Chong1, Boon-Giin Lee2.
Abstract
Sign language is intentionally designed to allow deaf and dumb communities to convey messages and to connect with society. Unfortunately, learning and practicing sign language is not common among society; hence, this study developed a sign language recognition prototype using the Leap Motion Controller (LMC). Many existing studies have proposed methods for incomplete sign language recognition, whereas this study aimed for full American Sign Language (ASL) recognition, which consists of 26 letters and 10 digits. Most of the ASL letters are static (no movement), but certain ASL letters are dynamic (they require certain movements). Thus, this study also aimed to extract features from finger and hand motions to differentiate between the static and dynamic gestures. The experimental results revealed that the sign language recognition rates for the 26 letters using a support vector machine (SVM) and a deep neural network (DNN) are 80.30% and 93.81%, respectively. Meanwhile, the recognition rates for a combination of 26 letters and 10 digits are slightly lower, approximately 72.79% for the SVM and 88.79% for the DNN. As a result, the sign language recognition system has great potential for reducing the gap between deaf and dumb communities and others. The proposed prototype could also serve as an interpreter for the deaf and dumb in everyday life in service sectors, such as at the bank or post office.Entities:
Keywords: American Sign Language; Leap Motion Controller; deep neural network; human-computer interaction; machine learning; multi-class classification; sign language recognition; support vector machine
Mesh:
Year: 2018 PMID: 30347776 PMCID: PMC6210690 DOI: 10.3390/s18103554
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The 26 letters and 10 digits of American Sign Language (ASL).
Figure 2Proposed system flow used in the study.
Figure 3Experimental setup with a real-time hand gesture in 3D graphic display.
Figure 4The position of (a) palm center and fingertips illustrated in (b) sphere radius mode.
Feature group organization.
| Group | Feature | # of Feature |
|---|---|---|
| S | Standard deviation of palm position | 3 |
| R | Hand palm curvature radius | 1 |
| D | Distance between palm center and each fingertips | 5 |
| A | Angle between two adjacent fingertips | 4 |
| L | Distance between one fingertip and the consecutive fingertip | 10 |
Overall accuracy of 26 classes and 36 classes.
| Accuracy (%) | |||||
|---|---|---|---|---|---|
| Combination | 26 Classes | 36 Classes | - | ||
| SVM | DNN | SVM | DNN | Average | |
| C1 | 74.26 | 87.84 | 67.85 | 83.29 | 74.31 |
| C2 | 74.57 | 92.77 | 67.08 | 87.38 | 80.45 |
| C3 | 75.35 | 88.61 | 67.94 | 83.89 | 78.95 |
| C4 | 80.30 | 93.29 | 72.04 | 87.35 | 83.25 |
| C5 | 68.81 | 87.15 | 57.53 | 83.18 | 74.17 |
| C6 | 79.73 | 93.81 | 72.79 | 88.79 | 83.78 |
| Average | 75.50 | 90.58 | 67.54 | 85.65 | - |
Figure 5Confusion matrices of 26 classes. (a) Support vector machine. (b) Deep neural network.
Figure 6Confusion matrices of 36 classes. (a) Support vector machine. (b) Deep neural network.
Sensitivity (Se) and specificity (Sp) of the ASL letters for 26 classes and 36 classes for both classifiers.
| 36 Classes | 26 Classes | |||||||
|---|---|---|---|---|---|---|---|---|
| Letters | SVM | DNN | SVM | DNN | ||||
| Se (%) | Sp (%) | Se (%) | Sp (%) | Se (%) | Sp (%) | Se (%) | Sp (%) | |
| A | 99.50 | 98.81 | 86.31 | 99.59 | 94.17 | 99.14 | 90.82 | 99.41 |
| B | 83.33 | 99.76 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| C | 100.00 | 99.55 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
| D | 16.17 | 99.48 | 62.33 | 99.00 | 74.33 | 99.32 | 89.67 | 99.81 |
| E | 63.50 | 99.03 | 98.67 | 99.93 | 83.83 | 99.58 | 99.00 | 99.99 |
| F | 75.00 | 99.49 | 79.33 | 99.80 | 100.00 | 99.99 | 100.00 | 100.00 |
| G | 91.67 | 98.71 | 89.17 | 99.85 | 91.67 | 99.58 | 90.00 | 99.89 |
| H | 52.67 | 99.00 | 69.67 | 99.22 | 41.67 | 99.59 | 72.17 | 99.77 |
| I | 100.00 | 99.75 | 100.00 | 100.00 | 100.00 | 99.91 | 100.00 | 99.99 |
| J | 94.83 | 99.99 | 99.67 | 99.90 | 94.33 | 99.99 | 99.67 | 10.00 |
| K | 75.00 | 99.05 | 93.17 | 99.84 | 91.67 | 99.55 | 99.50 | 99.97 |
| L | 100.00 | 99.75 | 100.00 | 100.00 | 100.00 | 99.90 | 99.83 | 99.99 |
| M | 91.67 | 98.33 | 96.67 | 99.79 | 100.00 | 99.50 | 95.17 | 99.90 |
| N | 58.33 | 99.64 | 96.33 | 99.71 | 66.67 | 99.80 | 93.67 | 99.91 |
| O | 50.00 | 98.12 | 64.50 | 99.20 | 83.33 | 99.90 | 99.33 | 99.99 |
| P | 70.50 | 98.66 | 81.17 | 99.62 | 58.33 | 99.58 | 86.67 | 99.90 |
| Q | 79.17 | 99.50 | 96.33 | 99.65 | 70.83 | 99.79 | 92.17 | 99.91 |
| R | 87.33 | 98.95 | 96.67 | 99.82 | 75.00 | 99.37 | 100.00 | 99.85 |
| S | 33.33 | 99.28 | 64.33 | 99.55 | 33.33 | 99.67 | 65.83 | 99.83 |
| T | 91.67 | 99.52 | 98.67 | 99.72 | 91.67 | 99.67 | 94.83 | 99.90 |
| U | 8.33 | 99.62 | 71.00 | 99.04 | 8.50 | 99.72 | 70.00 | 99.72 |
| V | 58.33 | 99.28 | 74.17 | 99.24 | 80.67 | 99.34 | 96.33 | 99.95 |
| W | 58.33 | 98.57 | 89.50 | 99.48 | 100.00 | 99.99 | 100.00 | 100.00 |
| X | 41.67 | 99.61 | 93.00 | 99.95 | 50.00 | 99.99 | 98.17 | 99.96 |
| Y | 100.00 | 99.99 | 100.00 | 100.00 | 100.00 | 99.99 | 100.00 | 100.00 |
| Z | 97.83 | 99.97 | 96.17 | 99.99 | 97.83 | 99.99 | 99.67 | 99.99 |
| 0 | 17.50 | 99.02 | 71.33 | 99.00 | - | - | - | - |
| 1 | 61.83 | 98.08 | 76.50 | 99.05 | - | - | - | - |
| 2 | 83.33 | 98.22 | 77.33 | 99.39 | - | - | - | - |
| 3 | 74.67 | 99.21 | 96.83 | 99.80 | - | - | - | - |
| 4 | 78.00 | 99.66 | 97.67 | 99.99 | - | - | - | - |
| 5 | 100.00 | 99.61 | 100.00 | 100.00 | - | - | - | - |
| 6 | 57.17 | 99.05 | 81.83 | 99.70 | - | - | - | - |
| 7 | 100.00 | 99.92 | 100.00 | 100.00 | - | - | - | - |
| 8 | 95.50 | 99.73 | 100.00 | 100.00 | - | - | - | - |
| 9 | 74.33 | 99.10 | 92.83 | 99.41 | - | - | - | - |
Figure 7Class-to-class similarity rate matrix for both classifiers (%). (a) Support Vector Machine. (b) Deep Neural Network.
Comparison of sign language recognition systems using the Leap Motion Controller.
| Author | Gesture | Dataset (M people × N repetitions per letter per person) | Cross-Validation | Features | Classifier | Accuracy (%) |
|---|---|---|---|---|---|---|
| Khelil et al. [ | 10 ArSL digits gesture | 10 people × 10 sets | - | Angle between two fingertips, angle between fingertips and hand’s normal, distance between the hand center to each fingertips | SVM | 91.3 |
| Du et al. [ | 10 digits gesture | 13 people × 20 sets | 80% training set, and 20% testing set, experiment performed 50 times | Fingertips angle (A), fingertips distance (D), fingertips elevation(E), fingertips tip distance (T) | SVM | 83.36 |
| A+D+E+T+ HOG | SVM | 99.42 | ||||
| Funasaka et al. [ | 24 ASL gestures (static) | - | - | Palm normal vector, fingertips position, arm direction and fingertips direction | Decision Tree | 82.7 |
| Marin et al. [ | 10 ASL gestures | 14 people × 10 sets | Leave-one-subject-out cross-validation | Fingertips angle fingertips distance, and fingertips elevation | SVM | 80.86 |
| Chuan et al. [ | 26 ASL gestures | 2 people × 2 sets | 4 fold cross-validation | Pinch strength, grab strength, average distance, average spread, average tri-spread, extended distance, dip-tip projection, OrderX, and angle | k-Nearest Neighbors | 72.78 |
| Mapari et al. [ | 32 ASL gestures (J, Z, 2 and 6 are excluded) | 146 people × 1 set | 90% training set, and 10% testing set cross-validation | Finger positions, palm position, distance between positions, angles between positions | MLP | 90 |
| Proposed work | 36 ASL | 12 people × 1 set | Leave-one-subject-out cross-validation | Standard deviation of palm positions, hand palm curvature radius, distance between palm center and each fingertips, angle between two adjacent fingertips, distance between fingertips and each consecutive fingertips | DNN | 93.81 (26 classes) |