| Literature DB >> 27104534 |
Shengjing Wei1, Xiang Chen2, Xidong Yang3, Shuai Cao4, Xu Zhang5.
Abstract
Sign language recognition (SLR) can provide a helpful tool for the communication between the deaf and the external world. This paper proposed a component-based vocabulary extensible SLR framework using data from surface electromyographic (sEMG) sensors, accelerometers (ACC), and gyroscopes (GYRO). In this framework, a sign word was considered to be a combination of five common sign components, including hand shape, axis, orientation, rotation, and trajectory, and sign classification was implemented based on the recognition of five components. Especially, the proposed SLR framework consisted of two major parts. The first part was to obtain the component-based form of sign gestures and establish the code table of target sign gesture set using data from a reference subject. In the second part, which was designed for new users, component classifiers were trained using a training set suggested by the reference subject and the classification of unknown gestures was performed with a code matching method. Five subjects participated in this study and recognition experiments under different size of training sets were implemented on a target gesture set consisting of 110 frequently-used Chinese Sign Language (CSL) sign words. The experimental results demonstrated that the proposed framework can realize large-scale gesture set recognition with a small-scale training set. With the smallest training sets (containing about one-third gestures of the target gesture set) suggested by two reference subjects, (82.6 ± 13.2)% and (79.7 ± 13.4)% average recognition accuracy were obtained for 110 words respectively, and the average recognition accuracy climbed up to (88 ± 13.7)% and (86.3 ± 13.7)% when the training set included 50~60 gestures (about half of the target gesture set). The proposed framework can significantly reduce the user's training burden in large-scale gesture recognition, which will facilitate the implementation of a practical SLR system.Entities:
Keywords: accelerometer; gyroscope; sign language recognition; surface electromyography
Mesh:
Year: 2016 PMID: 27104534 PMCID: PMC4851070 DOI: 10.3390/s16040556
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The flow diagram of the proposed SLR framework.
Figure 2The location of sEMG, ACC, and GYRO sensors on the forearms [16].
Figure 3Handshape change during the execution of CSL sign word “object”: (a) the beginning stage; (b) the middle stage; and (c) the end stage.
The component-based representation of a sign language gesture.
| Hand Shape | Orientation | Axis | Rotation | Trajectory | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
Figure 4The extraction process of the subclasses of component.
An example of gesture encoding.
| Subclass No. | 4 | 4 | 5 | 5 | 1 | 4 | 2 | 2 | 2 | 3 | 3 | 3 | 12 |
| Component code | 00010000000 | 00010000000 | 00001000000 | 00001 | 10000 | 00010 | 010 | 010 | 010 | 001 | 001 | 001 | 0000000000010 |
| Sign gesture code | 00010000000 | 00010000000 | 00001000000 | 00001 | 10000 | 00010 | 010 | 010 | 010 | 001 | 001 | 001 | 0000000000010 |
Figure 5Illustration of gesture segmentation.
Eleven typical handshape subclasses.
| Index | Subclass | Picture | Index | Subclass | Picture | Index | Subclass | Picture |
|---|---|---|---|---|---|---|---|---|
| 1 | CSL alphabet “Y” | 5 | U1 | 9 | G1 | |||
| 2 | CSL alphabet “A” | 6 | U2 | 10 | CSL alphabet “R” | |||
| 3 | CSL alphabet “G” | 7 | CSL alphabet “D” | 11 | CSL alphabet “V” | |||
| 4 | CSL alphabet “U” | 8 | “Claw” type |
The fifth hand shape is a palm extension with wrist flexion and the sixth hand shape is a palm extension with wrist extension. The ninth hand shape is an index finger extension with wrist extension.
Thirteen typical trajectory subclasses.
| Along x+ | Along x | Around x | Motionless | |
| Along x− | Along y | Around y | ||
| Along y+ | Along z | Around z | ||
| Along y− | ||||
| Along z+ | ||||
| Along z− |
Figure 6Five typical orientation subclasses: (a) palm inward; (b) hand upward; (c) hand downward; (d) palm up; and (e) palm down.
Figure 7Three typical axis subclasses.
Figure 8Three typical rotation subclasses.
Recognition accuracies (%) of 110 CSL sign words at different sizes of training set (Sub3 as reference subject).
| Θ | 1 | 2 | 3 | 4 | 6 | 8 | 10 | 12 | 15 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30~40 | 40~50 | 50~60 | 60~70 | 70~80 | 80~90 | 90~100 | 100~110 | 110 | ||||||||||
| Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | |
| Sub1 | 79.5 | 16.8 | 85.0 | 14.6 | 86.2 | 15.1 | 87.2 | 17.7 | 87.4 | 17.8 | 88.3 | 18.1 | 88.5 | 17.7 | 88.7 | 18.2 | 88.7 | 19.6 |
| Sub2 | 85.6 | 9.8 | 86.1 | 11.5 | 88.0 | 11.8 | 90.1 | 12.7 | 90.6 | 14.5 | 90.9 | 15.3 | 91.0 | 15.1 | 91.6 | 15.2 | 92.1 | 17.2 |
| Sub3 | 85.7 | 12.5 | 89.4 | 11.4 | 91.8 | 11.6 | 91.2 | 11.7 | 91.4 | 14.2 | 93.0 | 15.2 | 92.9 | 15.0 | 93.2 | 15.7 | 93.9 | 16.5 |
| Sub4 | 78.4 | 14.2 | 82.6 | 16.0 | 85.3 | 14.7 | 87.0 | 18.0 | 87.6 | 14.5 | 88.3 | 16.9 | 87.8 | 17.7 | 87.7 | 17.0 | 88.2 | 20.2 |
| Sub5 | 83.7 | 12.9 | 87.3 | 12.7 | 89.2 | 15.2 | 90.2 | 14.7 | 93.1 | 14.6 | 93.0 | 14.5 | 93.2 | 14.5 | 93.5 | 14.5 | 93.5 | 14.5 |
| Overall | 82.6 | 13.2 | 86.0 | 13.2 | 88.0 | 13.7 | 89.1 | 12.5 | 89.8 | 15.1 | 90.7 | 16 | 90.6 | 16 | 90.9 | 16.1 | 91.3 | 17.6 |
| 0.013 | 0.001 | 0.087 | 0.160 | 0.077 | 0.887 | 0.081 | 0.077 | |||||||||||
Recognition accuracies (%) of 110 CSL sign words at different sizes of training set (Sub5 as reference subject).
| Θ | 1 | 2 | 3 | 4 | 6 | 8 | 10 | 12 | 15 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30~40 | 40~50 | 50~60 | 60~70 | 70~80 | 80~90 | 90~100 | 100~110 | 110 | ||||||||||
| Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | Mean | Std | |
| Sub1 | 76.2 | 14.6 | 78.9 | 12.7 | 84.3 | 13.2 | 86.3 | 15.9 | 85.2 | 16.4 | 87.4 | 18.6 | 87.8 | 18.4 | 88.1 | 18.1 | 87.9 | 18.2 |
| Sub2 | 84.2 | 10.4 | 85.9 | 10.1 | 88.0 | 12.3 | 88.9 | 13.9 | 88.0 | 12.5 | 88.5 | 14.8 | 88.8 | 15.1 | 88.4 | 14.3 | 88.9 | 15.6 |
| Sub3 | 80.9 | 12.7 | 83.9 | 14.0 | 86.5 | 13.4 | 86.8 | 14.0 | 86.7 | 12.8 | 87.0 | 12.4 | 89.3 | 12.3 | 91.1 | 13.9 | 92.0 | 14.2 |
| Sub4 | 74.2 | 14.5 | 77.3 | 17.5 | 83.1 | 13.4 | 83.2 | 15.3 | 82.6 | 17.4 | 84.9 | 16.5 | 85.4 | 16.5 | 86.9 | 17.0 | 86.9 | 17.7 |
| Sub5 | 83.3 | 15.0 | 87.1 | 15.5 | 89.8 | 16.4 | 90.1 | 13.8 | 90.6 | 15.2 | 90.8 | 16.5 | 91.4 | 17.3 | 91.0 | 16.9 | 92.6 | 16.8 |
| Overall | 79.7 | 13.4 | 82.6 | 13.9 | 86.3 | 13.7 | 87.0 | 14.5 | 86.6 | 14.8 | 87.7 | 15.7 | 88.5 | 15.9 | 89.1 | 16.0 | 89.6 | 16.5 |
| 0.001 | 0.009 | 0.107 | 0.203 | 0.080 | 0.093 | 0.295 | 0.158 | |||||||||||
The component level recognition result at = 3(%) (Sub3 as reference subject).
| Conditions | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sub1 | 86.7 | 91.9 | 89.1 | 96.8 | 98.8 | 96.8 | 99.9 | 96.2 | 99.8 | 99.8 | 97.8 | 99.7 | 98.3 | 96.2 | 4.3 |
| Sub2 | 93.6 | 94.7 | 95.8 | 99.5 | 96.7 | 96.3 | 99.9 | 98.7 | 99.5 | 99.6 | 93.0 | 99.6 | 97.3 | 97.2 | 2.4 |
| Sub3 | 93.9 | 90.7 | 94.0 | 99.6 | 95.3 | 92.3 | 99.4 | 91.4 | 92.7 | 99.5 | 99.9 | 97.6 | 92.8 | 95.3 | 3.4 |
| Sub4 | 88.4 | 89.9 | 88.4 | 97.7 | 95.8 | 95.4 | 99.7 | 99.2 | 99.6 | 99.8 | 97.1 | 99.7 | 94.3 | 95.7 | 4.3 |
| Sub5 | 86.8 | 87.9 | 87.4 | 99.2 | 97.2 | 90.7 | 99.8 | 97.5 | 99.8 | 99.6 | 98.9 | 99.7 | 95.4 | 95.3 | 5.2 |
| 89.8 | 91.0 | 90.9 | 98.5 | 96.7 | 94.3 | 99.7 | 96.6 | 98.2 | 99.6 | 97.3 | 99.2 | 95.6 | 95.9 | 3.4 | |
| 3.5 | 2.5 | 3.7 | 1.2 | 1.3 | 2.6 | 0.2 | 3.1 | 3.1 | 0.1 | 2.6 | 0.9 | 2.2 | 0.7 |
The component level recognition result at = 3(%) (Sub5 as reference subject).
| Conditions | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sub1 | 84.9 | 85.0 | 91.4 | 98.4 | 96.8 | 92.3 | 98.7 | 92.5 | 97.7 | 99.6 | 98.9 | 99.7 | 95.4 | 94.7 | 4.9 |
| Sub2 | 91.0 | 93.0 | 93.3 | 99.2 | 94.1 | 98.9 | 99.5 | 96.1 | 99.7 | 99.7 | 93.0 | 99.7 | 97.4 | 96.4 | 3.0 |
| Sub3 | 92.0 | 91.8 | 92.8 | 99.6 | 94.8 | 94.1 | 98.6 | 96.1 | 99.7 | 99.5 | 99.9 | 97.6 | 92.8 | 96.1 | 3.0 |
| Sub4 | 87.3 | 88.1 | 89.8 | 99.2 | 97.1 | 93.9 | 99.7 | 99.7 | 99.7 | 99.8 | 97.1 | 99.7 | 94.3 | 95.8 | 4.5 |
| Sub5 | 92.8 | 88.7 | 91.4 | 98.4 | 96.8 | 92.3 | 99.7 | 92.5 | 99.7 | 99.6 | 98.9 | 99.7 | 95.4 | 95.8 | 3.7 |
| 89.6 | 89.3 | 91.7 | 98.9 | 95.9 | 94.3 | 99.2 | 95.3 | 99.3 | 99.6 | 97.5 | 99.2 | 95.0 | 95.7 | 3.8 | |
| 3.0 | 2.8 | 1.2 | 0.4 | 1.2 | 2.4 | 0.4 | 2.6 | 0.8 | 0.1 | 2.4 | 0.8 | 1.4 | 1.5 |
Figure 9The recognition results under three different testing sets (Sub3 as the reference subject).
Figure 10The recognition results under three different testing sets (Sub5 as the reference subject).