| Literature DB >> 25609039 |
Abstract
Sign language is a visual language used by deaf people. One difficulty of sign language recognition is that sign instances of vary in both motion and shape in three-dimensional (3D) space. In this research, we use 3D depth information from hand motions, generated from Microsoft's Kinect sensor and apply a hierarchical conditional random field (CRF) that recognizes hand signs from the hand motions. The proposed method uses a hierarchical CRF to detect candidate segments of signs using hand motions, and then a BoostMap embedding method to verify the hand shapes of the segmented signs. Experiments demonstrated that the proposed method could recognize signs from signed sentence data at a rate of 90.4%.Entities:
Mesh:
Year: 2014 PMID: 25609039 PMCID: PMC4327011 DOI: 10.3390/s150100135
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1.Overview of the proposed method for recognizing a sign.
Figure 2.Skeleton model: 10 upper body components.
Figure 3.Hand detection: (a) color image (b) depth image and feature positions (c) and detected regions with black wristband.
Seven features for recognizing the signer's hand.
| Position of the left hand with respect to the signer's face | |
| Position of the right hand with respect to the signer's face | |
| Position of the left hand with respect to the previous left hand | |
| Position of the right hand with respect to the previous right hand | |
| Position of the left hand with respect to the shoulder center | |
| Position of the right hand with respect to the shoulder center | |
| Occlusion of two hands |
Examples of hand shapes for sign language recognition: Categories of hand shapes are described in [1,3].
| Car (T) | S | S |
| Past (O) | Open B > Bent B | D.C. |
| Out (O) | Flat C > Flat O | D.C. |
O stands for one-handed sign; T stands for two-handed sign; D.C. means don't care; > Means that the hand shapes of start and end frames of the sign are changed.
Figure 4.Examples hand shape used for training the BoostMap embeddings.
24 ASL signs used in the vocabulary.
| One-handed signs | And, Know, Man, Out, Past, Tell, Yesterday |
| Two-handed signs | Arrive, Big, Born, Car, Decide, Different, Finish, Here, Many, Maybe, Now, Rain, Read, Take-off, Together, What, Wow |
Figure 5.Two examples of ASL signs; B and E indicate means beginning and end, respectively.
ASL recognition results.
| CRF2D | 185 | 34 | 25 | 21 | 33.3 | 77.0 |
| 197 | 27 | 24 | 16 | 27.9 | 82.0 | |
| H-CRF2D [ | 202 | 23 | 15 | 15 | 22.0 | 84.1 |
| H-CRF3D | 217 | 12 | 9 | 11 | 13.3 | 90.4 |
N is 240; 3D means using features extracted in 3D space; 2D means using features extracted in 2D space.
Figure 6.Sign language recognition result for H-CRF using a signed sentence.
Figure 7.Sign language recognition result for H-CRF using a signed sentence.
Figure 8.Sign language recognition result for H-CRF using a signed sentence that includes the sign “Different”.
Figure 9.Hand shape recognition results with the signed sentence of Figure 8.