| Literature DB >> 30082649 |
Tomasz Kapuscinski1, Patryk Organisciak2.
Abstract
In this paper, a method of handshapes recognition based on skeletal data is described. A new feature vector is proposed. It encodes the relative differences between vectors associated with the pointing directions of the particular fingers and the palm normal. Different classifiers are tested on the demanding dataset, containing 48 handshapes performed 500 times by five users. Two different sensor configurations and significant variation in the hand rotation are considered. The late fusion at the decision level of individual models, as well as a comparative study carried out on a publicly available dataset, are also included.Entities:
Keywords: finger alphabet; handshape recognition; sign language; skeletal data
Year: 2018 PMID: 30082649 PMCID: PMC6111288 DOI: 10.3390/s18082577
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Recent works on handshape recognition using skeletal data.
| Work | Sign | Sign | Users | Device | Method | Accuracy | Data |
|---|---|---|---|---|---|---|---|
| Type | Vocabulary | [%] | Available | ||||
| [ | static | 10 letters ASL | 14 | LMC | SVM | 80.86 | Yes |
| LMC + Kinect | 91.28 | ||||||
| [ | static | 26 letters ASL | 2 | LMC | kNN | 72.78 | No |
| SVM | 79.83 | ||||||
| [ | static | 28 letters ArSL | 1 | LMC | NB | 98.3 | No |
| MP | 99.1 | ||||||
| [ | dynamic | 50 sign ArSL | 2 | LMC | MP | 88 | No |
| [ | static | 10 digits ASL | 8 | 2 LMs | HMM | 93.14 | No |
| [ | static | 24 letters ASL | 1 | LMC | DT + GA | 82.71 | No |
| [ | static | 5 simple shapes | ? | LMC | BSR | ? | No |
| [ | static | 10 digits ISL | 4 | LMC | MP | 100 | No |
| [ | static | 28 letters ArSL | 20 | LMC + Kinect | SVM | 86 | No |
| [ | dynamic | 25 sign ISL | 10 | LMC + Kinect | CHMM | 90.80 | Yes |
| [ | static | 28 letters ArSL | 4 | LMC + Kinect | kNN | 100 | No |
| [ | static | 26 letters SIBI | 1 | LMC | RB-BGANN + GA | 93.8 | No |
| [ | static | 20 letters ASL | 50 | LMC, RealSense | SVM | 60–100 | No |
| [ | static | 26 letters ASL | 10 | LMC | kNN | 70–100 | No |
| 10 digits ASL | 95–100 | ||||||
| [ | static | 44 letters ThSL | ? | LMC | DT | 72.83 | No |
| [ | static | 28 letters ArSL | 1 | 2 LMs | LDA | 97.7 | No |
| [ | static | 10 hand shapes | 13 | LMC | SVM | 99.42 | No |
| [ | dynamic | 28 signs and | 10 | LMC | SVM + BLSTM | 63.57 | No |
| 28 words ISL | |||||||
| [ | static | 8 hand shapes | 20 | LMC | kNN | 95 | Yes |
| [ | static | 10 hand shapes | 14 | LMC + Kinect | 97 | No |
Figure 1Hand skeletal model.
Figure 2Feature vector construction.
Tested classifiers and their parameters.
| Classifier | Parameter | Value |
|---|---|---|
| DT | Maximum number of splits | 100 |
| Split criterion | Gini’s diversity index | |
| LD | Covariance structure | Full |
| QD | Covariance structure | Full |
| SVM Lin/Quad/Cub/Gauss | Kernel function | Linear/Quadratic/Cubic/Gaussian |
| Box constraint level | 1 | |
| Multiclass method | One-vs-one | |
| 1 NN/10 NN/100 NN | Number of neighbors | 1 /10/100 |
| Distance metric | Euclidean | |
| 10 NN Cos | Number of neighbors | 10 |
| Distance metric | Cosine | |
| 10 NN W | Number of neighbors | 10 |
| Distance metric | Euclidean | |
| Distance weight | Squared inverse | |
| Ens Boost/Bag/RUS | Ensemble method | Boosted/bagged/random subspace trees |
| Learner type | Decision tree | |
| Number of learners | 30 | |
| Ens Sub D/Sub kNN | Ensemble method | Subspace |
| Learner type | Discriminant/1 NN | |
| Number of learners | 30 | |
| Subspace dimension | 8 | |
| FLANN | Number of neighbors | 1 |
| Number of trees | 8 | |
| Number of times the trees | 128 | |
| should be recursively traversed |
Figure 3Static handshapes, occurring in Polish Finger Alphabet and Polish Sign Language.
10-fold cross-validation results for dataset 1, configuration (i), variant (a).
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 81.2 | 72.8 | 99.7 | 99.5 | 100.0 | 100.0 | 100.0 | 100.0 | 99.9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 99.2 | 99.9 | 100.0 | 64.2 | 100.0 | 69.9 | 100.0 | 39.9 | 100.0 |
10-fold cross-validation results for dataset 1, configuration (i), variant (b).
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 83.1 | 78.7 | 97.2 | 96.7 | 99.4 | 99.7 | 99.1 | 99.8 | 98.5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 88.9 | 98.5 | 99.5 | 67.1 | 99.7 | 77.3 | 99.8 | 35.8 | 99.7 |
10-fold cross-validation results for dataset 1, configuration (ii).
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 43.8 | 100.0 |
Leave-one-subject-out cross-validation results for dataset 1, configuration (i), variant (1).
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 41.4 | 50.9 | 42.5 | 52.3 | 49.1 | 46.9 | 14.0 | 48.6 | 48.6 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 48.9 | 47.9 | 48.7 | 43.3 | 50.8 | 46.8 | 49.9 | 28.7 | 47.6 |
Leave-one-subject-out cross-validation results for dataset 1, configuration (i), variant (1).
| Training | Testing | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | |||||||
| B, C, D, E | A | 37.4 | 51.3 | 32.1 | 46.1 | 41.0 | 38.8 | 4.0 | 43.1 | 43.3 |
| A, C, D, E | B | 31.2 | 37.6 | 29.6 | 33.0 | 32.9 | 31.0 | 5.6 | 32.3 | 31.7 |
| A, B, D, E | C | 47.9 | 54.7 | 41.6 | 57.8 | 57.5 | 55.4 | 21.8 | 60.6 | 61.2 |
| A, B, C, E | D | 41.6 | 53.8 | 51.1 | 58.6 | 54.0 | 51.7 | 18.7 | 52.3 | 51.3 |
| A, B, C, D | E | 48.9 | 57.1 | 58.1 | 66.1 | 60.1 | 57.4 | 19.8 | 54.5 | 55.5 |
| Avg | 41.4 | 50.9 | 42.5 | 52.3 | 49.1 | 46.9 | 14.0 | 48.6 | 48.6 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
| B, C, D, E | A | 45.0 | 44.3 | 43.2 | 38.7 | 39.6 | 50.4 | 42.2 | 27.6 | 40.9 |
| A, C, D, E | B | 32.6 | 32.7 | 31.8 | 37.6 | 40.5 | 31.3 | 35.9 | 16.4 | 33.8 |
| A, B, D, E | C | 60.4 | 55.7 | 61.6 | 46.6 | 59.0 | 51.0 | 60.4 | 35.3 | 56.5 |
| A, B, C, E | D | 48.9 | 51.7 | 51.3 | 45.2 | 56.1 | 44.8 | 52.0 | 30.1 | 50.8 |
| A, B, C, D | E | 57.7 | 55.3 | 55.4 | 48.5 | 58.7 | 56.5 | 59.0 | 33.9 | 56.1 |
| Avg | 48.9 | 47.9 | 48.7 | 43.3 | 50.8 | 46.8 | 49.9 | 28.7 | 47.6 | |
10-fold cross-validation results for dataset 2.
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 87.6 | 84.5 | 86.4 | 87.6 | 86.2 | 84.1 | 88.4 | 88.6 | 88.0 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 82.6 | 87.9 | 89.1 | 87.8 | 88.9 | 85.1 | 88.5 | 86.9 | 85.9 |
Leave-one-subject-out cross-validation results for the dataset 2.
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Recognition rate [%] | 86.2 | 84.2 | 87.5 | 87.6 | 86.4 | 82.3 | 86.7 | 89.2 | 85.5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Recognition rate [%] | 82.4 | 85.6 | 89.6 | 87.0 | 87.7 | 84.1 | 89.3 | 86.4 | 85.4 |
Leave-one-subject-out cross-validation results for the dataset 2.
| Training | Testing | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | |||||||
| 2–14 | 1 | 85.0 | 76.0 | 77.0 | 77.0 | 77.0 | 71.0 | 79.0 | 95.0 | 74.0 |
| 1, 3–14 | 2 | 89.0 | 87.0 | 91.0 | 90.0 | 90.0 | 91.0 | 91.0 | 89.0 | 90.0 |
| 1–2, 4–14 | 3 | 81.0 | 88.0 | 91.0 | 91.0 | 84.0 | 77.0 | 83.0 | 82.0 | 84.0 |
| 1–3, 5–14 | 4 | 89.0 | 92.0 | 97.0 | 96.0 | 95.0 | 87.0 | 97.0 | 94.0 | 94.0 |
| 1–4, 6–14 | 5 | 90.0 | 90.0 | 92.0 | 91.0 | 91.0 | 87.0 | 92.0 | 86.0 | 92.0 |
| 1–5, 7–14 | 6 | 91.0 | 92.0 | 93.0 | 92.0 | 92.0 | 89.0 | 93.0 | 93.0 | 92.0 |
| 1–6, 8–14 | 7 | 87.0 | 85.0 | 88.0 | 88.0 | 86.0 | 80.0 | 87.0 | 84.0 | 89.0 |
| 1–7, 9–14 | 8 | 86.0 | 90.0 | 92.0 | 92.0 | 87.0 | 90.0 | 93.0 | 91.0 | 92.0 |
| 1–8, 10–14 | 9 | 86.0 | 84.0 | 87.0 | 87.0 | 84.0 | 78.0 | 87.0 | 88.0 | 86.0 |
| 1–9, 11–14 | 10 | 84.0 | 77.0 | 89.0 | 89.0 | 89.0 | 86.0 | 81.0 | 86.0 | 85.0 |
| 1–10, 12–14 | 11 | 79.0 | 72.0 | 74.0 | 80.0 | 80.0 | 75.0 | 73.0 | 76.0 | 74.0 |
| 1–11, 13–14 | 12 | 90.0 | 94.0 | 100.0 | 100.0 | 100.0 | 96.0 | 100.0 | 95.0 | 97.0 |
| 1–12, 14 | 13 | 85.0 | 76.0 | 77.0 | 77.0 | 77.0 | 73.0 | 79.0 | 95.0 | 74.0 |
| 1–13 | 14 | 85.0 | 76.0 | 77.0 | 77.0 | 77.0 | 72.0 | 79.0 | 95.0 | 74.0 |
| Avg | 86.2 | 84.2 | 87.5 | 87.6 | 86.4 | 82.3 | 86.7 | 89.2 | 85.5 | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
| 2–14 | 1 | 73.0 | 76.0 | 95.0 | 77.0 | 80.0 | 76.0 | 95.0 | 74.0 | 95.0 |
| 1, 3–14 | 2 | 89.0 | 90.0 | 91.0 | 91.0 | 89.0 | 87.0 | 90.0 | 89.0 | 89.0 |
| 1–2, 4–14 | 3 | 88.0 | 84.0 | 82.0 | 84.0 | 83.0 | 88.0 | 82.0 | 90.0 | 82.0 |
| 1–3, 5–14 | 4 | 89.0 | 94.0 | 95.0 | 97.0 | 97.0 | 92.0 | 94.0 | 92.0 | 94.0 |
| 1–4, 6–14 | 5 | 90.0 | 91.0 | 89.0 | 92.0 | 92.0 | 91.0 | 86.0 | 92.0 | 80.0 |
| 1–5, 7–14 | 6 | 88.0 | 92.0 | 93.0 | 93.0 | 93.0 | 92.0 | 93.0 | 93.0 | 87.0 |
| 1–6, 8–14 | 7 | 82.0 | 89.0 | 87.0 | 88.0 | 86.0 | 88.0 | 83.0 | 88.0 | 78.0 |
| 1–7, 9–14 | 8 | 88.0 | 91.0 | 90.0 | 92.0 | 90.0 | 91.0 | 91.0 | 91.0 | 91.0 |
| 1–8, 10–14 | 9 | 86.0 | 85.0 | 86.0 | 87.0 | 88.0 | 84.0 | 88.0 | 87.0 | 88.0 |
| 1–9, 11–14 | 10 | 75.0 | 84.0 | 84.0 | 85.0 | 89.0 | 76.0 | 86.0 | 89.0 | 86.0 |
| 1-10, 12-14 | 11 | 74.0 | 74.0 | 78.0 | 78.0 | 79.0 | 72.0 | 75.0 | 79.0 | 69.0 |
| 1–11, 13–14 | 12 | 85.0 | 96.0 | 94.0 | 100.0 | 100.0 | 89.0 | 97.0 | 98.0 | 95.0 |
| 1–12, 14 | 13 | 73.0 | 76.0 | 95.0 | 77.0 | 82.0 | 76.0 | 95.0 | 74.0 | 70.0 |
| 1–13 | 14 | 73.0 | 76.0 | 95.0 | 77.0 | 80.0 | 76.0 | 95.0 | 74.0 | 91.0 |
| Avg | 82.4 | 85.6 | 89.6 | 87.0 | 87.7 | 84.1 | 89.3 | 86.4 | 85.4 | |
Support Vector Machines classifier with linear kernel function performance when the multiclass method was changed from one-vs-one to one-vs-all.
| Training | Testing | SVM Lin |
|---|---|---|
| B, C, D, E | A | 36.8 |
| A, C, D, E | B | 21.1 |
| A, B, D, E | C | 47.8 |
| A, B, C, E | D | 52.0 |
| A, B, C, D | E | 46.6 |
| Avg | 40.8 | |
Ens Bag performance for a different number of learners.
| Training | Testing | 10 | 20 | 30 | 40 | 50 | 100 | 200 | 400 | 800 | 1000 | 2000 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B, C, D, E | A | 44.6 | 47.4 | 39.6 | 46.8 | 45.3 | 46.8 | 46.9 | 46.1 | 45.4 | 45.9 | 46.3 |
| A, C, D, E | B | 40.1 | 40.3 | 40.5 | 38.0 | 38.6 | 37.0 | 38.8 | 39.2 | 40.0 | 41.4 | 38.7 |
| A, B, D, E | C | 53.2 | 56.6 | 59.0 | 56.1 | 55.8 | 53.9 | 58.6 | 58.3 | 57.4 | 57.7 | 58.0 |
| A, B, C, E | D | 54.8 | 54.6 | 56.1 | 54.8 | 58.5 | 57.9 | 57.0 | 57.4 | 57.9 | 58.1 | 57.9 |
| A, B, C, D | E | 58.7 | 60.0 | 58.7 | 60.6 | 58.6 | 63.4 | 60.5 | 61.2 | 62.5 | 63.0 | 61.7 |
| Avg | 50.4 | 51.8 | 50.8 | 51.3 | 51.3 | 51.8 | 52.4 | 52.4 | 52.6 | 53.2 | 52.5 | |
Figure 4Response time for Ens Bag classifier for different number of learners.
1 NN vs. FLANN with a different number of trees (given in parenthesis).
| Training | Testing | 1 NN | FLANN (1) | FLANN (2) | FLANN (4) | FLANN (8) | FLANN (16) | FLANN (32) |
|---|---|---|---|---|---|---|---|---|
| B, C, D, E | A | 43.1 | 38.0 | 41.2 | 39.5 | 40.4 | 41.1 | 40.4 |
| A, C, D, E | B | 32.3 | 31.1 | 30.9 | 33.6 | 33.6 | 33.0 | 33.6 |
| A, B, D, E | C | 60.6 | 55.8 | 56.5 | 57.4 | 56.1 | 57.0 | 56.4 |
| A, B, C, E | D | 52.3 | 52.1 | 51.3 | 51.3 | 51.1 | 51.5 | 51.2 |
| A, B, C, D | E | 54.5 | 55.1 | 56.6 | 56.5 | 55.7 | 55.5 | 55.6 |
| Avg | 48.6 | 46.4 | 47.3 | 47.6 | 47.4 | 47.6 | 47.4 | |
Average response times of the individual classifiers.
| Classifier | DT | LD | QD | SVM | SVM | SVM | SVM | 1 NN | 10 NN |
|---|---|---|---|---|---|---|---|---|---|
| Lin | Quad | Cub | Gauss | ||||||
| Response time [ms] | 0.07 | 0.46 | 0.42 | 26.73 | 31.98 | 30.12 | 64.87 | 24.15 | 26.64 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |||
| Response time [ms] | 29.24 | 22.50 | 23.14 | 3.22 | 2.95 | 9.3 | 47.65 | 4.14 | 0.06 |
10-fold cross validation results of different methods obtained for the Dataset 2.
| Lp | Reference | Features | Method | Recognition Rate |
|---|---|---|---|---|
| 1 | [ | Fingertips distances, angles and elevations | Multiclass SVM | 80.9% |
| 2 | [ | Fingertips Tip distance | Multiclass SVM | 81.1% |
| 3 | This paper | As described in | SVM Lin | 87.6% |
| 4 | This paper | As described in | 10NN W | 89.1% |