| Literature DB >> 32823956 |
Chanhwi Lee1, Jaehan Kim2, Seoungbae Cho2, Jinwoong Kim2, Jisang Yoo1, Soonchul Kwon3.
Abstract
The use of human gesturing to interact with devices such as computers or smartphones has presented several problems. This form of interaction relies on gesture interaction technology such as Leap Motion from Leap Motion, Inc, which enables humans to use hand gestures to interact with a computer. The technology has excellent hand detection performance, and even allows simple games to be played using gestures. Another example is the contactless use of a smartphone to take a photograph by simply folding and opening the palm. Research on interaction with other devices via hand gestures is in progress. Similarly, studies on the creation of a hologram display from objects that actually exist are also underway. We propose a hand gesture recognition system that can control the Tabletop holographic display based on an actual object. The depth image obtained using the latest Time-of-Flight based depth camera Azure Kinect is used to obtain information about the hand and hand joints by using the deep-learning model CrossInfoNet. Using this information, we developed a real time system that defines and recognizes gestures indicating left, right, up, and down basic rotation, and zoom in, zoom out, and continuous rotation to the left and right.Entities:
Keywords: azure kinect; deep-learning; gesture interaction; hand detection; hologram display
Mesh:
Year: 2020 PMID: 32823956 PMCID: PMC7471984 DOI: 10.3390/s20164566
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Overview of the gesture recognition for the tabletop holographic display.
Figure 2Azure Kinect. (a) Azure Kinect configuration; (b) Azure Kinect view field. (source: www.docs.microsoft.com)
Figure 3CrossInfoNet structure.
Figure 4Block diagram of real-time interactive holographic display by hand gestures.
Figure 5Framework of the hand gesture recognition system based on depth frames.
Figure 6Background subtraction.
Figure 78 hand gestures. (a) Left; (b) Right; (c) Up; (d) Down; (e) Zoom-out; (f) Zoom-in; (g) Continuous rotation right; (h) Continuous rotation left.
Figure 8Experiment environment. (a) Tabletop holographic environment; (b) Gesture interaction environment.
Figure 9The experimental result before (a) and after (b) of applying the background subtraction.
Figure 10Result of Region of Interest (ROI) and Bounding box. (a) Applied ROI; (b) Hand detection with 14 joint points and Bounding box.
Figure 11Result of basic rotation gestures. (a) Default state; (b) Up; (c) Down; (d) Right; (e) Left.
Figure 12Result of zoom in and out gestures. (a) Zoom in; (b) Default state; (c) Zoom out.
Experiment results of precision error, recall error, and F1 score about each gesture of 10 subject at 35–50 cm.
| Gesture | Left | Right | Up | Down | Zoom-In | Zoom-Out | Continue Left | Continue Right |
|---|---|---|---|---|---|---|---|---|
| Total Attempts | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| True Positive | 100 | 100 | 98 | 100 | 99 | 99 | 100 | 93 |
| False Positive | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| False Negative | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 7 |
| Precision | 100 | 100 | 100 | 100 | 99 | 99 | 100 | 100 |
| Recall | 100 | 100 | 98 | 100 | 100 | 100 | 100 | 93 |
| F1 score | 100 | 100 | 98.98 | 100 | 99.49 | 99.49 | 100 | 96.37 |
Experiment results of precision error, recall error, and F1 score about each gesture of 10 subjects at 50–60 cm.
| Gesture | Left | Right | Up | Down | Zoom-in | Zoom-out | Continue Left | Continue Right |
|---|---|---|---|---|---|---|---|---|
| Total Attempts | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| True Positive | 94 | 91 | 100 | 98 | 99 | 99 | 93 | 88 |
| False Positive | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| False Negative | 6 | 9 | 0 | 2 | 0 | 0 | 7 | 12 |
| Precision | 100 | 100 | 100 | 100 | 99 | 99 | 100 | 100 |
| Recall | 94 | 91 | 100 | 98 | 100 | 100 | 93 | 88 |
| F1 score | 96.90 | 95.28 | 100 | 98.98 | 99.49 | 99.49 | 96.37 | 93.61 |
Experiment results of precision error, recall error, and F1 score about each gesture of 10 subjects at 60–70 cm.
| Gesture | Left | Right | Up | Down | Zoom-In | Zoom-Out | Continue Left | Continue Right |
|---|---|---|---|---|---|---|---|---|
| Total Attempts | 100 | 100 | 100 | 100 | 100 | 100 | 100 | 100 |
| True Positive | 90 | 90 | 91 | 94 | 99 | 99 | 83 | 79 |
| False Positive | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| False Negative | 10 | 10 | 9 | 6 | 0 | 0 | 17 | 21 |
| Precision | 100 | 100 | 100 | 100 | 99 | 99 | 100 | 100 |
| Recall | 90 | 90 | 91 | 94 | 100 | 100 | 83 | 79 |
| F1 score | 94.73 | 94.73 | 95.28 | 96.90 | 99.49 | 99.49 | 90.71 | 88.26 |
Experiment results of total precision error, total recall error, and total F1 score.
| Distance | 35–50 cm | 50–60 cm | 60–70 cm |
|---|---|---|---|
| Precision | 0.99747 | 0.99747 | 0.99747 |
| Recall | 0.98872 | 0.95500 | 0.90875 |
| F1 score | 0.99307 | 0.97577 | 0.95104 |