| Literature DB >> 33171709 |
Fatemeh Mohammadi Amin1, Maryam Rezayati1, Hans Wernher van de Venn1, Hossein Karimpour2.
Abstract
Digital-enabled manufacturing systems require a high level of automation for fast and low-cost production but should also present flexibility and adaptiveness to varying and dynamic conditions in their environment, including the presence of human beings; however, this presence of workers in the shared workspace with robots decreases the productivity, as the robot is not aware about the human position and intention, which leads to concerns about human safety. This issue is addressed in this work by designing a reliable safety monitoring system for collaborative robots (cobots). The main idea here is to significantly enhance safety using a combination of recognition of human actions using visual perception and at the same time interpreting physical human-robot contact by tactile perception. Two datasets containing contact and vision data are collected by using different volunteers. The action recognition system classifies human actions using the skeleton representation of the latter when entering the shared workspace and the contact detection system distinguishes between intentional and incidental interactions if physical contact between human and cobot takes place. Two different deep learning networks are used for human action recognition and contact detection, which in combination, are expected to lead to the enhancement of human safety and an increase in the level of cobot perception about human intentions. The results show a promising path for future AI-driven solutions in safe and productive human-robot collaboration (HRC) in industrial automation.Entities:
Keywords: artificial intelligence; collision detection; human action recognition; industrial automation; safe physical human–robot collaboration
Mesh:
Year: 2020 PMID: 33171709 PMCID: PMC7664417 DOI: 10.3390/s20216347
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Three-dimensional convolutional neural networks (CNN) for human action recognition.
Figure 2Contact detection network diagram.
Figure 3Real-time interface of complex system.
Figure 4Type of human actions: (a) Passing: operator is just passing by, without paying attention to the robot. (b) Fail: blind spots or occlusion of the visual field may happen for a camera, in this situation the second camera takes over detection. (c) Observation: operator enters the working zone, without any interaction intention and stands next to the robot. (d) Dangerous Observation: operator proximity is too close, especially his head is at danger of collision with the robot. (e) Interaction: operator enters the working zone and prepares to work with the robot.
Precision and recall of two trained networks for human action recognition.
| Network | 2D | 3D | ||
|---|---|---|---|---|
| Precision | Recall | Precision | Recall | |
| Observation | 0.99 | 0.99 | 1.00 | 1.00 |
| Interaction | 1.00 | 1.00 | 1.00 | 1.00 |
| Passing | 1.00 | 1.00 | 1.00 | 1.00 |
| Fail | 1.00 | 1.00 | 1.00 | 1.00 |
| Dangerous Observation | 0.98 | 0.96 | 0.98 | 0.99 |
| Accuracy | 0.9956 | 0.9972 | ||
Confusion Matrix for different classes in HRC.
| Network | 2D | 3D | |||||||||
| Observation | Interaction | Passing | Fail | Dangerous Observation | Observation | Interaction | Passing | Fail | Dangerous Observation | ||
| True Labels | Observation | 3696 | 7 | 2 | 0 | 5 | 3751 | 6 | 2 | 1 | 7 |
| Interaction | 13 | 4130 | 0 | 0 | 1 | 8 | 4030 | 0 | 0 | 0 | |
| Passing | 2 | 0 | 1145 | 0 | 0 | 1 | 0 | 1160 | 0 | 0 | |
| Fail | 0 | 0 | 0 | 593 | 0 | 0 | 0 | 0 | 588 | 0 | |
| Dangerous Observation | 12 | 1 | 0 | 0 | 313 | 2 | 0 | 0 | 0 | 359 | |
Precision and recall of trained networks for contact detection with different window size.
| w | 100 | 200 | 300 | 100 | 200 | 300 |
|---|---|---|---|---|---|---|
| Precision | Recall | |||||
| No-Contact | 0.94 | 0.99 | 0.98 | 0.94 | 1.00 | 1.00 |
| Intentional_Link5 | 0.74 | 0.91 | 0.89 | 0.84 | 0.91 | 0.84 |
| Intentional_Link6 | 0.68 | 0.97 | 0.91 | 0.64 | 0.90 | 0.91 |
| Incidental_Link5 | 0.61 | 0.89 | 0.83 | 0.61 | 0.93 | 0.89 |
| Incidental_Link6 | 0.69 | 0.96 | 0.96 | 0.57 | 0.96 | 0.93 |
| Accuracy | 0.78 | 0.96 | 0.93 | |||
Confusion matrix of trained networks for contact detection with different window size.
| Window Size | 100 | 200 | 300 | |||||||||||||
| No-Contact | Intentional_Link5 | Intentional_Link6 | Incidental_Link5 | Incidental_Link6on | No-Contact | Intentional_Link5 | Intentional_Link6 | Incidental_Link5 | Incidental_Link6 | No-Contact | Intentional_Link5 | Intentional_Link6 | Incidental_Link5 | Incidental_Link6 | ||
| True Labels | No-Contact | 166 | 0 | 9 | 0 | 1 | 242 | 0 | 3 | 0 | 1 | 167 | 0 | 3 | 0 | 0 |
| Intentional_Link5 | 0 | 86 | 12 | 19 | 0 | 0 | 93 | 4 | 4 | 1 | 0 | 86 | 5 | 5 | 1 | |
| Intentional_Link6 | 8 | 1 | 59 | 2 | 17 | 0 | 3 | 83 | 0 | 0 | 0 | 5 | 84 | 0 | 3 | |
| Incidental_Link5 | 0 | 15 | 1 | 33 | 5 | 0 | 6 | 0 | 50 | 0 | 0 | 10 | 0 | 48 | 0 | |
| Incidental_Link6 | 3 | 0 | 11 | 0 | 31 | 0 | 0 | 2 | 0 | 52 | 0 | 1 | 0 | 1 | 50 | |
Figure 5Safety perception spectrum in (a) visual perception, (b) contact perception, (c) mixed perception safety systems.