| Literature DB >> 33495660 |
Baofu Fang1,2,3, Gaofei Mei1, Xiaohui Yuan4, Le Wang1, Zaijun Wang5, Junyang Wang1.
Abstract
The COVID-19 pandemic has affected many countries, posing a threat to human health and safety, and putting tremendous pressure on the medical system. This paper proposes a novel SLAM technology using RGB and depth images to improve hospital operation efficiency, reduce the risk of doctor-patient cross-infection, and curb the spread of the COVID-19. Most current visual SLAM researches assume that the environment is stationary, which makes handling real-world scenarios such as hospitals a challenge. This paper proposes a method that effectively deals with SLAM problems for scenarios with dynamic objects, e.g., people and movable objects, based on the semantic descriptor extracted from images with help of a knowledge graph. Specifically, our method leverages a knowledge graph to construct a priori movement relationship between entities and establishes high-level semantic information. Built upon this knowledge graph, a semantic descriptor is constructed to describe the semantic information around key points, which is rotation-invariant and robust to illumination. The seamless integration of the knowledge graph and semantic descriptor helps eliminate the dynamic objects and improves the accuracy of tracking and positioning of robots in dynamic environments. Experiments are conducted using data acquired from healthcare facilities, and semantic maps are established to meet the needs of robots for delivering medical services. In addition, to compare with the state-of-the-art methods, a publicly available dataset is used in our evaluation. Compared with the state-of-the-art methods, our proposed method demonstrated great improvement with respect to both accuracy and robustness in dynamic environments. The computational efficiency is also competitive.Entities:
Keywords: COVID-19 pandemic; Dynamic scenes; Knowledge graph; Semantic descriptors; Visual SLAM
Year: 2021 PMID: 33495660 PMCID: PMC7816967 DOI: 10.1016/j.patcog.2021.107822
Source DB: PubMed Journal: Pattern Recognit ISSN: 0031-3203 Impact factor: 7.740
Fig. 1Framework diagram of our system.
Fig. 2An illustration of a knowledge graph.
Fig. 3Semantic map and descriptor. (a) RGB image (b) semantic map (c) semantic descriptor. and in (a) are two key points and the zoom-in views of these two points and the neighborhood patches are shown in (c).
Fig. 4Two examples of the semantic descriptor. The left and right are semantic descriptors of and respectively.
Fig. 5Flowchart of dynamic object detection and rejection.
Fig. 6Examples of semantic maps. From top to bottom, the panels on each row depict the RGB image, the depth image, the rendered point cloud with color, and the semantic map. The left column shows a case with only stationary objects and the right column shows a case with non-stationary objects, in which the non-stationary objects are removed.
Fig. 7Robot Trajectory and health facility scenes.
Trajectory error in terms of average RMSE (aRMSE), absolute error distance, and median error distance.
| ORB-SLAM2 | Ours | |||||
|---|---|---|---|---|---|---|
| Sequence | aRMSE | Mean (STD) | Median | aRMSE | Mean (STD) | Median |
| fr3-walking-static | 0.3575 | 0.3243 (0.1490) | 0.2870 | 0.0104 | 0.0090 (0.0052) | 0.0079 |
| fr3-walking-xyz | 0.6770 | 0.5826 (0.3423) | 0.5223 | 0.0164 | 0.0139 (0.0087) | 0.0116 |
| fr3-walking-half | 0.5186 | 0.4567 (0.2424) | 0.4282 | 0.0923 | 0.0857 (0.0344) | 0.0838 |
| fr3-sitting-static | 0.0082 | 0.0072 (0.0039) | 0.0066 | 0.0065 | 0.0060 (0.0033) | 0.0054 |
| fr3-sitting-xyz | 0.0094 | 0.0079 (0.0051) | 0.0070 | 0.0088 | 0.0079 (0.0043) | 0.0070 |
| fr3-sitting-half | 0.0205 | 0.0159 (0.0130) | 0.0136 | 0.0145 | 0.0125 (0.0074) | 0.0115 |
The aRMSE of ATE for TUM RGB-D dataset in dynamic environments.
| Sequences | Sun | Li | Wang | Wang | Ours |
|---|---|---|---|---|---|
| fr3-walking-static | 0.0656 | 0.0261 | 0.3080 | ||
| fr3-walking-xyz | 0.0932 | 0.0601 | 0.3047 | ||
| fr3-walking-half | 0.1252 | 0.3116 | 0.0923 | ||
| fr3-sitting-static | - | - | 0.0078 | ||
| fr3-sitting-xyz | 0.0482 | 0.0397 | - | ||
| fr3-sitting-half | 0.0470 | 0.0432 | 0.0217 |
aRMSE of translational drift (RPE) for TUM RGB-D dataset in dynamic environments [m/s].
| Sequences | Sun | Li | Kim | Wang | Ours |
|---|---|---|---|---|---|
| fr3-walking-static | 0.0842 | 0.1339 | 0.1881 | ||
| fr3-walking-xyz | 0.1214 | 0.2326 | 0.2158 | ||
| fr3-walking-half | 0.1672 | 0.1738 | 0.1908 | ||
| fr3-sitting-static | - | 0.0231 | 0.0248 | ||
| fr3-sitting-xyz | 0.0330 | 0.0219 | 0.0482 | ||
| fr3-sitting-half | 0.0458 | 0.0389 | 0.0589 | 0.0245 |
aRMSE of rotational drift (RPE) for TUM RGB-D dataset in dynamic environments[/s].
| Sequences | Sun | Li | Kim | Wang | Ours |
|---|---|---|---|---|---|
| fr3-walking-static | 2.0487 | 2.0833 | 3.2101 | ||
| fr3-walking-xyz | 3.2346 | 4.3911 | 3.6476 | ||
| fr3-walking-half | 5.0108 | 4.2863 | 3.3321 | ||
| fr3-sitting-static | - | 0.7228 | 0.6997 | ||
| fr3-sitting-xyz | 0.9828 | 0.8466 | 1.3885 | ||
| fr3-sitting-half | 2.3748 | 1.8836 | 2.8804 |
RMSE of the trajectories with different values of parameters.
| Sequences | |||
|---|---|---|---|
| fr3-walking-static | 0.0198 | 0.0163 | |
| fr3-walking-xyz | 0.0382 | 0.0285 | |
| fr3-walking-half | 0.3020 | 0.2062 | |
| fr3-sitting-static | 0.0075 | 0.0074 | |
| fr3-sitting-xyz | 0.0093 | 0.0088 | |
| fr3-sitting-half | 0.0181 | 0.0145 |
Fig. 8Robot trajectory from test case fr3-walking-xyz.
Fig. 9Robot trajectory from test case fr3-walking-halfsphere.
Fig. 10Robot trajectory from test case fr3-sitting-static.
Absolute trajectory error of our algorithm with and without distance calculation.
| Ours without distance calculation | Ours with distance calculation | |||||||
|---|---|---|---|---|---|---|---|---|
| Sequence | RMSE | Mean | Median | S.D. | RMSE | Mean | Median | S.D. |
| fr3-walking-static | 0.0104 | 0.0090 | 0.0079 | 0.0052 | 0.0095 | 0.0083 | 0.0074 | 0.0045 |
| fr3-walking-xyz | 0.0164 | 0.0139 | 0.0116 | 0.0087 | 0.0158 | 0.0131 | 0.0107 | 0.0079 |
| fr3-walking-half | 0.0923 | 0.0857 | 0.0838 | 0.0344 | 0.0916 | 0.0850 | 0.0827 | 0.0332 |
| fr3-sitting-static | 0.0065 | 0.0060 | 0.0054 | 0.0033 | 0.0058 | 0.0051 | 0.0049 | 0.0030 |
| fr3-sitting-xyz | 0.0088 | 0.0079 | 0.0070 | 0.0043 | 0.0077 | 0.0072 | 0.0062 | 0.0045 |
| fr3-sitting-half | 0.0145 | 0.0125 | 0.0115 | 0.0074 | 0.0154 | 0.0133 | 0.0121 | 0.0078 |
Average time expense (in milliseconds).
| Case | ORB-SLAM2 | Berta | Yu | Ours |
|---|---|---|---|---|
| Dynamic | - | 235.98 | 48.31 | |
| Stationary | 3,362.22 | 55.19 | 1,021.37 |