| Literature DB >> 30082618 |
Jongmin Jeong1, Tae Sung Yoon2, Jin Bae Park3.
Abstract
Semantic 3D maps are required for various applications including robot navigation and surveying, and their importance has significantly increased. Generally, existing studies on semantic mapping were camera-based approaches that could not be operated in large-scale environments owing to their computational burden. Recently, a method of combining a 3D Lidar with a camera was introduced to address this problem, and a 3D Lidar and a camera were also utilized for semantic 3D mapping. In this study, our algorithm consists of semantic mapping and map refinement. In the semantic mapping, a GPS and an IMU are integrated to estimate the odometry of the system, and subsequently, the point clouds measured from a 3D Lidar are registered by using this information. Furthermore, we use the latest CNN-based semantic segmentation to obtain semantic information on the surrounding environment. To integrate the point cloud with semantic information, we developed incremental semantic labeling including coordinate alignment, error minimization, and semantic information fusion. Additionally, to improve the quality of the generated semantic map, the map refinement is processed in a batch. It enhances the spatial distribution of labels and removes traces produced by moving vehicles effectively. We conduct experiments on challenging sequences to demonstrate that our algorithm outperforms state-of-the-art methods in terms of accuracy and intersection over union.Entities:
Keywords: 3D Lidar; large-scale mapping; map refinement; moving vehicle removal; semantic mapping; semantic reconstruction
Year: 2018 PMID: 30082618 PMCID: PMC6111277 DOI: 10.3390/s18082571
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Example of a semantic 3D map generated by the proposed method.
Figure 2Flowchart of the semantic 3D mapping method.
Figure 3Example of 2D semantic segmentation: (Top) input image (Bottom) prediction.
Figure 4Coordinate alignment: labeled voxels are projected onto image.
Figure 5Result of the error minimization process. (First column) a set of voxels with the same label. (Second column) results of Euclidean clustering. (Third column) results obtained by the classifier.
Figure 6Example of the trace removal from the top view.
Figure 7Visualization of semantic 3D mapping results. Top view for the entire map and three close-up views with different scenarios.
Figure 8Effectiveness of the map refinement. (First Row) original images. (Second Row) 2D semantic segmentation. (Third Row) semantic 3D map without map refinement. (Bottom Row) semantic 3D map with map refinement.
Figure 9Comparison of 2D semantic segmentation and 3D semantic segmentation.
Quantitative results for 3D semantic segmentation on the Sengupta labelled dataset. The bold fonts indicate the best results.
| Method | Road | Sidewalk | Building | Fence | Pole | Vegetation | Vehicle | |
|---|---|---|---|---|---|---|---|---|
| Accuracy | Sengupta [ | 97.8 | 86.5 | 88.5 | 46.1 | 38.2 | 86.9 | 88.5 |
| Sengupta [ | 97.0 | 73.4 | 89.1 | 45.7 | 3.3 | 81.2 | 72.5 | |
| Vineet [ |
| 91.8 | 97.2 | 47.8 | 51.4 | 94.1 | 94.1 | |
| Yang [ |
| 93.8 | 98.2 | 84.7 | 66.3 |
| 95.5 | |
| Ours |
|
|
|
|
| 95.3 |
| |
| IoU | Sengupta [ | 96.3 | 68.4 | 83.8 | 45.2 | 28.9 | 74.3 | 63.5 |
| Sengupta [ | 87.8 | 49.1 | 73.8 | 43.7 | 1.9 | 65.2 | 55.8 | |
| Vineet [ | 94.7 | 73.8 | 88.3 | 46.3 | 41.7 | 83.2 | 79.5 | |
| Yang [ | 96.6 | 90.0 | 95.4 | 81.1 |
| 91.0 | 94.6 | |
| Ours |
|
|
|
| 59.4 |
|
|
Time analysis of the proposed algorithm.
| Method | Time(s) |
|---|---|
| 2D Semantic Segmentation | 0.2412 |
| Euclidean Clustering | 0.0898 |
| Random Forest | 0.1913 |
| Semantic information fusion | 0.0003 |