| Literature DB >> 31694330 |
Hyun-Koo Kim1, Kook-Yeol Yoo1, Ju H Park2, Ho-Youl Jung1.
Abstract
In this paper, we propose a method of generating a color image from light detection and ranging (LiDAR) 3D reflection intensity. The proposed method is composed of two steps: projection of LiDAR 3D reflection intensity into 2D intensity, and color image generation from the projected intensity by using a fully convolutional network (FCN). The color image should be generated from a very sparse projected intensity image. For this reason, the FCN is designed to have an asymmetric network structure, i.e., the layer depth of the decoder in the FCN is deeper than that of the encoder. The well-known KITTI dataset for various scenarios is used for the proposed FCN training and performance evaluation. Performance of the asymmetric network structures are empirically analyzed for various depth combinations for the encoder and decoder. Through simulations, it is shown that the proposed method generates fairly good visual quality of images while maintaining almost the same color as the ground truth image. Moreover, the proposed FCN has much higher performance than conventional interpolation methods and generative adversarial network based Pix2Pix. One interesting result is that the proposed FCN produces shadow-free and daylight color images. This result is caused by the fact that the LiDAR sensor data is produced by the light reflection and is, therefore, not affected by sunlight and shadow.Entities:
Keywords: LiDAR imaging; LiDAR sensor; advanced driver assistance system; asymmetric network model; image generation
Year: 2019 PMID: 31694330 PMCID: PMC6864548 DOI: 10.3390/s19214818
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Typical network architecture for gray-to-color image generation.
Figure 2The proposed color image-generation system architecture.
Figure 3Architecture design of our proposed image-generation network model.
Figure 4Encoder-Decoder-structured FCN models.
Figure 5Training and inference processes in the proposed network.
Evaluation dataset.
| Dataset | Category | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|
| City | Residential | Road | |||||||
| Left | Right | Left | Right | Left | Right | Left | Right | Total (Ratio) | |
| Training set | 376 | 374 | 208 | 209 | 852 | 853 | 1436 | 1436 | 2872 (66.6%) |
| Validation set | 93 | 95 | 51 | 53 | 213 | 213 | 357 | 361 | 718 (16.7%) |
| Testing set | 94 | 93 | 53 | 51 | 214 | 213 | 361 | 357 | 718 (16.7%) |
| Total sets | 563 | 562 | 312 | 313 | 1279 | 1279 | 2154 | 2154 | 4308 (100.0%) |
Figure 6Histogram of the number of valid LiDAR points in 2D lidar reflection-intensity image.
Number of layers and parameters when varying the number of blocks ( and ) in symmetric structured networks.
| Network Model | Encoder | Decoder | Total | |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| M-1-1 | 12 | 395,424 | 11 | 984,419 | 23 | 1,379,843 |
| M-2-2 | 20 | 592,464 | 19 | 1,181,459 | 39 | 1,773,923 |
| M-3-3 | 28 | 789,504 | 27 | 1,378,499 | 55 | 2,168,003 |
| M-4-4 | 36 | 986,544 | 35 | 1,575,539 | 71 | 2,562,083 |
| M-5-5 | 44 | 1,183,584 | 43 | 1,772,579 | 87 | 2,956,163 |
| M-6-6 | 52 | 1,380,624 | 51 | 1,969,619 | 103 | 3,350,243 |
| M-7-7 | 60 | 1,577,664 | 59 | 2,166,659 | 119 | 3,744,323 |
| M-8-8 | 68 | 1,774,704 | 67 | 2,363,699 | 135 | 4,138,403 |
| M-9-9 | 76 | 1,971,744 | 75 | 2,560,739 | 151 | 4,532,483 |
| M-10-10 | 84 | 2,168,784 | 83 | 2,757,779 | 167 | 4,926,563 |
Performance results when varying the number of blocks ( and ) in symmetric structured networks. Bold-faced numbers indicate the top-ranked network model and its scores.
| Network Model | Validation Set | Test Set | Total | |||
|---|---|---|---|---|---|---|
| PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | |
| M-1-1 | 17.45 | 0.41 | 17.45 | 0.41 | 17.45 | 0.41 |
| M-2-2 | 18.50 | 0.46 | 18.45 | 0.46 | 18.48 | 0.46 |
| M-3-3 | 18.90 | 0.47 | 18.86 | 0.48 | 18.88 | 0.47 |
| M-4-4 | 19.12 | 0.48 | 19.05 | 0.48 | 19.09 | 0.48 |
| M-5-5 | 19.20 | 0.49 | 19.13 | 0.49 | 19.17 | 0.49 |
|
|
|
|
|
|
|
|
| M-7-7 | 19.19 | 0.48 | 19.16 | 0.48 | 19.18 | 0.48 |
| M-8-8 | 19.12 | 0.48 | 19.10 | 0.48 | 19.11 | 0.48 |
| M-9-9 | 18.85 | 0.47 | 18.83 | 0.47 | 18.84 | 0.47 |
| M-10-10 | 18.76 | 0.46 | 18.72 | 0.46 | 18.74 | 0.46 |
Number of layers and parameters when varying the number of blocks ( and ) in asymmetric structured networks.
|
|
|
|
|---|---|---|
| 4 | 39 | 1,773,923 |
| 5 | 47 | 1,970,963 |
| 6 | 55 | 2,168,003 |
| 7 | 63 | 2,365,043 |
| 8 | 71 | 2,562,083 |
| 9 | 79 | 2,759,123 |
| 10 | 87 | 2,956,163 |
| 11 | 95 | 3,153,203 |
| 12 | 103 | 3,350,243 |
| 13 | 111 | 3,547,283 |
| 14 | 119 | 3,744,323 |
| 15 | 127 | 3,941,363 |
| 16 | 135 | 4,138,403 |
Figure 7PSNR performance according to in a model with a fixed total number of layers.
Performance according to depth of decoder in the proposed asymmetric structured networks. Bold-faced numbers indicate the top-ranked network model and its scores.
| Network Model | Validation Set | Test Set | Total | |||
|---|---|---|---|---|---|---|
| PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | |
| M-3-1 | 18.55 | 0.46 | 18.55 | 0.46 | 18.55 | 0.46 |
| M-3-2 | 18.69 | 0.47 | 18.49 | 0.47 | 18.69 | 0.47 |
| M-3-3 | 18.90 | 0.47 | 18.86 | 0.47 | 18.88 | 0.47 |
| M-3-4 | 19.02 | 0.48 | 19.02 | 0.48 | 19.02 | 0.48 |
| M-3-5 | 19.15 | 0.49 | 19.13 | 0.49 | 19.14 | 0.49 |
| M-3-6 | 19.17 | 0.49 | 19.14 | 0.49 | 19.16 | 0.49 |
| M-3-7 | 19.27 | 0.49 | 19.24 | 0.49 | 19.26 | 0.49 |
| M-3-8 | 19.36 | 0.49 | 19.34 | 0.50 | 19.35 | 0.495 |
|
|
|
|
|
|
|
|
| M-3-10 | 19.33 | 0.49 | 19.34 | 0.49 | 19.34 | 0.49 |
| M-3-11 | 19.31 | 0.49 | 19.33 | 0.49 | 19.32 | 0.49 |
| M-3-12 | 18.55 | 0.45 | 18.54 | 0.45 | 18.55 | 0.45 |
| M-3-13 | 18.41 | 0.44 | 18.35 | 0.44 | 18.38 | 0.44 |
Performance results of the proposed method and conventional methods. Bold-faced numbers indicate the top-ranked method and its scores.
| Method | Validation Set | Test Set | Total | |||
|---|---|---|---|---|---|---|
| PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | |
| IDW interpolation | 11.15 | 0.20 | 11.14 | 0.20 | 11.15 | 0.20 |
| NN interpolation | 9.36 | 0.18 | 9.35 | 0.18 | 9.36 | 0.18 |
| Pix2Pix | 15.96 | 0.39 | 15.98 | 0.39 | 15.97 | 0.39 |
|
|
|
|
|
|
|
|
Figure 8Inference examples in the validation and test dataset. (a) Case images with bus and road lanes; (b) Case images captured at short distance; and (c) Case images with vehicles at various distances are shown.
Figure 9Additional inference examples of GT images with heavy shadows.