| Literature DB >> 35990884 |
Long Cheng1, Ni Liu2, Xusen Guo2, Yuhao Shen1, Zijun Meng1, Kai Huang2,3, Xiaoqin Zhang1.
Abstract
As bio-inspired vision devices, dynamic vision sensors (DVS) are being applied in more and more applications. Unlike normal cameras, pixels in DVS independently respond to the luminance change with asynchronous output spikes. Therefore, removing raindrops and streaks from DVS event videos is a new but challenging task as the conventional deraining methods are no longer applicable. In this article, we propose to perform the deraining process in the width and time (W-T) space. This is motivated by the observation that rain steaks exhibits discontinuity in the width and time directions while background moving objects are usually piecewise smooth along with both directions. The W-T space can fuse the discontinuity in both directions and thus transforms raindrops and streaks to approximately uniform noise that are easy to remove. The non-local means filter is adopted as background object motion has periodic patterns in the W-T space. A repairing method is also designed to restore edge details erased during the deraining process. Experimental results demonstrate that our approach can better remove rain noise than the four existing methods for traditional camera videos. We also study how the event buffer depth and event frame time affect the performance investigate the potential implementation of our approach to classic RGB images. A new real-world database for DVS deraining is also created and shared for public use.Entities:
Keywords: deraining; dynamic vision sensors; intelligent driving; outdoor vision systems; rain removal
Year: 2022 PMID: 35990884 PMCID: PMC9387434 DOI: 10.3389/fnbot.2022.928707
Source DB: PubMed Journal: Front Neurorobot ISSN: 1662-5218 Impact factor: 3.493
Figure 1The working principle of the CeleXTM chipset.
Figure 2A rainy DVS frame and the deraining results of two existing algorithms and our approach. Observe MCSC (Li M. et al., 2018) only removes part of the rain streaks and ReMAEN (Yang and Lu, 2019) failed to preserve most texture details of background objects. (A) Input. (B) MCSC. (C) ReMAEN. (D) Our.
Figure 3Transformation from normal space to the W-T space.
Figure 4(A–D) Four frames from a rainy DVS sequence comprising 60 images. (E) Their W-T perspective.
Figure 5The structure of WTSD. (A) The event frame buffer. (B) A transformed image in the W-T space. Rain streaks and system noise become approximately uniform noise. (C) The extracted motion curves. Note that the edges are hampered. (D) After processing all the images in the W-T space, rain free images in normal space are retrieved by performing the transformation reversely. (E) Using our repairing method, we get images having more edge details.
Remove rain and noise in the W-T space
| get |
| get |
| |
Figure 6The rainy input, rain removal results by different methods, and the ground truth (GT) of nine different frames. WTSD is our approach while MCSC, PMG, and ReMAEN are the ones for comparison. Note the part framed in red of PMG results. (A) Input. (B) MCSC. (C) PMG. (D) ReMAEN. (E) WTSD. (F) GT.
Quantitative comparison of different approaches.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Input | 2234.651 | 14.680 | 0.107 | 0.244 | 0.374 | 0.444 | 0.151 |
| MCSC | 1303.015 | 17.327 | 0.0574 | 0.823 | 0.755 | 0.756 | 0.787 |
| PMG | 1308.653 | 17.313 | 0.0546 | 0.801 | 0.746 | 0.738 | 0.680 |
| ReMAEN | 1045.639 | 20.267 | 0.10 | 0.906 |
| 0.765 | 0.401 |
| WTSD |
|
|
|
| 0.904 |
|
|
Values in boldface are the best ones.
Quantitative comparison of different event frame buffer depths.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 3 |
|
| 0.209 | 0.891 | 0.878 | 0.852 | 0.902 |
| 5 | 744.283 | 20.846 | 0.229 | 0.920 |
| 0.913 | 0.916 |
| 7 | 818.348 | 20.612 | 0.233 | 0.917 | 0.903 | 0.919 | 0.913 |
| 10 | 846.601 | 20.577 | 0.243 | 0.920 | 0.904 | 0.923 | 0.915 |
| 15 | 840.685 | 20.606 | 0.244 | 0.920 | 0.905 | 0.923 | 0.916 |
| 20 | 840.685 | 20.606 | 0.244 | 0.920 | 0.905 | 0.923 | 0.916 |
| 25 | 840.685 | 20.606 | 0.244 | 0.920 | 0.905 | 0.923 | 0.916 |
| 30 | 835.386 | 20.670 | 0.246 | 0.921 | 0.905 | 0.923 | 0.916 |
| 40 | 822.589 | 20.765 | 0.249 | 0.921 | 0.906 | 0.924 | 0.916 |
| 50 | 810.712 | 20.830 |
|
| 0.907 |
|
|
Values in boldface are the best ones.
Quantitative comparison of different event frame times T (ms).
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 30 | 646.495 | 20.461 |
| 0.933 | 0.913 | 0.937 | 0.929 |
| 15 | 477.290 | 21.707 | 0.302 |
|
| 0.962 |
|
| 5 |
|
| 0.181 | 0.942 |
|
| 0.927 |
Values in boldface are the best ones.
Quantitative comparison when adopting different denoising methods in the W-T space.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| NLM | 846.601 | 20.577 | 0.243 | 0.920 | 0.904 | 0.923 | 0.915 |
| NLM-R | 790.034 | 20.782 | 0.262 | 0.922 | 0.904 | 0.922 | 0.917 |
| NLM-RG |
|
|
|
|
|
|
|
| MF | 795.768 | 19.433 | 0.0837 | 0.828 | 0.810 | 0.751 | 0.857 |
| MF-R | 762.147 | 19.518 | 0.102 | 0.831 | 0.789 | 0.745 | 0.857 |
| MF-RG | 597.407 | 20.808 | 0.151 | 0.900 | 0.860 | 0.860 | 0.889 |
| SWM | 1325.803 | 17.001 | 0.0779 | 0.648 | 0.598 | 0.615 | 0.146 |
| SWM-R | 2221.985 | 14.705 | 0.106 | 0.246 | 0.375 | 0.445 | 0.154 |
| SWM-RG | 995.026 | 18.348 | 0.133 | 0.649 | 0.660 | 0.632 | 0.553 |
NLM-R denotes the NLM-based method for repairing. NLM-RG means the same method when adopting global deraining. Same notations are applied to the MF and SWF-based methods. Values in boldface are the best ones.
Figure 7(A) One image selected from a normal rainy video. (B) The corresponding output image from our approach. (C) One image in the W-T space sliced at a height of 200.