Literature DB >> 32491986

Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks.

Deng-Ping Fan, Zheng Lin, Zhao Zhang, Menglong Zhu, Ming-Ming Cheng.   

Abstract

The use of RGB-D information for salient object detection (SOD) has been extensively explored in recent years. However, relatively few efforts have been put toward modeling SOD in real-world human activity scenes with RGB-D. In this article, we fill the gap by making the following contributions to RGB-D SOD: 1) we carefully collect a new Salient Person (SIP) data set that consists of ~1 K high-resolution images that cover diverse real-world scenes from various viewpoints, poses, occlusions, illuminations, and background s; 2) we conduct a large-scale (and, so far, the most comprehensive) benchmark comparing contemporary methods, which has long been missing in the field and can serve as a baseline for future research, and we systematically summarize 32 popular models and evaluate 18 parts of 32 models on seven data sets containing a total of about 97k images; and 3) we propose a simple general architecture, called deep depth-depurator network (D3Net). It consists of a depth depurator unit (DDU) and a three-stream feature learning module (FLM), which performs low-quality depth map filtering and cross-modal feature learning, respectively. These components form a nested structure and are elaborately designed to be learned jointly. D3Net exceeds the performance of any prior contenders across all five metrics under consideration, thus serving as a strong model to advance research in this field. We also demonstrate that D3Net can be used to efficiently extract salient object masks from real scenes, enabling effective background-changing application with a speed of 65 frames/s on a single GPU. All the saliency maps, our new SIP data set, the D3Net model, and the evaluation tools are publicly available at https://github.com/DengPingFan/D3NetBenchmark.

Entities:  

Mesh:

Year:  2021        PMID: 32491986     DOI: 10.1109/TNNLS.2020.2996406

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  4 in total

Review 1.  Salient Object Detection Techniques in Computer Vision-A Survey.

Authors:  Ashish Kumar Gupta; Ayan Seal; Mukesh Prasad; Pritee Khanna
Journal:  Entropy (Basel)       Date:  2020-10-19       Impact factor: 2.524

Review 2.  RGB-D salient object detection: A survey.

Authors:  Tao Zhou; Deng-Ping Fan; Ming-Ming Cheng; Jianbing Shen; Ling Shao
Journal:  Comput Vis Media (Beijing)       Date:  2021-01-07

3.  Fast Detection of Tomato Sucker Using Semantic Segmentation Neural Networks Based on RGB-D Images.

Authors:  Truong Thi Huong Giang; Tran Quoc Khai; Dae-Young Im; Young-Jae Ryoo
Journal:  Sensors (Basel)       Date:  2022-07-08       Impact factor: 3.847

4.  Dynamic Knowledge Distillation with Noise Elimination for RGB-D Salient Object Detection.

Authors:  Guangyu Ren; Yinxiao Yu; Hengyan Liu; Tania Stathaki
Journal:  Sensors (Basel)       Date:  2022-08-18       Impact factor: 3.847

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.