Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Cross-Modal Attentional Context Learning for RGB-D Object Detection.

Literature DB >> 30387729

Cross-Modal Attentional Context Learning for RGB-D Object Detection.

Guanbin Li, Yukang Gan, Hejun Wu, Nong Xiao, Liang Lin.

Abstract

Recognizing objects from simultaneously sensed photometric (RGB) and depth channels is a fundamental yet practical problem in many machine vision applications, such as robot grasping and autonomous driving. In this paper, we address this problem by developing a cross-modal attentional context (CMAC) learning framework, which enables the full exploitation of the context information from both RGB and depth data. Compared to existing RGB-D object detection frameworks, our approach has several appealing properties. First, it consists of an attention-based global context model for exploiting adaptive contextual information and incorporating this information into a region-based CNN (e.g., fast RCNN) framework to achieve improved object detection performance. Second, our CMAC framework further contains a fine-grained object part attention module to harness multiple discriminative object parts inside each possible object region for superior local feature representation. While greatly improving the accuracy of RGB-D object detection, the effective cross-modal information fusion as well as attentional context modeling in our proposed model provide an interpretable visualization scheme. Experimental results demonstrate that the proposed method significantly improves upon the state of the art on all public benchmarks.

Year: 2018 PMID： 30387729 DOI： 10.1109/TIP.2018.2878956

Source DB: PubMed Journal: IEEE Trans Image Process ISSN： 1057-7149 Impact factor: 10.856

Keyword Cloud
Cited

2 in total

1. Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments.

Authors: Chi Xu; Jun Zhou; Wendi Cai; Yunkai Jiang; Yongbo Li; Yi Liu
Journal: Sensors (Basel) Date: 2020-11-07 Impact factor: 3.576

2. Developing a real-time social distancing detection system based on YOLOv4-tiny and bird-eye view for COVID-19.

Authors: Sergio Saponara; Abdussalam Elhanashi; Qinghe Zheng
Journal: J Real Time Image Process Date: 2022-02-22 Impact factor: 2.293

2 in total