Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Unsupervised Learning of Depth and Camera Pose with Feature Map Warping.

Literature DB >> 33573136

Unsupervised Learning of Depth and Camera Pose with Feature Map Warping.

Ente Guo¹, Zhifeng Chen¹, Yanlin Zhou², Dapeng Oliver Wu².

Abstract

Estimating the depth of image and egomotion of agent are important for autonomous and robot in understanding the surrounding environment and avoiding collision. Most existing unsupervised methods estimate depth and camera egomotion by minimizing photometric error between adjacent frames. However, the photometric consistency sometimes does not meet the real situation, such as brightness change, moving objects and occlusion. To reduce the influence of brightness change, we propose a feature pyramid matching loss (FPML) which captures the trainable feature error between a current and the adjacent frames and therefore it is more robust than photometric error. In addition, we propose the occlusion-aware mask (OAM) network which can indicate occlusion according to change of masks to improve estimation accuracy of depth and camera pose. The experimental results verify that the proposed unsupervised approach is highly competitive against the state-of-the-art methods, both qualitatively and quantitatively. Specifically, our method reduces absolute relative error (Abs Rel) by 0.017-0.088.

Entities: Chemical Disease Gene

Keywords: feature pyramid matching loss; monocular depth estimation; occlusion-aware mask network; single camera egomotion

Year: 2021 PMID： 33573136 PMCID： PMC7866542 DOI： 10.3390/s21030923

Source DB: PubMed Journal: Sensors (Basel) ISSN： 1424-8220 Impact factor: 3.576

3 in total

Unsupervised Learning of Depth and Camera Pose with Feature Map Warping.

1. Image quality assessment: from error visibility to structural similarity.

2. Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.

3. Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding.