Literature DB >> 33816967

Online supervised attention-based recurrent depth estimation from monocular video.

Dmitrii Maslov1, Ilya Makarov1,2.   

Abstract

Autonomous driving highly depends on depth information for safe driving. Recently, major improvements have been taken towards improving both supervised and self-supervised methods for depth reconstruction. However, most of the current approaches focus on single frame depth estimation, where quality limit is hard to beat due to limitations of supervised learning of deep neural networks in general. One of the way to improve quality of existing methods is to utilize temporal information from frame sequences. In this paper, we study intelligent ways of integrating recurrent block in common supervised depth estimation pipeline. We propose a novel method, which takes advantage of the convolutional gated recurrent unit (convGRU) and convolutional long short-term memory (convLSTM). We compare use of convGRU and convLSTM blocks and determine the best model for real-time depth estimation task. We carefully study training strategy and provide new deep neural networks architectures for the task of depth estimation from monocular video using information from past frames based on attention mechanism. We demonstrate the efficiency of exploiting temporal information by comparing our best recurrent method with existing image-based and video-based solutions for monocular depth reconstruction. ©2020 Maslov and Makarov.

Entities:  

Keywords:  Augmented Reality; Autonomous Vehicles; Computer Science Methods; Computer Vision; Deep Convolutional Neural Networks; Depth Reconstruction; Recurrent Neural Networks

Year:  2020        PMID: 33816967      PMCID: PMC7924529          DOI: 10.7717/peerj-cs.317

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  4 in total

1.  Learning Depth from Single Monocular Images Using Deep Convolutional Neural Fields.

Authors:  Fayao Liu; Chunhua Shen; Guosheng Lin; Ian Reid
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-12-03       Impact factor: 6.226

2.  Depth Transfer: Depth Extraction from Video Using Non-Parametric Sampling.

Authors:  Kevin Karsch; Ce Liu; Sing Bing Kang
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2014-11       Impact factor: 6.226

3.  Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding.

Authors:  Chenxu Luo; Zhenheng Yang; Peng Wang; Yang Wang; Wei Xu; Ramkant Nevatia; Alan Yuille
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-07-23       Impact factor: 6.226

4.  Deep Ordinal Regression Network for Monocular Depth Estimation.

Authors:  Huan Fu; Mingming Gong; Chaohui Wang; Kayhan Batmanghelich; Dacheng Tao
Journal:  Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit       Date:  2018-12-17
  4 in total
  1 in total

1.  Self-supervised recurrent depth estimation with attention mechanisms.

Authors:  Ilya Makarov; Maria Bakhanova; Sergey Nikolenko; Olga Gerasimova
Journal:  PeerJ Comput Sci       Date:  2022-01-31
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.