Literature DB >> 33466401

Towards a Robust Visual Place Recognition in Large-Scale vSLAM Scenarios Based on a Deep Distance Learning.

Liang Chen1, Sheng Jin1, Zhoujun Xia1.   

Abstract

The application of deep learning is blooming in the field of visual place recognition, which plays a critical role in visual Simultaneous Localization and Mapping (vSLAM) applications. The use of convolutional neural networks (CNNs) achieve better performance than handcrafted feature descriptors. However, visual place recognition is still a challenging task due to two major problems, i.e., perceptual aliasing and perceptual variability. Therefore, designing a customized distance learning method to express the intrinsic distance constraints in the large-scale vSLAM scenarios is of great importance. Traditional deep distance learning methods usually use the triplet loss which requires the mining of anchor images. This may, however, result in very tedious inefficient training and anomalous distance relationships. In this paper, a novel deep distance learning framework for visual place recognition is proposed. Through in-depth analysis of the multiple constraints of the distance relationship in the visual place recognition problem, the multi-constraint loss function is proposed to optimize the distance constraint relationships in the Euclidean space. The new framework can support any kind of CNN such as AlexNet, VGGNet and other user-defined networks to extract more distinguishing features. We have compared the results with the traditional deep distance learning method, and the results show that the proposed method can improve the performance by 19-28%. Additionally, compared to some contemporary visual place recognition techniques, the proposed method can improve the performance by 40%/36% and 27%/24% in average on VGGNet/AlexNet using the New College and the TUM datasets, respectively. It's verified the method is capable to handle appearance changes in complex environments.

Entities:  

Keywords:  CNN; deep distance learning; multi-constraint loss; vSLAM; visual place recognition

Year:  2021        PMID: 33466401      PMCID: PMC7796086          DOI: 10.3390/s21010310

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  6 in total

1.  Factors of Transferability for a Generic ConvNet Representation.

Authors:  Hossein Azizpour; Ali Sharif Razavian; Josephine Sullivan; Atsuto Maki; Stefan Carlsson
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-11-12       Impact factor: 6.226

2.  End-to-End Comparative Attention Networks for Person Re-Identification.

Authors: 
Journal:  IEEE Trans Image Process       Date:  2017-05-03       Impact factor: 10.856

3.  NetVLAD: CNN Architecture for Weakly Supervised Place Recognition.

Authors:  Relja Arandjelovic; Petr Gronat; Akihiko Torii; Tomas Pajdla; Josef Sivic
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-06-01       Impact factor: 6.226

4.  SIFT Meets CNN: A Decade Survey of Instance Retrieval.

Authors:  Liang Zheng; Yi Yang; Qi Tian
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-05       Impact factor: 6.226

5.  Large-Scale Place Recognition Based on Camera-LiDAR Fused Descriptor.

Authors:  Shaorong Xie; Chao Pan; Yaxin Peng; Ke Liu; Shihui Ying
Journal:  Sensors (Basel)       Date:  2020-05-19       Impact factor: 3.576

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.