Literature DB >> 33401740

Single- and Cross-Modality Near Duplicate Image Pairs Detection via Spatial Transformer Comparing CNN.

Yi Zhang1, Shizhou Zhang1, Ying Li1,2, Yanning Zhang1.   

Abstract

Recently, both single modality and cross modality near-duplicate image detection tasks have received wide attention in the community of pattern recognition and computer vision. Existing deep neural networks-based methods have achieved remarkable performance in this task. However, most of the methods mainly focus on the learning of each image from the image pair, thus leading to less use of the information between the near duplicate image pairs to some extent. In this paper, to make more use of the correlations between image pairs, we propose a spatial transformer comparing convolutional neural network (CNN) model to compare near-duplicate image pairs. Specifically, we firstly propose a comparing CNN framework, which is equipped with a cross-stream to fully learn the correlation information between image pairs, while considering the features of each image. Furthermore, to deal with the local deformations led by cropping, translation, scaling, and non-rigid transformations, we additionally introduce a spatial transformer comparing CNN model by incorporating a spatial transformer module to the comparing CNN architecture. To demonstrate the effectiveness of the proposed method on both the single-modality and cross-modality (Optical-InfraRed) near-duplicate image pair detection tasks, we conduct extensive experiments on three popular benchmark datasets, namely CaliforniaND (ND means near duplicate), Mir-Flickr Near Duplicate, and TNO Multi-band Image Data Collection. The experimental results show that the proposed method can achieve superior performance compared with many state-of-the-art methods on both tasks.

Entities:  

Keywords:  comparing CNN; near duplicate image pairs; spatial transformer network

Year:  2021        PMID: 33401740      PMCID: PMC7794762          DOI: 10.3390/s21010255

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  6 in total

1.  Aggregating local image descriptors into compact codes.

Authors:  Hervé Jégou; Florent Perronnin; Matthijs Douze; Jorge Sánchez; Patrick Pérez; Cordelia Schmid
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2012-09       Impact factor: 6.226

2.  Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification.

Authors:  Ruimao Zhang; Liang Lin; Rui Zhang; Wangmeng Zuo; Lei Zhang
Journal:  IEEE Trans Image Process       Date:  2015-08-11       Impact factor: 10.856

3.  Scale-rotation invariant pattern entropy for keypoint-based near-duplicate detection.

Authors:  Wan-Lei Zhao; Chong-Wah Ngo
Journal:  IEEE Trans Image Process       Date:  2009-02       Impact factor: 10.856

4.  Edge-SIFT: discriminative binary descriptor for scalable partial-duplicate mobile search.

Authors:  Shiliang Zhang; Qi Tian; Ke Lu; Qingming Huang; Wen Gao
Journal:  IEEE Trans Image Process       Date:  2013-03-07       Impact factor: 10.856

5.  Coupled binary embedding for large-scale image retrieval.

Authors:  Liang Zheng; Shengjin Wang; Qi Tian
Journal:  IEEE Trans Image Process       Date:  2014-06-12       Impact factor: 10.856

6.  The TNO Multiband Image Data Collection.

Authors:  Alexander Toet
Journal:  Data Brief       Date:  2017-09-22
  6 in total
  1 in total

1.  Transformer-Based Deep Learning Network for Tooth Segmentation on Panoramic Radiographs.

Authors:  Chen Sheng; Lin Wang; Zhenhuan Huang; Tian Wang; Yalin Guo; Wenjie Hou; Laiqing Xu; Jiazhu Wang; Xue Yan
Journal:  J Syst Sci Complex       Date:  2022-10-14       Impact factor: 1.272

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.