Literature DB >> 23787339

Learning to relate images.

Roland Memisevic1.   

Abstract

A fundamental operation in many vision tasks, including motion understanding, stereopsis, visual odometry, or invariant recognition, is establishing correspondences between images or between images and data from other modalities. Recently, there has been increasing interest in learning to infer correspondences from data using relational, spatiotemporal, and bilinear variants of deep learning methods. These methods use multiplicative interactions between pixels or between features to represent correlation patterns across multiple images. In this paper, we review the recent work on relational feature learning, and we provide an analysis of the role that multiplicative interactions play in learning to encode relations. We also discuss how square-pooling and complex cell models can be viewed as a way to represent multiplicative interactions and thereby as a way to encode relations.

Mesh:

Year:  2013        PMID: 23787339     DOI: 10.1109/TPAMI.2013.53

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  3 in total

1.  AI, visual imagery, and a case study on the challenges posed by human intelligence tests.

Authors:  Maithilee Kunda
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-24       Impact factor: 11.205

2.  An Embodied Multi-Sensor Fusion Approach to Visual Motion Estimation Using Unsupervised Deep Networks.

Authors:  E Jared Shamwell; William D Nothwang; Donald Perlis
Journal:  Sensors (Basel)       Date:  2018-05-04       Impact factor: 3.576

3.  Toward a Unified Sub-symbolic Computational Theory of Cognition.

Authors:  Martin V Butz
Journal:  Front Psychol       Date:  2016-06-21
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.