Literature DB >> 10770830

Minimizing binding errors using learned conjunctive features.

B W Mel1, J Fiser.   

Abstract

We have studied some of the design trade-offs governing visual representations based on spatially invariant conjunctive feature detectors, with an emphasis on the susceptibility of such systems to false-positive recognition errors-Malsburg's classical binding problem. We begin by deriving an analytical model that makes explicit how recognition performance is affected by the number of objects that must be distinguished, the number of features included in the representation, the complexity of individual objects, and the clutter load, that is, the amount of visual material in the field of view in which multiple objects must be simultaneously recognized, independent of pose, and without explicit segmentation. Using the domain of text to model object recognition in cluttered scenes, we show that with corrections for the nonuniform probability and nonindependence of text features, the analytical model achieves good fits to measured recognition rates in simulations involving a wide range of clutter loads, word size, and feature counts. We then introduce a greedy algorithm for feature learning, derived from the analytical model, which grows a representation by choosing those conjunctive features that are most likely to distinguish objects from the cluttered backgrounds in which they are embedded. We show that the representations produced by this algorithm are compact, decorrelated, and heavily weighted toward features of low conjunctive order. Our results provide a more quantitative basis for understanding when spatially invariant conjunctive features can support unambiguous perception in multiobject scenes, and lead to several insights regarding the properties of visual representations optimized for specific recognition tasks.

Entities:  

Mesh:

Year:  2000        PMID: 10770830     DOI: 10.1162/089976600300015574

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  7 in total

1.  Invariant Visual Object and Face Recognition: Neural and Computational Bases, and a Model, VisNet.

Authors:  Edmund T Rolls
Journal:  Front Comput Neurosci       Date:  2012-06-19       Impact factor: 2.380

2.  Multimap formation in visual cortex.

Authors:  Rishabh Jain; Rachel Millin; Bartlett W Mel
Journal:  J Vis       Date:  2015       Impact factor: 2.240

3.  Category selectivity in the ventral visual pathway confers robustness to clutter and diverted attention.

Authors:  Leila Reddy; Nancy Kanwisher
Journal:  Curr Biol       Date:  2007-11-08       Impact factor: 10.834

4.  Physiologically inspired model for the visual recognition of transitive hand actions.

Authors:  Falk Fleischer; Vittorio Caggiano; Peter Thier; Martin A Giese
Journal:  J Neurosci       Date:  2013-04-10       Impact factor: 6.167

Review 5.  The neural basis of visual object learning.

Authors:  Hans P Op de Beeck; Chris I Baker
Journal:  Trends Cogn Sci       Date:  2009-11-27       Impact factor: 20.229

6.  A dual-route approach to orthographic processing.

Authors:  Jonathan Grainger; Johannes C Ziegler
Journal:  Front Psychol       Date:  2011-04-13

7.  Ventral-stream-like shape representation: from pixel intensity values to trainable object-selective COSFIRE models.

Authors:  George Azzopardi; Nicolai Petkov
Journal:  Front Comput Neurosci       Date:  2014-07-30       Impact factor: 2.380

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.