Literature DB >> 12816566

Learning optimized features for hierarchical models of invariant object recognition.

Heiko Wersing1, Edgar Körner.   

Abstract

There is an ongoing debate over the capabilities of hierarchical neural feedforward architectures for performing real-world invariant object recognition. Although a variety of hierarchical models exists, appropriate supervised and unsupervised learning methods are still an issue of intense research. We propose a feedforward model for recognition that shares components like weight sharing, pooling stages, and competitive nonlinearities with earlier approaches but focuses on new methods for learning optimal feature-detecting cells in intermediate stages of the hierarchical network. We show that principles of sparse coding, which were previously mostly applied to the initial feature detection stages, can also be employed to obtain optimized intermediate complex features. We suggest a new approach to optimize the learning of sparse features under the constraints of a weight-sharing or convolutional architecture that uses pooling operations to achieve gradual invariance in the feature hierarchy. The approach explicitly enforces symmetry constraints like translation invariance on the feature set. This leads to a dimension reduction in the search space of optimal features and allows determining more efficiently the basis representatives, which achieve a sparse decomposition of the input. We analyze the quality of the learned feature representation by investigating the recognition performance of the resulting hierarchical network on object and face databases. We show that a hierarchy with features learned on a single object data set can also be applied to face recognition without parameter changes and is competitive with other recent machine learning recognition approaches. To investigate the effect of the interplay between sparse coding and processing nonlinearities, we also consider alternative feedforward pooling nonlinearities such as presynaptic maximum selection and sum-of-squares integration. The comparison shows that a combination of strong competitive nonlinearities with sparse coding offers the best recognition performance in the difficult scenario of segmentation-free recognition in cluttered surround. We demonstrate that for both learning and recognition, a precise segmentation of the objects is not necessary.

Mesh:

Year:  2003        PMID: 12816566     DOI: 10.1162/089976603321891800

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  11 in total

1.  Visual object categorization in birds and primates: integrating behavioral, neurobiological, and computational evidence within a "general process" framework.

Authors:  Fabian A Soto; Edward A Wasserman
Journal:  Cogn Affect Behav Neurosci       Date:  2012-03       Impact factor: 3.282

2.  A feedforward architecture accounts for rapid categorization.

Authors:  Thomas Serre; Aude Oliva; Tomaso Poggio
Journal:  Proc Natl Acad Sci U S A       Date:  2007-04-02       Impact factor: 11.205

3.  Biased Competition in Visual Processing Hierarchies: A Learning Approach Using Multiple Cues.

Authors:  Alexander R T Gepperth; Sven Rebhan; Stephan Hasler; Jannik Fritsch
Journal:  Cognit Comput       Date:  2011-01-19       Impact factor: 5.418

4.  What are the Visual Features Underlying Rapid Object Recognition?

Authors:  Sébastien M Crouzet; Thomas Serre
Journal:  Front Psychol       Date:  2011-11-15

5.  Modeling invariant object processing based on tight integration of simulated and empirical data in a Common Brain Space.

Authors:  Judith C Peters; Joel Reithler; Rainer Goebel
Journal:  Front Comput Neurosci       Date:  2012-03-09       Impact factor: 2.380

6.  A model of the ventral visual system based on temporal stability and local memory.

Authors:  Reto Wyss; Peter König; Paul F M J Verschure
Journal:  PLoS Biol       Date:  2006-04-18       Impact factor: 8.029

7.  Towards a mathematical theory of cortical micro-circuits.

Authors:  Dileep George; Jeff Hawkins
Journal:  PLoS Comput Biol       Date:  2009-10-09       Impact factor: 4.475

8.  A spiking neural network based cortex-like mechanism and application to facial expression recognition.

Authors:  Si-Yao Fu; Guo-Sheng Yang; Xin-Kai Kuai
Journal:  Comput Intell Neurosci       Date:  2012-10-30

9.  Unsupervised invariance learning of transformation sequences in a model of object recognition yields selectivity for non-accidental properties.

Authors:  Sarah M Parker; Thomas Serre
Journal:  Front Comput Neurosci       Date:  2015-10-07       Impact factor: 2.380

10.  Toward Self-Referential Autonomous Learning of Object and Situation Models.

Authors:  Florian Damerow; Andreas Knoblauch; Ursula Körner; Julian Eggert; Edgar Körner
Journal:  Cognit Comput       Date:  2016-04-27       Impact factor: 5.418

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.