Literature DB >> 30559207

Brain-inspired automated visual object discovery and detection.

Lichao Chen1, Sudhir Singh1, Thomas Kailath2, Vwani Roychowdhury3.   

Abstract

Despite significant recent progress, machine vision systems lag considerably behind their biological counterparts in performance, scalability, and robustness. A distinctive hallmark of the brain is its ability to automatically discover and model objects, at multiscale resolutions, from repeated exposures to unlabeled contextual data and then to be able to robustly detect the learned objects under various nonideal circumstances, such as partial occlusion and different view angles. Replication of such capabilities in a machine would require three key ingredients: (i) access to large-scale perceptual data of the kind that humans experience, (ii) flexible representations of objects, and (iii) an efficient unsupervised learning algorithm. The Internet fortunately provides unprecedented access to vast amounts of visual data. This paper leverages the availability of such data to develop a scalable framework for unsupervised learning of object prototypes-brain-inspired flexible, scale, and shift invariant representations of deformable objects (e.g., humans, motorcycles, cars, airplanes) comprised of parts, their different configurations and views, and their spatial relationships. Computationally, the object prototypes are represented as geometric associative networks using probabilistic constructs such as Markov random fields. We apply our framework to various datasets and show that our approach is computationally scalable and can construct accurate and operational part-aware object models much more efficiently than in much of the recent computer vision literature. We also present efficient algorithms for detection and localization in new scenes of objects and their partial views.

Entities:  

Keywords:  brain memory models; brain-inspired learning; brain-inspired object models; computer vision; machine learning

Mesh:

Year:  2018        PMID: 30559207      PMCID: PMC6320548          DOI: 10.1073/pnas.1802103115

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  9 in total

1.  Atoms of recognition in human and computer vision.

Authors:  Shimon Ullman; Liav Assif; Ethan Fetaya; Daniel Harari
Journal:  Proc Natl Acad Sci U S A       Date:  2016-02-16       Impact factor: 11.205

2.  Visual Turing test for computer vision systems.

Authors:  Donald Geman; Stuart Geman; Neil Hallonquist; Laurent Younes
Journal:  Proc Natl Acad Sci U S A       Date:  2015-03-09       Impact factor: 11.205

Review 3.  Representation learning: a review and new perspectives.

Authors:  Yoshua Bengio; Aaron Courville; Pascal Vincent
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-08       Impact factor: 6.226

4.  Reverse engineering the cognitive brain.

Authors:  Gert Cauwenberghs
Journal:  Proc Natl Acad Sci U S A       Date:  2013-09-12       Impact factor: 11.205

Review 5.  Learning multiple layers of representation.

Authors:  Geoffrey E Hinton
Journal:  Trends Cogn Sci       Date:  2007-10       Impact factor: 20.229

6.  Stimulus-selective properties of inferior temporal neurons in the macaque.

Authors:  R Desimone; T D Albright; C G Gross; C Bruce
Journal:  J Neurosci       Date:  1984-08       Impact factor: 6.167

7.  From simple innate biases to complex visual concepts.

Authors:  Shimon Ullman; Daniel Harari; Nimrod Dorfman
Journal:  Proc Natl Acad Sci U S A       Date:  2012-09-24       Impact factor: 11.205

Review 8.  Visual object recognition.

Authors:  N K Logothetis; D L Sheinberg
Journal:  Annu Rev Neurosci       Date:  1996       Impact factor: 12.449

9.  Articulated human detection with flexible mixtures of parts.

Authors:  Yi Yang; Deva Ramanan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-12       Impact factor: 6.226

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.