Literature DB >> 32413803

Local features and global shape information in object classification by deep convolutional neural networks.

Nicholas Baker1, Hongjing Lu2, Gennady Erlikhman3, Philip J Kellman3.   

Abstract

Deep convolutional neural networks (DCNNs) show impressive similarities to the human visual system. Recent research, however, suggests that DCNNs have limitations in recognizing objects by their shape. We tested the hypothesis that DCNNs are sensitive to an object's local contour features but have no access to global shape information that predominates human object recognition. We employed transfer learning to assess local and global shape processing in trained networks. In Experiment 1, we used restricted and unrestricted transfer learning to retrain AlexNet, VGG-19, and ResNet-50 to classify circles and squares. We then probed these networks with stimuli with conflicting global shape and local contour information. We presented networks with overall square shapes comprised of curved elements and circles comprised of corner elements. Networks classified the test stimuli by local contour features rather than global shapes. In Experiment 2, we changed the training data to include circles and squares comprised of different elements so that the local contour features of the object were uninformative. This considerably increased the network's tendency to produce global shape responses, but deeper analyses in Experiment 3 revealed the network still showed no sensitivity to the spatial configuration of local elements. These findings demonstrate that DCNNs' performance is an inversion of human performance with respect to global and local shape processing. Whereas abstract relations of elements predominate in human perception of shape, DCNNs appear to extract only local contour fragments, with no representation of how they spatially relate to each other to form global shapes.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Deep learning; Global and local features; Object recognition; Shape

Mesh:

Year:  2020        PMID: 32413803      PMCID: PMC8158065          DOI: 10.1016/j.visres.2020.04.003

Source DB:  PubMed          Journal:  Vision Res        ISSN: 0042-6989            Impact factor:   1.886


  36 in total

1.  Deriving behavioural receptive fields for visually completed contours.

Authors:  J M Gold; R F Murray; P J Bennett; A B Sekuler
Journal:  Curr Biol       Date:  2000-06-01       Impact factor: 10.834

Review 2.  A theory of visual interpolation in object perception.

Authors:  P J Kellman; T F Shipley
Journal:  Cogn Psychol       Date:  1991-04       Impact factor: 3.468

3.  The reviewing of object files: object-specific integration of information.

Authors:  D Kahneman; A Treisman; B J Gibbs
Journal:  Cogn Psychol       Date:  1992-04       Impact factor: 3.468

4.  Performance-optimized hierarchical models predict neural responses in higher visual cortex.

Authors:  Daniel L K Yamins; Ha Hong; Charles F Cadieu; Ethan A Solomon; Darren Seibert; James J DiCarlo
Journal:  Proc Natl Acad Sci U S A       Date:  2014-05-08       Impact factor: 11.205

5.  Contour integration by the human visual system: evidence for a local "association field".

Authors:  D J Field; A Hayes; R F Hess
Journal:  Vision Res       Date:  1993-01       Impact factor: 1.886

Review 6.  Towards a unified perspective of object shape and motion processing in human dorsal cortex.

Authors:  Gennady Erlikhman; Gideon P Caplovitz; Gennadiy Gurariy; Jared Medina; Jacqueline C Snow
Journal:  Conscious Cogn       Date:  2018-05-18

7.  The emergence of kind-based object individuation in infancy.

Authors:  Fei Xu; Susan Carey; Nina Quint
Journal:  Cogn Psychol       Date:  2004-09       Impact factor: 3.468

8.  Deep neural networks rival the representation of primate IT cortex for core visual object recognition.

Authors:  Charles F Cadieu; Ha Hong; Daniel L K Yamins; Nicolas Pinto; Diego Ardila; Ethan A Solomon; Najib J Majaj; James J DiCarlo
Journal:  PLoS Comput Biol       Date:  2014-12-18       Impact factor: 4.475

9.  Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence.

Authors:  Radoslaw Martin Cichy; Aditya Khosla; Dimitrios Pantazis; Antonio Torralba; Aude Oliva
Journal:  Sci Rep       Date:  2016-06-10       Impact factor: 4.379

View more
  6 in total

1.  The Dorsal Visual Pathway Represents Object-Centered Spatial Relations for Object Recognition.

Authors:  Vladislav Ayzenberg; Marlene Behrmann
Journal:  J Neurosci       Date:  2022-05-04       Impact factor: 6.709

2.  Perception of an object's global shape is best described by a model of skeletal structure in human infants.

Authors:  Vladislav Ayzenberg; Stella Lourenco
Journal:  Elife       Date:  2022-05-25       Impact factor: 8.713

3.  ImageNet-trained deep neural networks exhibit illusion-like response to the Scintillating grid.

Authors:  Eric D Sun; Ron Dekel
Journal:  J Vis       Date:  2021-10-05       Impact factor: 2.240

4.  From photos to sketches - how humans and deep neural networks process objects across different levels of visual abstraction.

Authors:  Johannes J D Singer; Katja Seeliger; Tim C Kietzmann; Martin N Hebart
Journal:  J Vis       Date:  2022-02-01       Impact factor: 2.240

5.  Deep learning models fail to capture the configural nature of human shape perception.

Authors:  Nicholas Baker; James H Elder
Journal:  iScience       Date:  2022-08-11

6.  General object-based features account for letter perception.

Authors:  Daniel Janini; Chris Hamblin; Arturo Deza; Talia Konkle
Journal:  PLoS Comput Biol       Date:  2022-09-26       Impact factor: 4.779

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.