Literature DB >> 34000564

Arguments for the unsuitability of convolutional neural networks for non-local tasks.

Sebastian Stabinger1, David Peer2, Antonio Rodríguez-Sánchez2.   

Abstract

Convolutional neural networks have established themselves over the past years as the state of the art method for image classification, and for many datasets, they even surpass humans in categorizing images. Unfortunately, the same architectures perform much worse when they have to compare parts of an image to each other to correctly classify this image. Until now, no well-formed theoretical argument has been presented to explain this deficiency. In this paper, we will argue that convolutional layers are of little use for such problems, since comparison tasks are global by nature, but convolutional layers are local by design. We will use this insight to reformulate a comparison task into a sorting task and use findings on sorting networks to propose a lower bound for the number of parameters a neural network needs to solve comparison tasks in a generalizable way. We will use this lower bound to argue that attention, as well as iterative/recurrent processing, is needed to prevent a combinatorial explosion.
Copyright © 2021 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Entities:  

Keywords:  Attention; Convolutional neural networks; Locality; Relational reasoning; Sorting networks

Year:  2021        PMID: 34000564     DOI: 10.1016/j.neunet.2021.05.001

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  1 in total

1.  Evaluating the progress of deep learning for visual relational concepts.

Authors:  Sebastian Stabinger; David Peer; Justus Piater; Antonio Rodríguez-Sánchez
Journal:  J Vis       Date:  2021-10-05       Impact factor: 2.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.