Literature DB >> 23800216

Where do features come from?

Geoffrey Hinton1.   

Abstract

It is possible to learn multiple layers of non-linear features by backpropagating error derivatives through a feedforward neural network. This is a very effective learning procedure when there is a huge amount of labeled training data, but for many learning tasks very few labeled examples are available. In an effort to overcome the need for labeled data, several different generative models were developed that learned interesting features by modeling the higher order statistical structure of a set of input vectors. One of these generative models, the restricted Boltzmann machine (RBM), has no connections between its hidden units and this makes perceptual inference and learning much simpler. More significantly, after a layer of hidden features has been learned, the activities of these features can be used as training data for another RBM. By applying this idea recursively, it is possible to learn a deep hierarchy of progressively more complicated features without requiring any labeled data. This deep hierarchy can then be treated as a feedforward neural network which can be discriminatively fine-tuned using backpropagation. Using a stack of RBMs to initialize the weights of a feedforward neural network allows backpropagation to work effectively in much deeper networks and it leads to much better generalization. A stack of RBMs can also be used to initialize a deep Boltzmann machine that has many hidden layers. Combining this initialization method with a new method for fine-tuning the weights finally leads to the first efficient way of training Boltzmann machines with many hidden layers and millions of weights.
Copyright © 2013 Cognitive Science Society, Inc.

Keywords:  Backpropagation; Boltzmann machines; Contrastive divergence; Deep learning; Distributed representations; Learning features; Learning graphical models; Variational learning

Mesh:

Year:  2013        PMID: 23800216     DOI: 10.1111/cogs.12049

Source DB:  PubMed          Journal:  Cogn Sci        ISSN: 0364-0213


  8 in total

1.  Spatial generalization in operant learning: lessons from professional basketball.

Authors:  Tal Neiman; Yonatan Loewenstein
Journal:  PLoS Comput Biol       Date:  2014-05-22       Impact factor: 4.475

2.  Analyzing Distributional Learning of Phonemic Categories in Unsupervised Deep Neural Networks.

Authors:  Okko Räsänen; Tasha Nagamine; Nima Mesgarani
Journal:  Cogsci       Date:  2016-08

3.  Modeling language and cognition with deep unsupervised learning: a tutorial overview.

Authors:  Marco Zorzi; Alberto Testolin; Ivilin P Stoianov
Journal:  Front Psychol       Date:  2013-08-20

4.  The Role of Architectural and Learning Constraints in Neural Network Models: A Case Study on Visual Space Coding.

Authors:  Alberto Testolin; Michele De Filippo De Grazia; Marco Zorzi
Journal:  Front Comput Neurosci       Date:  2017-03-21       Impact factor: 2.380

5.  Comparing deep belief networks with support vector machines for classifying gene expression data from complex disorders.

Authors:  Johannes Smolander; Matthias Dehmer; Frank Emmert-Streib
Journal:  FEBS Open Bio       Date:  2019-06-07       Impact factor: 2.693

6.  Deep generative learning of location-invariant visual word recognition.

Authors:  Maria Grazia Di Bono; Marco Zorzi
Journal:  Front Psychol       Date:  2013-09-19

Review 7.  Computational Foundations of Natural Intelligence.

Authors:  Marcel van Gerven
Journal:  Front Comput Neurosci       Date:  2017-12-07       Impact factor: 2.380

8.  Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients.

Authors:  Johannes Smolander; Alexey Stupnikov; Galina Glazko; Matthias Dehmer; Frank Emmert-Streib
Journal:  BMC Cancer       Date:  2019-12-03       Impact factor: 4.430

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.