Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Where do features come from?

Literature DB >> 23800216

Where do features come from?

Abstract

It is possible to learn multiple layers of non-linear features by backpropagating error derivatives through a feedforward neural network. This is a very effective learning procedure when there is a huge amount of labeled training data, but for many learning tasks very few labeled examples are available. In an effort to overcome the need for labeled data, several different generative models were developed that learned interesting features by modeling the higher order statistical structure of a set of input vectors. One of these generative models, the restricted Boltzmann machine (RBM), has no connections between its hidden units and this makes perceptual inference and learning much simpler. More significantly, after a layer of hidden features has been learned, the activities of these features can be used as training data for another RBM. By applying this idea recursively, it is possible to learn a deep hierarchy of progressively more complicated features without requiring any labeled data. This deep hierarchy can then be treated as a feedforward neural network which can be discriminatively fine-tuned using backpropagation. Using a stack of RBMs to initialize the weights of a feedforward neural network allows backpropagation to work effectively in much deeper networks and it leads to much better generalization. A stack of RBMs can also be used to initialize a deep Boltzmann machine that has many hidden layers. Combining this initialization method with a new method for fine-tuning the weights finally leads to the first efficient way of training Boltzmann machines with many hidden layers and millions of weights.

Keywords: Backpropagation; Boltzmann machines; Contrastive divergence; Deep learning; Distributed representations; Learning features; Learning graphical models; Variational learning

Mesh：

Year: 2013 PMID： 23800216 DOI： 10.1111/cogs.12049

Source DB: PubMed Journal: Cogn Sci ISSN： 0364-0213

Keyword Cloud
Cited

8 in total

1. Spatial generalization in operant learning: lessons from professional basketball.

Authors: Tal Neiman; Yonatan Loewenstein
Journal: PLoS Comput Biol Date: 2014-05-22 Impact factor: 4.475

2. Analyzing Distributional Learning of Phonemic Categories in Unsupervised Deep Neural Networks.

Authors: Okko Räsänen; Tasha Nagamine; Nima Mesgarani
Journal: Cogsci Date: 2016-08

3. Modeling language and cognition with deep unsupervised learning: a tutorial overview.

Authors: Marco Zorzi; Alberto Testolin; Ivilin P Stoianov
Journal: Front Psychol Date: 2013-08-20

4. The Role of Architectural and Learning Constraints in Neural Network Models: A Case Study on Visual Space Coding.

Authors: Alberto Testolin; Michele De Filippo De Grazia; Marco Zorzi
Journal: Front Comput Neurosci Date: 2017-03-21 Impact factor: 2.380

5. Comparing deep belief networks with support vector machines for classifying gene expression data from complex disorders.

Authors: Johannes Smolander; Matthias Dehmer; Frank Emmert-Streib
Journal: FEBS Open Bio Date: 2019-06-07 Impact factor: 2.693

Where do features come from?

1. Spatial generalization in operant learning: lessons from professional basketball.

2. Analyzing Distributional Learning of Phonemic Categories in Unsupervised Deep Neural Networks.

3. Modeling language and cognition with deep unsupervised learning: a tutorial overview.

4. The Role of Architectural and Learning Constraints in Neural Network Models: A Case Study on Visual Space Coding.

5. Comparing deep belief networks with support vector machines for classifying gene expression data from complex disorders.

6. Deep generative learning of location-invariant visual word recognition.

Review 7. Computational Foundations of Natural Intelligence.

8. Comparing biological information contained in mRNA and non-coding RNAs for classification of lung cancer patients.