Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Training products of experts by minimizing contrastive divergence.

Literature DB >> 12180402

Training products of experts by minimizing contrastive divergence.

Abstract

It is possible to combine multiple latent-variable models of the same data by multiplying their probability distributions together and then renormalizing. This way of combining individual "expert" models makes it hard to generate samples from the combined model but easy to infer the values of the latent variables of each expert, because the combination rule ensures that the latent variables of different experts are conditionally independent when given the data. A product of experts (PoE) is therefore an interesting candidate for a perceptual system in which rapid inference is vital and generation is unnecessary. Training a PoE by maximizing the likelihood of the data is difficult because it is hard even to approximate the derivatives of the renormalization term in the combination rule. Fortunately, a PoE can be trained using a different objective function called "contrastive divergence" whose derivatives with regard to the parameters can be approximated accurately and efficiently. Examples are presented of contrastive divergence learning using several types of expert on several types of data.

Year: 2002 PMID： 12180402 DOI： 10.1162/089976602760128018

Source DB: PubMed Journal: Neural Comput ISSN： 0899-7667 Impact factor: 2.026

Keyword Cloud
Cited

153 in total

1. Ontology-based Deep Learning for Human Behavior Prediction with Explanations in Health Social Networks.

Authors: Nhathai Phan; Dejing Dou; Hao Wang; David Kil; Brigitte Piniewski
Journal: Inf Sci (N Y) Date: 2016-08-16 Impact factor: 6.795

2. Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors: Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal: PLoS Comput Biol Date: 2018-11-05 Impact factor: 4.475

3. Reconstruction and stability of secondary structure elements in the context of protein structure prediction.

Authors: Alexei A Podtelezhnikov; David L Wild
Journal: Biophys J Date: 2009-06-03 Impact factor: 4.033

4. An algorithm to improve speech recognition in noise for hearing-impaired listeners.

Authors: Eric W Healy; Sarah E Yoho; Yuxuan Wang; DeLiang Wang
Journal: J Acoust Soc Am Date: 2013-10 Impact factor: 1.840

5. Gaussian-binary restricted Boltzmann machines for modeling natural image statistics.

Authors: Jan Melchior; Nan Wang; Laurenz Wiskott
Journal: PLoS One Date: 2017-02-02 Impact factor: 3.240

6. Task-dependent recurrent dynamics in visual cortex.

Authors: Satohiro Tajima; Kowa Koida; Chihiro I Tajima; Hideyuki Suzuki; Kazuyuki Aihara; Hidehiko Komatsu
Journal: Elife Date: 2017-07-24 Impact factor: 8.140

7. Inferring Generative Model Structure with Static Analysis.

Authors: Paroma Varma; Bryan He; Payal Bajaj; Imon Banerjee; Nishith Khandwala; Daniel L Rubin; Christopher Ré
Journal: Adv Neural Inf Process Syst Date: 2017-12

8. Neural Quadratic Discriminant Analysis: Nonlinear Decoding with V1-Like Computation.

Authors: Marino Pagan; Eero P Simoncelli; Nicole C Rust
Journal: Neural Comput Date: 2016-09-14 Impact factor: 2.026

9. Distributed Bayesian Computation and Self-Organized Learning in Sheets of Spiking Neurons with Local Lateral Inhibition.

Authors: Johannes Bill; Lars Buesing; Stefan Habenschuss; Bernhard Nessler; Wolfgang Maass; Robert Legenstein
Journal: PLoS One Date: 2015-08-18 Impact factor: 3.240

Review 10. Learning to represent visual input.

Authors: Geoffrey E Hinton
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2010-01-12 Impact factor: 6.237