Literature DB >> 12180402

Training products of experts by minimizing contrastive divergence.

Geoffrey E Hinton1.   

Abstract

It is possible to combine multiple latent-variable models of the same data by multiplying their probability distributions together and then renormalizing. This way of combining individual "expert" models makes it hard to generate samples from the combined model but easy to infer the values of the latent variables of each expert, because the combination rule ensures that the latent variables of different experts are conditionally independent when given the data. A product of experts (PoE) is therefore an interesting candidate for a perceptual system in which rapid inference is vital and generation is unnecessary. Training a PoE by maximizing the likelihood of the data is difficult because it is hard even to approximate the derivatives of the renormalization term in the combination rule. Fortunately, a PoE can be trained using a different objective function called "contrastive divergence" whose derivatives with regard to the parameters can be approximated accurately and efficiently. Examples are presented of contrastive divergence learning using several types of expert on several types of data.

Year:  2002        PMID: 12180402     DOI: 10.1162/089976602760128018

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  153 in total

1.  Ontology-based Deep Learning for Human Behavior Prediction with Explanations in Health Social Networks.

Authors:  Nhathai Phan; Dejing Dou; Hao Wang; David Kil; Brigitte Piniewski
Journal:  Inf Sci (N Y)       Date:  2016-08-16       Impact factor: 6.795

2.  Synthetic protein alignments by CCMgen quantify noise in residue-residue contact prediction.

Authors:  Susann Vorberg; Stefan Seemayer; Johannes Söding
Journal:  PLoS Comput Biol       Date:  2018-11-05       Impact factor: 4.475

3.  Reconstruction and stability of secondary structure elements in the context of protein structure prediction.

Authors:  Alexei A Podtelezhnikov; David L Wild
Journal:  Biophys J       Date:  2009-06-03       Impact factor: 4.033

4.  An algorithm to improve speech recognition in noise for hearing-impaired listeners.

Authors:  Eric W Healy; Sarah E Yoho; Yuxuan Wang; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2013-10       Impact factor: 1.840

5.  Gaussian-binary restricted Boltzmann machines for modeling natural image statistics.

Authors:  Jan Melchior; Nan Wang; Laurenz Wiskott
Journal:  PLoS One       Date:  2017-02-02       Impact factor: 3.240

6.  Task-dependent recurrent dynamics in visual cortex.

Authors:  Satohiro Tajima; Kowa Koida; Chihiro I Tajima; Hideyuki Suzuki; Kazuyuki Aihara; Hidehiko Komatsu
Journal:  Elife       Date:  2017-07-24       Impact factor: 8.140

7.  Inferring Generative Model Structure with Static Analysis.

Authors:  Paroma Varma; Bryan He; Payal Bajaj; Imon Banerjee; Nishith Khandwala; Daniel L Rubin; Christopher Ré
Journal:  Adv Neural Inf Process Syst       Date:  2017-12

8.  Neural Quadratic Discriminant Analysis: Nonlinear Decoding with V1-Like Computation.

Authors:  Marino Pagan; Eero P Simoncelli; Nicole C Rust
Journal:  Neural Comput       Date:  2016-09-14       Impact factor: 2.026

9.  Distributed Bayesian Computation and Self-Organized Learning in Sheets of Spiking Neurons with Local Lateral Inhibition.

Authors:  Johannes Bill; Lars Buesing; Stefan Habenschuss; Bernhard Nessler; Wolfgang Maass; Robert Legenstein
Journal:  PLoS One       Date:  2015-08-18       Impact factor: 3.240

Review 10.  Learning to represent visual input.

Authors:  Geoffrey E Hinton
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-01-12       Impact factor: 6.237

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.