Literature DB >> 18390314

Adaptive importance sampling to accelerate training of a neural probabilistic language model.

Y Bengio1, J S Senecal.   

Abstract

Previous work on statistical language modeling has shown that it is possible to train a feedforward neural network to approximate probabilities over sequences of words, resulting in significant error reduction when compared to standard baseline models based on n-grams. However, training the neural network model with the maximum-likelihood criterion requires computations proportional to the number of words in the vocabulary. In this paper, we introduce adaptive importance sampling as a way to accelerate training of the model. The idea is to use an adaptive n-gram model to track the conditional distributions produced by the neural network. We show that a very significant speedup can be obtained on standard problems.

Entities:  

Mesh:

Year:  2008        PMID: 18390314     DOI: 10.1109/TNN.2007.912312

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  2 in total

1.  The importance of Term Weighting in semantic understanding of text: A review of techniques.

Authors:  R N Rathi; A Mustafi
Journal:  Multimed Tools Appl       Date:  2022-04-13       Impact factor: 2.577

2.  Fallback Variable History NNLMs: Efficient NNLMs by precomputation and stochastic training.

Authors:  Francisco J Zamora-Martínez; Salvador España-Boquera; Maria Jose Castro-Bleda; Adrian Palacios-Corella
Journal:  PLoS One       Date:  2018-07-26       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.