Literature DB >> 26625430

Bayesian Recurrent Neural Network for Language Modeling.

Jen-Tzung Chien, Yuan-Chu Ku.   

Abstract

A language model (LM) is calculated as the probability of a word sequence that provides the solution to word prediction for a variety of information systems. A recurrent neural network (RNN) is powerful to learn the large-span dynamics of a word sequence in the continuous space. However, the training of the RNN-LM is an ill-posed problem because of too many parameters from a large dictionary size and a high-dimensional hidden layer. This paper presents a Bayesian approach to regularize the RNN-LM and apply it for continuous speech recognition. We aim to penalize the too complicated RNN-LM by compensating for the uncertainty of the estimated model parameters, which is represented by a Gaussian prior. The objective function in a Bayesian classification network is formed as the regularized cross-entropy error function. The regularized model is constructed not only by calculating the regularized parameters according to the maximum a posteriori criterion but also by estimating the Gaussian hyperparameter by maximizing the marginal likelihood. A rapid approximation to a Hessian matrix is developed to implement the Bayesian RNN-LM (BRNN-LM) by selecting a small set of salient outer-products. The proposed BRNN-LM achieves a sparser model than the RNN-LM. Experiments on different corpora show the robustness of system performance by applying the rapid BRNN-LM under different conditions.

Entities:  

Year:  2015        PMID: 26625430     DOI: 10.1109/TNNLS.2015.2499302

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  2 in total

1.  Bayesian Recurrent Neural Network Models for Forecasting and Quantifying Uncertainty in Spatial-Temporal Data.

Authors:  Patrick L McDermott; Christopher K Wikle
Journal:  Entropy (Basel)       Date:  2019-02-15       Impact factor: 2.524

2.  Probabilistic Deep Learning to Quantify Uncertainty in Air Quality Forecasting.

Authors:  Abdulmajid Murad; Frank Alexander Kraemer; Kerstin Bach; Gavin Taylor
Journal:  Sensors (Basel)       Date:  2021-11-30       Impact factor: 3.576

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.