Literature DB >> 30799908

Machine Learning on Sequential Data Using a Recurrent Weighted Average.

Jared Ostmeyer1, Lindsay Cowell1.   

Abstract

Recurrent Neural Networks (RNN) are a type of statistical model designed to handle sequential data. The model reads a sequence one symbol at a time. Each symbol is processed based on information collected from the previous symbols. With existing RNN architectures, each symbol is processed using only information from the previous processing step. To overcome this limitation, we propose a new kind of RNN model that computes a recurrent weighted average (RWA) over every past processing step. Because the RWA can be computed as a running average, the computational overhead scales like that of any other RNN architecture. The approach essentially reformulates the attention mechanism into a stand-alone model. The performance of the RWA model is assessed on the variable copy problem, the adding problem, classification of artificial grammar, classification of sequences by length, and classification of the MNIST images (where the pixels are read sequentially one at a time). On almost every task, the RWA model is found to fit the data significantly faster than a standard LSTM model.

Entities:  

Keywords:  Attention Mechanism; Recurrent Neural Network; Sequences

Year:  2018        PMID: 30799908      PMCID: PMC6380500          DOI: 10.1016/j.neucom.2018.11.066

Source DB:  PubMed          Journal:  Neurocomputing        ISSN: 0925-2312            Impact factor:   5.719


  3 in total

1.  Learning to forget: continual prediction with LSTM.

Authors:  F A Gers; J Schmidhuber; F Cummins
Journal:  Neural Comput       Date:  2000-10       Impact factor: 2.026

2.  Learning long-term dependencies with gradient descent is difficult.

Authors:  Y Bengio; P Simard; P Frasconi
Journal:  IEEE Trans Neural Netw       Date:  1994

3.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

  3 in total
  2 in total

1.  Splice-site identification for exon prediction using bidirectional LSTM-RNN approach.

Authors:  Noopur Singh; Ravindra Nath; Dev Bukhsh Singh
Journal:  Biochem Biophys Rep       Date:  2022-05-26

2.  Travel demand and distance analysis for free-floating car sharing based on deep learning method.

Authors:  Chen Zhang; Jie He; Ziyang Liu; Lu Xing; Yinhai Wang
Journal:  PLoS One       Date:  2019-10-16       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.