Literature DB >> 18263528

Learning long-term dependencies in NARX recurrent neural networks.

T Lin1, B G Horne, P Tino, C L Giles.   

Abstract

It has previously been shown that gradient-descent learning algorithms for recurrent neural networks can perform poorly on tasks that involve long-term dependencies, i.e. those problems for which the desired output depends on inputs presented at times far in the past. We show that the long-term dependencies problem is lessened for a class of architectures called nonlinear autoregressive models with exogenous (NARX) recurrent neural networks, which have powerful representational capabilities. We have previously reported that gradient descent learning can be more effective in NARX networks than in recurrent neural network architectures that have "hidden states" on problems including grammatical inference and nonlinear system identification. Typically, the network converges much faster and generalizes better than other networks. The results in this paper are consistent with this phenomenon. We present some experimental results which show that NARX networks can often retain information for two to three times as long as conventional recurrent neural networks. We show that although NARX networks do not circumvent the problem of long-term dependencies, they can greatly improve performance on long-term dependency problems. We also describe in detail some of the assumptions regarding what it means to latch information robustly and suggest possible ways to loosen these assumptions.

Year:  1996        PMID: 18263528     DOI: 10.1109/72.548162

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  13 in total

1.  Segmenting and classifying activities in robot-assisted surgery with recurrent neural networks.

Authors:  Robert DiPietro; Narges Ahmidi; Anand Malpani; Madeleine Waldram; Gyusung I Lee; Mija R Lee; S Swaroop Vedula; Gregory D Hager
Journal:  Int J Comput Assist Radiol Surg       Date:  2019-04-29       Impact factor: 2.924

2.  Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting.

Authors:  Martin Wöllmer; Erik Marchi; Stefano Squartini; Björn Schuller
Journal:  Cogn Neurodyn       Date:  2011-08-09       Impact factor: 5.082

3.  Hybrid methodology for tuberculosis incidence time-series forecasting based on ARIMA and a NAR neural network.

Authors:  K W Wang; C Deng; J P Li; Y Y Zhang; X Y Li; M C Wu
Journal:  Epidemiol Infect       Date:  2017-01-24       Impact factor: 4.434

4.  Assessing the Health of LiFePO₄ Traction Batteries through Monotonic Echo State Networks.

Authors:  Luciano Sánchez; David Anseán; José Otero; Inés Couso
Journal:  Sensors (Basel)       Date:  2017-12-21       Impact factor: 3.576

5.  Wavelet-Like Transform to Optimize the Order of an Autoregressive Neural Network Model to Predict the Dissolved Gas Concentration in Power Transformer Oil from Sensor Data.

Authors:  Francisco Elânio Bezerra; Fernando André Zemuner Garcia; Silvio Ikuyo Nabeta; Gilberto Francisco Martha de Souza; Ivan Eduardo Chabu; Josemir Coelho Santos; Shigueru Nagao Junior; Fabio Henrique Pereira
Journal:  Sensors (Basel)       Date:  2020-05-11       Impact factor: 3.576

Review 6.  Massive MIMO Systems for 5G and Beyond Networks-Overview, Recent Trends, Challenges, and Future Research Direction.

Authors:  Robin Chataut; Robert Akl
Journal:  Sensors (Basel)       Date:  2020-05-12       Impact factor: 3.576

7.  Deep Learning for Stock Market Prediction.

Authors:  M Nabipour; P Nayyeri; H Jabani; A Mosavi; E Salwana; Shahab S
Journal:  Entropy (Basel)       Date:  2020-07-30       Impact factor: 2.524

Review 8.  On Training Efficiency and Computational Costs of a Feed Forward Neural Network: A Review.

Authors:  Antonino Laudani; Gabriele Maria Lozito; Francesco Riganti Fulginei; Alessandro Salvini
Journal:  Comput Intell Neurosci       Date:  2015-08-31

9.  EMG-Based Continuous and Simultaneous Estimation of Arm Kinematics in Able-Bodied Individuals and Stroke Survivors.

Authors:  Jie Liu; Sang Hoon Kang; Dali Xu; Yupeng Ren; Song Joo Lee; Li-Qun Zhang
Journal:  Front Neurosci       Date:  2017-08-25       Impact factor: 4.677

10.  Novel Model Based on Artificial Neural Networks to Predict Short-Term Temperature Evolution in Museum Environment.

Authors:  Alessandro Bile; Hamed Tari; Andreas Grinde; Francesca Frasca; Anna Maria Siani; Eugenio Fazio
Journal:  Sensors (Basel)       Date:  2022-01-13       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.