Literature DB >> 18267787

Learning long-term dependencies with gradient descent is difficult.

Y Bengio1, P Simard, P Frasconi.   

Abstract

Recurrent neural networks can be used to map input sequences to output sequences, such as for recognition, production or prediction problems. However, practical difficulties have been reported in training recurrent neural networks to perform tasks in which the temporal contingencies present in the input/output sequences span long intervals. We show why gradient based learning algorithms face an increasingly difficult problem as the duration of the dependencies to be captured increases. These results expose a trade-off between efficient learning by gradient descent and latching on information for long periods. Based on an understanding of this problem, alternatives to standard gradient descent are considered.

Entities:  

Year:  1994        PMID: 18267787     DOI: 10.1109/72.279181

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  218 in total

1.  Initialization and self-organized optimization of recurrent neural network connectivity.

Authors:  Joschka Boedecker; Oliver Obst; N Michael Mayer; Minoru Asada
Journal:  HFSP J       Date:  2009-10-26

2.  Neural Tree Indexers for Text Understanding.

Authors:  Tsendsuren Munkhdalai; Hong Yu
Journal:  Proc Conf Assoc Comput Linguist Meet       Date:  2017-04

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Machine Learning on Sequential Data Using a Recurrent Weighted Average.

Authors:  Jared Ostmeyer; Lindsay Cowell
Journal:  Neurocomputing       Date:  2018-11-29       Impact factor: 5.719

Review 5.  Evolutionary aspects of reservoir computing.

Authors:  Luís F Seoane
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2019-06-10       Impact factor: 6.237

6.  Deep Learning for Classification of Normal Swallows in Adults.

Authors:  Joshua M Dudik; James L Coyle; Amro El-Jaroudi; Zhi-Hong Mao; Mingui Sun; Ervin Sejdić
Journal:  Neurocomputing       Date:  2018-01-31       Impact factor: 5.719

7.  Mobile sensor based human activity recognition: distinguishing of challenging activities by applying long short-term memory deep learning modified by residual network concept.

Authors:  Seyed Vahab Shojaedini; Mohamad Javad Beirami
Journal:  Biomed Eng Lett       Date:  2020-06-29

8.  Artificial Neural Network-Based System for PET Volume Segmentation.

Authors:  Mhd Saeed Sharif; Maysam Abbod; Abbes Amira; Habib Zaidi
Journal:  Int J Biomed Imaging       Date:  2010-09-26

9.  A modular kernel approach for integrative analysis of protein domain boundaries.

Authors:  Paul D Yoo; Bing Bing Zhou; Albert Y Zomaya
Journal:  BMC Genomics       Date:  2009-12-03       Impact factor: 3.969

10.  Ab initio and homology based prediction of protein domains by recursive neural networks.

Authors:  Ian Walsh; Alberto J M Martin; Catherine Mooney; Enrico Rubagotti; Alessandro Vullo; Gianluca Pollastri
Journal:  BMC Bioinformatics       Date:  2009-06-26       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.