Literature DB >> 31703175

Toward Training Recurrent Neural Networks for Lifelong Learning.

Shagun Sodhani1, Sarath Chandar2, Yoshua Bengio3.   

Abstract

Catastrophic forgetting and capacity saturation are the central challenges of any parametric lifelong learning system. In this work, we study these challenges in the context of sequential supervised learning with an emphasis on recurrent neural networks. To evaluate the models in the lifelong learning setting, we propose a curriculum-based, simple, and intuitive benchmark where the models are trained on tasks with increasing levels of difficulty. To measure the impact of catastrophic forgetting, the model is tested on all the previous tasks as it completes any task. As a step toward developing true lifelong learning systems, we unify gradient episodic memory (a catastrophic forgetting alleviation approach) and Net2Net (a capacity expansion approach). Both models are proposed in the context of feedforward networks, and we evaluate the feasibility of using them for recurrent networks. Evaluation on the proposed benchmark shows that the unified model is more suitable than the constituent models for lifelong learning setting.

Entities:  

Year:  2019        PMID: 31703175     DOI: 10.1162/neco_a_01246

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  5 in total

1.  Continual Sequence Modeling With Predictive Coding.

Authors:  Louis Annabi; Alexandre Pitti; Mathias Quoy
Journal:  Front Neurorobot       Date:  2022-05-23       Impact factor: 3.493

2.  Catastrophic Forgetting in Deep Graph Networks: A Graph Classification Benchmark.

Authors:  Antonio Carta; Andrea Cossu; Federico Errica; Davide Bacciu
Journal:  Front Artif Intell       Date:  2022-02-04

3.  Introducing principles of synaptic integration in the optimization of deep neural networks.

Authors:  Giorgia Dellaferrera; Stanisław Woźniak; Giacomo Indiveri; Angeliki Pantazi; Evangelos Eleftheriou
Journal:  Nat Commun       Date:  2022-04-07       Impact factor: 17.694

4.  GrapHD: Graph-Based Hyperdimensional Memorization for Brain-Like Cognitive Learning.

Authors:  Prathyush Poduval; Haleh Alimohamadi; Ali Zakeri; Farhad Imani; M Hassan Najafi; Tony Givargis; Mohsen Imani
Journal:  Front Neurosci       Date:  2022-02-04       Impact factor: 4.677

5.  Exploring the associative learning capabilities of the segmented attractor network for lifelong learning.

Authors:  Alexander Jones; Rashmi Jha
Journal:  Front Artif Intell       Date:  2022-08-01
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.