Literature DB >> 28292907

Overcoming catastrophic forgetting in neural networks.

James Kirkpatrick1, Razvan Pascanu2, Neil Rabinowitz2, Joel Veness2, Guillaume Desjardins2, Andrei A Rusu2, Kieran Milan2, John Quan2, Tiago Ramalho2, Agnieszka Grabska-Barwinska2, Demis Hassabis2, Claudia Clopath3, Dharshan Kumaran2, Raia Hadsell2.   

Abstract

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.

Keywords:  artificial intelligence; continual learning; deep learning; stability plasticity; synaptic consolidation

Mesh:

Year:  2017        PMID: 28292907      PMCID: PMC5380101          DOI: 10.1073/pnas.1611835114

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  19 in total

Review 1.  An integrative theory of prefrontal cortex function.

Authors:  E K Miller; J D Cohen
Journal:  Annu Rev Neurosci       Date:  2001       Impact factor: 12.449

2.  Multiple model-based reinforcement learning.

Authors:  Kenji Doya; Kazuyuki Samejima; Ken-ichi Katagiri; Mitsuo Kawato
Journal:  Neural Comput       Date:  2002-06       Impact factor: 2.026

3.  Catastrophic forgetting in connectionist networks.

Authors: 
Journal:  Trends Cogn Sci       Date:  1999-04       Impact factor: 20.229

4.  Using noise to compute error surfaces in connectionist networks: a novel means of reducing catastrophic forgetting.

Authors:  Robert M French; Nick Chater
Journal:  Neural Comput       Date:  2002-07       Impact factor: 2.026

5.  Cascade models of synaptically stored memories.

Authors:  Stefano Fusi; Patrick J Drew; L F Abbott
Journal:  Neuron       Date:  2005-02-17       Impact factor: 17.173

Review 6.  Connectionist models of recognition memory: constraints imposed by learning and forgetting functions.

Authors:  R Ratcliff
Journal:  Psychol Rev       Date:  1990-04       Impact factor: 8.934

7.  Branch-specific dendritic Ca(2+) spikes cause persistent synaptic plasticity.

Authors:  Joseph Cichon; Wen-Biao Gan
Journal:  Nature       Date:  2015-03-30       Impact factor: 49.962

8.  Stably maintained dendritic spines are associated with lifelong memories.

Authors:  Guang Yang; Feng Pan; Wen-Biao Gan
Journal:  Nature       Date:  2009-11-29       Impact factor: 49.962

9.  Synapses with short-term plasticity are optimal estimators of presynaptic membrane potentials.

Authors:  Jean-Pascal Pfister; Peter Dayan; Máté Lengyel
Journal:  Nat Neurosci       Date:  2010-09-19       Impact factor: 24.884

10.  Tag-trigger-consolidation: a model of early and late long-term-potentiation and depression.

Authors:  Claudia Clopath; Lorric Ziegler; Eleni Vasilaki; Lars Büsing; Wulfram Gerstner
Journal:  PLoS Comput Biol       Date:  2008-12-26       Impact factor: 4.475

View more
  85 in total

1.  mPFC spindle cycles organize sparse thalamic activation and recently active CA1 cells during non-REM sleep.

Authors:  Carmen Varela; Matthew A Wilson
Journal:  Elife       Date:  2020-06-11       Impact factor: 8.140

2.  Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation.

Authors:  Micah J Sheller; G Anthony Reina; Brandon Edwards; Jason Martin; Spyridon Bakas
Journal:  Brainlesion       Date:  2019-01-26

3.  The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima.

Authors:  Yu Feng; Yuhai Tu
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-02       Impact factor: 11.205

Review 4.  Spine dynamics in the brain, mental disorders and artificial neural networks.

Authors:  Haruo Kasai; Noam E Ziv; Hitoshi Okazaki; Sho Yagishita; Taro Toyoizumi
Journal:  Nat Rev Neurosci       Date:  2021-05-28       Impact factor: 34.870

5.  Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization.

Authors:  Nicolas Y Masse; Gregory D Grant; David J Freedman
Journal:  Proc Natl Acad Sci U S A       Date:  2018-10-12       Impact factor: 11.205

6.  How to study the neural mechanisms of multiple tasks.

Authors:  Guangyu Robert Yang; Michael W Cole; Kanaka Rajan
Journal:  Curr Opin Behav Sci       Date:  2019-09-09

Review 7.  Reevaluating the Role of Persistent Neural Activity in Short-Term Memory.

Authors:  Nicolas Y Masse; Matthew C Rosen; David J Freedman
Journal:  Trends Cogn Sci       Date:  2020-01-29       Impact factor: 20.229

8.  A modeling framework for adaptive lifelong learning with transfer and savings through gating in the prefrontal cortex.

Authors:  Ben Tsuda; Kay M Tye; Hava T Siegelmann; Terrence J Sejnowski
Journal:  Proc Natl Acad Sci U S A       Date:  2020-11-05       Impact factor: 11.205

Review 9.  If deep learning is the answer, what is the question?

Authors:  Andrew Saxe; Stephanie Nelli; Christopher Summerfield
Journal:  Nat Rev Neurosci       Date:  2020-11-16       Impact factor: 34.870

10.  Reply to Huszár: The elastic weight consolidation penalty is empirically valid.

Authors:  James Kirkpatrick; Razvan Pascanu; Neil Rabinowitz; Joel Veness; Guillaume Desjardins; Andrei A Rusu; Kieran Milan; John Quan; Tiago Ramalho; Agnieszka Grabska-Barwinska; Demis Hassabis; Claudia Clopath; Dharshan Kumaran; Raia Hadsell
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-20       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.