Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Overcoming catastrophic forgetting in neural networks.

Literature DB >> 28292907

Overcoming catastrophic forgetting in neural networks.

James Kirkpatrick¹, Razvan Pascanu², Neil Rabinowitz², Joel Veness², Guillaume Desjardins², Andrei A Rusu², Kieran Milan², John Quan², Tiago Ramalho², Agnieszka Grabska-Barwinska², Demis Hassabis², Claudia Clopath³, Dharshan Kumaran², Raia Hadsell².

Abstract

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.

Keywords: artificial intelligence; continual learning; deep learning; stability plasticity; synaptic consolidation

Mesh：

Year: 2017 PMID： 28292907 PMCID： PMC5380101 DOI： 10.1073/pnas.1611835114

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

19 in total

Review 1. An integrative theory of prefrontal cortex function.

Authors: E K Miller; J D Cohen
Journal: Annu Rev Neurosci Date: 2001 Impact factor: 12.449

2. Multiple model-based reinforcement learning.

Authors: Kenji Doya; Kazuyuki Samejima; Ken-ichi Katagiri; Mitsuo Kawato
Journal: Neural Comput Date: 2002-06 Impact factor: 2.026

3. Catastrophic forgetting in connectionist networks.

Authors:
Journal: Trends Cogn Sci Date: 1999-04 Impact factor: 20.229

4. Using noise to compute error surfaces in connectionist networks: a novel means of reducing catastrophic forgetting.

Authors: Robert M French; Nick Chater
Journal: Neural Comput Date: 2002-07 Impact factor: 2.026

5. Cascade models of synaptically stored memories.

Authors: Stefano Fusi; Patrick J Drew; L F Abbott
Journal: Neuron Date: 2005-02-17 Impact factor: 17.173

Review 6. Connectionist models of recognition memory: constraints imposed by learning and forgetting functions.

Authors: R Ratcliff
Journal: Psychol Rev Date: 1990-04 Impact factor: 8.934

7. Branch-specific dendritic Ca(2+) spikes cause persistent synaptic plasticity.

Authors: Joseph Cichon; Wen-Biao Gan
Journal: Nature Date: 2015-03-30 Impact factor: 49.962

8. Stably maintained dendritic spines are associated with lifelong memories.

Authors: Guang Yang; Feng Pan; Wen-Biao Gan
Journal: Nature Date: 2009-11-29 Impact factor: 49.962

9. Synapses with short-term plasticity are optimal estimators of presynaptic membrane potentials.

Authors: Jean-Pascal Pfister; Peter Dayan; Máté Lengyel
Journal: Nat Neurosci Date: 2010-09-19 Impact factor: 24.884

10. Tag-trigger-consolidation: a model of early and late long-term-potentiation and depression.

Authors: Claudia Clopath; Lorric Ziegler; Eleni Vasilaki; Lars Büsing; Wulfram Gerstner
Journal: PLoS Comput Biol Date: 2008-12-26 Impact factor: 4.475

85 in total

1. mPFC spindle cycles organize sparse thalamic activation and recently active CA1 cells during non-REM sleep.

Authors: Carmen Varela; Matthew A Wilson
Journal: Elife Date: 2020-06-11 Impact factor: 8.140

2. Multi-Institutional Deep Learning Modeling Without Sharing Patient Data: A Feasibility Study on Brain Tumor Segmentation.

Authors: Micah J Sheller; G Anthony Reina; Brandon Edwards; Jason Martin; Spyridon Bakas
Journal: Brainlesion Date: 2019-01-26

3. The inverse variance-flatness relation in stochastic gradient descent is critical for finding flat minima.

Authors: Yu Feng; Yuhai Tu
Journal: Proc Natl Acad Sci U S A Date: 2021-03-02 Impact factor: 11.205

Review 4. Spine dynamics in the brain, mental disorders and artificial neural networks.

Authors: Haruo Kasai; Noam E Ziv; Hitoshi Okazaki; Sho Yagishita; Taro Toyoizumi
Journal: Nat Rev Neurosci Date: 2021-05-28 Impact factor: 34.870

5. Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization.

Authors: Nicolas Y Masse; Gregory D Grant; David J Freedman
Journal: Proc Natl Acad Sci U S A Date: 2018-10-12 Impact factor: 11.205

6. How to study the neural mechanisms of multiple tasks.

Authors: Guangyu Robert Yang; Michael W Cole; Kanaka Rajan
Journal: Curr Opin Behav Sci Date: 2019-09-09

Review 7. Reevaluating the Role of Persistent Neural Activity in Short-Term Memory.

Authors: Nicolas Y Masse; Matthew C Rosen; David J Freedman
Journal: Trends Cogn Sci Date: 2020-01-29 Impact factor: 20.229

8. A modeling framework for adaptive lifelong learning with transfer and savings through gating in the prefrontal cortex.

Authors: Ben Tsuda; Kay M Tye; Hava T Siegelmann; Terrence J Sejnowski
Journal: Proc Natl Acad Sci U S A Date: 2020-11-05 Impact factor: 11.205

Review 9. If deep learning is the answer, what is the question?

Authors: Andrew Saxe; Stephanie Nelli; Christopher Summerfield
Journal: Nat Rev Neurosci Date: 2020-11-16 Impact factor: 34.870

10. Reply to Huszár: The elastic weight consolidation penalty is empirically valid.

Authors: James Kirkpatrick; Razvan Pascanu; Neil Rabinowitz; Joel Veness; Guillaume Desjardins; Andrei A Rusu; Kieran Milan; John Quan; Tiago Ramalho; Agnieszka Grabska-Barwinska; Demis Hassabis; Claudia Clopath; Dharshan Kumaran; Raia Hadsell
Journal: Proc Natl Acad Sci U S A Date: 2018-02-20 Impact factor: 11.205