Overcoming catastrophic forgetting in neural networks

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Significance

Deep neural networks are currently the most successful machine-learning technique for solving a variety of tasks, including language translation, image classification, and image generation. One weakness of such models is that, unlike humans, they are unable to learn multiple tasks sequentially. In this work we propose a practical solution to train such models sequentially by protecting the weights important for previous tasks. This approach, inspired by synaptic consolidation in neuroscience, enables state of the art results on multiple reinforcement learning problems experienced sequentially.

Abstract

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Until now neural networks have not been capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks that they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on a hand-written digit dataset and by learning several Atari 2600 games sequentially.

Related collections

Most cited references 14

Record: found
Abstract: not found
Article: not found

A Practical Bayesian Framework for Backpropagation Networks

David MacKay (1992)

0 comments Cited 460 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory.

James McClelland, Bruce L. McNaughton, Randall C O'Reilly (1995)

Damage to the hippocampal system disrupts recent memory but leaves remote memory intact. The account presented here suggests that memories are first stored via synaptic changes in the hippocampal system, that these changes support reinstatement of recent memories in the neocortex, that neocortical synapses change a little on each reinstatement, and that remote memory is based on accumulated neocortical changes. Models that learn via changes to connections help explain this organization. These models discover the structure in ensembles of items if learning of each item is gradual and interleaved with learning about other items. This suggests that the neocortex learns slowly to discover the structure in ensembles of experiences. The hippocampal system permits rapid learning of new items without disrupting this structure, and reinstatement of new memories interleaves them with others to integrate them into structured neocortical memory systems.

0 comments Cited 422 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Catastrophic forgetting in connectionist networks.

R. French (1999)

All natural cognitive systems, and, in particular, our own, gradually forget previously learned information. Plausible models of human cognition should therefore exhibit similar patterns of gradual forgetting of old information as new information is acquired. Only rarely does new learning in natural cognitive systems completely disrupt or erase previously learned information; that is, natural cognitive systems do not, in general, forget 'catastrophically'. Unfortunately, though, catastrophic forgetting does occur under certain circumstances in distributed connectionist networks. The very features that give these networks their remarkable abilities to generalize, to function in the presence of degraded input, and so on, are found to be the root cause of catastrophic forgetting. The challenge in this field is to discover how to keep the advantages of distributed connectionist networks while avoiding the problem of catastrophic forgetting. In this article the causes, consequences and numerous solutions to the problem of catastrophic forgetting in neural networks are examined. The review will consider how the brain might have overcome this problem and will also explore the consequences of this solution for distributed connectionist networks.

0 comments Cited 290 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Proc Natl Acad Sci U S A

Journal ID (iso-abbrev): Proc. Natl. Acad. Sci. U.S.A

Journal ID (hwp): pnas

Journal ID (pmc): pnas

Journal ID (publisher-id): PNAS

Title: Proceedings of the National Academy of Sciences of the United States of America

Publisher: National Academy of Sciences

ISSN (Print): 0027-8424

ISSN (Electronic): 1091-6490

Publication date (Print): 28 March 2017

Publication date (Electronic): 14 March 2017

Publication date PMC-release: 14 March 2017

Volume: 114

Issue: 13

Pages: 3521-3526

Affiliations

[1] ^a DeepMind , London EC4 5TW, United Kingdom;

[2] ^bBioengineering Department, Imperial College London , London SW7 2AZ, United Kingdom

Author notes

¹To whom correspondence should be addressed. Email: kirkpatrick@ 123456google.com .

Edited by James L. McClelland, Stanford University, Stanford, CA, and approved February 13, 2017 (received for review July 19, 2016)

Author contributions: J.K., R.P., N.R., D.H., C.C., D.K., and R.H. designed research; J.K., R.P., N.R., J.V., G.D., A.A.R., K.M., J.Q., T.R., and A.G.-B. performed research; and J.K., R.P., N.R., D.K., and R.H. wrote the paper.

Article

Accession ID: PMC5380101 Pmcid ID: PMC5380101 Pmc-uid ID: 5380101 Publisher ID: 201611835

DOI: 10.1073/pnas.1611835114

PMC ID: 5380101

PubMed ID: 28292907

SO-VID: 810a4aa5-8455-4b69-b9b4-07f8868626d2

License:

Freely available online through the PNAS open access option.

History

Page count

Pages: 6

Comments

Comment on this article

scite_

Cited by 421

See all cited by

Most referenced authors 305

See all reference authors

- Version 1

Overcoming catastrophic forgetting in neural networks

Read this article at

Significance

Abstract

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 14

A Practical Bayesian Framework for Backpropagation Networks

Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory.

Catastrophic forgetting in connectionist networks.

Author and article information

Journal

Affiliations

Author notes

Article

History

Page count

Categories

Comments

Comment on this article

Similar content 213

Cited by 421

Most referenced authors 305