Sparse Attentive Backtracking: Temporal CreditAssignment Through
  Reminding

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Learning long-term dependencies in extended temporal sequences requires credit assignment to events far back in the past. The most common method for training recurrent neural networks, back-propagation through time (BPTT), requires credit information to be propagated backwards through every single step of the forward computation, potentially over thousands or millions of time steps. This becomes computationally expensive or even infeasible when used with long sequences. Importantly, biological brains are unlikely to perform such detailed reverse replay over very long sequences of internal states (consider days, months, or years.) However, humans are often reminded of past memories or mental states which are associated with the current mental state. We consider the hypothesis that such memory associations between past and present could be used for credit assignment through arbitrarily long sequences, propagating the credit assigned to the current state to the associated past state. Based on this principle, we study a novel algorithm which only back-propagates through a few of these temporal skip connections, realized by a learned attention mechanism that associates current states with relevant past states. We demonstrate in experiments that our method matches or outperforms regular BPTT and truncated BPTT in tasks involving particularly long-term dependencies, but without requiring the biologically implausible backward replay through the whole history of states. Additionally, we demonstrate that the proposed method transfers to longer sequences significantly better than LSTMs trained with BPTT and LSTMs trained with full self-attention.

Related collections

Most cited references 12

Record: found
Abstract: not found
Conference Proceedings: not found

Show and tell: A neural image caption generator

Oriol Vinyals, Alexander Toshev, Samy Bengio … (2015)

0 comments Cited 546 times – based on 0 reviews

Bookmark

Record: found
Abstract: not found
Article: not found

Hybrid computing using a neural network with dynamic external memory

Alex Graves, Greg Wayne, Malcolm Reynolds … (2016)

0 comments Cited 361 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Hippocampal replay is not a simple function of experience.

Anoopum S Gupta, Matthijs A. A. van der Meer, David Touretzky … (2010)

Replay of behavioral sequences in the hippocampus during sharp wave ripple complexes (SWRs) provides a potential mechanism for memory consolidation and the learning of knowledge structures. Current hypotheses imply that replay should straightforwardly reflect recent experience. However, we find these hypotheses to be incompatible with the content of replay on a task with two distinct behavioral sequences (A and B). We observed forward and backward replay of B even when rats had been performing A for >10 min. Furthermore, replay of nonlocal sequence B occurred more often when B was infrequently experienced. Neither forward nor backward sequences preferentially represented highly experienced trajectories within a session. Additionally, we observed the construction of never-experienced novel-path sequences. These observations challenge the idea that sequence activation during SWRs is a simple replay of recent experience. Instead, replay reflected all physically available trajectories within the environment, suggesting a potential role in active learning and maintenance of the cognitive map. Copyright 2010 Elsevier Inc. All rights reserved.

0 comments Cited 237 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Publication date Created: 11 September 2018

Article

ArXiV ID: 1809.03702

SO-VID: 64a5f4db-dc6d-49d6-90ef-4518325de3d5

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Comments To appear as a Spotlight presentation at NIPS 2018

Categories cs.LG stat.ML

ScienceOpen disciplines: Machine learning,Artificial intelligence

Data availability:

ScienceOpen disciplines: Machine learning, Artificial intelligence

Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding

Read this article at

Abstract

Related collections

Radiology and Natural Language Processing

Most cited references 12

Show and tell: A neural image caption generator

Hybrid computing using a neural network with dynamic external memory

Hippocampal replay is not a simple function of experience.

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 42

Most referenced authors 378