The Missing Link Between Memory and Reinforcement Learning-Reference-Cited by-同舟云学术

The Missing Link Between Memory and Reinforcement Learning

Published:2020-12-10 Issue: Volume:11 Page:
ISSN:1664-1078
Container-title:Frontiers in Psychology
language:
Short-container-title:Front. Psychol.

Author:

Balkenius Christian,Tjøstheim Trond A.,Johansson Birger,Wallin Annika,Gärdenfors Peter

Abstract

Reinforcement learning systems usually assume that a value function is defined over all states (or state-action pairs) that can immediately give the value of a particular state or action. These values are used by a selection mechanism to decide which action to take. In contrast, when humans and animals make decisions, they collect evidence for different alternatives over time and take action only when sufficient evidence has been accumulated. We have previously developed a model of memory processing that includes semantic, episodic and working memory in a comprehensive architecture. Here, we describe how this memory mechanism can support decision making when the alternatives cannot be evaluated based on immediate sensory information alone. Instead we first imagine, and then evaluate a possible future that will result from choosing one of the alternatives. Here we present an extended model that can be used as a model for decision making that depends on accumulating evidence over time, whether that information comes from the sequential attention to different sensory properties or from internal simulation of the consequences of making a particular choice. We show how the new model explains both simple immediate choices, choices that depend on multiple sensory factors and complicated selections between alternatives that require forward looking simulations based on episodic and semantic memory structures. In this framework, vicarious trial and error is explained as an internal simulation that accumulates evidence for a particular choice. We argue that a system like this forms the “missing link” between more traditional ideas of semantic and episodic memory, and the associative nature of reinforcement learning.

Funder

Marianne and Marcus Wallenberg Foundation

Publisher

Frontiers Media SA

Subject

General Psychology

Reference56 articles.

1. Synaptic depression and cortical gain control;Abbott;Science,1997

2. Latching dynamics in neural networks with synaptic depression;Aguilar;PLoS ONE,2017

3. Dynamics of pattern formation in lateral-inhibition type neural fields;Amari;Biol. Cybern,1977

4. Adaptive gain and the role of the locus coeruleus-norepinephrine system in optimal performance;Aston-Jones;J. Compar. Neurol,2005

5. Role of locus coeruleus in attention and behavioral flexibility;Aston-Jones;Biol. Psychiatry,1999

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A System-Level Brain Model for Enactive Haptic Perception in a Humanoid Robot;Artificial Neural Networks and Machine Learning – ICANN 2023;2023

2. How Working Memory and Reinforcement Learning Are Intertwined: A Cognitive, Neural, and Computational Perspective;Journal of Cognitive Neuroscience;2021-12-23

3. Direct Approach or Detour: A Comparative Model of Inhibition and Neural Ensemble Size in Behavior Selection;Frontiers in Systems Neuroscience;2021-11-09