Searching for rewards in graph-structured spaces
Author(s) -
Charley M. Wu,
Eric Schulz,
Samuel J. Gershman
Publication year - 2019
Publication title -
2022 conference on cognitive computational neuroscience
Language(s) - English
Resource type - Conference proceedings
DOI - 10.32470/ccn.2019.1041-0
Subject(s) - computer science , graph , theoretical computer science
How do people generalize and explore structured spaces? We study human behavior on a multi-armed bandit task, where rewards are influenced by the connectivity structure of a graph. A detailed predictive model comparison shows that a Gaussian Process regression model using a diffusion kernel is able to best describe participant choices, and also predict judgments about expected reward and confidence. This model unifies psychological models of function learning with the Successor Representation used in reinforcement learning, thereby building a bridge between different models of generalization.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom