Dopamine: Generalization and Bonuses

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
dopamine
reinforcement learning
exploration
temporal difference
generalization
Biochemistry, Biophysics, and Structural Biology
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Kakade, Sham
Dayan, Peter
Contributor
Abstract

In the temporal difference model of primate dopamine neurons, their phasic activity reports a prediction error for future reward. This model is supported by a wealth of experimental data. However, in certain circumstances, the activity of the dopamine cells seems anomalous under the model, as they respond in particular ways to stimuli that are not obviously related to predictions of reward. In this paper, we address two important sets of anomalies, those having to do with generalization and novelty. Generalization responses are treated as the natural consequence of partial information; novelty responses are treated by the suggestion that dopamine cells multiplex information about reward bonuses, including exploration bonuses and shaping bonuses. We interpret this additional role for dopamine in terms of the mechanistic attentional and psychomotor effects of dopamine, having the computational role of guiding exploration.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2002-06-01
Journal title
Neural Networks
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection