Simple Reinforcement Learning Agents: Pareto Beats Nash in an Algorithmic Game Theory Study

Kimbrough, Steven. O; Lu, Ming

Simple Reinforcement Learning Agents: Pareto Beats Nash in an Algorithmic Game Theory Study

Files

simple_rl_agents_final.pdf (671.67 KB)

Penn collection

Operations, Information and Decisions Papers

Subject

Q-learning
algorithmic game theory
games
learning and games
Other Social and Behavioral Sciences
Set Theory
Theory and Algorithms

Permalink

https://repository.upenn.edu/handle/20.500.14332/42284

View all metadata

Author

Kimbrough, Steven. O

Lu, Ming

Abstract

Repeated play in games by simple adaptive agents is investigated. The agents use Q-learning, a special form of reinforcement learning, to direct learning of behavioral strategies in a number of 2×2 games. The agents are able effectively to maximize the total wealth extracted. This often leads to Pareto optimal outcomes. When the rewards signals are sufficiently clear, Pareto optimal outcomes will largely be achieved. The effect can select Pareto outcomes that are not Nash equilibria and it can select Pareto optimal outcomes among Nash equilibria.

Publication date

2005-03-01

Journal title

Information Systems and e-Business Management

Collection

Articles