Exploration in Metric State Spaces
Loading...
Penn collection
Statistics Papers
Degree type
Discipline
Subject
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Kakade, Sham M
Kearns, Michael J
Langford, John
Contributor
Abstract
We present metric- E3 a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows the construction of accurate local models. The algorithm is a generalization of the E3 algorithm of Kearns and Singh, and assumes a black box for approximate planning. Unlike the original E3 , metric-E3 finds a near optimal policy in an amount of time that does not directly depend on the size of the state space, but instead depends on the covering number of the state space. Informally, the covering number is the number of neighborhoods required for accurate local modeling.
Advisor
Date of presentation
2003-01-01
Conference name
Statistics Papers
Conference dates
2023-05-17T15:27:41.000