Competitive Analysis of the Explore/Exploit Tradeoff

Langford, John; Zinkevich, Martin; Kakade, Sham M

Competitive Analysis of the Explore/Exploit Tradeoff

Files

Explore_exploit_tradeoff.pdf (194.12 KB)

Penn collection

Statistics Papers

Subject

Statistics and Probability

Permalink

https://repository.upenn.edu/handle/20.500.14332/47464

View all metadata

Author

Langford, John

Zinkevich, Martin

Kakade, Sham M

Abstract

We investigate the explore/exploit trade-off in reinforcement learning using competitive analysis applied to an abstract model. We state and prove lower and upper bounds on the competitive ratio. The essential conclusion of our analysis is that optimizing the explore/exploit trade-off is much easier with a few pieces of extra knowledge such as the stopping time or upper and lower bounds on the value of the optimal exploitation policy.

Date of presentation

2002-01-01

Conference name

Statistics Papers

Conference dates

2023-05-17T15:27:37.000

Collection

Presentations