Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

Abernethy, Jacob D; Hazan, Elad; Rakhlin, Alexander

Competing in the Dark: An Efficient Algorithm for Bandit Linear Optimization

Files

Competing_in_the_dark.pdf (326.44 KB)

Penn collection

Statistics Papers

Subject

Statistics and Probability

Permalink

https://repository.upenn.edu/handle/20.500.14332/47461

View all metadata

Author

Abernethy, Jacob D

Hazan, Elad

Rakhlin, Alexander

Abstract

We introduce an efficient algorithm for the problem of online linear optimization in the bandit setting which achieves the optimal O*(√T)regret. The setting is a natural generalization of the nonstochastic multiarmed bandit problem, and the existence of an efficient optimal algorithm has been posed as an open problem in a number of recent papers. We show how the difficulties encountered by previous approaches are overcome by the use of a self-concordant potential function. Our approach presents a novel connection between online learning and interior point methods.

Date of presentation

2009-01-01

Conference name

Statistics Papers

Conference dates

2023-05-17T15:27:43.000

Comments

At the time of publication, author Alexander Rakhlin was affiliated with the University of California, Berkeley. Currently, he is a faculty member at the Statistics Department at the University of Pennsylvania.

Collection

Presentations