Online Learning: Beyond Regret

Rakhlin, Alexander; Sridharan, Karthik; Tewari, Ambuj

Online Learning: Beyond Regret

Files

rakhlin11a.pdf (619.69 KB)

Penn collection

Statistics Papers

Subject

Computer Sciences
Statistics and Probability

Permalink

https://repository.upenn.edu/handle/20.500.14332/47486

View all metadata

Author

Rakhlin, Alexander

Sridharan, Karthik

Tewari, Ambuj

Abstract

We study online learnability of a wide class of problems, extending the results of Rakhlin et al. (2010a) to general notions of performance measure well beyond external regret. Our framework simultaneously captures such well-known notions as internal and general Φ-regret, learning with non-additive global cost functions, Blackwell's approachability, calibration of forecasters, and more. We show that learnability in all these situations is due to control of the same three quantities: a martingale convergence term, a term describing the ability to perform well if future is known, and a generalization of sequential Rademacher complexity, studied in Rakhlin et al. (2010a). Since we directly study complexity of the problem instead of focusing on efficient algorithms, we are able to improve and extend many known results which have been previously derived via an algorithmic construction.

Date of presentation

2011-01-01

Conference name

Statistics Papers

Conference dates

2023-05-17T15:26:53.000

Collection

Presentations