Graphical Models for Bandit Problems

Loading...
Thumbnail Image

Related Collections

Degree type

Discipline

Subject

Computer Sciences

Funder

Grant number

License

Copyright date

Distributor

Related resources

Author

Amin, Kareem
Syed, Umar

Contributor

Abstract

We introduce a rich class of graphical models for multi-armed bandit problems that permit both the state or context space and the action space to be very large, yet succinctly specify the payoffs for any context-action pair. Our main result is an algorithm for such models whose regret is bounded by the number of parameters and whose running time depends only on the treewidth of the graph substructure induced by the action space.

Advisor

Date of presentation

2011-07-01

Conference name

Departmental Papers (CIS)

Conference dates

2023-05-17T07:13:35.000

Conference location

Date Range for Data Collection (Start Date)

Date Range for Data Collection (End Date)

Digital Object Identifier

Series name and number

Volume number

Issue number

Publisher

Publisher DOI

Journal Issues

Comments

Amin, K., Kearns, M., & Syed, U., Graphical Models for Bandit Problems, 27th Conference on Uncertainty in Artificial Intelligence, July 2011, http://arxiv.org/abs/1202.3782

Recommended citation

Collection