Graphical Models for Bandit Problems

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
Computer Sciences
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Amin, Kareem
Syed, Umar
Contributor
Abstract

We introduce a rich class of graphical models for multi-armed bandit problems that permit both the state or context space and the action space to be very large, yet succinctly specify the payoffs for any context-action pair. Our main result is an algorithm for such models whose regret is bounded by the number of parameters and whose running time depends only on the treewidth of the graph substructure induced by the action space.

Advisor
Date of presentation
2011-07-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T07:13:35.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Amin, K., Kearns, M., & Syed, U., Graphical Models for Bandit Problems, 27th Conference on Uncertainty in Artificial Intelligence, July 2011, http://arxiv.org/abs/1202.3782
Recommended citation
Collection