Markov Decision Problems Where Means Bound Variances

Arlotto, Alessandro; Gans, Noah; Steele, John M

Markov Decision Problems Where Means Bound Variances

Files

ArlottoGansSteele_MDPsWhereMeansBoundVariances.pdf (386.99 KB)

Penn collection

Operations, Information and Decisions Papers

Subject

Markov decision problems
variance bounds
optimal total reward
Business Administration, Management, and Operations
Finance and Financial Management
Operations and Supply Chain Management

Permalink

https://repository.upenn.edu/handle/20.500.14332/42016

View all metadata

Author

Arlotto, Alessandro

Gans, Noah

Steele, John M

Abstract

We identify a rich class of finite-horizon Markov decision problems (MDPs) for which the variance of the optimal total reward can be bounded by a simple linear function of its expected value. The class is characterized by three natural properties: reward nonnegativity and boundedness, existence of a do-nothing action, and optimal action monotonicity. These properties are commonly present and typically easy to check. Implications of the class properties and of the variance bound are illustrated by examples of MDPs from operations research, operations management, financial engineering, and combinatorial optimization.

Publication date

2014-07-01

Journal title

Operations Research

Collection

Articles