A Central Limit Theorem for Temporally Non-Homogenous Markov Chains with Applications to Dynamic Programming

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
non-homogeneous Markov chain
central limit theorem
Markov decision problem
sequential decision
dynamic inventory management
alternating subsequence
Business
Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Arlotto, Alessandro
Steele, J Michael
Contributor
Abstract

We prove a central limit theorem for a class of additive processes that arise naturally in the theory of finite horizon Markov decision problems. The main theorem generalizes a classic result of Dobrushin (1956) for temporally non-homogeneous Markov chains, and the principal innovation is that here the summands are permitted to depend on both the current state and a bounded number of future states of the chain. We show through several examples that this added flexibility gives one a direct path to asymptotic normality of the optimal total reward of finite horizon Markov decision problems. The same examples also explain why such results are not easily obtained by alternative Markovian techniques such as enlargement of the state space.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2016-11-01
Journal title
Mathematics of Operations Research
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection