Near-Term Liability of Exploitation: Exploration and Exploitation in Multistage Problems

Penn collection
Management Papers
Degree type
Discipline
Subject
exploration and exploitation
maximization
multistage problems
reinforcement learning
softmax choice rule
Business Administration, Management, and Operations
Organizational Behavior and Theory
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Fang, Christina
Levinthal, Daniel A
Contributor
Abstract

The classic trade-off between exploration and exploitation reflects the tension between gaining new information about alternatives to improve future returns and using the information currently available to improve present returns. By considering these issues in the context of a multistage, as opposed to a repeated, problem environment, we show that exploratory behavior has value quite apart from its role in revising beliefs. We show that even if current beliefs provide an unbiased characterization of the problem environment, maximizing with respect to these beliefs may lead to an inferior expected payoff relative to other mechanisms that make less aggressive use of the organization's beliefs. Search can lead to more robust actions in multistage decision problems than maximization, a benefit quite apart from its role in the updating of beliefs.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2009-05-01
Journal title
Organization Science
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection