Usercentric Operational Decision Making in Distributed Information Retrieval

Hosanagar, kartik

Usercentric Operational Decision Making in Distributed Information Retrieval

Files

User_Centric_Operational_Decision_Making_in_Distri.pdf (383.22 KB)

Penn collection

Operations, Information and Decisions Papers

Subject

distributed information retrieval (IR)
personalization
utility theory
optimal operational decisions
source selection
query termination
stochastic modeling
Other Education
Other Social and Behavioral Sciences

Permalink

https://repository.upenn.edu/handle/20.500.14332/42130

View all metadata

Author

Hosanagar, kartik

Abstract

Information specialists in enterprises regularly use distributed information retrieval (DIR) systems that query a large number of information retrieval (IR) systems, merge the retrieved results, and display them to users. There can be considerable heterogeneity in the quality of results returned by different IR servers. Further, because different servers handle collections of different sizes and have different processing and bandwidth capacities, there can be considerable heterogeneity in their response times. The broker in the DIR system has to decide which servers to query, how long to wait for responses, and which retrieved results to display based on the benefits and costs imposed on users. The benefit of querying more servers and waiting longer is the ability to retrieve more documents. The costs may be in the form of access fees charged by IR servers or user’s cost associated with waiting for the servers to respond. We formulate the broker’s decision problem as a stochastic mixed-integer program and present analytical solutions for the problem. Using data gathered from FedStats—a system that queries IR engines of several U.S. federal agencies—we demonstrate that the technique can significantly increase the utility from DIR systems. Finally, simulations suggest that the technique can be applied to solve the broker’s decision problem under more complex decision environments.

Publication date

2011-12-01

Journal title

Information Systems Research

Collection

Articles