MOTC: An Interactive Aid for Multidimensional Hypothesis Generatio

Loading...
Thumbnail Image
Penn collection
Operations, Information and Decisions Papers
Degree type
Discipline
Subject
data mining
data visualition
hypotheses exploration
knowledge discovery in databases
OLAP
prediction analysis
Databases and Information Systems
Other Computer Sciences
Quantitative, Qualitative, Comparative, and Historical Methodologies
Statistical Theory
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Balachandran, Krishnamohan
Buzydlowski, Jan
Dworman, Garett
Kimbrough, Steven. O
Shafer, Tate
Vachula, William J
Contributor
Abstract

The paper reports on conceptual development in the areas of database mining and knowledge discovery in databases (KDD). Our efforts have also led to a prototype implementation, called MOTC, for exploring hypothesis space in large and complex data sets. Our KDD conceptual development rests on two main principles. First, we use the crosstab representation for working with qualitative data. This is by now standard in on-line analytical processing (OLAP) applications, and we reaffirm it with additional reasons. Second, and innovatively, we use prediction analysis as a measure of goodness for hypotheses. Prediction analysis is an established statistical technique for analysis of associations among qualitative variables. It generalizes and subsumes a large number of other such measures of association, depending on specific assumptions the user is willing to make. As such, it provides a very useful framework for exploring hypothesis space in a KDD context. The paper illustrates these points with an extensive discussion of MOTC.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
1999
Journal title
Journal of Management Information Systems
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection