Operations, Information and Decisions Papers

Document Type

Journal Article

Date of this Version

1999

Publication Source

Journal of Management Information Systems

Volume

16

Issue

1

Start Page

17

Last Page

36

DOI

10.1080/07421222.1999.11518232

Abstract

The paper reports on conceptual development in the areas of database mining and knowledge discovery in databases (KDD). Our efforts have also led to a prototype implementation, called MOTC, for exploring hypothesis space in large and complex data sets. Our KDD conceptual development rests on two main principles. First, we use the crosstab representation for working with qualitative data. This is by now standard in on-line analytical processing (OLAP) applications, and we reaffirm it with additional reasons. Second, and innovatively, we use prediction analysis as a measure of goodness for hypotheses. Prediction analysis is an established statistical technique for analysis of associations among qualitative variables. It generalizes and subsumes a large number of other such measures of association, depending on specific assumptions the user is willing to make. As such, it provides a very useful framework for exploring hypothesis space in a KDD context. The paper illustrates these points with an extensive discussion of MOTC.

Copyright/Permission Statement

This is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Management Information Systems on 02 Dec 2015, available online: http://wwww.tandfonline.com/10.1080/07421222.1999.11518232

Keywords

data mining, data visualition, hypotheses exploration, knowledge discovery in databases, OLAP, prediction analysis

Embargo Date

6-2-2017

 

Date Posted: 27 November 2017

This document has been peer reviewed.