Technical Reports (CIS)

PennAspect: A Two-Way Aspect Model Implementation

Andrew I. Schein, University of Pennsylvania
Alexandrin Popescul
Lyle H. Ungar, University of Pennsylvania

Document Type Technical Report

University of Pennsylvania Department of Computer and Information Science Technical Report No. MS-CIS-01-25.

Abstract

The two-way aspect model is a latent class statistical mixture model for performing soft clustering of co-occurrence data observations. It acts on data such as document/word pairs (words occurring in documents) or movie/people pairs (people see certain movies) to produce their joint distribution estimate. This document describes our software immplementation of the aspect model available under GNU Public License (included with the distribution). We call this package PennAspect. The distribution is packaged as Java source and class files. The software comes with no guarantees of any kind. We welcome user feedback and comments. To down load PennAspect, visit: http://www.cis.upenn.edu/datamining/software_distPennAspect/index.html.

 

Date Posted: 22 June 2007