Multiway Clustering for Creating Biomedical Term Sets

Loading...
Thumbnail Image
Penn collection
Statistics Papers
Degree type
Discipline
Subject
Computer Sciences
Other Statistics and Probability
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Kandylas, Vasileios
Ungar, Lyle
Sandler, Ted
Jensen, Shane T
Contributor
Abstract

We present an EM-based clustering method that can be used for constructing or augmenting ontologies such as MeSH. Our algorithm simultaneously clusters verbs and nouns using both verb-noun and noun-noun co-occurrence pairs. This strategy provides greater coverage of words than using either set of pairs alone, since not all words appear in both datasets. We demonstrate it on data extracted from Medline and evaluate the results using MeSH and Wordnet.

Advisor
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Publication date
2008-01-01
Journal title
2013 IEEE International Conference on Bioinformatics and Biomedicine 2008
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Recommended citation
Collection