Discourse Indicators for Content Selection in Summaization

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
Computer Sciences
Funder
Grant number
License
Copyright date
Distributor
Related resources
Contributor
Abstract

We present analyses aimed at eliciting which specific aspects of discourse provide the strongest indication for text importance. In the context of content selection for single document summarization of news, we examine the benefits of both the graph structure of text provided by discourse relations and the semantic sense of these relations. We find that structure information is the most robust indicator of importance. Semantic sense only provides constraints on content selection but is not indicative of important content by itself. However, sense features complement structure information and lead to improved performance. Further, both types of discourse information prove complementary to non-discourse features. While our results establish the usefulness of discourse features, we also find that lexical overlap provides a simple and cheap alternative to discourse for computing text structure with comparable performance for the task of content selection.

Advisor
Date of presentation
2010-09-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T07:16:39.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Louis, A., Joshi, A., & Nenkova, A., Discourse Indicators for Content Selection in Summarization, The 11th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Sept. 2010, doi: http://www.aclweb.org/anthology/W10-4327
Recommended citation
Collection