Automatic Induction of Rules for Text Simplification

Chandrasekar, R.; Srinivas, B.

Automatic Induction of Rules for Text Simplification

Files

96_30.pdf (165.19 KB)

Penn collection

IRCS Technical Reports Series

Subject

Cognitive Neuroscience
Theory and Algorithms

Permalink

https://repository.upenn.edu/handle/20.500.14332/37520

View all metadata

Author

Chandrasekar, R.

Srinivas, B.

Abstract

Long and complicated sentences pose various problems to many state-of-the-art natural language technologies. We have been exploring methods to automatically transform such sentences as to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. This paper describes an algorithm and an implementation by which generalized rules for simplification are automatically induced from annotated training material with a novel partial parsing technique which combines constituent structure and dependency information. This algorithm described in the paper employs example-based generalizations on linguistically-motivated structures.

Publication date

1996-12-01

Comments

University of Pennsylvania Institute for Research in Cognitive Science Technical Report No. IRCS-96-30.

Collection

Reports