Sparsity in Dependency Grammar Induction

Taskar, Ben; Pereira, Fernando CN; Graca, Joao V; Gillenwater, Jennifer; Ganchev, Kuzman

Sparsity in Dependency Grammar Induction

Files

acl10.pdf (282.21 KB)

Penn collection

Center for Human Modeling and Simulation

Subject

Computer Sciences

Permalink

https://repository.upenn.edu/handle/20.500.14332/36402

View all metadata

Author

Taskar, Ben

Pereira, Fernando CN

Graca, Joao V

Gillenwater, Jennifer

Ganchev, Kuzman

Abstract

A strong inductive bias is essential in unsupervised grammar induction. We explore a particular sparsity bias in dependency grammars that encourages a small number of unique dependency types. Specifically, we investigate sparsity-inducing penalties on the posterior distributions of parent-child POS tag pairs in the posterior regularization (PR) framework of Graça et al. (2007). In experiments with 12 languages, we achieve substantial gains over the standard expectation maximization (EM) baseline, with average improvement in attachment accuracy of 6.3%. Further, our method outperforms models based on a standard Bayesian sparsity-inducing prior by an average of 4.9%. On English in particular, we show that our approach improves on several other state-of-the-art techniques.

Publication date

2010-07-01

Collection

Reports