Learning Tractable Word Alignment Models with Complex Constraints

Ganchev, Kuzman; Graça, João V.; Taskar, Ben

Learning Tractable Word Alignment Models with Complex Constraints

dc.contributor.author	Ganchev, Kuzman
dc.contributor.author	Graça, João V.
dc.contributor.author	Taskar, Ben
dc.date	2023-05-17T06:02:46.000
dc.date.accessioned	2023-05-22T19:19:48Z
dc.date.available	2023-05-22T19:19:48Z
dc.date.issued	2010-03-10
dc.date.submitted	2011-02-01T10:26:29-08:00
dc.description.abstract	Word-level alignment of bilingual text is a critical resource for a growing variety of tasks. Probabilistic models for word alignment present a fundamental trade-off between richness of captured constraints and correlations versus efficiency and tractability of inference. In this article, we use the Posterior Regularization framework (Graça, Ganchev, and Taskar 2007) to incorporate complex constraints into probabilistic models during learning without changing the efficiency of the underlying model. We focus on the simple and tractable hidden Markov model, and present an efficient learning algorithm for incorporating approximate bijectivity and symmetry constraints. Models estimated with these constraints produce a significant boost in performance as measured by both precision and recall of manually annotated alignments for six language pairs. We also report experiments on two different tasks where word alignments are required: phrase-based machine translation and syntax transfer, and show promising improvements over standard methods.
dc.description.comments	Suggested Citation: J. Graça, K. Ganchev and B. Taskar. (2010). "Learning TractableWord AlignmentModels with Complex Constraints." Computational Linguistics. Vol. 36(3). p. 481-504. © 2010 MIT Press http://www.mitpressjournals.org/loi/coli
dc.identifier.uri	https://repository.upenn.edu/handle/20.500.14332/34820
dc.legacy.articleid	1068
dc.legacy.fields	true
dc.legacy.fulltexturl	https://repository.upenn.edu/cgi/viewcontent.cgi?article=1068&context=grasp_papers&unstamped=1
dc.source.issue	66
dc.source.journal	Lab Papers (GRASP)
dc.source.peerreviewed	true
dc.source.status	published
dc.subject.other	Engineering
dc.title	Learning Tractable Word Alignment Models with Complex Constraints
dc.type	Article
digcom.identifier	grasp_papers/66
digcom.identifier.contextkey	1756674
digcom.identifier.submissionpath	grasp_papers/66
digcom.type	article
dspace.entity.type	Publication
relation.isAuthorOfPublication	5b360bfb-5497-43dc-9a15-0236987ccc59
relation.isAuthorOfPublication	5b360bfb-5497-43dc-9a15-0236987ccc59
relation.isAuthorOfPublication	48084f74-55a3-43da-96d7-8a01c512b3b9
relation.isAuthorOfPublication.latestForDiscovery	5b360bfb-5497-43dc-9a15-0236987ccc59
upenn.schoolDepartmentCenter	Lab Papers (GRASP)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Learning_Tractable_Word_Alignment.pdf
Size:: 684.46 KB
Format:: Adobe Portable Document Format

Download

Collection

Articles

Learning Tractable Word Alignment Models with Complex Constraints

Files

Original bundle

Collection

Usage statistics

Penn's Heritage