Exploiting Cross-Lingual Representations For Natural Language Processing

Upadhyay, Shyam

Exploiting Cross-Lingual Representations For Natural Language Processing

Files

Upadhyay_upenngdas_0175C_13618.pdf (8.92 MB)

Degree type

Doctor of Philosophy (PhD)

Graduate group

Computer and Information Science

Subject

cross-lingual
low supervision
multilingual
natural language processing
representation learning
Computer Sciences

Copyright date

2019-08-27T20:19:00-07:00

Permalink

https://repository.upenn.edu/handle/20.500.14332/30270

View all metadata

Author

Upadhyay, Shyam

Abstract

Traditional approaches to supervised learning require a generous amount of labeled data for good generalization. While such annotation-heavy approaches have proven useful for some Natural Language Processing (NLP) tasks in high-resource languages (like English), they are unlikely to scale to languages where collecting labeled data is di cult and time-consuming. Translating supervision available in English is also not a viable solution, because developing a good machine translation system requires expensive to annotate resources which are not available for most languages. In this thesis, I argue that cross-lingual representations are an effective means of extending NLP tools to languages beyond English without resorting to generous amounts of annotated data or expensive machine translation. These representations can be learned in an inexpensive manner, often from signals completely unrelated to the task of interest. I begin with a review of different ways of inducing such representations using a variety of cross-lingual signals and study algorithmic approaches of using them in a diverse set of downstream tasks. Examples of such tasks covered in this thesis include learning representations to transfer a trained model across languages for document classification, assist in monolingual lexical semantics like word sense induction, identify asymmetric lexical relationships like hypernymy between words in different languages, or combining supervision across languages through a shared feature space for cross-lingual entity linking. In all these applications, the representations make information expressed in other languages available in English, while requiring minimal additional supervision in the language of interest.

Advisor

Dan Roth

Date of degree

2019-01-01

Collection

Dissertations and Theses