Domain Adaptation With Coupled Subspaces

Blitzer, John; Foster, Dean P; Kakade, Sham M

Domain Adaptation With Coupled Subspaces

Files

blitzer11a.pdf (1.21 MB)

Penn collection

Statistics Papers

Subject

Computer Sciences
Statistics and Probability

Permalink

https://repository.upenn.edu/handle/20.500.14332/47481

View all metadata

Author

Blitzer, John

Foster, Dean P

Kakade, Sham M

Abstract

Domain adaptation algorithms address a key issue in applied machine learning: How can we train a system under a source distribution but achieve high performance under a different target distribution? We tackle this question for divergent distributions where crucial predictive target features may not even have support under the source distribution. In this setting, the key intuition is that that if we can link target-specific features to source features, we can learn effectively using only source labeled data. We formalize this intuition, as well as the assumptions under which such coupled learning is possible. This allows us to give finite sample target error bounds (using only source training data) and an algorithm which performs at the state-of-the-art on two natural language processing adaptation tasks which are characterized by novel target features.

Publication date

2011-01-01

Collection

Working Papers