
Technical Reports (CIS)
Title
Further Results and Analysis of Icelandic Part of Speech Tagging
Document Type
Technical Report
Date of this Version
April 2008
Abstract
Data driven POS tagging has achieved good performance for English, but can still lag behind linguistic rule based taggers for morphologically complex languages, such as Icelandic. We extend a statistical tagger to handle fine grained tagsets and improve over the best Icelandic POS tagger. Additionally, we develop a case tagger for non-local case and gender decisions. An error analysis of our system suggests future directions. This paper presents further results and analysis to the original work (Dredze and Wallenberg, 2008).
Date Posted: 05 May 2008

Comments
University of Pennsylvania Department of Computer and Information Science Technical Report No. MS-CIS-08-13.