Part-of-Speech Tagging using Conditional Random Fields: Exploiting Sub-Label Dependencies for Improved Accuracy
Author(s) -
Miikka Silfverberg,
Teemu Ruokolainen,
Krister Lindén,
Mikko Kurimo
Publication year - 2014
Language(s) - English
Resource type - Conference proceedings
DOI - 10.3115/v1/p14-2043
Subject(s) - conditional random field , computer science , speech recognition , artificial intelligence , part of speech tagging , pattern recognition (psychology) , natural language processing , part of speech
We discuss part-of-speech (POS) tagging in presence of large, fine-grained label sets using conditional random fields (CRFs). We propose improving tagging accuracy by utilizing dependencies within sub-components of the fine-grained labels. These sub-label dependencies are incorporated into the CRF model via a (relatively) straightforward feature extraction scheme. Experiments on five languages show that the approach can yield significant improvement in tagging accuracy in case the labels have sufficiently rich inner structure.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom