Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints
Author(s) -
Nikola Mrkšić,
Ivan Vulić,
Diarmuid Ó Séaghdha,
Ira Leviant,
Roi Reichart,
Milica Gašić,
Anna Korhonen,
Steve Young
Publication year - 2017
Publication title -
transactions of the association for computational linguistics
Language(s) - English
Resource type - Journals
ISSN - 2307-387X
DOI - 10.1162/tacl_a_00063
Subject(s) - computer science , word (group theory) , task (project management) , natural language processing , artificial intelligence , quality (philosophy) , vector space , resource (disambiguation) , similarity (geometry) , construct (python library) , semantic similarity , image (mathematics) , programming language , linguistics , mathematics , geometry , management , epistemology , economics , computer network , philosophy
We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources. Attract-Repel facilitates the use of constraints from mono- and cross-lingual resources, yielding semantically specialized cross-lingual vector spaces. Our evaluation shows that the method can make use of existing cross-lingual lexicons to construct high-quality vector spaces for a plethora of different languages, facilitating semantic transfer from high- to lower-resource ones. The effectiveness of our approach is demonstrated with state-of-the-art results on semantic similarity datasets in six languages. We next show that Attract-Repel-specialized vectors boost performance in the downstream task of dialogue state tracking (DST) across multiple languages. Finally, we show that cross-lingual vector spaces produced by our algorithm facilitate the training of multilingual DST models, which brings further performance improvements.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom