z-logo
open-access-imgOpen Access
Learning Continuous Phrase Representations for Translation Modeling
Author(s) -
Jianfeng Gao,
Xiaodong He,
Wen-tau Yih,
Li Deng
Publication year - 2014
Language(s) - English
Resource type - Conference proceedings
DOI - 10.3115/v1/p14-1066
Subject(s) - phrase , computer science , translation (biology) , machine translation , artificial intelligence , natural language processing , projection (relational algebra) , example based machine translation , space (punctuation) , artificial neural network , vector space , speech recognition , algorithm , mathematics , biochemistry , chemistry , messenger rna , gene , operating system , geometry
This paper tackles the sparsity problem in estimating phrase translation probabilities by learning continuous phrase representations, whose distributed nature enables the sharing of related phrases in their representations. A pair of source and target phrases are projected into continuous-valued vector representations in a low-dimensional latent space, where their translation score is computed by the distance between the pair in this new space. The projection is performed by a neural network whose weights are learned on parallel training data. Experimental evaluation has been performed on two WMT translation tasks. Our best result improves the performance of a state-of-the-art phrase-based statistical machine translation system trained on WMT 2012 French-English data by up to 1.3 BLEU points.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom