z-logo
open-access-imgOpen Access
Are Multilingual Neural Machine Translation Models Better at Capturing Linguistic Features?
Author(s) -
David Mareček,
Hande Çelikkanat,
Miikka Silfverberg,
Vinit Ravishankar,
Jörg Tiedemann
Publication year - 2020
Publication title -
˜the œprague bulletin of mathematical linguistics
Language(s) - English
Resource type - Journals
eISSN - 1804-0462
pISSN - 0032-6585
DOI - 10.14712/00326585.009
Subject(s) - machine translation , computer science , natural language processing , artificial intelligence , translation (biology) , linguistics , chemistry , philosophy , biochemistry , messenger rna , gene
We investigate the effect of training NMTmodels on multiple target languages. We hypothesize that the integration of multiple languages and the increase of linguistic diversity will lead to a stronger representation of syntactic and semantic features captured by the model. We test our hypothesis on two different NMT architectures: The widely-used Transformer architecture and the Attention Bridge architecture. We train models on Europarl data and quantify the level of syntactic and semantic information discovered by the models using three different methods: SentEval linguistic probing tasks, an analysis of the attention structures regarding the inherent phrase and dependency information and a structural probe on contextualized word representations. Our results show evidence that with growing number of target languages the Attention Bridge model increasingly picks up certain linguistic properties including some syntactic and semantic aspects of the sentence whereas Transformermodels are largely unaffected. The latter also applies to phrase structure and syntactic dependencies that do not seem to be developing in sentence representationswhen increasing the linguistic diversity in training to translate. This is rather surprising and may hint on the relatively little influence of grammatical structure on language understanding.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom