z-logo
open-access-imgOpen Access
Better Evaluation of ASR in Speech Translation Context Using Word Embeddings
Author(s) -
Ngoc-Tien Le,
Christophe Servan,
Benjamin Lecouteux,
Laurent Besacier
Publication year - 2016
Publication title -
interspeech 2022
Language(s) - English
Resource type - Conference proceedings
DOI - 10.21437/interspeech.2016-464
Subject(s) - computer science , speech recognition , word (group theory) , natural language processing , translation (biology) , context (archaeology) , speech translation , artificial intelligence , machine translation , linguistics , philosophy , paleontology , biology , biochemistry , chemistry , messenger rna , gene
This paper investigates the evaluation of ASR in spoken language translation context. More precisely, we propose a simple extension of WER metric in order to penalize differently substitution errors according to their context using word embeddings. For instance, the proposed metric should catch near matches (mainly morphological variants) and penalize less this kind of error which has a more limited impact on translation performance. Our experiments show that the correlation of the new proposed metric with SLT performance is better than the one of WER. Oracle experiments are also conducted and show the ability of our metric to find better hypotheses (to be translated) in the ASR N-best. Finally, a preliminary experiment where ASR tuning is based on our new metric shows encouraging results. For reproductible experiments, the code allowing to call our modified WER and the corpora used are made available to the research community.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom