Adapting to All Domains at Once: Rewarding Domain Invariance in SMT | Zendy

Hoang Manh Cuong | Zendy; Khalil Sima’an | Zendy; Ivan Titov | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Adapting to All Domains at Once: Rewarding Domain Invariance in SMT

Author(s) -

Hoang Manh Cuong,

Khalil Sima’an,

Ivan Titov

Publication year - 2016

Publication title -

transactions of the association for computational linguistics

Language(s) - English

Resource type - Journals

ISSN - 2307-387X

DOI - 10.1162/tacl_a_00086

Subject(s) - computer science , intuition , safer , domain (mathematical analysis) , artificial intelligence , domain adaptation , encode , machine learning , feature (linguistics) , computer security , mathematics , classifier (uml) , psychology , mathematical analysis , biochemistry , chemistry , linguistics , philosophy , gene , cognitive science

Existing work on domain adaptation for statistical machine translation has consistently assumed access to a small sample from the test distribution (target domain) at training time. In practice, however, the target domain may not be known at training time or it may change to match user needs. In such situations, it is natural to push the system to make safer choices, giving higher preference to domain-invariant translations, which work well across domains, over risky domain-specific alternatives. We encode this intuition by (1) inducing latent subdomains from the training data only; (2) introducing features which measure how specialized phrases are to individual induced sub-domains; (3) estimating feature weights on out-of-domain data (rather than on the target domain). We conduct experiments on three language pairs and a number of different domains. We observe consistent improvements over a baseline which does not explicitly reward domain invariance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research