Learning Neural Sequence-to-Sequence Models from Weak Feedback with Bipolar Ramp Loss | Zendy

Laura Jehl | Zendy; Carolin Lawrence | Zendy; Stefan Riezler | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Learning Neural Sequence-to-Sequence Models from Weak Feedback with Bipolar Ramp Loss

Author(s) -

Laura Jehl,

Carolin Lawrence,

Stefan Riezler

Publication year - 2019

Publication title -

transactions of the association for computational linguistics

Language(s) - English

Resource type - Journals

ISSN - 2307-387X

DOI - 10.1162/tacl_a_00265

Subject(s) - computer science , machine translation , metric (unit) , machine learning , artificial intelligence , security token , sequence (biology) , parsing , task (project management) , supervised learning , artificial neural network , operations management , computer security , management , biology , economics , genetics

In many machine learning scenarios, supervision by gold labels is not available and consequently neural models cannot be trained directly by maximum likelihood estimation (MLE). In a weak supervision scenario, metric-augmented objectives can be employed to assign feedback to model outputs, which can be used to extract a supervision signal for training. We present several objectives for two separate weakly supervised tasks, machine translation and semantic parsing. We show that objectives should actively discourage negative outputs in addition to promoting a surrogate gold structure. This notion of bipolarity is naturally present in ramp loss objectives, which we adapt to neural models. We show that bipolar ramp loss objectives outperform other non-bipolar ramp loss objectives and minimum risk training (MRT) on both weakly supervised tasks, as well as on a supervised machine translation task. Additionally, we introduce a novel token-level ramp loss objective, which is able to outperform even the best sequence-level ramp loss on both weakly supervised tasks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research