PARENTing via Model-Agnostic Reinforcement Learning to Correct Pathological Behaviors in Data-to-Text Generation
Author(s) -
Clément Rebuffel,
Laure Soulier,
Geoffrey Scoutheeten,
Patrick Gallinari
Publication year - 2020
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , computer science , reinforcement , artificial intelligence , pathological , machine learning , cognitive psychology , psychology , social psychology , mathematics , mathematical analysis
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom