z-logo
open-access-imgOpen Access
Sample Efficient On-line Learning of Optimal Dialogue Policies with Kalman Temporal Differences
Author(s) -
Olivier Pietquin,
Matthieu Geist,
Senthilkumar Chandramohan
Publication year - 2011
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , dialog box , computer science , task (project management) , sample (material) , policy learning , artificial intelligence , domain (mathematical analysis) , machine learning , transfer of learning , mathematics , chemistry , chromatography , mathematical analysis , management , world wide web , economics

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom