z-logo
open-access-imgOpen Access
Sample Efficient On-line Learning of Optimal Dialogue Policies with Kalman Temporal Differences
Author(s) -
Olivier Pietquin,
Matthieu Geist,
Senthilkumar Chandramohan
Publication year - 2011
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , dialog box , computer science , task (project management) , sample (material) , policy learning , artificial intelligence , domain (mathematical analysis) , machine learning , transfer of learning , mathematics , chemistry , chromatography , mathematical analysis , management , world wide web , economics

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here