
Sample Efficient On-line Learning of Optimal Dialogue Policies with Kalman Temporal Differences
Author(s) -
Olivier Pietquin,
Matthieu Geist,
Senthilkumar Chandramohan
Publication year - 2011
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , dialog box , computer science , task (project management) , sample (material) , policy learning , artificial intelligence , domain (mathematical analysis) , machine learning , transfer of learning , mathematics , chemistry , chromatography , mathematical analysis , management , world wide web , economics