
Multiple-step greedy policies in online and approximate reinforcement learning
Author(s) -
Yonathan Efroni,
Gal Dalal,
Bruno Scherrer,
Shie Mannor
Publication year - 2018
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , computer science , greedy algorithm , artificial intelligence , mathematical optimization , algorithm , mathematics