
Trading off rewards and errors in multi-armed bandits
Author(s) -
Akram Erraqabi,
Alessandro Lazaric,
Michal Valko,
Emma Brunskill,
Yun-En Liu
Publication year - 2017
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - computer science , maximization , utility maximization , artificial intelligence , machine learning , mathematical optimization , economics , mathematics , mathematical economics