Trading off rewards and errors in multi-armed bandits
Author(s) -
Akram Erraqabi,
Alessandro Lazaric,
Michal Vaľko,
Emma Brunskill,
Yun-En Liu
Publication year - 2017
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - computer science , maximization , utility maximization , artificial intelligence , machine learning , mathematical optimization , economics , mathematics , mathematical economics
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom