Learning from eXtreme Bandit Feedback
Author(s) -
Romain Lopez,
Inderjit S. Dhillon,
Michael I. Jordan
Publication year - 2021
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v35i10.17058
Subject(s) - estimator , computer science , pruning , machine learning , variance (accounting) , benchmark (surveying) , sampling (signal processing) , artificial intelligence , multi armed bandit , thompson sampling , supervised learning , minimum variance unbiased estimator , mathematics , statistics , bayesian probability , artificial neural network , filter (signal processing) , regret , accounting , geodesy , agronomy , business , computer vision , biology , geography
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom