Statistically Efficient, Polynomial-Time Algorithms for Combinatorial Semi-Bandits
Author(s) -
Thibaut Cuvelier,
Richard Combes,
Éric Gourdin
Publication year - 2021
Publication title -
proceedings of the acm on measurement and analysis of computing systems
Language(s) - English
Resource type - Journals
ISSN - 2476-1249
DOI - 10.1145/3447387
Subject(s) - regret , combinatorics , mathematics , function (biology) , time complexity , exponential time hypothesis , maximization , computational complexity theory , discrete mathematics , exponential function , algorithm , set (abstract data type) , approximation algorithm , polynomial , constraint (computer aided design) , mathematical optimization , computer science , statistics , mathematical analysis , geometry , evolutionary biology , biology , programming language
We consider combinatorial semi-bandits over a set of arms X \subset \0,1\ ^d where rewards are uncorrelated across items. For this problem, the algorithm ESCB yields the smallest known regret bound R(T) = O( d (łn m)^2 (łn T) / Δ_\min ) after T rounds, where m = \max_x \in X 1^\top x. However, ESCB it has computational complexity O(|X|), which is typically exponential in d, and cannot be used in large dimensions. We propose the first algorithm that is both computationally and statistically efficient for this problem with regret R(T) = O( d (łn m)^2 (łn T) / Δ_\min ) and computational asymptotic complexity O(δ_T^-1 poly(d)), where δ_T is a function which vanishes arbitrarily slowly. Our approach involves carefully designing AESCB, an approximate version of ESCB with the same regret guarantees. We show that, whenever budgeted linear maximization over X can be solved up to a given approximation ratio, AESCB is implementable in polynomial time O(δ_T^-1 poly(d)) by repeatedly maximizing a linear function over X subject to a linear budget constraint, and showing how to solve these maximization problems efficiently.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom