
Multi-dimensional Arms for Combinatorial Multi-armed Bandit
Author(s) -
Qi Li,
Lijun Cai
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3590438
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
The multi-armed bandit (MAB) problem with concave rewards has indeed become a significant area of research in recent years. In MAB problem, maximizing total rewards within a fixed number of pulls is a common objective. Previous works have indeed made substantial advancements in designing efficient online selection algorithms. However, the limitation of these works is that they fail to achieve a sublinear regret bound with multi-dimensional arms. In this paper, we study a combinatorial MAB problem with concave rewards and multi-dimensional arms for online advertising delivery. At first, we treat a multi-dimensional arm as a vector and adapt the choice of arm based on contextual information. Moreover, we utilize the confidence ellipsoid to construct optimistic estimate. Based on optimistic estimate, we combine online convex optimization with bandit methods to design selection algorithm. Our algorithm’s can achieve a sublinear regret bound of O (√ T ) with probability guarantees in T selection rounds. At last, we evaluate the performance of our algorithm through extensive simulations and demonstrate that it is better than the baseline algorithms.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom