Sequential decision making with vector outcomes | Zendy

Yossi Azar | Zendy; Uriel Felge | Zendy; Michal Feldman | Zendy; Moshe Tennenholtz | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Sequential decision making with vector outcomes

Author(s) -

Yossi Azar,

Uriel Felge,

Michal Feldman,

Moshe Tennenholtz

Publication year - 2014

Publication title -

citeseer x (the pennsylvania state university)

Language(s) - English

Resource type - Conference proceedings

DOI - 10.1145/2554797.2554817

Subject(s) - computer science , artificial intelligence , machine learning

We study a multi-round optimization setting in which in each round a player may select one of several actions, and each action produces an outcome vector, not observable to the player until the round ends. The final payoff for the player is computed by applying some known function f to the sum of all outcome vectors (e.g., the minimum of all coordinates of the sum). We show that standard notions of performance measure (such as comparison to the best single action) used in related expert and bandit settings (in which the payoff in each round is scalar) are not useful in our vector setting. Instead, we propose a different performance measure, and design algorithms that have vanishing regret with respect to our new measure.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research