Defensive Universal Learning with Experts | Zendy

Jan Poland | Zendy; Marcus Hütter | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Defensive Universal Learning with Experts

Author(s) -

Jan Poland,

Marcus Hütter

Publication year - 2005

Publication title -

lecture notes in computer science

Language(s) - English

Resource type - Book series

SCImago Journal Rank - 0.249

H-Index - 400

eISSN - 1611-3349

pISSN - 0302-9743

ISBN - 3-540-29242-X

DOI - 10.1007/11564089_28

Subject(s) - computer science , adversary , convergence (economics) , class (philosophy) , artificial intelligence , theoretical computer science , computer security , economics , economic growth

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedback from the actions actually chosen (bandit setup), (b) it can be applied with countably infinite expert classes, and (c) it copes with losses that may grow in time appropriately slowly. We prove loss bounds against an adaptive adversary. From this, we obtain a master algorithm for “reactive” experts problems, which means that the master’s actions may influence the behavior of the adversary. Our algorithm can significantly outperform standard experts algorithms on such problems. Finally, we combine it with a universal expert class. The resulting universal learner performs – in a certain sense – almost as well as any computable strategy, for any online decision problem. We also specify the (worst-case) convergence speed, which is very slow.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research