Bias Adaptive Statistical Inference Learning Agents for Learning from Human Feedback | Zendy

Jonathan I Watson | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Bias Adaptive Statistical Inference Learning Agents for Learning from Human Feedback

Author(s) -

Jonathan I Watson

Publication year - 2021

Publication title -

proceedings of the ... international florida artificial intelligence research society conference

Language(s) - English

Resource type - Journals

eISSN - 2334-0762

pISSN - 2334-0754

DOI - 10.32473/flairs.v34i1.128471

Subject(s) - computer science , oracle , inference , distortion (music) , artificial intelligence , heuristic , parametric statistics , value (mathematics) , signal (programming language) , machine learning , algorithm , mathematics , statistics , amplifier , computer network , software engineering , bandwidth (computing) , programming language

We present a novel technique for learning behaviors from ahuman provided feedback signal that is distorted by system-atic bias. Our technique, which we refer to as BASIL, modelsthe feedback signal as being separable into a heuristic evalu-ation of the utility of an action and a bias value that is drawnfrom a parametric distribution probabilistically, where thedistribution is defined by unknown parameters. We presentthe general form of the technique as well as a specific algo-rithm for integrating the technique with the TAMER algo-rithm for bias values drawn from a normal distribution. Wetest our algorithm against standard TAMER in the domain ofTetris using a synthetic oracle that provides feedback undervarying levels of distortion. We find our algorithm can learnvery quickly under bias distortions that entirely stymie thelearning of classic TAMER.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore