
LEARNING FROM DEMONSTRATIONS WITH SACR2: SOFT ACTOR-CRITIC WITH REWARD RELABELING
Author(s) -
Jesús Bujalance Martín,
Raphaël Chekroun,
Fabien Moutarde
Publication year - 2021
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
Subject(s) - reinforcement learning , inefficiency , computer science , artificial intelligence , process (computing) , task (project management) , robotics , temporal difference learning , focus (optics) , markov decision process , isolation (microbiology) , sample (material) , human–computer interaction , machine learning , robot , engineering , markov process , statistics , physics , chemistry , mathematics , microbiology and biotechnology , systems engineering , optics , chromatography , economics , biology , microeconomics , operating system