
Joint Optimization of Concave Scalarized Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm
Author(s) -
Qinbo Bai,
Mridul Agarwal,
Vaneet Aggarwal
Publication year - 2022
Publication title -
journal of artificial intelligence research/the journal of artificial intelligence research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.79
H-Index - 123
eISSN - 1943-5037
pISSN - 1076-9757
DOI - 10.1613/jair.1.13981
Subject(s) - reinforcement learning , convergence (economics) , mathematical optimization , estimator , term (time) , function (biology) , gradient method , computer science , mathematics , joint (building) , algorithm , artificial intelligence , statistics , physics , quantum mechanics , evolutionary biology , economics , biology , economic growth , architectural engineering , engineering