Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Author(s) -
Hongyao Tang,
Zhaopeng Meng,
Guangyong Chen,
Pengfei Chen,
Chen Chen,
Yaodong Yang,
Luo Zhang,
Wulong Liu,
Jianye Hao
Publication year - 2021
Publication title -
proceedings of the aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v35i11.17182
Subject(s) - bellman equation , reinforcement learning , computer science , trajectory , representation (politics) , value (mathematics) , function (biology) , latent variable , mathematical optimization , temporal difference learning , artificial intelligence , machine learning , mathematics , physics , astronomy , evolutionary biology , politics , political science , law , biology
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom