
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
Author(s) -
Hongyao Tang,
Jianye Hao,
Guangyong Chen,
Pengfei Chen,
Chen Chen,
Yaodong Yang,
Luo Zhang,
Wulong Liu,
Zhaopeng Meng
Publication year - 2021
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - English
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v35i11.17182
Subject(s) - bellman equation , reinforcement learning , computer science , trajectory , representation (politics) , value (mathematics) , function (biology) , latent variable , mathematical optimization , temporal difference learning , artificial intelligence , machine learning , mathematics , physics , astronomy , evolutionary biology , politics , political science , law , biology