z-logo
open-access-imgOpen Access
Chaining Value Functions for Off-Policy Learning
Author(s) -
Simon Schmitt,
John ShaweTaylor,
Hado van Hasselt
Publication year - 2022
Publication title -
proceedings of the ... aaai conference on artificial intelligence
Language(s) - Uncategorized
Resource type - Journals
eISSN - 2374-3468
pISSN - 2159-5399
DOI - 10.1609/aaai.v36i8.20792
Subject(s) - chaining , reinforcement learning , computer science , bootstrapping (finance) , counterfactual conditional , value (mathematics) , process (computing) , artificial intelligence , econometrics , machine learning , mathematics , counterfactual thinking , psychology , philosophy , epistemology , psychotherapist , operating system

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here