Flexible Heuristic Dynamic Programming for Reinforcement Learning in Quad-Rotors
Author(s) -
Alexander Helmer,
Coen C. de Visser,
Erik-Jan Van Kampen
Publication year - 2018
Publication title -
2018 aiaa information systems-aiaa infotech @ aerospace
Language(s) - English
Resource type - Conference proceedings
DOI - 10.2514/6.2018-2134
Subject(s) - reinforcement learning , markov decision process , curse of dimensionality , state space , computer science , q learning , action (physics) , heuristic , artificial intelligence , dynamic programming , bellman equation , process (computing) , state (computer science) , machine learning , markov process , mathematical optimization , mathematics , algorithm , statistics , physics , quantum mechanics , operating system
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom