z-logo
open-access-imgOpen Access
Learning aquatic locomotion with animats
Author(s) -
Dennis G. Wilson,
Jean Disset,
Sylvain CussatBlanc,
Yves Duthen,
Hervé Luga
Publication year - 2017
Language(s) - English
Resource type - Conference proceedings
DOI - 10.7551/ecal_a_092
Subject(s) - hebbian theory , computer science , reinforcement learning , artificial intelligence , cma es , artificial neural network , adaptation (eye) , spiking neural network , spike timing dependent plasticity , parameterized complexity , controller (irrigation) , machine learning , evolution strategy , neuroscience , synaptic plasticity , evolutionary algorithm , algorithm , agronomy , biochemistry , chemistry , receptor , biology
One of the challenges of researching spiking neural networks (SNN) is translation from temporal spiking behavior to classic controller output. While many encoding schemes exist to facilitate this translation, there are few benchmarks for neural networks that inherently utilize a temporal controller. In this work, we consider the common reinforcement problem of animat locomotion in an environment suited for evaluating SNNs. Using this problem, we explore novel methods of reward distribution as they impacts learning. Hebbian learning, in the form of spike time dependent plasticity (STDP), is modulated by a dopamine signal and affected by reward-induced neural activity. Different reward strategies are parameterized and the Covariance Matrix Adaptation Evolution Strategy (CMA-ES) is used to find the best strategies for fixed animat morphologies. The contribution of this work is two-fold: to cast the problem of animat locomotion in a form directly applicable to simple temporal controllers, and to demonstrate n...

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom