
Developing Monte Carlo Simulator of Reinforcement Learning Type
Author(s) -
Georgi Tsochev
Publication year - 2020
Publication title -
problems of engineering cybernetics and robotics
Language(s) - English
Resource type - Journals
eISSN - 2738-7364
pISSN - 2738-7356
DOI - 10.7546/pecr.73.20.04
Subject(s) - monte carlo method , python (programming language) , computer science , hybrid monte carlo , simulation , markov chain monte carlo , mathematics , statistics , operating system
Monte Carlo methods are a way to solve the reinforcement learning problem based on average test results. To ensure that well-defined results are available, Monte Carlo methods are used only for episodic tasks. The Monte Carlo term is often used more widely in any valuation method whose operation involves significant participation on a random basis. Here it is specifically used for methods based on the average of full results (as opposed to methods that are learned from incomplete results). The paper describes a simulator for estimating raindrops in a specific area using the package matlib. Keywords: Monte Carlo, reinforcement learning, simulation, matlib, python