
Balance Control of an Inverted Pendulum on a Quadruped Robot by Reinforcement Learning
Author(s) -
Shang-Yu Wu,
Xianqing Lei,
Linqi Ye
Publication year - 2022
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/2187/1/012024
Subject(s) - inverted pendulum , robot , reinforcement learning , terrain , control theory (sociology) , computer science , stability (learning theory) , flexibility (engineering) , adaptability , nonlinear system , set (abstract data type) , simulation , control (management) , artificial intelligence , mathematics , physics , ecology , statistics , quantum mechanics , machine learning , biology , programming language
When a quadruped robot crosses some uneven terrain, its stability is a very important consideration. In practical applications, it is often difficult for researchers to obtain accurate models of quadruped robot systems and need quadruped robots to complete high-stability tasks. Balancing an inverted pendulum on a quadruped robot is a good way to test their balancing ability, which is of great research and practical value. According to the nonlinear, uncertain, and strong coupling characteristics of the quadruped robot inverted pendulum system, this paper proposes a quadruped robot balancing inverted pendulum algorithm based on reinforcement learning, the Q-learning algorithm. The advantage of this algorithm is that it can learn effective balancing policy directly from experience, and it does not depend on the accurate model of the quadruped robot. It has the characteristics of high efficiency and flexibility. Through a lot of training, it can break through the performance limit brought by model error to traditional methods. In this paper, a comparative experiment of a set of balanced inverted pendulums of quadruped robots with different sizes is completed in the V-REP simulation software. The experimental results show that the algorithm can effectively improve the balance ability of quadruped robots, and it also shows that the algorithm has good adaptability.