Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes-Reference-Cited by-同舟云学术

Reinforcement learning approach to control an inverted pendulum: A general framework for educational purposes

Published:2023-02-13 Issue:2 Volume:18 Page:e0280071
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Israilov Sardor,Fu Li,Sánchez-Rodríguez Jesús^ORCID,Fusco Franco,Allibert Guillaume,Raufaste Christophe^ORCID,Argentina Médéric^ORCID

Abstract

Machine learning is often cited as a new paradigm in control theory, but is also often viewed as empirical and less intuitive for students than classical model-based methods. This is particularly the case for reinforcement learning, an approach that does not require any mathematical model to drive a system inside an unknown environment. This lack of intuition can be an obstacle to design experiments and implement this approach. Reversely there is a need to gain experience and intuition from experiments. In this article, we propose a general framework to reproduce successful experiments and simulations based on the inverted pendulum, a classic problem often used as a benchmark to evaluate control strategies. Two algorithms (basic Q-Learning and Deep Q-Networks (DQN)) are introduced, both in experiments and in simulation with a virtual environment, to give a comprehensive understanding of the approach and discuss its implementation on real systems. In experiments, we show that learning over a few hours is enough to control the pendulum with high accuracy. Simulations provide insights about the effect of each physical parameter and tests the feasibility and robustness of the approach.

Funder

ANR

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference26 articles.

1. History of Inverted-Pendulum Systems;KH Lundberg;IFAC Proceedings Volumes,2010

2. The inverted pendulum benchmark in nonlinear control theory: a survey;O Boubaker;International Journal of Advanced Robotic Systems,2013

3. Sugihara T, Nakamura Y, Inoue H. Real-time humanoid motion generation through ZMP manipulation based on inverted pendulum control. In: IEEE International Conference on Robotics and Automation. vol. 2; 2002. p. 1404–1409.

4. Lee GH, Jung S. Design and control of an inverted pendulum system for intelligent mechatronics system control education. In: IEEE/ASME International Conference on Advanced Intelligent Mechatronics; 2008. p. 1254–1259.

5. Lazarini AZN, de Souza Ribeiro JM, Jorgetto MFC. Low cost implementation of a inverted pendulum control system. In: 11th IEEE/IAS International Conference on Industry Applications; 2014. p. 1–5.

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Model Predictive Control-Based Reinforcement Learning;2024 IEEE International Symposium on Circuits and Systems (ISCAS);2024-05-19

2. Study of Q-learning and deep Q-network learning control for a rotary inverted pendulum system;Discover Applied Sciences;2024-02-02

3. Comprehensive Review of Metaheuristic Algorithms (MAs) for Optimal Control (OCl) Improvement;Archives of Computational Methods in Engineering;2024-01-31

4. Reliability evaluation of reinforcement learning methods for mechanical systems with increasing complexity;Multibody System Dynamics;2023-12-22

5. Robust Control of An Inverted Pendulum System Based on Policy Iteration in Reinforcement Learning;Applied Sciences;2023-12-12