End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks-Reference-Cited by-同舟云学术

End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks

Published:2019-07-17 Issue: Volume:33 Page:3387-3395
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Cheng Richard,Orosz Gábor,Murray Richard M.,Burdick Joel W.

Abstract

Reinforcement Learning (RL) algorithms have found limited success beyond simulated applications, and one main reason is the absence of safety guarantees during the learning process. Real world systems would realistically fail or break before an optimal controller can be learned. To address this issue, we propose a controller architecture that combines (1) a model-free RL-based controller with (2) model-based controllers utilizing control barrier functions (CBFs) and (3) online learning of the unknown system dynamics, in order to ensure safety during learning. Our general framework leverages the success of RL algorithms to learn high-performance controllers, while the CBF-based controllers both guarantee safety and guide the learning process by constraining the set of explorable polices. We utilize Gaussian Processes (GPs) to model the system dynamics and its uncertainties. Our novel controller synthesis algorithm, RL-CBF, guarantees safety with high probability during the learning process, regardless of the RL algorithm used, and demonstrates greater policy exploration efficiency. We test our algorithm on (1) control of an inverted pendulum and (2) autonomous carfollowing with wireless vehicle-to-vehicle communication, and show that our algorithm attains much greater sample efficiency in learning than other state-of-the-art algorithms and maintains safety during the entire learning process.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 202 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A data-driven solution for intelligent power allocation of connected hybrid electric vehicles inspired by offline deep reinforcement learning in V2X scenario;Applied Energy;2024-10

2. Risk-Informed Model-Free Safe Control of Linear Parameter-Varying Systems;IEEE/CAA Journal of Automatica Sinica;2024-09

3. Invariant set estimation for piecewise affine dynamical systems using piecewise affine barrier function;European Journal of Control;2024-09

4. Safety reinforcement learning control via transfer learning;Automatica;2024-08

5. Safe Deep Reinforcement Learning-Based Controller (SDRLC) for Autonomous Navigation of Planetary Rovers;2024 IEEE Space, Aerospace and Defence Conference (SPACE);2024-07-22