Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions

D. Du, S. Han, Naiming Qi, Haitham Bou Ammar, Jun Wang, W. Pan

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

40 Downloads (Pure)

Abstract

Reinforcement learning (RL) exhibits impressive performance when managing complicated control tasks for robots. However, its wide application to physical robots is limited by the absence of strong safety guarantees. To overcome this challenge, this paper explores the control Lyapunov barrier function (CLBF) to analyze the safety and reachability solely based on data without explicitly employing a dynamic model. We also proposed the Lyapunov barrier actor-critic (LBAC), a model-free RL algorithm, to search for a controller that satisfies the data-based approximation of the safety and reachability conditions. The proposed approach is demonstrated through simulation and real-world robot control experiments, i.e., a 2D quadrotor navigation task. The experimental findings reveal this approach's effectiveness in reachability and safety, surpassing other model-free RL methods.
Original languageEnglish
Title of host publicationProceedings of the IEEE International Conference on Robotics and Automation (ICRA 2023)
PublisherIEEE
Pages9442-9448
ISBN (Print)979-8-3503-2365-8
DOIs
Publication statusPublished - 2023
EventICRA 2023: International Conference on Robotics and Automation - London, United Kingdom
Duration: 29 May 20232 Jun 2023

Conference

ConferenceICRA 2023: International Conference on Robotics and Automation
Country/TerritoryUnited Kingdom
CityLondon
Period29/05/232/06/23

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Fingerprint

Dive into the research topics of 'Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions'. Together they form a unique fingerprint.

Cite this