Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots

Xinglong Zhang, Yaoqian Peng, W. Pan, Xin Xu, Haibin Xie

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

14 Downloads (Pure)

Abstract

Distributed model predictive control (DMPC) concerns how to online control multiple robotic systems with constraints effectively. However, the nonlinearity, nonconvexity, and strong interconnections of dynamic system models and constraints can make the real-time and real-world DMPC implementations nontrivial. Reinforcement learning (RL) algorithms are promising for control policy design. However, how to ensure safety in terms of state constraints in RL remains a significant issue. This paper proposes a barrier function-based safe reinforcement learning algorithm for DMPC of nonlinear multi-robot systems under state constraints. The proposed approach is composed of several local learning-based MPC regulators. Each regulator, associated with a local system, learns and deploys the local control policy using a safe reinforcement learning algorithm in a distributed manner, i.e., with state information only among the neighbor agents. As a prominent feature of the proposed algorithm, we present a novel barrier-based policy structure to ensure safety, which has a clear mechanistic interpretation. Both simulated and real-world experiments on the formation control of mobile robots with collision avoidance show the effectiveness of the proposed safe reinforcement learning algorithm for DMPC.
Original languageEnglish
Title of host publicationProceedings of the International Conference on Robotics and Automation (ICRA 2022)
EditorsGeorge J. Pappas, Vijay Kumar
PublisherIEEE
Pages5532-5538
ISBN (Electronic)978-1-7281-9681-7
ISBN (Print)978-1-7281-9680-0
DOIs
Publication statusPublished - 2022
Event2022 International Conference on Robotics and Automation (ICRA) - Philadelphia, United States
Duration: 23 May 202227 May 2022
Conference number: 39

Conference

Conference2022 International Conference on Robotics and Automation (ICRA)
Abbreviated titleICRA 2022
Country/TerritoryUnited States
CityPhiladelphia
Period23/05/2227/05/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • Regulators
  • Heuristic algorithms
  • Reinforcement learning
  • Prediction algorithms
  • Formation control
  • Safety
  • Mobile robots

Fingerprint

Dive into the research topics of 'Barrier Function-based Safe Reinforcement Learning for Formation Control of Mobile Robots'. Together they form a unique fingerprint.

Cite this