Abstract
Distributed model predictive control (DMPC) concerns how to online control multiple robotic systems with constraints effectively. However, the nonlinearity, nonconvexity, and strong interconnections of dynamic system models and constraints can make the real-time and real-world DMPC implementations nontrivial. Reinforcement learning (RL) algorithms are promising for control policy design. However, how to ensure safety in terms of state constraints in RL remains a significant issue. This paper proposes a barrier function-based safe reinforcement learning algorithm for DMPC of nonlinear multi-robot systems under state constraints. The proposed approach is composed of several local learning-based MPC regulators. Each regulator, associated with a local system, learns and deploys the local control policy using a safe reinforcement learning algorithm in a distributed manner, i.e., with state information only among the neighbor agents. As a prominent feature of the proposed algorithm, we present a novel barrier-based policy structure to ensure safety, which has a clear mechanistic interpretation. Both simulated and real-world experiments on the formation control of mobile robots with collision avoidance show the effectiveness of the proposed safe reinforcement learning algorithm for DMPC.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Conference on Robotics and Automation (ICRA 2022) |
Editors | George J. Pappas, Vijay Kumar |
Publisher | IEEE |
Pages | 5532-5538 |
ISBN (Electronic) | 978-1-7281-9681-7 |
ISBN (Print) | 978-1-7281-9680-0 |
DOIs | |
Publication status | Published - 2022 |
Event | 2022 International Conference on Robotics and Automation (ICRA) - Philadelphia, United States Duration: 23 May 2022 → 27 May 2022 Conference number: 39 |
Conference
Conference | 2022 International Conference on Robotics and Automation (ICRA) |
---|---|
Abbreviated title | ICRA 2022 |
Country/Territory | United States |
City | Philadelphia |
Period | 23/05/22 → 27/05/22 |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-careOtherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Keywords
- Regulators
- Heuristic algorithms
- Reinforcement learning
- Prediction algorithms
- Formation control
- Safety
- Mobile robots