Learning scalable and efficient communication policies for multi-robot collision avoidance

Álvaro Serra-Gómez*, Hai Zhu, B.F. Ferreira de Brito, Wendelin Böhmer, Javier Alonso-Mora

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

24 Downloads (Pure)

Abstract

Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.

Original languageEnglish
Pages (from-to)1275-1297
Number of pages23
JournalAutonomous Robots
Volume47
Issue number8
DOIs
Publication statusPublished - 2023

Keywords

  • Aerial robots
  • Collision avoidance
  • Multi-agent reinforcement learning
  • Multi-robot communication
  • Multi-robot systems

Fingerprint

Dive into the research topics of 'Learning scalable and efficient communication policies for multi-robot collision avoidance'. Together they form a unique fingerprint.

Cite this