Learning scalable and efficient communication policies for multi-robot collision avoidance

Álvaro Serra-Gómez; Hai Zhu; B.F. Ferreira de Brito; Wendelin Böhmer; Javier Alonso-Mora

doi:10.1007/s10514-023-10127-3

Learning scalable and efficient communication policies for multi-robot collision avoidance

Álvaro Serra-Gómez^*, Hai Zhu, B.F. Ferreira de Brito, Wendelin Böhmer, Javier Alonso-Mora

^*Corresponding author for this work

Research output: Contribution to journal › Article › Scientific › peer-review

24 Downloads (Pure)

Abstract

Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.

Original language	English
Pages (from-to)	1275-1297
Number of pages	23
Journal	Autonomous Robots
Volume	47
Issue number	8
DOIs	https://doi.org/10.1007/s10514-023-10127-3
Publication status	Published - 2023

Keywords

Aerial robots
Collision avoidance
Multi-agent reinforcement learning
Multi-robot communication
Multi-robot systems

Access to Document

10.1007/s10514-023-10127-3

s10514-023-10127-3Final published version, 2.71 MBLicence: CC BY

Cite this

@article{d0853c0f0c0f4ae681cf66ad45f8eb67,

title = "Learning scalable and efficient communication policies for multi-robot collision avoidance",

abstract = "Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots{\textquoteright} states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.",

keywords = "Aerial robots, Collision avoidance, Multi-agent reinforcement learning, Multi-robot communication, Multi-robot systems",

author = "{\'A}lvaro Serra-G{\'o}mez and Hai Zhu and {Ferreira de Brito}, B.F. and Wendelin B{\"o}hmer and Javier Alonso-Mora",

year = "2023",

doi = "10.1007/s10514-023-10127-3",

language = "English",

volume = "47",

pages = "1275--1297",

journal = "Autonomous Robots",

issn = "0929-5593",

publisher = "Springer",

number = "8",

}

TY - JOUR

T1 - Learning scalable and efficient communication policies for multi-robot collision avoidance

AU - Serra-Gómez, Álvaro

AU - Zhu, Hai

AU - Ferreira de Brito, B.F.

AU - Böhmer, Wendelin

AU - Alonso-Mora, Javier

PY - 2023

Y1 - 2023

N2 - Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.

AB - Decentralized multi-robot systems typically perform coordinated motion planning by constantly broadcasting their intentions to avoid collisions. However, the risk of collision between robots varies as they move and communication may not always be needed. This paper presents an efficient communication method that addresses the problem of “when” and “with whom” to communicate in multi-robot collision avoidance scenarios. In this approach, each robot learns to reason about other robots’ states and considers the risk of future collisions before asking for the trajectory plans of other robots. We introduce a new neural architecture for the learned communication policy which allows our method to be scalable. We evaluate and verify the proposed communication strategy in simulation with up to twelve quadrotors, and present results on the zero-shot generalization/robustness capabilities of the policy in different scenarios. We demonstrate that our policy (learned in a simulated environment) can be successfully transferred to real robots.

KW - Aerial robots

KW - Collision avoidance

KW - Multi-agent reinforcement learning

KW - Multi-robot communication

KW - Multi-robot systems

UR - http://www.scopus.com/inward/record.url?scp=85168347475&partnerID=8YFLogxK

U2 - 10.1007/s10514-023-10127-3

DO - 10.1007/s10514-023-10127-3

M3 - Article

AN - SCOPUS:85168347475

SN - 0929-5593

VL - 47

SP - 1275

EP - 1297

JO - Autonomous Robots

JF - Autonomous Robots

IS - 8

ER -

Learning scalable and efficient communication policies for multi-robot collision avoidance

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this