Robust Event-Driven Interactions in Cooperative Multi-agent Learning

Daniel Jarne Ornia; Manuel Mazo

doi:10.1007/978-3-031-15839-1_16

Robust Event-Driven Interactions in Cooperative Multi-agent Learning

^*Corresponding author for this work

Team Manuel Mazo Jr

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

19 Downloads (Pure)

Abstract

We present an approach to safely reduce the communication required between agents in a Multi-Agent Reinforcement Learning system by exploiting the inherent robustness of the underlying Markov Decision Process. We compute robustness certificate functions (off-line), that give agents a conservative indication of how far their state measurements can deviate before they need to update other agents in the system with new measurements. This results in fully distributed decision functions, enabling agents to decide when it is necessary to communicate state variables. We derive bounds on the optimality of the resulting systems in terms of the discounted sum of rewards obtained, and show these bounds are a function of the design parameters. Additionally, we extend the results for the case where the robustness surrogate functions are learned from data, and present experimental results demonstrating a significant reduction in communication events between agents.

Original language	English
Title of host publication	Formal Modeling and Analysis of Timed Systems
Subtitle of host publication	20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings
Editors	Sergiy Bogomolov, David Parker
Publisher	Springer
Pages	281-297
ISBN (Electronic)	978-3-031-15839-1
ISBN (Print)	978-3-031-15838-4
DOIs	https://doi.org/10.1007/978-3-031-15839-1_16
Publication status	Published - 2022
Event	20th International Conference on Formal Modeling and Analysis of Timed Systems, FORMATS 2022 - Warsaw, Poland Duration: 13 Sept 2022 → 15 Sept 2022

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	13465 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	20th International Conference on Formal Modeling and Analysis of Timed Systems, FORMATS 2022
Country/Territory	Poland
City	Warsaw
Period	13/09/22 → 15/09/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Event-Triggered Communication
Multi-Agent Systems
Reinforcement Learning

Access to Document

10.1007/978-3-031-15839-1_16

978-3-031-15839-1_16Final published version, 790 KB

Cite this

Jarne Ornia, D., & Mazo, M. (2022). Robust Event-Driven Interactions in Cooperative Multi-agent Learning. In S. Bogomolov, & D. Parker (Eds.), Formal Modeling and Analysis of Timed Systems: 20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings (pp. 281-297). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13465 LNCS). Springer. https://doi.org/10.1007/978-3-031-15839-1_16

Jarne Ornia, Daniel ; Mazo, Manuel. / Robust Event-Driven Interactions in Cooperative Multi-agent Learning. Formal Modeling and Analysis of Timed Systems: 20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings. editor / Sergiy Bogomolov ; David Parker. Springer, 2022. pp. 281-297 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{1b8e09ee2e8e48e4ae9c77966fd13e44,

title = "Robust Event-Driven Interactions in Cooperative Multi-agent Learning",

abstract = "We present an approach to safely reduce the communication required between agents in a Multi-Agent Reinforcement Learning system by exploiting the inherent robustness of the underlying Markov Decision Process. We compute robustness certificate functions (off-line), that give agents a conservative indication of how far their state measurements can deviate before they need to update other agents in the system with new measurements. This results in fully distributed decision functions, enabling agents to decide when it is necessary to communicate state variables. We derive bounds on the optimality of the resulting systems in terms of the discounted sum of rewards obtained, and show these bounds are a function of the design parameters. Additionally, we extend the results for the case where the robustness surrogate functions are learned from data, and present experimental results demonstrating a significant reduction in communication events between agents.",

keywords = "Event-Triggered Communication, Multi-Agent Systems, Reinforcement Learning",

author = "{Jarne Ornia}, Daniel and Manuel Mazo",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 20th International Conference on Formal Modeling and Analysis of Timed Systems, FORMATS 2022 ; Conference date: 13-09-2022 Through 15-09-2022",

year = "2022",

doi = "10.1007/978-3-031-15839-1_16",

language = "English",

isbn = "978-3-031-15838-4",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "281--297",

editor = "Sergiy Bogomolov and David Parker",

booktitle = "Formal Modeling and Analysis of Timed Systems",

}

Jarne Ornia, D & Mazo, M 2022, Robust Event-Driven Interactions in Cooperative Multi-agent Learning. in S Bogomolov & D Parker (eds), Formal Modeling and Analysis of Timed Systems: 20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13465 LNCS, Springer, pp. 281-297, 20th International Conference on Formal Modeling and Analysis of Timed Systems, FORMATS 2022, Warsaw, Poland, 13/09/22. https://doi.org/10.1007/978-3-031-15839-1_16

Robust Event-Driven Interactions in Cooperative Multi-agent Learning. / Jarne Ornia, Daniel ; Mazo, Manuel.
Formal Modeling and Analysis of Timed Systems: 20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings. ed. / Sergiy Bogomolov; David Parker. Springer, 2022. p. 281-297 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13465 LNCS).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Robust Event-Driven Interactions in Cooperative Multi-agent Learning

AU - Jarne Ornia, Daniel

AU - Mazo, Manuel

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - We present an approach to safely reduce the communication required between agents in a Multi-Agent Reinforcement Learning system by exploiting the inherent robustness of the underlying Markov Decision Process. We compute robustness certificate functions (off-line), that give agents a conservative indication of how far their state measurements can deviate before they need to update other agents in the system with new measurements. This results in fully distributed decision functions, enabling agents to decide when it is necessary to communicate state variables. We derive bounds on the optimality of the resulting systems in terms of the discounted sum of rewards obtained, and show these bounds are a function of the design parameters. Additionally, we extend the results for the case where the robustness surrogate functions are learned from data, and present experimental results demonstrating a significant reduction in communication events between agents.

AB - We present an approach to safely reduce the communication required between agents in a Multi-Agent Reinforcement Learning system by exploiting the inherent robustness of the underlying Markov Decision Process. We compute robustness certificate functions (off-line), that give agents a conservative indication of how far their state measurements can deviate before they need to update other agents in the system with new measurements. This results in fully distributed decision functions, enabling agents to decide when it is necessary to communicate state variables. We derive bounds on the optimality of the resulting systems in terms of the discounted sum of rewards obtained, and show these bounds are a function of the design parameters. Additionally, we extend the results for the case where the robustness surrogate functions are learned from data, and present experimental results demonstrating a significant reduction in communication events between agents.

KW - Event-Triggered Communication

KW - Multi-Agent Systems

KW - Reinforcement Learning

UR - http://www.scopus.com/inward/record.url?scp=85137977563&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-15839-1_16

DO - 10.1007/978-3-031-15839-1_16

M3 - Conference contribution

AN - SCOPUS:85137977563

SN - 978-3-031-15838-4

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 281

EP - 297

BT - Formal Modeling and Analysis of Timed Systems

A2 - Bogomolov, Sergiy

A2 - Parker, David

PB - Springer

T2 - 20th International Conference on Formal Modeling and Analysis of Timed Systems, FORMATS 2022

Y2 - 13 September 2022 through 15 September 2022

ER -

Jarne Ornia D , Mazo M. Robust Event-Driven Interactions in Cooperative Multi-agent Learning. In Bogomolov S, Parker D, editors, Formal Modeling and Analysis of Timed Systems: 20th International Conference, FORMATS 2022, Warsaw, Poland, September 13–15, 2022, Proceedings. Springer. 2022. p. 281-297. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-15839-1_16