Constraint Propagation and Reverse Multi-Agent Learning

Research output: Contribution to conferencePaper

9 Downloads (Pure)

Abstract

The development of multi-agent reinforcement learning has been largely driven by the question of how to design learning algorithms to reach some particular notion of optimality of strategies, e.g. Nash equilibria. The set of optimal strategies is not known before the execution of the learning algorithm,
however we can often immediately identify a set of clearly undesirable outcomes. Therefore, we propose to consider a dual problem: given a collection of agent algorithms and a collection of unwanted strategy profiles, can one identify a set
of starting strategies that invariably lead there? This leads us to study the algorithmic problem of backpropagation of con-straints defining the forbidden region by learning dynamics,

through the lens of set-valued maps and interval arithmetics.
Original languageEnglish
Number of pages4
Publication statusPublished - 2021
EventCOMARL AAAI 2021: Spring Symposium Series - Stanford University, Palo Alto, United States
Duration: 22 Mar 202123 Mar 2021

Conference

ConferenceCOMARL AAAI 2021
CountryUnited States
CityPalo Alto
Period22/03/2123/03/21

Bibliographical note

Accepted author manuscript

Fingerprint

Dive into the research topics of 'Constraint Propagation and Reverse Multi-Agent Learning'. Together they form a unique fingerprint.

Cite this