Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms

Frits de Nijs; Erwin Walraven; Mathijs M. de Weerdt; Matthijs T.J. Spaan

doi:10.1613/JAIR.1.12233

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms

Frits de Nijs, Erwin Walraven, Mathijs M. de Weerdt, Matthijs T.J. Spaan

Algorithmics

Research output: Contribution to journal › Article › Scientific › peer-review

7 Citations (Scopus)

30 Downloads (Pure)

Abstract

In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.

Original language	English
Pages (from-to)	955-1001
Number of pages	47
Journal	Journal of Artificial Intelligence Research
Volume	70
DOIs	https://doi.org/10.1613/JAIR.1.12233
Publication status	Published - 2021

Access to Document

10.1613/JAIR.1.12233Licence: Other

12233-Article (PDF)-26325-1-10-20210308Final published version, 568 KBLicence: Other

Cite this

@article{feb2cb7e3a694b50bad5aed99d251d4d,

title = "Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms",

abstract = "In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.",

author = "{de Nijs}, Frits and Erwin Walraven and {de Weerdt}, {Mathijs M.} and Spaan, {Matthijs T.J.}",

year = "2021",

doi = "10.1613/JAIR.1.12233",

language = "English",

volume = "70",

pages = "955--1001",

journal = "Journal of Artificial Intelligence Research",

issn = "1076-9757",

publisher = "Morgan Kaufmann Publishers",

}

TY - JOUR

T1 - Constrained multiagent Markov decision processes

T2 - A taxonomy of problems and algorithms

AU - de Nijs, Frits

AU - Walraven, Erwin

AU - de Weerdt, Mathijs M.

AU - Spaan, Matthijs T.J.

PY - 2021

Y1 - 2021

N2 - In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.

AB - In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.

UR - http://www.scopus.com/inward/record.url?scp=85103679548&partnerID=8YFLogxK

U2 - 10.1613/JAIR.1.12233

DO - 10.1613/JAIR.1.12233

M3 - Article

AN - SCOPUS:85103679548

SN - 1076-9757

VL - 70

SP - 955

EP - 1001

JO - Journal of Artificial Intelligence Research

JF - Journal of Artificial Intelligence Research

ER -

Constrained multiagent Markov decision processes: A taxonomy of problems and algorithms

Abstract

Access to Document

Other files and links

Fingerprint

Cite this