Deep coordination graphs

Wendelin Böhmer; Vitaly Kurin; Shimon Whiteson

Deep coordination graphs

Wendelin Böhmer^*, Vitaly Kurin, Shimon Whiteson

^*Corresponding author for this work

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

34 Citations (Scopus)

Abstract

This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.

Original language	English
Title of host publication	37th International Conference on Machine Learning, ICML 2020
Editors	Hal Daume, Aarti Singh
Publisher	International Machine Learning Society (IMLS)
Pages	957-968
Number of pages	12
ISBN (Electronic)	9781713821120
Publication status	Published - 2020
Externally published	Yes
Event	37th International Conference on Machine Learning, ICML 2020 - Virtual, Online Duration: 12 Jul 2020 → 18 Jul 2020

Publication series

Name	37th International Conference on Machine Learning, ICML 2020
Volume	PartF168147-2

Conference

Conference	37th International Conference on Machine Learning, ICML 2020
City	Virtual, Online
Period	12/07/20 → 18/07/20

Cite this

@inproceedings{1095262df0dd477d9e3e9fd470f2392b,

title = "Deep coordination graphs",

abstract = "This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.",

author = "Wendelin B{\"o}hmer and Vitaly Kurin and Shimon Whiteson",

year = "2020",

language = "English",

series = "37th International Conference on Machine Learning, ICML 2020",

publisher = "International Machine Learning Society (IMLS)",

pages = "957--968",

editor = "Hal Daume and Aarti Singh",

booktitle = "37th International Conference on Machine Learning, ICML 2020",

note = "37th International Conference on Machine Learning, ICML 2020 ; Conference date: 12-07-2020 Through 18-07-2020",

}

Böhmer, W, Kurin, V & Whiteson, S 2020, Deep coordination graphs. in H Daume & A Singh (eds), 37th International Conference on Machine Learning, ICML 2020. 37th International Conference on Machine Learning, ICML 2020, vol. PartF168147-2, International Machine Learning Society (IMLS), pp. 957-968, 37th International Conference on Machine Learning, ICML 2020, Virtual, Online, 12/07/20.

Deep coordination graphs. / Böhmer, Wendelin; Kurin, Vitaly; Whiteson, Shimon.
37th International Conference on Machine Learning, ICML 2020. ed. / Hal Daume; Aarti Singh. International Machine Learning Society (IMLS), 2020. p. 957-968 (37th International Conference on Machine Learning, ICML 2020; Vol. PartF168147-2).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Deep coordination graphs

AU - Böhmer, Wendelin

AU - Kurin, Vitaly

AU - Whiteson, Shimon

PY - 2020

Y1 - 2020

N2 - This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.

AB - This paper introduces the deep coordination graph (DCG) for collaborative multi-agent reinforcement learning. DCG strikes a flexible tradeoff between representational capacity and generalization by factoring the joint value function of all agents according to a coordination graph into payoffs between pairs of agents. The value can be maximized by local message passing along the graph, which allows training of the value function end-to-end with Q-learning. Payoff functions are approximated with deep neural networks that employ parameter sharing and low-rank approximations to significantly improve sample efficiency. We show that DCG can solve predatorprey tasks that highlight the relative overgeneralization pathology, as well as challenging StarCraft II micromanagement tasks.

UR - http://www.scopus.com/inward/record.url?scp=85098424450&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85098424450

T3 - 37th International Conference on Machine Learning, ICML 2020

SP - 957

EP - 968

BT - 37th International Conference on Machine Learning, ICML 2020

A2 - Daume, Hal

A2 - Singh, Aarti

PB - International Machine Learning Society (IMLS)

T2 - 37th International Conference on Machine Learning, ICML 2020

Y2 - 12 July 2020 through 18 July 2020

ER -

Deep coordination graphs

Abstract

Publication series

Conference

Other files and links

Fingerprint

Cite this