Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal; Christian A. Schroeder de Witt; Bei Peng; Wendelin Böhmer; Shimon Whiteson; Fei Sha

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian A. Schroeder de Witt, Bei Peng, Wendelin Böhmer, Shimon Whiteson, Fei Sha

Algorithmics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

38 Downloads (Pure)

Abstract

Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities; however, agents within these tasks rarely need to consider all others at all times in order to act effectively. Factored value function approaches have historically leveraged such independences to improve learning efficiency, but these approaches typically rely on domain knowledge to select fixed subsets of state features to include in each factor. We propose to utilize value function factoring with random subsets of entities in each factor as an auxiliary objective in order to disentangle value predictions from irrelevant entities. This factoring approach is instantiated through a simple attention mechanism masking procedure. We hypothesize that such an approach helps agents learn more effectively in multi-agent settings by discovering common trajectories across episodes within sub-groups of agents/entities. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging StarCraft micromanagement tasks.

Original language	English
Title of host publication	Proceedings of the 37th International Conference on Machine Learning, ICML 2020
Editors	Marina Meila, Tong Zhang
Pages	4596-4606
Number of pages	11
Volume	139
Publication status	Published - 2021
Event	International Conference on Machine Learning: 2021 - Duration: 18 Jul 2021 → 24 Jul 2021 Conference number: 38th https://icml.cc/Conferences/2021

Conference

Conference	International Conference on Machine Learning
Abbreviated title	ICML
Period	18/07/21 → 24/07/21
Internet address	https://icml.cc/Conferences/2021

Access to Document

iqbal21a(1)Final published version, 2.74 MB

Cite this

@inproceedings{18584fec0d9a4865929fdd493b4b9e30,

title = "Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning",

abstract = "Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities; however, agents within these tasks rarely need to consider all others at all times in order to act effectively. Factored value function approaches have historically leveraged such independences to improve learning efficiency, but these approaches typically rely on domain knowledge to select fixed subsets of state features to include in each factor. We propose to utilize value function factoring with random subsets of entities in each factor as an auxiliary objective in order to disentangle value predictions from irrelevant entities. This factoring approach is instantiated through a simple attention mechanism masking procedure. We hypothesize that such an approach helps agents learn more effectively in multi-agent settings by discovering common trajectories across episodes within sub-groups of agents/entities. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging StarCraft micromanagement tasks.",

author = "Shariq Iqbal and Witt, {Christian A. Schroeder de} and Bei Peng and Wendelin B{\"o}hmer and Shimon Whiteson and Fei Sha",

year = "2021",

language = "English",

volume = "139",

pages = "4596--4606",

editor = "Meila, {Marina } and Zhang, {Tong }",

booktitle = "Proceedings of the 37th International Conference on Machine Learning, ICML 2020",

note = "International Conference on Machine Learning : 2021, ICML ; Conference date: 18-07-2021 Through 24-07-2021",

url = "https://icml.cc/Conferences/2021",

}

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning. / Iqbal, Shariq; Witt, Christian A. Schroeder de; Peng, Bei et al.
Proceedings of the 37th International Conference on Machine Learning, ICML 2020. ed. / Marina Meila; Tong Zhang. Vol. 139 2021. p. 4596-4606.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

AU - Iqbal, Shariq

AU - Witt, Christian A. Schroeder de

AU - Peng, Bei

AU - Böhmer, Wendelin

AU - Whiteson, Shimon

AU - Sha, Fei

N1 - Conference code: 38th

PY - 2021

Y1 - 2021

N2 - Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities; however, agents within these tasks rarely need to consider all others at all times in order to act effectively. Factored value function approaches have historically leveraged such independences to improve learning efficiency, but these approaches typically rely on domain knowledge to select fixed subsets of state features to include in each factor. We propose to utilize value function factoring with random subsets of entities in each factor as an auxiliary objective in order to disentangle value predictions from irrelevant entities. This factoring approach is instantiated through a simple attention mechanism masking procedure. We hypothesize that such an approach helps agents learn more effectively in multi-agent settings by discovering common trajectories across episodes within sub-groups of agents/entities. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging StarCraft micromanagement tasks.

AB - Real world multi-agent tasks often involve varying types and quantities of agents and non-agent entities; however, agents within these tasks rarely need to consider all others at all times in order to act effectively. Factored value function approaches have historically leveraged such independences to improve learning efficiency, but these approaches typically rely on domain knowledge to select fixed subsets of state features to include in each factor. We propose to utilize value function factoring with random subsets of entities in each factor as an auxiliary objective in order to disentangle value predictions from irrelevant entities. This factoring approach is instantiated through a simple attention mechanism masking procedure. We hypothesize that such an approach helps agents learn more effectively in multi-agent settings by discovering common trajectories across episodes within sub-groups of agents/entities. Our approach, Randomized Entity-wise Factorization for Imagined Learning (REFIL), outperforms all strong baselines by a significant margin in challenging StarCraft micromanagement tasks.

UR - https://proceedings.mlr.press/v139/

M3 - Conference contribution

VL - 139

SP - 4596

EP - 4606

BT - Proceedings of the 37th International Conference on Machine Learning, ICML 2020

A2 - Meila, Marina

A2 - Zhang, Tong

T2 - International Conference on Machine Learning

Y2 - 18 July 2021 through 24 July 2021

ER -

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Abstract

Conference

Access to Document

Other files and links

Fingerprint

Cite this