Learning to Play Trajectory Games Against Opponents with Unknown Objectives

Xinjie Liu; Lasse Peters; Javier Alonso-Mora

doi:10.1109/LRA.2023.3280809

Learning to Play Trajectory Games Against Opponents with Unknown Objectives

Xinjie Liu, Lasse Peters, Javier Alonso-Mora

Learning & Autonomous Control

Research output: Contribution to journal › Article › Scientific › peer-review

2 Citations (Scopus)

32 Downloads (Pure)

Abstract

Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.

Original language	English
Pages (from-to)	4139-4146
Journal	IEEE Robotics and Automation Letters
Volume	8
Issue number	7
DOIs	https://doi.org/10.1109/LRA.2023.3280809
Publication status	Published - 2023

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Collision avoidance
Games
human-aware motion planning
integrated planning and learning
Maximum likelihood estimation
multi-robot systems
Optimization
Planning
Robots
Trajectory
Trajectory games

Access to Document

10.1109/LRA.2023.3280809

Learning_to_Play_Trajectory_Games_Against_Opponents_With_Unknown_ObjectivesFinal published version, 2.79 MB

Cite this

@article{fafde1eca4194ad7817ed8a4e384ca6c,

title = "Learning to Play Trajectory Games Against Opponents with Unknown Objectives",

abstract = "Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.",

keywords = "Collision avoidance, Games, human-aware motion planning, integrated planning and learning, Maximum likelihood estimation, multi-robot systems, Optimization, Planning, Robots, Trajectory, Trajectory games",

author = "Xinjie Liu and Lasse Peters and Javier Alonso-Mora",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2023",

doi = "10.1109/LRA.2023.3280809",

language = "English",

volume = "8",

pages = "4139--4146",

journal = "IEEE Robotics and Automation Letters",

issn = "2377-3766",

publisher = "Institute of Electrical and Electronics Engineers (IEEE)",

number = "7",

}

TY - JOUR

T1 - Learning to Play Trajectory Games Against Opponents with Unknown Objectives

AU - Liu, Xinjie

AU - Peters, Lasse

AU - Alonso-Mora, Javier

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.

AB - Many autonomous agents, such as intelligent vehicles, are inherently required to interact with one another. Game theory provides a natural mathematical tool for robot motion planning in such interactive settings. However, tractable algorithms for such problems usually rely on a strong assumption, namely that the objectives of all players in the scene are known. To make such tools applicable for ego-centric planning with only local information, we propose an adaptive model-predictive game solver, which jointly infers other players' objectives online and computes a corresponding generalized Nash equilibrium (GNE) strategy. The adaptivity of our approach is enabled by a differentiable trajectory game solver whose gradient signal is used for maximum likelihood estimation (MLE) of opponents' objectives. This differentiability of our pipeline facilitates direct integration with other differentiable elements, such as neural networks (NNs). Furthermore, in contrast to existing solvers for cost inference in games, our method handles not only partial state observations but also general inequality constraints. In two simulated traffic scenarios, we find superior performance of our approach over both existing game-theoretic methods and non-game-theoretic model-predictive control (MPC) approaches. We also demonstrate our approach's real-time planning capabilities and robustness in two-player hardware experiments.

KW - Collision avoidance

KW - Games

KW - human-aware motion planning

KW - integrated planning and learning

KW - Maximum likelihood estimation

KW - multi-robot systems

KW - Optimization

KW - Planning

KW - Robots

KW - Trajectory

KW - Trajectory games

UR - http://www.scopus.com/inward/record.url?scp=85161017236&partnerID=8YFLogxK

U2 - 10.1109/LRA.2023.3280809

DO - 10.1109/LRA.2023.3280809

M3 - Article

AN - SCOPUS:85161017236

SN - 2377-3766

VL - 8

SP - 4139

EP - 4146

JO - IEEE Robotics and Automation Letters

JF - IEEE Robotics and Automation Letters

IS - 7

ER -

Learning to Play Trajectory Games Against Opponents with Unknown Objectives

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this