Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

Matthijs T.J. Spaan; Tiago S. Veiga; Pedro U. Lima

doi:10.1007/s10458-014-9279-8

Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

Matthijs T.J. Spaan^*, Tiago S. Veiga, Pedro U. Lima

^*Corresponding author for this work

Algorithmics

Research output: Contribution to journal › Article › Scientific › peer-review

49 Citations (Scopus)

Abstract

Partially observable Markov decision processes (POMDPs) provide a principled framework for modeling an agent’s decision-making problem when the agent needs to consider noisy state estimates. POMDP policies take into account an action’s influence on the environment as well as the potential information gain. This is a crucial feature for robotic agents which generally have to consider the effect of actions on sensing. However, building POMDP models which reward information gain directly is not straightforward, but is important in domains such as robot-assisted surveillance in which the value of information is hard to quantify. Common techniques for uncertainty reduction such as expected entropy minimization lead to non-standard POMDPs that are hard to solve. We present the POMDP with Information Rewards (POMDP-IR) modeling framework, which rewards an agent for reaching a certain level of belief regarding a state feature. By remaining in the standard POMDP setting we can exploit many known results as well as successful approximate algorithms. We demonstrate our ideas in a toy problem as well as in real robot-assisted surveillance, showcasing their use for active cooperative perception scenarios. Finally, our experiments show that the POMDP-IR framework compares favorably with a related approach on benchmark domains.

Original language	English
Pages (from-to)	1157-1185
Number of pages	29
Journal	Autonomous Agents and Multi-Agent Systems
Volume	29
Issue number	6
DOIs	https://doi.org/10.1007/s10458-014-9279-8
Publication status	Published - 2015

Keywords

Active cooperative perception
Partially observable Markov decision processes
Planning under uncertainty for robots

Access to Document

10.1007/s10458-014-9279-8

Cite this

@article{1b6a0295119342e6b1a9ef0a44da0668,

title = "Decision-theoretic planning under uncertainty with information rewards for active cooperative perception",

abstract = "Partially observable Markov decision processes (POMDPs) provide a principled framework for modeling an agent{\textquoteright}s decision-making problem when the agent needs to consider noisy state estimates. POMDP policies take into account an action{\textquoteright}s influence on the environment as well as the potential information gain. This is a crucial feature for robotic agents which generally have to consider the effect of actions on sensing. However, building POMDP models which reward information gain directly is not straightforward, but is important in domains such as robot-assisted surveillance in which the value of information is hard to quantify. Common techniques for uncertainty reduction such as expected entropy minimization lead to non-standard POMDPs that are hard to solve. We present the POMDP with Information Rewards (POMDP-IR) modeling framework, which rewards an agent for reaching a certain level of belief regarding a state feature. By remaining in the standard POMDP setting we can exploit many known results as well as successful approximate algorithms. We demonstrate our ideas in a toy problem as well as in real robot-assisted surveillance, showcasing their use for active cooperative perception scenarios. Finally, our experiments show that the POMDP-IR framework compares favorably with a related approach on benchmark domains.",

keywords = "Active cooperative perception, Partially observable Markov decision processes, Planning under uncertainty for robots",

author = "Spaan, {Matthijs T.J.} and Veiga, {Tiago S.} and Lima, {Pedro U.}",

year = "2015",

doi = "10.1007/s10458-014-9279-8",

language = "English",

volume = "29",

pages = "1157--1185",

journal = "Autonomous Agents and Multi-Agent Systems",

issn = "1387-2532",

publisher = "Springer",

number = "6",

}

TY - JOUR

T1 - Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

AU - Spaan, Matthijs T.J.

AU - Veiga, Tiago S.

AU - Lima, Pedro U.

PY - 2015

Y1 - 2015

N2 - Partially observable Markov decision processes (POMDPs) provide a principled framework for modeling an agent’s decision-making problem when the agent needs to consider noisy state estimates. POMDP policies take into account an action’s influence on the environment as well as the potential information gain. This is a crucial feature for robotic agents which generally have to consider the effect of actions on sensing. However, building POMDP models which reward information gain directly is not straightforward, but is important in domains such as robot-assisted surveillance in which the value of information is hard to quantify. Common techniques for uncertainty reduction such as expected entropy minimization lead to non-standard POMDPs that are hard to solve. We present the POMDP with Information Rewards (POMDP-IR) modeling framework, which rewards an agent for reaching a certain level of belief regarding a state feature. By remaining in the standard POMDP setting we can exploit many known results as well as successful approximate algorithms. We demonstrate our ideas in a toy problem as well as in real robot-assisted surveillance, showcasing their use for active cooperative perception scenarios. Finally, our experiments show that the POMDP-IR framework compares favorably with a related approach on benchmark domains.

AB - Partially observable Markov decision processes (POMDPs) provide a principled framework for modeling an agent’s decision-making problem when the agent needs to consider noisy state estimates. POMDP policies take into account an action’s influence on the environment as well as the potential information gain. This is a crucial feature for robotic agents which generally have to consider the effect of actions on sensing. However, building POMDP models which reward information gain directly is not straightforward, but is important in domains such as robot-assisted surveillance in which the value of information is hard to quantify. Common techniques for uncertainty reduction such as expected entropy minimization lead to non-standard POMDPs that are hard to solve. We present the POMDP with Information Rewards (POMDP-IR) modeling framework, which rewards an agent for reaching a certain level of belief regarding a state feature. By remaining in the standard POMDP setting we can exploit many known results as well as successful approximate algorithms. We demonstrate our ideas in a toy problem as well as in real robot-assisted surveillance, showcasing their use for active cooperative perception scenarios. Finally, our experiments show that the POMDP-IR framework compares favorably with a related approach on benchmark domains.

KW - Active cooperative perception

KW - Partially observable Markov decision processes

KW - Planning under uncertainty for robots

UR - http://www.scopus.com/inward/record.url?scp=84942504056&partnerID=8YFLogxK

U2 - 10.1007/s10458-014-9279-8

DO - 10.1007/s10458-014-9279-8

M3 - Article

AN - SCOPUS:84942504056

SN - 1387-2532

VL - 29

SP - 1157

EP - 1185

JO - Autonomous Agents and Multi-Agent Systems

JF - Autonomous Agents and Multi-Agent Systems

IS - 6

ER -

Decision-theoretic planning under uncertainty with information rewards for active cooperative perception

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this