Learning What to Attend to: Using Bisimulation Metrics to Explore and Improve Upon What a Deep Reinforcement Learning Agent Learns

N. Albers, M. Suau de Castro, F.A. Oliehoek

Research output: Contribution to conferenceAbstractScientific

31 Downloads (Pure)

Abstract

Recent years have seen a surge of algorithms and architectures for deep Re-
inforcement Learning (RL), many of which have shown remarkable success for
various problems. Yet, little work has attempted to relate the performance of
these algorithms and architectures to what the resulting deep RL agents actu-
ally learn, and whether this corresponds to what they should ideally learn. Such
a comparison may allow for both an improved understanding of why certain
algorithms or network architectures perform better than others and the devel-
opment of methods that specically address discrepancies between what is and
what should be learned.
Original languageEnglish
Number of pages3
Publication statusPublished - 2020
EventBNAIC/BENELEARN 2020 - Leiden, Netherlands
Duration: 19 Nov 202020 Nov 2020

Conference

ConferenceBNAIC/BENELEARN 2020
Country/TerritoryNetherlands
CityLeiden
Period19/11/2020/11/20

Keywords

  • Deep Reinforcement Learning
  • Representation Learning
  • Bisimulation Metrics
  • Markovianity

Fingerprint

Dive into the research topics of 'Learning What to Attend to: Using Bisimulation Metrics to Explore and Improve Upon What a Deep Reinforcement Learning Agent Learns'. Together they form a unique fingerprint.

Cite this