Aligning Human Preferences with Baseline Objectives in Reinforcement Learning

Daniel Marta, Simon Holk, Christian Pek, Jana Tumova, Iolanda Leite

Research output: Contribution to conferencePaperpeer-review

Fingerprint

Dive into the research topics of 'Aligning Human Preferences with Baseline Objectives in Reinforcement Learning'. Together they form a unique fingerprint.

INIS

Psychology

Computer Science

Keyphrases