Safe Policy Improvement with an Estimated Baseline Policy

Thiago D. Simão, Romain Laroche, Rémi Tachet des Combes

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

24 Downloads (Pure)

Fingerprint

Dive into the research topics of 'Safe Policy Improvement with an Estimated Baseline Policy'. Together they form a unique fingerprint.

INIS

Computer Science