Structure Learning for Safe Policy Improvement

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

8 Citations (Scopus)

2023
Safe Online and Offline Reinforcement Learning
Simão, T. D., 2023, 128 p.
Research output: Thesis › Dissertation (TU Delft)

Open Access
File
200 Downloads (Pure)
2020
Safe Policy Improvement with an Estimated Baseline Policy
Simão, T. D., Laroche, R. & Tachet des Combes, R., 2020, Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems. Richland, SC, p. 1269–1277 9 p. (AAMAS '20).
Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Open Access
File
24 Downloads (Pure)
2019
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
Simão, T. D. & Spaan, M. T. J., 2019, 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019. American Association for Artificial Intelligence (AAAI), p. 4967-4974 8 p. (33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019).
Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review
19 Citations (Scopus)

Research output