Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Search results

  • 2023

    Safe Online and Offline Reinforcement Learning

    Simão, T. D., 2023, 128 p.

    Research output: ThesisDissertation (TU Delft)

    Open Access
    File
    189 Downloads (Pure)
  • 2020

    Safe Policy Improvement with an Estimated Baseline Policy

    Simão, T. D., Laroche, R. & Tachet des Combes, R., 2020, Proceedings of the 19th International Conference on Autonomous Agents and MultiAgent Systems. Richland, SC, p. 1269–1277 9 p. (AAMAS '20).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    Open Access
    File
    21 Downloads (Pure)
  • 2019

    Safe Policy Improvement with Baseline Bootstrapping in Factored Environments

    Simão, T. D. & Spaan, M. T. J., 2019, 33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019. American Association for Artificial Intelligence (AAAI), p. 4967-4974 8 p. (33rd AAAI Conference on Artificial Intelligence, AAAI 2019, 31st Innovative Applications of Artificial Intelligence Conference, IAAI 2019 and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019).

    Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

    16 Citations (Scopus)