If you made any changes in Pure these will be visible here soon.

Research Output

  • 3 Conference contribution
2020

Generalized Optimistic Q-Learning with Provable Efficiency

Neustroev, G. & de Weerdt, M., May 2020, Proceedings of AAMAS'20. An, B., Yorke-Smith, N., El Fallah Seghrouchni, A. & Sukthankar, G. (eds.). International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), p. 913-921 9 p.

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Open Access

Interval Q-Learning: Balancing Deep and Wide Exploration

Neustroev, G., Ponnambalam, C., de Weerdt, M. & Spaan, M., 2020, Adaptive and Learning Agents Workshop. 7 p.

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Open Access
File
2 Downloads (Pure)
2019

Discovery of Optimal Solution Horizons in Non-Stationary Markov Decision Processes with Unbounded Rewards

Neustroev, G., de Weerdt, M. & Verzijlbergh, R., 2019, Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling. Benton, J., Lipovetzky, N., Onaindia, E., Smith, D. E. & Srivastava, S. (eds.). Association for the Advancement of Artificial Intelligence (AAAI), Vol. 29. p. 292-300 9 p.

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

Open Access
File
21 Downloads (Pure)