Non-deterministic policy improvement stabilizes approximated reinforcement learning

Wendelin Böhmer, Rong Guo, Klaus Obermayer

Research output: Other contributionScientific

Original languageEnglish
Publication statusPublished - 2016
Externally publishedYes

Cite this