A survey of actor-critic reinforcement learning: standard and natural policy gradients

I Grondman, IL Busoniu, GAD Lopes, R Babuska

Research output: Contribution to journalArticleScientificpeer-review

676 Citations (Scopus)
Original languageEnglish
Pages (from-to)1291-1307
Number of pages17
JournalIEEE Transactions on Systems, Man and Cybernetics, Part C: Applications and Reviews
Issue number6
Publication statusPublished - 2012

Cite this