A survey of actor-critic reinforcement learning: standard and natural policy gradients

I Grondman, IL Busoniu, GAD Lopes, R Babuska

Research output: Contribution to journalArticleScientificpeer-review

587 Citations (Scopus)
Original languageEnglish
Pages (from-to)1291-1307
Number of pages17
JournalIEEE Transactions on Systems, Man and Cybernetics, Part C: Applications and Reviews
Volume42
Issue number6
DOIs
Publication statusPublished - 2012

Cite this