TY - JOUR
T1 - Deep Reinforcement Learning Versus Evolution Strategies
T2 - A Comparative Survey
AU - Majid, Amjad Yousef
AU - Saaybi, Serge
AU - Francois-Lavet, Vincent
AU - Venkatesha Prasad, Ranga
AU - Verhoeven, Chris
PY - 2023
Y1 - 2023
N2 - Deep reinforcement learning (DRL) and evolution strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison is provided on key aspects, such as scalability, exploration, adaptation to dynamic environments, and multiagent learning. Current research challenges are also discussed, including sample efficiency, exploration versus exploitation, dealing with sparse rewards, and learning to plan. Then, the benefits of hybrid algorithms that combine DRL and ESs are highlighted.
AB - Deep reinforcement learning (DRL) and evolution strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist. To get insights into the strengths and weaknesses of DRL versus ESs, an analysis of their respective capabilities and limitations is provided. After presenting their fundamental concepts and algorithms, a comparison is provided on key aspects, such as scalability, exploration, adaptation to dynamic environments, and multiagent learning. Current research challenges are also discussed, including sample efficiency, exploration versus exploitation, dealing with sparse rewards, and learning to plan. Then, the benefits of hybrid algorithms that combine DRL and ESs are highlighted.
KW - Deep learning
KW - Deep reinforcement learning (DRL)
KW - Evolution (biology)
KW - evolution strategies (ESs)
KW - exploration
KW - Games
KW - meta-learning
KW - multiagent
KW - Optimization
KW - parallelism
KW - Q-learning
KW - Robots
KW - Scalability
UR - http://www.scopus.com/inward/record.url?scp=85159827870&partnerID=8YFLogxK
U2 - 10.1109/TNNLS.2023.3264540
DO - 10.1109/TNNLS.2023.3264540
M3 - Article
AN - SCOPUS:85159827870
SN - 2162-237X
SP - 1
EP - 19
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
ER -