Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework

Guangliang Li; Hamdi Dibeklioğlu; Shimon Whiteson; Hayley Hung

doi:10.1007/s10458-020-09447-w

Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework

Guangliang Li^*, Hamdi Dibeklioğlu, Shimon Whiteson, Hayley Hung

^*Corresponding author for this work

Pattern Recognition and Bioinformatics

Research output: Contribution to journal › Article › Scientific › peer-review

16 Citations (Scopus)

95 Downloads (Pure)

Abstract

Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative feedback provided by a human user. Previous research showed that humans give copious feedback early in training but very sparsely thereafter. In this article, we investigate the potential of agent learning from trainers’ facial expressions via interpreting them as evaluative feedback. To do so, we implemented TAMER which is a popular interactive reinforcement learning method in a reinforcement-learning benchmark problem—Infinite Mario, and conducted the first large-scale study of TAMER involving 561 participants. With designed CNN–RNN model, our analysis shows that telling trainers to use facial expressions and competition can improve the accuracies for estimating positive and negative feedback using facial expressions. In addition, our results with a simulation experiment show that learning solely from predicted feedback based on facial expressions is possible and using strong/effective prediction models or a regression method, facial responses would significantly improve the performance of agents. Furthermore, our experiment supports previous studies demonstrating the importance of bi-directional feedback and competitive elements in the training interface.

Original language	English
Article number	22
Number of pages	29
Journal	Autonomous Agents and Multi-Agent Systems
Volume	34
Issue number	1
DOIs	https://doi.org/10.1007/s10458-020-09447-w
Publication status	Published - 2020

Keywords

Facial expressions
Human agent interaction
Interactive reinforcement learning
Reinforcement learning

Access to Document

10.1007/s10458-020-09447-w

Li2020_Article_FacialFeedbackForReinforcementFinal published version, 1.75 MBLicence: CC BY

Cite this

@article{c28ce47a70644c7fae37721d24e62eda,

title = "Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework",

abstract = "Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative feedback provided by a human user. Previous research showed that humans give copious feedback early in training but very sparsely thereafter. In this article, we investigate the potential of agent learning from trainers{\textquoteright} facial expressions via interpreting them as evaluative feedback. To do so, we implemented TAMER which is a popular interactive reinforcement learning method in a reinforcement-learning benchmark problem—Infinite Mario, and conducted the first large-scale study of TAMER involving 561 participants. With designed CNN–RNN model, our analysis shows that telling trainers to use facial expressions and competition can improve the accuracies for estimating positive and negative feedback using facial expressions. In addition, our results with a simulation experiment show that learning solely from predicted feedback based on facial expressions is possible and using strong/effective prediction models or a regression method, facial responses would significantly improve the performance of agents. Furthermore, our experiment supports previous studies demonstrating the importance of bi-directional feedback and competitive elements in the training interface.",

keywords = "Facial expressions, Human agent interaction, Interactive reinforcement learning, Reinforcement learning",

author = "Guangliang Li and Hamdi Dibeklioğlu and Shimon Whiteson and Hayley Hung",

year = "2020",

doi = "10.1007/s10458-020-09447-w",

language = "English",

volume = "34",

journal = "Autonomous Agents and Multi-Agent Systems",

issn = "1387-2532",

publisher = "Springer",

number = "1",

}

TY - JOUR

T1 - Facial feedback for reinforcement learning

T2 - a case study and offline analysis using the TAMER framework

AU - Li, Guangliang

AU - Dibeklioğlu, Hamdi

AU - Whiteson, Shimon

AU - Hung, Hayley

PY - 2020

Y1 - 2020

N2 - Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative feedback provided by a human user. Previous research showed that humans give copious feedback early in training but very sparsely thereafter. In this article, we investigate the potential of agent learning from trainers’ facial expressions via interpreting them as evaluative feedback. To do so, we implemented TAMER which is a popular interactive reinforcement learning method in a reinforcement-learning benchmark problem—Infinite Mario, and conducted the first large-scale study of TAMER involving 561 participants. With designed CNN–RNN model, our analysis shows that telling trainers to use facial expressions and competition can improve the accuracies for estimating positive and negative feedback using facial expressions. In addition, our results with a simulation experiment show that learning solely from predicted feedback based on facial expressions is possible and using strong/effective prediction models or a regression method, facial responses would significantly improve the performance of agents. Furthermore, our experiment supports previous studies demonstrating the importance of bi-directional feedback and competitive elements in the training interface.

AB - Interactive reinforcement learning provides a way for agents to learn to solve tasks from evaluative feedback provided by a human user. Previous research showed that humans give copious feedback early in training but very sparsely thereafter. In this article, we investigate the potential of agent learning from trainers’ facial expressions via interpreting them as evaluative feedback. To do so, we implemented TAMER which is a popular interactive reinforcement learning method in a reinforcement-learning benchmark problem—Infinite Mario, and conducted the first large-scale study of TAMER involving 561 participants. With designed CNN–RNN model, our analysis shows that telling trainers to use facial expressions and competition can improve the accuracies for estimating positive and negative feedback using facial expressions. In addition, our results with a simulation experiment show that learning solely from predicted feedback based on facial expressions is possible and using strong/effective prediction models or a regression method, facial responses would significantly improve the performance of agents. Furthermore, our experiment supports previous studies demonstrating the importance of bi-directional feedback and competitive elements in the training interface.

KW - Facial expressions

KW - Human agent interaction

KW - Interactive reinforcement learning

KW - Reinforcement learning

UR - http://www.scopus.com/inward/record.url?scp=85079570004&partnerID=8YFLogxK

U2 - 10.1007/s10458-020-09447-w

DO - 10.1007/s10458-020-09447-w

M3 - Article

AN - SCOPUS:85079570004

SN - 1387-2532

VL - 34

JO - Autonomous Agents and Multi-Agent Systems

JF - Autonomous Agents and Multi-Agent Systems

IS - 1

M1 - 22

ER -

Facial feedback for reinforcement learning: a case study and offline analysis using the TAMER framework

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this