Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback

Carlos Celemin; Jens Kober

doi:10.1007/s00521-022-08118-z

Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback

Carlos Celemin^*, Jens Kober

^*Corresponding author for this work

Learning & Autonomous Control

Research output: Contribution to journal › Article › Scientific › peer-review

6 Citations (Scopus)

16 Downloads (Pure)

Abstract

In order to deploy robots that could be adapted by non-expert users, interactive imitation learning (IIL) methods must be flexible regarding the interaction preferences of the teacher and avoid assumptions of perfect teachers (oracles), while considering they make mistakes influenced by diverse human factors. In this work, we propose an IIL method that improves the human–robot interaction for non-expert and imperfect teachers in two directions. First, uncertainty estimation is included to endow the agents with a lack of knowledge awareness (epistemic uncertainty) and demonstration ambiguity awareness (aleatoric uncertainty), such that the robot can request human input when it is deemed more necessary. Second, the proposed method enables the teachers to train with the flexibility of using corrective demonstrations, evaluative reinforcements, and implicit positive feedback. The experimental results show an improvement in learning convergence with respect to other learning methods when the agent learns from highly ambiguous teachers. Additionally, in a user study, it was found that the components of the proposed method improve the teaching experience and the data efficiency of the learning process.

Original language	English
Pages (from-to)	16821-16839
Journal	Neural Computing and Applications
Volume	35
Issue number	23
DOIs	https://doi.org/10.1007/s00521-022-08118-z
Publication status	Published - 2023

Keywords

Active learning
Corrective demonstrations
Human reinforcement
Interactive imitation learning
Uncertainty

Access to Document

10.1007/s00521-022-08118-z

s00521-022-08118-zFinal published version, 1.33 MBLicence: CC BY

Cite this

@article{010c1cec38094a4bb971bc499462763d,

title = "Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback",

abstract = "In order to deploy robots that could be adapted by non-expert users, interactive imitation learning (IIL) methods must be flexible regarding the interaction preferences of the teacher and avoid assumptions of perfect teachers (oracles), while considering they make mistakes influenced by diverse human factors. In this work, we propose an IIL method that improves the human–robot interaction for non-expert and imperfect teachers in two directions. First, uncertainty estimation is included to endow the agents with a lack of knowledge awareness (epistemic uncertainty) and demonstration ambiguity awareness (aleatoric uncertainty), such that the robot can request human input when it is deemed more necessary. Second, the proposed method enables the teachers to train with the flexibility of using corrective demonstrations, evaluative reinforcements, and implicit positive feedback. The experimental results show an improvement in learning convergence with respect to other learning methods when the agent learns from highly ambiguous teachers. Additionally, in a user study, it was found that the components of the proposed method improve the teaching experience and the data efficiency of the learning process.",

keywords = "Active learning, Corrective demonstrations, Human reinforcement, Interactive imitation learning, Uncertainty",

author = "Carlos Celemin and Jens Kober",

year = "2023",

doi = "10.1007/s00521-022-08118-z",

language = "English",

volume = "35",

pages = "16821--16839",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer",

number = "23",

}

TY - JOUR

T1 - Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback

AU - Celemin, Carlos

AU - Kober, Jens

PY - 2023

Y1 - 2023

N2 - In order to deploy robots that could be adapted by non-expert users, interactive imitation learning (IIL) methods must be flexible regarding the interaction preferences of the teacher and avoid assumptions of perfect teachers (oracles), while considering they make mistakes influenced by diverse human factors. In this work, we propose an IIL method that improves the human–robot interaction for non-expert and imperfect teachers in two directions. First, uncertainty estimation is included to endow the agents with a lack of knowledge awareness (epistemic uncertainty) and demonstration ambiguity awareness (aleatoric uncertainty), such that the robot can request human input when it is deemed more necessary. Second, the proposed method enables the teachers to train with the flexibility of using corrective demonstrations, evaluative reinforcements, and implicit positive feedback. The experimental results show an improvement in learning convergence with respect to other learning methods when the agent learns from highly ambiguous teachers. Additionally, in a user study, it was found that the components of the proposed method improve the teaching experience and the data efficiency of the learning process.

AB - In order to deploy robots that could be adapted by non-expert users, interactive imitation learning (IIL) methods must be flexible regarding the interaction preferences of the teacher and avoid assumptions of perfect teachers (oracles), while considering they make mistakes influenced by diverse human factors. In this work, we propose an IIL method that improves the human–robot interaction for non-expert and imperfect teachers in two directions. First, uncertainty estimation is included to endow the agents with a lack of knowledge awareness (epistemic uncertainty) and demonstration ambiguity awareness (aleatoric uncertainty), such that the robot can request human input when it is deemed more necessary. Second, the proposed method enables the teachers to train with the flexibility of using corrective demonstrations, evaluative reinforcements, and implicit positive feedback. The experimental results show an improvement in learning convergence with respect to other learning methods when the agent learns from highly ambiguous teachers. Additionally, in a user study, it was found that the components of the proposed method improve the teaching experience and the data efficiency of the learning process.

KW - Active learning

KW - Corrective demonstrations

KW - Human reinforcement

KW - Interactive imitation learning

KW - Uncertainty

UR - http://www.scopus.com/inward/record.url?scp=85146280338&partnerID=8YFLogxK

U2 - 10.1007/s00521-022-08118-z

DO - 10.1007/s00521-022-08118-z

M3 - Article

AN - SCOPUS:85146280338

SN - 0941-0643

VL - 35

SP - 16821

EP - 16839

JO - Neural Computing and Applications

JF - Neural Computing and Applications

IS - 23

ER -

Knowledge- and ambiguity-aware robot learning from corrective and evaluative feedback

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this