Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

Peter Valletta; Rodrigo Pérez-Dattari; Jens Kober

doi:10.1109/ICRA48506.2021.9561686

Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

Peter Valletta, Rodrigo Pérez-Dattari, Jens Kober

Learning & Autonomous Control

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

2 Citations (Scopus)

26 Downloads (Pure)

Abstract

Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.

Original language	English
Title of host publication	Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021
Publisher	IEEE
Pages	3655-3661
ISBN (Electronic)	978-1-7281-9077-8
DOIs	https://doi.org/10.1109/ICRA48506.2021.9561686
Publication status	Published - 2021
Event	2021 IEEE International Conference on Robotics and Automation, ICRA 2021 - Xi'an, China Duration: 30 May 2021 → 5 Jun 2021

Conference

Conference	2021 IEEE International Conference on Robotics and Automation, ICRA 2021
Country/Territory	China
City	Xi'an
Period	30/05/21 → 5/06/21

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Access to Document

10.1109/ICRA48506.2021.9561686

Imitation_Learning_with_Inconsistent_Demonstrations_through_Uncertainty-based_Data_ManipulationFinal published version, 3.89 MB

Cite this

@inproceedings{165bfbfe4b0845bdb3b35a0e89b94a6a,

title = "Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation",

abstract = "Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.",

author = "Peter Valletta and Rodrigo P{\'e}rez-Dattari and Jens Kober",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 2021 IEEE International Conference on Robotics and Automation, ICRA 2021 ; Conference date: 30-05-2021 Through 05-06-2021",

year = "2021",

doi = "10.1109/ICRA48506.2021.9561686",

language = "English",

pages = "3655--3661",

booktitle = "Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021",

publisher = "IEEE",

address = "United States",

}

Valletta, P, Pérez-Dattari, R & Kober, J 2021, Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation. in Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021. IEEE, pp. 3655-3661, 2021 IEEE International Conference on Robotics and Automation, ICRA 2021, Xi'an, China, 30/05/21. https://doi.org/10.1109/ICRA48506.2021.9561686

Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation. / Valletta, Peter; Pérez-Dattari, Rodrigo ; Kober, Jens.
Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021. IEEE, 2021. p. 3655-3661.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

AU - Valletta, Peter

AU - Pérez-Dattari, Rodrigo

AU - Kober, Jens

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2021

Y1 - 2021

N2 - Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.

AB - Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.

UR - http://www.scopus.com/inward/record.url?scp=85125488068&partnerID=8YFLogxK

U2 - 10.1109/ICRA48506.2021.9561686

DO - 10.1109/ICRA48506.2021.9561686

M3 - Conference contribution

AN - SCOPUS:85125488068

SP - 3655

EP - 3661

BT - Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021

PB - IEEE

T2 - 2021 IEEE International Conference on Robotics and Automation, ICRA 2021

Y2 - 30 May 2021 through 5 June 2021

ER -

Imitation Learning with Inconsistent Demonstrations through Uncertainty-based Data Manipulation

Abstract

Conference

Bibliographical note

Access to Document

Other files and links

Fingerprint

Cite this