Aleatoric uncertainty estimation, based on the observed training data, is applied for the detection of conflicts in a demonstration data set. The particular focus of this paper is the resolution of conflicting data resulting from scenarios with equivalent action choices, such as obstacle avoidance, path planning or multiple joint configurations. In terms of the estimated uncertainty, the proposed algorithm aims to decrease this otherwise irreducible value through direct alteration of the accrued data set and to provide data that a policy-learning neural network is able to fit appropriately. The proposed algorithm was validated with real robot scenarios while learning from inconsistent demonstrations, where the resulting policies consistently achieved their prescribed objectives. A video showing our method and experiments can be found at: https://youtu.be/oGYnzlW9Ncw.
|Title of host publication||Proceedings of the IEEE International Conference on Robotics and Automation, ICRA 2021|
|Publication status||Published - 2021|
|Event||2021 IEEE International Conference on Robotics and Automation, ICRA 2021 - Xi'an, China|
Duration: 30 May 2021 → 5 Jun 2021
|Conference||2021 IEEE International Conference on Robotics and Automation, ICRA 2021|
|Period||30/05/21 → 5/06/21|
Bibliographical noteGreen Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.