Reinforcement learning of potential fields to achieve limit-cycle walking

D.S. Feirstein (student); Ivan Koryakovskiy; Jens Kober; Heike Vallery

doi:10.1016/j.ifacol.2016.07.994

Reinforcement learning of potential fields to achieve limit-cycle walking

D.S. Feirstein (student), Ivan Koryakovskiy, Jens Kober, Heike Vallery

Biomechatronics & Human-Machine Control

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

6 Citations (Scopus)

287 Downloads (Pure)

Abstract

Reinforcement learning is a powerful tool to derive controllers for systems where no models are available. Particularly policy search algorithms are suitable for complex systems, to keep learning time manageable and account for continuous state and action spaces. However, these algorithms demand more insight into the system to choose a suitable controller parameterization. This paper investigates a type of policy parameterization for impedance control that allows energy input to be implicitly bounded: Potential fields. In this work, a methodology for generating a potential field-constrained impedance controller via approximation of example trajectories, and subsequently improving the control policy using Reinforcement Learning, is presented. The potential field-const rained approximation is used as a policy parameterization for policy search reinforcement learning and is compared to its unconstrained counterpart. Simulations on a simple biped walking model show the learned controllers are able to surpass the potential field of gravity by generating a stable limit-cycle gait on flat ground for both parameterizations. The potential field-constrained controller provides safety with a known energy bound while performing equally well as the unconstrained policy.

Original language	English
Title of host publication	Proceedings of the 6th IFAC Workshop on Periodic Control Systems (PSYCO 2016)
Editors	Henk Nijmeijer
Publisher	Elsevier
Pages	113-118
DOIs	https://doi.org/10.1016/j.ifacol.2016.07.994
Publication status	Published - 2016
Event	PSYCO 2016: 6th IFAC Workshop on Periodic Control System - Eindhoven, Netherlands Duration: 29 Jun 2016 → 1 Jul 2016

Publication series

Name	IFAC-PapersOnLine
Number	14
Volume	49
ISSN (Electronic)	2405-8963

Workshop

Workshop	PSYCO 2016: 6th IFAC Workshop on Periodic Control System
Country/Territory	Netherlands
City	Eindhoven
Period	29/06/16 → 1/07/16

Keywords

Energy Control
Limit cycles
Machine learning
Robot control
Walking

Access to Document

10.1016/j.ifacol.2016.07.994

IFAC2016_Reinforcement_FeirsteinAccepted author manuscript, 2.79 MBLicence: CC BY-NC-ND

Cite this

@inproceedings{b46933034c1a40f6b20bf19ba845701f,

title = "Reinforcement learning of potential fields to achieve limit-cycle walking",

abstract = "Reinforcement learning is a powerful tool to derive controllers for systems where no models are available. Particularly policy search algorithms are suitable for complex systems, to keep learning time manageable and account for continuous state and action spaces. However, these algorithms demand more insight into the system to choose a suitable controller parameterization. This paper investigates a type of policy parameterization for impedance control that allows energy input to be implicitly bounded: Potential fields. In this work, a methodology for generating a potential field-constrained impedance controller via approximation of example trajectories, and subsequently improving the control policy using Reinforcement Learning, is presented. The potential field-const rained approximation is used as a policy parameterization for policy search reinforcement learning and is compared to its unconstrained counterpart. Simulations on a simple biped walking model show the learned controllers are able to surpass the potential field of gravity by generating a stable limit-cycle gait on flat ground for both parameterizations. The potential field-constrained controller provides safety with a known energy bound while performing equally well as the unconstrained policy.",

keywords = "Energy Control, Limit cycles, Machine learning, Robot control, Walking",

author = "{Feirstein (student)}, D.S. and Ivan Koryakovskiy and Jens Kober and Heike Vallery",

year = "2016",

doi = "10.1016/j.ifacol.2016.07.994",

language = "English",

series = "IFAC-PapersOnLine",

publisher = "Elsevier",

number = "14",

pages = "113--118",

editor = "Henk Nijmeijer",

booktitle = "Proceedings of the 6th IFAC Workshop on Periodic Control Systems (PSYCO 2016)",

note = "PSYCO 2016: 6th IFAC Workshop on Periodic Control System ; Conference date: 29-06-2016 Through 01-07-2016",

}

Feirstein (student), DS, Koryakovskiy, I, Kober, J & Vallery, H 2016, Reinforcement learning of potential fields to achieve limit-cycle walking. in H Nijmeijer (ed.), Proceedings of the 6th IFAC Workshop on Periodic Control Systems (PSYCO 2016). IFAC-PapersOnLine, no. 14, vol. 49, Elsevier, pp. 113-118, PSYCO 2016: 6th IFAC Workshop on Periodic Control System, Eindhoven, Netherlands, 29/06/16. https://doi.org/10.1016/j.ifacol.2016.07.994

Reinforcement learning of potential fields to achieve limit-cycle walking. / Feirstein (student), D.S.; Koryakovskiy, Ivan; Kober, Jens et al.
Proceedings of the 6th IFAC Workshop on Periodic Control Systems (PSYCO 2016). ed. / Henk Nijmeijer. Elsevier, 2016. p. 113-118 (IFAC-PapersOnLine; Vol. 49, No. 14).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Reinforcement learning of potential fields to achieve limit-cycle walking

AU - Feirstein (student), D.S.

AU - Koryakovskiy, Ivan

AU - Kober, Jens

AU - Vallery, Heike

PY - 2016

Y1 - 2016

N2 - Reinforcement learning is a powerful tool to derive controllers for systems where no models are available. Particularly policy search algorithms are suitable for complex systems, to keep learning time manageable and account for continuous state and action spaces. However, these algorithms demand more insight into the system to choose a suitable controller parameterization. This paper investigates a type of policy parameterization for impedance control that allows energy input to be implicitly bounded: Potential fields. In this work, a methodology for generating a potential field-constrained impedance controller via approximation of example trajectories, and subsequently improving the control policy using Reinforcement Learning, is presented. The potential field-const rained approximation is used as a policy parameterization for policy search reinforcement learning and is compared to its unconstrained counterpart. Simulations on a simple biped walking model show the learned controllers are able to surpass the potential field of gravity by generating a stable limit-cycle gait on flat ground for both parameterizations. The potential field-constrained controller provides safety with a known energy bound while performing equally well as the unconstrained policy.

AB - Reinforcement learning is a powerful tool to derive controllers for systems where no models are available. Particularly policy search algorithms are suitable for complex systems, to keep learning time manageable and account for continuous state and action spaces. However, these algorithms demand more insight into the system to choose a suitable controller parameterization. This paper investigates a type of policy parameterization for impedance control that allows energy input to be implicitly bounded: Potential fields. In this work, a methodology for generating a potential field-constrained impedance controller via approximation of example trajectories, and subsequently improving the control policy using Reinforcement Learning, is presented. The potential field-const rained approximation is used as a policy parameterization for policy search reinforcement learning and is compared to its unconstrained counterpart. Simulations on a simple biped walking model show the learned controllers are able to surpass the potential field of gravity by generating a stable limit-cycle gait on flat ground for both parameterizations. The potential field-constrained controller provides safety with a known energy bound while performing equally well as the unconstrained policy.

KW - Energy Control

KW - Limit cycles

KW - Machine learning

KW - Robot control

KW - Walking

UR - http://resolver.tudelft.nl/uuid:b4693303-4c1a-40f6-b20b-f19ba845701f

UR - http://www.scopus.com/inward/record.url?scp=84990062929&partnerID=8YFLogxK

U2 - 10.1016/j.ifacol.2016.07.994

DO - 10.1016/j.ifacol.2016.07.994

M3 - Conference contribution

T3 - IFAC-PapersOnLine

SP - 113

EP - 118

BT - Proceedings of the 6th IFAC Workshop on Periodic Control Systems (PSYCO 2016)

A2 - Nijmeijer, Henk

PB - Elsevier

T2 - PSYCO 2016: 6th IFAC Workshop on Periodic Control System

Y2 - 29 June 2016 through 1 July 2016

ER -

Reinforcement learning of potential fields to achieve limit-cycle walking

Abstract

Publication series

Workshop

Keywords

Access to Document

Other files and links

Fingerprint

Cite this