Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control

D.D.C. De Buysscher; T.S.C. Pollack; E. van Kampen

doi:10.2514/6.2022-0790

Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control

D.D.C. De Buysscher, T.S.C. Pollack, E. van Kampen

Control & Simulation

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

1 Citation (Scopus)

53 Downloads (Pure)

Abstract

Safe Curriculum Learning aims at improving safety and efficiency aspects of Reinforcement Learning (RL). Curricular RL approaches divide a task into stages of increasing complexity in order to increase efficiency. This paper proposes a black box safe curriculum learning architecture applicable to systems with parametric unknowns. The agent domain solely requires knowledge of the state and action spaces’ dimensions for a given task and system. By adding system identification capabilities to existing safe curriculum learning paradigms, the proposed architecture ensures safe learning of tracking tasks without requiring initial knowledge of the system dynamics. A model estimate is generated online to complement safety filters that rely on uncertain models for their safety guarantees. This research explicitly targets linearised systems with decoupled dynamics. The paradigm is initially verified on a mass-spring-damper system, after which it is applied to a quadrotor altitude and attitude tracking task. The RL agent is able to safely learn an optimal policy that can track an independent reference on each degree of freedom.

Original language	English
Title of host publication	AIAA SCITECH 2022 Forum
Number of pages	23
ISBN (Electronic)	978-1-62410-631-6
DOIs	https://doi.org/10.2514/6.2022-0790
Publication status	Published - 2022
Event	AIAA SCITECH 2022 Forum - virtual event Duration: 3 Jan 2022 → 7 Jan 2022

Publication series

Name	AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

Conference

Conference	AIAA SCITECH 2022 Forum
Period	3/01/22 → 7/01/22

Access to Document

10.2514/6.2022-0790

6.2022-0790Final published version, 2.55 MB

Cite this

@inproceedings{104c2976037c48b2af3e6fcd1ed78df7,

title = "Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control",

abstract = "Safe Curriculum Learning aims at improving safety and efficiency aspects of Reinforcement Learning (RL). Curricular RL approaches divide a task into stages of increasing complexity in order to increase efficiency. This paper proposes a black box safe curriculum learning architecture applicable to systems with parametric unknowns. The agent domain solely requires knowledge of the state and action spaces{\textquoteright} dimensions for a given task and system. By adding system identification capabilities to existing safe curriculum learning paradigms, the proposed architecture ensures safe learning of tracking tasks without requiring initial knowledge of the system dynamics. A model estimate is generated online to complement safety filters that rely on uncertain models for their safety guarantees. This research explicitly targets linearised systems with decoupled dynamics. The paradigm is initially verified on a mass-spring-damper system, after which it is applied to a quadrotor altitude and attitude tracking task. The RL agent is able to safely learn an optimal policy that can track an independent reference on each degree of freedom.",

author = "{De Buysscher}, D.D.C. and T.S.C. Pollack and {van Kampen}, E.",

year = "2022",

doi = "10.2514/6.2022-0790",

language = "English",

series = "AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022",

booktitle = "AIAA SCITECH 2022 Forum",

note = "AIAA SCITECH 2022 Forum ; Conference date: 03-01-2022 Through 07-01-2022",

}

Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control. / De Buysscher, D.D.C.; Pollack, T.S.C.; van Kampen, E.
AIAA SCITECH 2022 Forum. 2022. AIAA 2022-0790 (AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control

AU - De Buysscher, D.D.C.

AU - Pollack, T.S.C.

AU - van Kampen, E.

PY - 2022

Y1 - 2022

N2 - Safe Curriculum Learning aims at improving safety and efficiency aspects of Reinforcement Learning (RL). Curricular RL approaches divide a task into stages of increasing complexity in order to increase efficiency. This paper proposes a black box safe curriculum learning architecture applicable to systems with parametric unknowns. The agent domain solely requires knowledge of the state and action spaces’ dimensions for a given task and system. By adding system identification capabilities to existing safe curriculum learning paradigms, the proposed architecture ensures safe learning of tracking tasks without requiring initial knowledge of the system dynamics. A model estimate is generated online to complement safety filters that rely on uncertain models for their safety guarantees. This research explicitly targets linearised systems with decoupled dynamics. The paradigm is initially verified on a mass-spring-damper system, after which it is applied to a quadrotor altitude and attitude tracking task. The RL agent is able to safely learn an optimal policy that can track an independent reference on each degree of freedom.

AB - Safe Curriculum Learning aims at improving safety and efficiency aspects of Reinforcement Learning (RL). Curricular RL approaches divide a task into stages of increasing complexity in order to increase efficiency. This paper proposes a black box safe curriculum learning architecture applicable to systems with parametric unknowns. The agent domain solely requires knowledge of the state and action spaces’ dimensions for a given task and system. By adding system identification capabilities to existing safe curriculum learning paradigms, the proposed architecture ensures safe learning of tracking tasks without requiring initial knowledge of the system dynamics. A model estimate is generated online to complement safety filters that rely on uncertain models for their safety guarantees. This research explicitly targets linearised systems with decoupled dynamics. The paradigm is initially verified on a mass-spring-damper system, after which it is applied to a quadrotor altitude and attitude tracking task. The RL agent is able to safely learn an optimal policy that can track an independent reference on each degree of freedom.

UR - http://www.scopus.com/inward/record.url?scp=85123358849&partnerID=8YFLogxK

U2 - 10.2514/6.2022-0790

DO - 10.2514/6.2022-0790

M3 - Conference contribution

T3 - AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

BT - AIAA SCITECH 2022 Forum

T2 - AIAA SCITECH 2022 Forum

Y2 - 3 January 2022 through 7 January 2022

ER -

Safe Curriculum Learning for Linear Systems with Parametric Unknowns in Primary Flight Control

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this