Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control

C. Teirlinck; E. van Kampen

doi:10.2514/6.2024-2406

Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control

Control & Simulation

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

46 Downloads (Pure)

Abstract

Recent advancements in fault-tolerant flight control have involved model-free offline and online Reinforcement Learning (RL) algorithms in order to provide robust and adaptive control to autonomous systems. Inspired by recent work on Incremental Dual Heuristic Programming (IDHP) and Soft Actor-Critic (SAC), this research proposes a hybrid SAC-IDHP framework aiming to combine adaptive online learning from IDHP with the high complexity generalization power of SAC in controlling a fully coupled system. The hybrid framework is implemented into the inner loop of a cascaded altitude controller for a high-fidelity, six-degree-of-freedom model of the Cessna Citation II PH-LAB research aircraft. Compared to SAC-only, the SAC-IDHP hybrid demonstrates an improvement in tracking performance of 0.74%, 5.46% and 0.82% in nMAE for nominal case, longitudinal and lateral failure cases respectively. Random online policy initialization is eliminated due to identity initialization of the hybrid policy, resulting in an argument for increased safety. Additionally, robustness to biased sensor noise, initial flight condition and random critic initialization is demonstrated.

Original language	English
Title of host publication	Proceedings of the AIAA SCITECH 2024 Forum
Publisher	American Institute of Aeronautics and Astronautics Inc. (AIAA)
Number of pages	22
ISBN (Electronic)	978-1-62410-711-5
DOIs	https://doi.org/10.2514/6.2024-2406
Publication status	Published - 2024
Event	AIAA SCITECH 2024 Forum - Orlando, United States Duration: 8 Jan 2024 → 12 Jan 2024

Conference

Conference	AIAA SCITECH 2024 Forum
Country/Territory	United States
City	Orlando
Period	8/01/24 → 12/01/24

Access to Document

10.2514/6.2024-2406

teirlinck-van-kampen-2024-hybrid-soft-actor-critic-and-incremental-dual-heuristic-programming-reinforcement-learningFinal published version, 2.82 MB

Cite this

@inproceedings{02616c7bffaa45f19b32875a6a9e3061,

title = "Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control",

abstract = "Recent advancements in fault-tolerant flight control have involved model-free offline and online Reinforcement Learning (RL) algorithms in order to provide robust and adaptive control to autonomous systems. Inspired by recent work on Incremental Dual Heuristic Programming (IDHP) and Soft Actor-Critic (SAC), this research proposes a hybrid SAC-IDHP framework aiming to combine adaptive online learning from IDHP with the high complexity generalization power of SAC in controlling a fully coupled system. The hybrid framework is implemented into the inner loop of a cascaded altitude controller for a high-fidelity, six-degree-of-freedom model of the Cessna Citation II PH-LAB research aircraft. Compared to SAC-only, the SAC-IDHP hybrid demonstrates an improvement in tracking performance of 0.74%, 5.46% and 0.82% in nMAE for nominal case, longitudinal and lateral failure cases respectively. Random online policy initialization is eliminated due to identity initialization of the hybrid policy, resulting in an argument for increased safety. Additionally, robustness to biased sensor noise, initial flight condition and random critic initialization is demonstrated.",

author = "C. Teirlinck and {van Kampen}, E.",

year = "2024",

doi = "10.2514/6.2024-2406",

language = "English",

booktitle = "Proceedings of the AIAA SCITECH 2024 Forum",

publisher = "American Institute of Aeronautics and Astronautics Inc. (AIAA)",

address = "United States",

note = "AIAA SCITECH 2024 Forum ; Conference date: 08-01-2024 Through 12-01-2024",

}

Teirlinck, C & van Kampen, E 2024, Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control. in Proceedings of the AIAA SCITECH 2024 Forum., AIAA 2024-2406, American Institute of Aeronautics and Astronautics Inc. (AIAA), AIAA SCITECH 2024 Forum, Orlando, Florida, United States, 8/01/24. https://doi.org/10.2514/6.2024-2406

Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control. / Teirlinck, C.; van Kampen, E.
Proceedings of the AIAA SCITECH 2024 Forum. American Institute of Aeronautics and Astronautics Inc. (AIAA), 2024. AIAA 2024-2406.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control

AU - Teirlinck, C.

AU - van Kampen, E.

PY - 2024

Y1 - 2024

N2 - Recent advancements in fault-tolerant flight control have involved model-free offline and online Reinforcement Learning (RL) algorithms in order to provide robust and adaptive control to autonomous systems. Inspired by recent work on Incremental Dual Heuristic Programming (IDHP) and Soft Actor-Critic (SAC), this research proposes a hybrid SAC-IDHP framework aiming to combine adaptive online learning from IDHP with the high complexity generalization power of SAC in controlling a fully coupled system. The hybrid framework is implemented into the inner loop of a cascaded altitude controller for a high-fidelity, six-degree-of-freedom model of the Cessna Citation II PH-LAB research aircraft. Compared to SAC-only, the SAC-IDHP hybrid demonstrates an improvement in tracking performance of 0.74%, 5.46% and 0.82% in nMAE for nominal case, longitudinal and lateral failure cases respectively. Random online policy initialization is eliminated due to identity initialization of the hybrid policy, resulting in an argument for increased safety. Additionally, robustness to biased sensor noise, initial flight condition and random critic initialization is demonstrated.

AB - Recent advancements in fault-tolerant flight control have involved model-free offline and online Reinforcement Learning (RL) algorithms in order to provide robust and adaptive control to autonomous systems. Inspired by recent work on Incremental Dual Heuristic Programming (IDHP) and Soft Actor-Critic (SAC), this research proposes a hybrid SAC-IDHP framework aiming to combine adaptive online learning from IDHP with the high complexity generalization power of SAC in controlling a fully coupled system. The hybrid framework is implemented into the inner loop of a cascaded altitude controller for a high-fidelity, six-degree-of-freedom model of the Cessna Citation II PH-LAB research aircraft. Compared to SAC-only, the SAC-IDHP hybrid demonstrates an improvement in tracking performance of 0.74%, 5.46% and 0.82% in nMAE for nominal case, longitudinal and lateral failure cases respectively. Random online policy initialization is eliminated due to identity initialization of the hybrid policy, resulting in an argument for increased safety. Additionally, robustness to biased sensor noise, initial flight condition and random critic initialization is demonstrated.

U2 - 10.2514/6.2024-2406

DO - 10.2514/6.2024-2406

M3 - Conference contribution

BT - Proceedings of the AIAA SCITECH 2024 Forum

PB - American Institute of Aeronautics and Astronautics Inc. (AIAA)

T2 - AIAA SCITECH 2024 Forum

Y2 - 8 January 2024 through 12 January 2024

ER -

Hybrid Soft Actor-Critic and Incremental Dual Heuristic Programming Reinforcement Learning for Fault-Tolerant Flight Control

Abstract

Conference

Access to Document

Fingerprint

Cite this