Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

Ye Zhou; Erik Jan van Kampen; Qiping Chu

doi:10.1016/j.ast.2020.106013

Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

Ye Zhou^*, Erik Jan van Kampen, Qiping Chu

^*Corresponding author for this work

Control & Simulation

Research output: Contribution to journal › Article › Scientific › peer-review

26 Citations (Scopus)

Abstract

Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances.

Original language	English
Article number	106013
Number of pages	14
Journal	Aerospace Science and Technology
Volume	105
DOIs	https://doi.org/10.1016/j.ast.2020.106013
Publication status	Published - 1 Oct 2020

Keywords

Adaptive nonlinear flight control
Heuristic dynamic programming
Incremental techniques
Online reinforcement learning
Partial observability

Access to Document

10.1016/j.ast.2020.106013

Cite this

@article{059cb7da66c9491b98a00f85cc786de7,

title = "Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability",

abstract = "Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances.",

keywords = "Adaptive nonlinear flight control, Heuristic dynamic programming, Incremental techniques, Online reinforcement learning, Partial observability",

author = "Ye Zhou and {van Kampen}, {Erik Jan} and Qiping Chu",

year = "2020",

month = oct,

day = "1",

doi = "10.1016/j.ast.2020.106013",

language = "English",

volume = "105",

journal = "Aerospace Science and Technology",

issn = "1270-9638",

publisher = "Elsevier",

}

TY - JOUR

T1 - Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

AU - Zhou, Ye

AU - van Kampen, Erik Jan

AU - Chu, Qiping

PY - 2020/10/1

Y1 - 2020/10/1

N2 - Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances.

AB - Heuristic dynamic programming is a class of reinforcement learning, which has been introduced to aerospace engineering to solve nonlinear, optimal adaptive control problems. However, it requires an off-line learning stage to train a global system model to represent the system dynamics. This paper uses an incremental model in heuristic dynamic programming to improve the online learning ability, which is incremental model based heuristic dynamic programming. The trait of the online identification of the incremental model makes this method an option for fault-tolerant control and partially observable control problems. This study, therefore, also extends this method to deal with partial observability. The presented method has been validated on two different online tracking problems: missile fault-tolerant control with full-state measurements and also spacecraft attitude control disturbed with liquid sloshing under partially observable conditions. The results reveal that the proposed method outperforms the conventional heuristic dynamic programming method in fault-tolerant control tasks, deals with partial observability, and is robust to internal uncertainties and external disturbances.

KW - Adaptive nonlinear flight control

KW - Heuristic dynamic programming

KW - Incremental techniques

KW - Online reinforcement learning

KW - Partial observability

UR - http://www.scopus.com/inward/record.url?scp=85086580235&partnerID=8YFLogxK

U2 - 10.1016/j.ast.2020.106013

DO - 10.1016/j.ast.2020.106013

M3 - Article

AN - SCOPUS:85086580235

SN - 1270-9638

VL - 105

JO - Aerospace Science and Technology

JF - Aerospace Science and Technology

M1 - 106013

ER -

Incremental model based online heuristic dynamic programming for nonlinear adaptive tracking control with partial observability

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this