TY - GEN
T1 - Intelligent Adaptive Control Using LADP and IADP Applied to F-16 Aircraft with Imperfect Measurements
AU - de Alvear Cardenas, J.I.
AU - Sun, B.
AU - van Kampen, E.
N1 - Virtual/online event due to COVID-19
PY - 2021
Y1 - 2021
N2 - Linear Approximate Dynamic Programming (LADP) and Incremental Approximate Dynamic Programming (IADP) are Reinforcement Learning methods that seek to contribute to the field of Adaptive Flight Control. This paper assesses their performance and convergence, as well as the impact of sensor noise on policy convergence, online system identification, performance and control surface deflection. After summarising their theory and derivation with full state (FS) and output feedback (OPFB), they are implemented on the linearised longitudinal F16 model. In order to establish an objective performance comparison, their hyper-parameters were tuned with an evolutionary algorithm: Particle Swarm Optimisation (PSO). Results show that LADP and IADP have the same performance in the presence of FS feedback, whereas LADP outperforms IADP when only OPFB is available. Output noise causes LADP based on OPFB to diverge. In the case of IADP based on OPFB, sensor noise improves the performance due to a better exploration of the solution space. The present research aims at bridging the gap between the discussed ADP algorithms and real world systems.
AB - Linear Approximate Dynamic Programming (LADP) and Incremental Approximate Dynamic Programming (IADP) are Reinforcement Learning methods that seek to contribute to the field of Adaptive Flight Control. This paper assesses their performance and convergence, as well as the impact of sensor noise on policy convergence, online system identification, performance and control surface deflection. After summarising their theory and derivation with full state (FS) and output feedback (OPFB), they are implemented on the linearised longitudinal F16 model. In order to establish an objective performance comparison, their hyper-parameters were tuned with an evolutionary algorithm: Particle Swarm Optimisation (PSO). Results show that LADP and IADP have the same performance in the presence of FS feedback, whereas LADP outperforms IADP when only OPFB is available. Output noise causes LADP based on OPFB to diverge. In the case of IADP based on OPFB, sensor noise improves the performance due to a better exploration of the solution space. The present research aims at bridging the gap between the discussed ADP algorithms and real world systems.
UR - http://www.scopus.com/inward/record.url?scp=85100289653&partnerID=8YFLogxK
U2 - 10.2514/6.2021-1119
DO - 10.2514/6.2021-1119
M3 - Conference contribution
T3 - AIAA Scitech 2021 Forum
SP - 1
EP - 44
BT - AIAA Scitech 2021 Forum
PB - American Institute of Aeronautics and Astronautics Inc. (AIAA)
T2 - AIAA Scitech 2021 Forum
Y2 - 11 January 2021 through 21 January 2021
ER -