End-to-end neural network based optimal quadcopter control

Robin Ferede; Guido de Croon; Christophe De Wagter; Dario Izzo

doi:10.1016/j.robot.2023.104588

End-to-end neural network based optimal quadcopter control

Robin Ferede^*, Guido de Croon, Christophe De Wagter, Dario Izzo

^*Corresponding author for this work

Control & Simulation

Research output: Contribution to journal › Article › Scientific › peer-review

20 Downloads (Pure)

Abstract

Developing optimal controllers for aggressive high-speed quadcopter flight poses significant challenges in robotics. Recent trends in the field involve utilizing neural network controllers trained through supervised or reinforcement learning. However, the sim-to-real transfer introduces a reality gap, requiring the use of robust inner loop controllers during real flights, which limits the network's control authority and flight performance. In this paper, we investigate for the first time, an end-to-end neural network controller, addressing the reality gap issue without being restricted by an inner-loop controller. The networks, referred to as G&CNets, are trained to learn an energy-optimal policy mapping the quadcopter's state to rpm commands using an optimal trajectory dataset. In hover-to-hover flights, we identified the unmodeled moments as a significant contributor to the reality gap. To mitigate this, we propose an adaptive control strategy that works by learning from optimal trajectories of a system affected by constant external pitch, roll and yaw moments. In real test flights, this model mismatch is estimated onboard and fed to the network to obtain the optimal rpm command. We demonstrate the effectiveness of our method by performing energy-optimal hover-to-hover flights with and without moment feedback. Finally, we compare the adaptive controller to a state-of-the-art differential-flatness-based controller in a consecutive waypoint flight and demonstrate the advantages of our method in terms of energy optimality and robustness.

Original language	English
Article number	104588
Number of pages	11
Journal	Robotics and Autonomous Systems
Volume	172
DOIs	https://doi.org/10.1016/j.robot.2023.104588
Publication status	Published - 2024

Funding

This work was supported by the European Space Agency.This research was co-funded under the Discovery programme of, and funded by, the European Space Agency.

Keywords

End-to-end control
G&CNet
Optimal control
Reality gap
Sim-to-real transfer
Supervised learning

Access to Document

10.1016/j.robot.2023.104588

1-s2.0-S0921889023002270-mainFinal published version, 4.5 MBLicence: CC BY

Cite this

@article{16169a19bf6b46818ecc18a9f2bd5e0f,

title = "End-to-end neural network based optimal quadcopter control",

abstract = "Developing optimal controllers for aggressive high-speed quadcopter flight poses significant challenges in robotics. Recent trends in the field involve utilizing neural network controllers trained through supervised or reinforcement learning. However, the sim-to-real transfer introduces a reality gap, requiring the use of robust inner loop controllers during real flights, which limits the network's control authority and flight performance. In this paper, we investigate for the first time, an end-to-end neural network controller, addressing the reality gap issue without being restricted by an inner-loop controller. The networks, referred to as G&CNets, are trained to learn an energy-optimal policy mapping the quadcopter's state to rpm commands using an optimal trajectory dataset. In hover-to-hover flights, we identified the unmodeled moments as a significant contributor to the reality gap. To mitigate this, we propose an adaptive control strategy that works by learning from optimal trajectories of a system affected by constant external pitch, roll and yaw moments. In real test flights, this model mismatch is estimated onboard and fed to the network to obtain the optimal rpm command. We demonstrate the effectiveness of our method by performing energy-optimal hover-to-hover flights with and without moment feedback. Finally, we compare the adaptive controller to a state-of-the-art differential-flatness-based controller in a consecutive waypoint flight and demonstrate the advantages of our method in terms of energy optimality and robustness.",

keywords = "End-to-end control, G&CNet, Optimal control, Reality gap, Sim-to-real transfer, Supervised learning",

author = "Robin Ferede and {de Croon}, Guido and {De Wagter}, Christophe and Dario Izzo",

year = "2024",

doi = "10.1016/j.robot.2023.104588",

language = "English",

volume = "172",

journal = "Robotics and Autonomous Systems",

issn = "0921-8890",

publisher = "Elsevier",

}

TY - JOUR

T1 - End-to-end neural network based optimal quadcopter control

AU - Ferede, Robin

AU - de Croon, Guido

AU - De Wagter, Christophe

AU - Izzo, Dario

PY - 2024

Y1 - 2024

N2 - Developing optimal controllers for aggressive high-speed quadcopter flight poses significant challenges in robotics. Recent trends in the field involve utilizing neural network controllers trained through supervised or reinforcement learning. However, the sim-to-real transfer introduces a reality gap, requiring the use of robust inner loop controllers during real flights, which limits the network's control authority and flight performance. In this paper, we investigate for the first time, an end-to-end neural network controller, addressing the reality gap issue without being restricted by an inner-loop controller. The networks, referred to as G&CNets, are trained to learn an energy-optimal policy mapping the quadcopter's state to rpm commands using an optimal trajectory dataset. In hover-to-hover flights, we identified the unmodeled moments as a significant contributor to the reality gap. To mitigate this, we propose an adaptive control strategy that works by learning from optimal trajectories of a system affected by constant external pitch, roll and yaw moments. In real test flights, this model mismatch is estimated onboard and fed to the network to obtain the optimal rpm command. We demonstrate the effectiveness of our method by performing energy-optimal hover-to-hover flights with and without moment feedback. Finally, we compare the adaptive controller to a state-of-the-art differential-flatness-based controller in a consecutive waypoint flight and demonstrate the advantages of our method in terms of energy optimality and robustness.

AB - Developing optimal controllers for aggressive high-speed quadcopter flight poses significant challenges in robotics. Recent trends in the field involve utilizing neural network controllers trained through supervised or reinforcement learning. However, the sim-to-real transfer introduces a reality gap, requiring the use of robust inner loop controllers during real flights, which limits the network's control authority and flight performance. In this paper, we investigate for the first time, an end-to-end neural network controller, addressing the reality gap issue without being restricted by an inner-loop controller. The networks, referred to as G&CNets, are trained to learn an energy-optimal policy mapping the quadcopter's state to rpm commands using an optimal trajectory dataset. In hover-to-hover flights, we identified the unmodeled moments as a significant contributor to the reality gap. To mitigate this, we propose an adaptive control strategy that works by learning from optimal trajectories of a system affected by constant external pitch, roll and yaw moments. In real test flights, this model mismatch is estimated onboard and fed to the network to obtain the optimal rpm command. We demonstrate the effectiveness of our method by performing energy-optimal hover-to-hover flights with and without moment feedback. Finally, we compare the adaptive controller to a state-of-the-art differential-flatness-based controller in a consecutive waypoint flight and demonstrate the advantages of our method in terms of energy optimality and robustness.

KW - End-to-end control

KW - G&CNet

KW - Optimal control

KW - Reality gap

KW - Sim-to-real transfer

KW - Supervised learning

UR - http://www.scopus.com/inward/record.url?scp=85179484368&partnerID=8YFLogxK

U2 - 10.1016/j.robot.2023.104588

DO - 10.1016/j.robot.2023.104588

M3 - Article

AN - SCOPUS:85179484368

SN - 0921-8890

VL - 172

JO - Robotics and Autonomous Systems

JF - Robotics and Autonomous Systems

M1 - 104588

ER -

End-to-end neural network based optimal quadcopter control

Abstract

Funding

Keywords

Access to Document

Other files and links

Fingerprint

Cite this