Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles

Qingrui Zhang; Wei Pan; Vasso Reppa

doi:10.1109/CDC42340.2020.9304347

Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles

Qingrui Zhang, Wei Pan, Vasso Reppa

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

11 Citations (Scopus)

29 Downloads (Pure)

Abstract

This paper presents a novel model-reference reinforcement learning control method for uncertain autonomous surface vehicles. The proposed control combines a conventional model-based control method with deep reinforcement learning. With the conventional model-based control, we can ensure the learning-based control law provides closed-loop stability for the trajectory tracking control of the overall system, and increase the sample efficiency of the deep reinforcement learning. With reinforcement learning, we can directly learn a control law to compensate for modeling uncertainties. In the proposed control, a nominal system is employed for the design of a baseline control law using a conventional control approach. The nominal system also defines the desired performance for uncertain autonomous vehicles to follow. In comparison with traditional deep reinforcement learning methods, our proposed learning-based control can provide stability guarantees and better sample efficiency. We demonstrate the performance of the new algorithm via extensive simulation results.

Original language	English
Title of host publication	Proceedings of the 59th IEEE Conference on Decision and Control, CDC 2020
Place of Publication	Piscataway, NJ, USA
Publisher	IEEE
Pages	5291-5296
ISBN (Electronic)	978-1-7281-7447-1
DOIs	https://doi.org/10.1109/CDC42340.2020.9304347
Publication status	Published - 2020
Event	59th IEEE Conference on Decision and Control, CDC 2020 - Virtual, Jeju Island, Korea, Republic of Duration: 14 Dec 2020 → 18 Dec 2020

Conference

Conference	59th IEEE Conference on Decision and Control, CDC 2020
Country/Territory	Korea, Republic of
City	Virtual, Jeju Island
Period	14/12/20 → 18/12/20

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Access to Document

10.1109/CDC42340.2020.9304347

Model-Reference_Reinforcement_Learning_Control_of_Autonomous_Surface_VehiclesFinal published version, 1.46 MB

Cite this

@inproceedings{28b8efc0253b48feb7b8cfe470c7473c,

title = "Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles",

abstract = "This paper presents a novel model-reference reinforcement learning control method for uncertain autonomous surface vehicles. The proposed control combines a conventional model-based control method with deep reinforcement learning. With the conventional model-based control, we can ensure the learning-based control law provides closed-loop stability for the trajectory tracking control of the overall system, and increase the sample efficiency of the deep reinforcement learning. With reinforcement learning, we can directly learn a control law to compensate for modeling uncertainties. In the proposed control, a nominal system is employed for the design of a baseline control law using a conventional control approach. The nominal system also defines the desired performance for uncertain autonomous vehicles to follow. In comparison with traditional deep reinforcement learning methods, our proposed learning-based control can provide stability guarantees and better sample efficiency. We demonstrate the performance of the new algorithm via extensive simulation results.",

author = "Qingrui Zhang and Wei Pan and Vasso Reppa",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 59th IEEE Conference on Decision and Control, CDC 2020 ; Conference date: 14-12-2020 Through 18-12-2020",

year = "2020",

doi = "10.1109/CDC42340.2020.9304347",

language = "English",

pages = "5291--5296",

booktitle = "Proceedings of the 59th IEEE Conference on Decision and Control, CDC 2020",

publisher = "IEEE",

address = "United States",

}

Zhang, Q, Pan, W & Reppa, V 2020, Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles. in Proceedings of the 59th IEEE Conference on Decision and Control, CDC 2020. IEEE, Piscataway, NJ, USA, pp. 5291-5296, 59th IEEE Conference on Decision and Control, CDC 2020, Virtual, Jeju Island, Korea, Republic of, 14/12/20. https://doi.org/10.1109/CDC42340.2020.9304347

Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles. / Zhang, Qingrui; Pan, Wei; Reppa, Vasso.
Proceedings of the 59th IEEE Conference on Decision and Control, CDC 2020. Piscataway, NJ, USA: IEEE, 2020. p. 5291-5296.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles

AU - Zhang, Qingrui

AU - Pan, Wei

AU - Reppa, Vasso

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2020

Y1 - 2020

N2 - This paper presents a novel model-reference reinforcement learning control method for uncertain autonomous surface vehicles. The proposed control combines a conventional model-based control method with deep reinforcement learning. With the conventional model-based control, we can ensure the learning-based control law provides closed-loop stability for the trajectory tracking control of the overall system, and increase the sample efficiency of the deep reinforcement learning. With reinforcement learning, we can directly learn a control law to compensate for modeling uncertainties. In the proposed control, a nominal system is employed for the design of a baseline control law using a conventional control approach. The nominal system also defines the desired performance for uncertain autonomous vehicles to follow. In comparison with traditional deep reinforcement learning methods, our proposed learning-based control can provide stability guarantees and better sample efficiency. We demonstrate the performance of the new algorithm via extensive simulation results.

AB - This paper presents a novel model-reference reinforcement learning control method for uncertain autonomous surface vehicles. The proposed control combines a conventional model-based control method with deep reinforcement learning. With the conventional model-based control, we can ensure the learning-based control law provides closed-loop stability for the trajectory tracking control of the overall system, and increase the sample efficiency of the deep reinforcement learning. With reinforcement learning, we can directly learn a control law to compensate for modeling uncertainties. In the proposed control, a nominal system is employed for the design of a baseline control law using a conventional control approach. The nominal system also defines the desired performance for uncertain autonomous vehicles to follow. In comparison with traditional deep reinforcement learning methods, our proposed learning-based control can provide stability guarantees and better sample efficiency. We demonstrate the performance of the new algorithm via extensive simulation results.

UR - http://www.scopus.com/inward/record.url?scp=85099877199&partnerID=8YFLogxK

U2 - 10.1109/CDC42340.2020.9304347

DO - 10.1109/CDC42340.2020.9304347

M3 - Conference contribution

AN - SCOPUS:85099877199

SP - 5291

EP - 5296

BT - Proceedings of the 59th IEEE Conference on Decision and Control, CDC 2020

PB - IEEE

CY - Piscataway, NJ, USA

T2 - 59th IEEE Conference on Decision and Control, CDC 2020

Y2 - 14 December 2020 through 18 December 2020

ER -

Model-Reference Reinforcement Learning Control of Autonomous Surface Vehicles

Abstract

Conference

Bibliographical note

Access to Document

Other files and links

Fingerprint

Cite this