Safe, Efficient, Comfort, and Energy-saving Automated Driving through Roundabout Based on Deep Reinforcement Learning

Henan Yuan, Penghui Li, Bart van Arem, Liujiang Kang, Haneen Farah, Yongqi Dong

Research output: Chapter in Book/Conference proceedings/Edited volumeConference contributionScientificpeer-review

42 Downloads (Pure)

Abstract

Traffic scenarios in roundabouts pose substantial complexity for automated driving. Manually mapping all possible scenarios into a state space is labor-intensive and challenging. Deep reinforcement learning (DRL) with its ability to learn from interacting with the environment emerges as a promising solution for training such automated driving models. This study explores, employs, and implements various DRL algorithms, namely Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO), and Trust Region Policy Optimization (TRPO) to instruct automated vehicles' driving through roundabouts. The driving state space, action space, and reward function are designed. The reward function considers safety, efficiency, comfort, and energy consumption to align with real-world requirements. All three tested DRL algorithms succeed in enabling automated vehicles to drive through the roundabout. To holistically evaluate the performance of these algorithms, this study establishes an evaluation methodology considering multiple indicators, i.e., safety, efficiency, comfort and energy consumption level. A method employing the Analytic Hierarchy Process is also developed to weigh these evaluation indicators. Experimental results on various testing scenarios reveal that the TRPO algorithm outperforms DDPG and PPO in terms of safety and efficiency, while PPO performs the best in terms of comfort level and energy consumption. Lastly, to verify the model's adaptability and robustness regarding other driving scenarios, this study also deploys the model trained by TRPO to a range of different testing scenarios, e.g., highway driving and merging. Experimental results demonstrate that the TRPO model trained on only roundabout driving scenarios exhibits a certain degree of proficiency in highway driving and merging scenarios. This study provides a foundation for the application of automated driving with DRL.
Original languageEnglish
Title of host publication2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)
PublisherIEEE
Pages6074-6079
Number of pages6
ISBN (Electronic)9798350399462
DOIs
Publication statusPublished - 2023
Event26th IEEE International Conference on Intelligent Transportation Systems, ITSC 2023 - Euskalduna Conference Centre, Bilbao, Spain
Duration: 24 Sept 202328 Sept 2023
https://2023.ieee-itsc.org/

Publication series

NameIEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
ISSN (Print)2153-0009
ISSN (Electronic)2153-0017

Conference

Conference26th IEEE International Conference on Intelligent Transportation Systems, ITSC 2023
Abbreviated titleIEEE ITSC 2023
Country/TerritorySpain
CityBilbao
Period24/09/2328/09/23
Internet address

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Fingerprint

Dive into the research topics of 'Safe, Efficient, Comfort, and Energy-saving Automated Driving through Roundabout Based on Deep Reinforcement Learning'. Together they form a unique fingerprint.

Cite this