Code underlying the publication: Safe, Efficient, Comfort, and Energy-Saving Automated Driving Through Roundabout Based on Deep Reinforcement Learning

DOI:10.4121/c1020a3f-0053-491f-8ead-35d18819d37e.v1

The DOI displayed above is for this specific version of this dataset, which is currently the latest. Newer versions may be published in the future. For a link that will always point to the latest version, please use
DOI: 10.4121/c1020a3f-0053-491f-8ead-35d18819d37e

Datacite citation style

Yuan, Henan; Dong, Yongqi; Li, Penghui; Kang, Liujiang; Haneen Farah et. al. (2025): Code underlying the publication: Safe, Efficient, Comfort, and Energy-Saving Automated Driving Through Roundabout Based on Deep Reinforcement Learning. Version 1. 4TU.ResearchData. dataset. https://doi.org/10.4121/c1020a3f-0053-491f-8ead-35d18819d37e.v1

Other citation styles (APA, Harvard, MLA, Vancouver, Chicago, IEEE) available at Datacite

Dataset

Keywords

Deep reinforcement learning DRL Energy consumption ITS Merging Road transportation Roundabout Safety Testing

Licence

CC BY 4.0

Export as...

RefWorks BibTeX Reference Manager Endnote DataCite NLM DC CFF

by Henan Yuan, Yongqi Dong

, Penghui Li, Liujiang Kang, Haneen Farah

, Bart van Arem

This is the code related to the publication:

H. Yuan, P. Li, B. Van Arem, L. Kang, H. Farah and Y. Dong, "Safe, Efficient, Comfort, and Energy-Saving Automated Driving Through Roundabout Based on Deep Reinforcement Learning," 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Spain, 2023, pp. 6074-6079, doi: 10.1109/ITSC57777.2023.10422488.

keywords: {Road transportation;Deep learning;Energy consumption;Merging;Reinforcement learning;Safety;Testing},

The implementation is based on Python, Stable-Baselines3 (https://stable-baselines3.readthedocs.io/en/master/) and Highway_env simulation environment https://github.com/Farama-Foundation/HighwayEnv.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Traffic scenarios in roundabouts pose substantial complexity for automated driving. Manually mapping all possible scenarios into a state space is labor-intensive and challenging. Deep reinforcement learning (DRL) with its ability to learn from interacting with the environment emerges as a promising solution for training such automated driving models. This study explores, employs, and implements various DRL algorithms, namely Deep Deterministic Policy Gradient (DDPG), Proximal Policy Optimization (PPO), and Trust Region Policy Optimization (TRPO) to instruct automated vehicles' driving through roundabouts. The driving state space, action space, and reward function are designed. The reward function considers safety, efficiency, comfort, and energy consumption to align with real-world requirements. All three tested DRL algorithms succeed in enabling automated vehicles to drive through the roundabout. To holistically evaluate the performance of these algorithms, this study establishes an evaluation methodology considering multiple indicators, i.e., safety, efficiency, comfort and energy consumption level. A method employing the Analytic Hierarchy Process is also developed to weigh these evaluation indicators. Experimental results on various testing scenarios reveal that the TRPO algorithm outperforms DDPG and PPO in terms of safety and efficiency, while PPO performs the best in terms of comfort level and energy consumption. Lastly, to verify the model's adaptability and robustness regarding other driving scenarios, this study also deploys the model trained by TRPO to a range of different testing scenarios, e.g., highway driving and merging. Experimental results demonstrate that the TRPO model trained on only roundabout driving scenarios exhibits a certain degree of proficiency in highway driving and merging scenarios. This study provides a foundation for the application of automated driving with DRL.

History

2025-02-20 first online, published, posted

Publisher

4TU.ResearchData

Format

py; txt; csv; avi

Associated peer-reviewed publication

Safe, Efficient, Comfort, and Energy-Saving Automated Driving Through Roundabout Based on Deep Reinforcement Learning

Funding

Safe and efficient operation of AutoMated and human drivEN vehicles in mixed traffic (grant code 17187) [more info...] Applied and Technical Sciences (TTW), a subdomain of the Dutch Institute for Scientific Research (NWO)

Organizations

TU Delft, Faculty of Civil Engineering and Geosciences, Department of Transport and Planning
Beijing Jiaotong University, School of Traffic and Transportation

DATA

Files (2)

3,230 bytesMD5:49ea28418caaba60f3fadb834905dcc9Readme.txt
92,119,671 bytesMD5:c86bc940abe5034cf14c4a43eb887c60Safe, Efficient, Comfort, and Energy-Saving Automated Driving Through Roundabout Based on DRL.zip
download all files (zip)
92,122,901 bytes unzipped