cff-version: 1.2.0
abstract: "<p>This is the code and data related to the publication:</p><p>Y. Dong, T. Datema, V. Wassenaar, J. Van de Weg, C. T. Kopar and H. Suleman, "Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers," 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Spain, 2023, pp. 6165-6170, doi: 10.1109/ITSC57777.2023.10422159.</p><p>&nbsp;</p><p>&nbsp;keywords: {Training;Deep learning;Roads;Reinforcement learning;Automobiles;Task analysis;Optimization}</p><p><br></p><p>The implementation is based on Python, Stable-Baselines3 (https://stable-baselines3.readthedocs.io/en/master/) and Highway_env simulation environment https://github.com/Farama-Foundation/HighwayEnv</p><p><br></p><p>~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~</p><p>Developing and testing automated driving models in the real world might be challenging and even dangerous, while simulation can help with this, especially for challenging manoeuvres. Deep reinforcement learning (DRL) has the potential to tackle complex decision-making and controlling tasks through learning and interacting with the environment, thus it is suitable for developing automated driving while not being explored in detail yet. This study carried out a comprehensive study by implementing, evaluating, and comparing the two DRL algorithms, Deep Q-networks (DQN) and Trust Region Policy Optimization (TRPO), for training automated driving on the highway-env simulation platform. Effective and customized reward functions were developed and the implemented algorithms were evaluated in terms of onlane accuracy (how well the car drives on the road within the lane), efficiency (how fast the car drives), safety (how likely the car is to crash into obstacles), and comfort (how much the car makes jerks, e.g., suddenly accelerates or brakes). Results show that the TRPO-based models with modified reward functions delivered the best performance in most cases. Furthermore, to train a uniform driving model that can tackle various driving manoeuvres besides the specific ones, this study expanded the highway-env and developed an extra customized training environment, namely, ComplexRoads, integrating various driving manoeuvres and multiple road scenarios together. Models trained on the designed ComplexRoads environment can adapt well to other driving manoeuvres with promising overall performance. Lastly, several functionalities were added to the highway-env to implement this work. The codes are open on GitHub at https://github.com/alaineman/drlcarsim-paper.</p><p><br></p>"
authors:
  - family-names: Dong
    given-names: Yongqi
    orcid: "https://orcid.org/0000-0003-1159-9584"
  - family-names: Datema
    given-names: Tobias
  - family-names: Wassenaar
    given-names: Vincent
  - family-names: Van de Weg
    given-names: Joris
  - family-names: Tolga Kopar
    given-names: Cahit
  - family-names: Suleman
    given-names: Harim
title: "Code underlying the publication: Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers"
keywords:
version: 1
identifiers:
  - type: doi
    value: 10.4121/26e8f131-53f8-44b9-8ecf-249bfedb0154.v1
license: CC BY 4.0
date-released: 2025-02-20