Literature DB >> 34111168

Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization algorithm.

Nesma M Ashraf1, Reham R Mostafa2, Rasha H Sakr1, M Z Rashad1.   

Abstract

Deep Reinforcement Learning (DRL) enables agents to make decisions based on a well-designed reward function that suites a particular environment without any prior knowledge related to a given environment. The adaptation of hyperparameters has a great impact on the overall learning process and the learning processing times. Hyperparameters should be accurately estimated while training DRL algorithms, which is one of the key challenges that we attempt to address. This paper employs a swarm-based optimization algorithm, namely the Whale Optimization Algorithm (WOA), for optimizing the hyperparameters of the Deep Deterministic Policy Gradient (DDPG) algorithm to achieve the optimum control strategy in an autonomous driving control problem. DDPG is capable of handling complex environments, which contain continuous spaces for actions. To evaluate the proposed algorithm, the Open Racing Car Simulator (TORCS), a realistic autonomous driving simulation environment, was chosen to its ease of design and implementation. Using TORCS, the DDPG agent with optimized hyperparameters was compared with a DDPG agent with reference hyperparameters. The experimental results showed that the DDPG's hyperparameters optimization leads to maximizing the total rewards, along with testing episodes and maintaining a stable driving policy.

Entities:  

Year:  2021        PMID: 34111168     DOI: 10.1371/journal.pone.0252754

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  4 in total

1.  Eight pruning deep learning models for low storage and high-speed COVID-19 computed tomography lung segmentation and heatmap-based lesion localization: A multicenter study using COVLIAS 2.0.

Authors:  Mohit Agarwal; Sushant Agarwal; Luca Saba; Gian Luca Chabert; Suneet Gupta; Alessandro Carriero; Alessio Pasche; Pietro Danna; Armin Mehmedovic; Gavino Faa; Saurabh Shrivastava; Kanishka Jain; Harsh Jain; Tanay Jujaray; Inder M Singh; Monika Turk; Paramjit S Chadha; Amer M Johri; Narendra N Khanna; Sophie Mavrogeni; John R Laird; David W Sobel; Martin Miner; Antonella Balestrieri; Petros P Sfikakis; George Tsoulfas; Durga Prasanna Misra; Vikas Agarwal; George D Kitas; Jagjit S Teji; Mustafa Al-Maini; Surinder K Dhanjil; Andrew Nicolaides; Aditya Sharma; Vijay Rathore; Mostafa Fatemi; Azra Alizad; Pudukode R Krishnan; Rajanikant R Yadav; Frence Nagy; Zsigmond Tamás Kincses; Zoltan Ruzsa; Subbaram Naidu; Klaudija Viskovic; Manudeep K Kalra; Jasjit S Suri
Journal:  Comput Biol Med       Date:  2022-05-21       Impact factor: 6.698

2.  High-Performance Computing Analysis and Location Selection of Logistics Distribution Center Space Based on Whale Optimization Algorithm.

Authors:  Lijuan Yang; Xiedong Song
Journal:  Comput Intell Neurosci       Date:  2022-06-22

3.  A novel hybrid soft computing optimization framework for dynamic economic dispatch problem of complex non-convex contiguous constrained machines.

Authors:  Ijaz Ahmed; Um-E-Habiba Alvi; Abdul Basit; Tayyaba Khursheed; Alwena Alvi; Keum-Shik Hong; Muhammad Rehan
Journal:  PLoS One       Date:  2022-01-26       Impact factor: 3.240

4.  Android malware classification based on random vector functional link and artificial Jellyfish Search optimizer.

Authors:  Emad T Elkabbash; Reham R Mostafa; Sherif I Barakat
Journal:  PLoS One       Date:  2021-11-19       Impact factor: 3.240

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.