Skip to main content

ORIGINAL RESEARCH article

Front. Neurorobot.
Volume 18 - 2024 | doi: 10.3389/fnbot.2024.1466571
This article is part of the Research Topic Insights in Neurorobotics: 2023-2024 View all 6 articles

Noisy Dueling Double Deep Q-Network Algorithm for Autonomous Underwater Vehicle Path Planning

Provisionally accepted
Xu Liao Xu Liao 1,2Le Li Le Li 1*Chuangxia Huang Chuangxia Huang 2,3Xian Zhao Xian Zhao 1,2Shumin Tan Shumin Tan 1
  • 1 School of Meteorology and Oceanography, National University of Defense Technology,, Changsha, Anhui Province, China
  • 2 School of Mathematics and Statistics, Changsha University of Science and Technology,, Changsha, Anhui Province, China
  • 3 College of Science, Hunan University of Science and Engineering, Yongzhou, China

The final, formatted version of the article will be published soon.

    How to improve the success rate of autonomous underwater vehicle (AUV) path planning and reduce travel time as much as possible is a very challenging and crucial problem in the practical applications of AUV in the complex ocean current environment. Traditional reinforcement learning algorithms lack exploration of the environment, and the strategies learned by the agent may not generalize well to other different environments. To address these challenges, we propose a novel AUV path planning algorithm named the Noisy Dueling Double Deep Q-Network (ND3QN) algorithm by modifying the reward function and introducing a noisy network, which generalizes the traditional D3QN algorithm. Compared with the classical algorithm (e.g., Rapidly-exploring Random Trees Star (RRT*), DQN, and D3QN), with simulation experiments conducted in realistic terrain and ocean currents, the proposed ND3QN algorithm demonstrates the outstanding characteristics of a higher success rate of AUV path planning, shorter travel time, and smoother paths.

    Keywords: AUV, path planning, deep reinforcement learning, ND3QN, noisy network

    Received: 18 Jul 2024; Accepted: 23 Sep 2024.

    Copyright: © 2024 Liao, Li, Huang, Zhao and Tan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Le Li, School of Meteorology and Oceanography, National University of Defense Technology,, Changsha, Anhui Province, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.