An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning-Reference-Cited by-同舟云学术

An AUV Target-Tracking Method Combining Imitation Learning and Deep Reinforcement Learning

Published:2022-03-07 Issue:3 Volume:10 Page:383
ISSN:2077-1312
Container-title:Journal of Marine Science and Engineering
language:en
Short-container-title:JMSE

Author:

Mao Yubing,Gao Farong^ORCID,Zhang Qizhong^ORCID,Yang Zhangyi

Abstract

This study aims to solve the problem of sparse reward and local convergence when using a reinforcement learning algorithm as the controller of an AUV. Based on the generative adversarial imitation (GAIL) algorithm combined with a multi-agent, a multi-agent GAIL (MAG) algorithm is proposed. The GAIL enables the AUV to directly learn from expert demonstrations, overcoming the difficulty of slow initial training of the network. Parallel training of multi-agents reduces the high correlation between samples to avoid local convergence. In addition, a reward function is designed to help training. Finally, the results show that in the unity simulation platform test, the proposed algorithm has a strong optimal decision-making ability in the tracking process.

Funder

Open Foundation of Key Laboratory of Submarine Geosciences, MNR

Publisher

MDPI AG

Subject

Ocean Engineering,Water Science and Technology,Civil and Structural Engineering

Link

https://www.mdpi.com/2077-1312/10/3/383/pdf

Reference52 articles.

1. Unmanned Underwater Vehicle;Chen,2014

2. Development of hovering control system for an underwater vehicle to perform core internal inspections

3. Terrain Correlation Correction Method for AUV Seabed Terrain Mapping