Research on Autonomous Manoeuvre Decision Making in Within-Visual-Range Aerial Two-Player Zero-Sum Games Based on Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Research on Autonomous Manoeuvre Decision Making in Within-Visual-Range Aerial Two-Player Zero-Sum Games Based on Deep Reinforcement Learning

Published:2024-07-10 Issue:14 Volume:12 Page:2160
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Lu Bo¹²,Ru Le¹²,Hu Shiguang¹²^ORCID,Wang Wenfei¹²,Xi Hailong¹²,Zhao Xiaolin¹²

Affiliation:

1. Equipment Management and UAV Engineering College, Air Force Engineering University, Xi’an 710051, China

2. National Key Lab of Unmanned Aerial Vehicle Technology, Air Force Engineering University, Xi’an 710051, China

Abstract

In recent years, with the accelerated development of technology towards automation and intelligence, autonomous decision-making capabilities in unmanned systems are poised to play a crucial role in contemporary aerial two-player zero-sum games (TZSGs). Deep reinforcement learning (DRL) methods enable agents to make autonomous manoeuvring decisions. This paper focuses on current mainstream DRL algorithms based on fundamental tactical manoeuvres, selecting a typical aerial TZSG scenario—within visual range (WVR) combat. We model the key elements influencing the game using a Markov decision process (MDP) and demonstrate the mathematical foundation for implementing DRL. Leveraging high-fidelity simulation software (Warsim v1.0), we design a prototypical close-range aerial combat scenario. Utilizing this environment, we train mainstream DRL algorithms and analyse the training outcomes. The effectiveness of these algorithms in enabling agents to manoeuvre in aerial TZSG autonomously is summarised, providing a foundational basis for further research.

Publisher

MDPI AG

Link

https://www.mdpi.com/2227-7390/12/14/2160/pdf

Reference43 articles.

1. Han, R., Chen, H., Liu, Q., and Huang, J. (2021, January 22–24). Research on Autonomous Air Combat Maneuver Decision Making Based on Reward Shaping and D3QN. Proceedings of the 2021 China Automation Conference, Beijing, China.

2. Genetic fuzzy based artificial intelligence for unmanned combat aerial vehicle control in simulated air combat missions;Ernest;J. Def. Manag.,2016

3. Optimal maneuver-based motion planning over terrain and threats using a dynamic hybrid PSO algorithm;Karimi;Aerospaceence Technol.,2013

4. Improving maneuver strategy in air combat by alternate freeze games with a deep reinforcement learning algorithm;Wang;Math. Probl. Eng.,2020

5. Guidance and control for own aircraft in the autonomous air combat: A historical review and future prospects;Dong;Proc. Inst. Mech. Eng.,2019