ε-Nash Equilibrium of Pursuer–Evader–Defender Missile Navigation Dynamic Games-Reference-Cited by-同舟云学术

ε-Nash Equilibrium of Pursuer–Evader–Defender Missile Navigation Dynamic Games

Published:2024-08-20 Issue: Volume: Page:1-23
ISSN:2301-3850
Container-title:Unmanned Systems
language:en
Short-container-title:Un. Sys.

Author:

Noriega-Marquez Sebastian¹^ORCID,Hernandez-Sanchez Alejandra²^ORCID,Chairez Isaac³^ORCID,Poznyak Alexander¹^ORCID

Affiliation:

1. Departamento de Control Automático Centro de Investigacion y, Estudios Avanzados del IPN (CINVESTAV-IPN), Gustavo A. Madero, 47368 Ciudad de México, CDMX, Mexico

2. Institute of Advanced Materials for Sustainable Manufacturing Tecnológico de Monterrey, 14380 Ciudad de México, CDMX, Mexico

3. Institute of Advanced Materials for Sustainable Manufacturing, Tecnológico de Monterrey, 45210 Zapopan, JA, Mexico

Abstract

This research is dedicated to developing a min–max robust control strategy for a dynamic game involving pursuers, evaders, and defenders in a multiple-missile scenario. The approach employs neural dynamic programming, utilizing multiple continuous differential neural networks (DNNs). The competitive controller devised addresses the robust optimization of a joint cost function that relies on the trajectories of the pursuer–evader–defender system, accommodating an uncertain mathematical model while adhering to control restrictions. The dynamic programming min–max formulation facilitates robust control by accounting for bounded modeling uncertainties and external disturbances for each game component. The value function of the Hamilton–Jacobi–Bellman (HJB) equation is approximated by a DNN, enabling the estimation of the closed-loop formulation for the joint dynamic game with state restrictions. The controller’s design is grounded in estimating the state trajectory under the worst possible uncertainties and perturbations, providing a robustness factor through the robust neural controller. The learning law class for the time-varying weights in the DNN is generated by studying the HJB partial differential equation for the missile motion for each player in the dynamic game. The controller incorporates the solution of the obtained learning laws and a time-varying Riccati equation, offering an online solution to the control implementation. A recurrent algorithm, based on the Kiefer–Wolfowitz method, adjusts the initial conditions for the weights to satisfy the final condition of the given cost function for the dynamic game. A numerical example is presented to validate the proposed robust control methodology, confirming the optimization solution based on the DNN approximation for Bellman’s value function.

Funder

Institute of Advanced Materials for Sustainable Manufacturing, Tecnológico de Monterrey under the Grant Challenge-Based Research Funding Program 2022

Publisher

World Scientific Pub Co Pte Ltd

Link

https://www.worldscientific.com/doi/pdf/10.1142/S2301385025500517