Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning

Author:

Jing Chonglin1,Wang Chaoli1,Song Hongkai2,Shi Yibo1,Hao Longyan1

Affiliation:

1. Department of Control Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China

2. Equipment Assets Management Office, Shanghai Jian Qiao University, Shanghai 201306, China

Abstract

This paper employs an integral reinforcement learning (IRL) method to investigate the optimal tracking control problem (OTCP) for nonlinear nonzero-sum (NZS) differential game systems with unknown drift dynamics. Unlike existing methods, which can only bound the tracking error, the proposed approach ensures that the tracking error asymptotically converges to zero. This study begins by constructing an augmented system using the tracking error and reference signal, transforming the original OTCP into solving the coupled Hamilton–Jacobi (HJ) equation of the augmented system. Because the HJ equation contains unknown drift dynamics and cannot be directly solved, the IRL method is utilized to convert the HJ equation into an equivalent equation without unknown drift dynamics. To solve this equation, a critic neural network (NN) is employed to approximate the complex value function based on the tracking error and reference information data. For the unknown NN weights, the least squares (LS) method is used to design an estimation law, and the convergence of the weight estimation error is subsequently proven. The approximate solution of optimal control converges to the Nash equilibrium, and the tracking error asymptotically converges to zero in the closed system. Finally, we validate the effectiveness of the proposed method in this paper based on MATLAB using the ode45 method and least squares method to execute Algorithm 2.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Reference44 articles.

1. Robust Adaptive Dynamic Programming for Large-Scale Systems with an Application to Multimachine Power Systems;Jiang;IEEE Trans. Circuits Syst. Express Briefs,2012

2. Decentralized Adaptive Optimal Control of Large-Scale Systems with Application to Power Systems;Bian;IEEE Trans. Ind. Electron.,2015

3. Kirk, D.E. (2004). Optimal Control Theory: An Introduction, Dover Publications.

4. Affine Quadratic Optimal Control and Aerospace Applications;Rodrigues;IEEE Trans. Aerosp. Electron. Syst.,2021

5. Inverse Optimal Adaptive Control of Canonical Nonlinear Systems with Dynamic Uncertainties and Its Application to Industrial Robots;Lu;IEEE Trans. Ind. Inf.,2024

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3