Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks
In this paper, the nearly optimal solution for discrete-time (DT) affine nonlinear control systems in the presence of partially unknown internal system dynamics and disturbances is considered. The approach is based on successive approximate solution of the Hamilton-Jacobi-Isaacs (HJI) equation, which appears in optimal control. Successive approximation approach for updating control and disturbance inputs for DT nonlinear affine systems are proposed. Moreover, sufficient conditions for the convergence of the approximate HJI solution to the saddle point are derived, and an iterative approach to approximate the HJI equation using a neural network (NN) is presented. Then, the requirement of full knowledge of the internal dynamics of the nonlinear DT system is relaxed by using a second NN online approximator. The result is a closed-loop optimal NN controller via offline learning. A numerical example is provided illustrating the effectiveness of the approach. © 2013 IEEE.
Publication Source (Journal or Book title)
IEEE Transactions on Cybernetics
Mehraeen, S., Dierks, T., Jagannathan, S., & Crow, M. (2013). Zero-sum two-player game theoretic formulation of affine nonlinear discrete-time systems using neural networks. IEEE Transactions on Cybernetics, 43 (6), 1641-1655. https://doi.org/10.1109/TSMCB.2012.2227253