Q-Learning for Feedback Nash Strategy of Finite-Horizon Nonzero-Sum Difference Games

- Zhang, Zhaorong; Xu, Juanjuan; Fu, Minyue