Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

Cisneros-Velarde, Pedro; Koyejo, Sanmi

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2303

Computer Science > Machine Learning

Title: Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

Authors: Pedro Cisneros-Velarde, Sanmi Koyejo

(Submitted on 1 Mar 2023)

Abstract: Nash Q-learning may be considered one of the first and most known algorithms in multi-agent reinforcement learning (MARL) for learning policies that constitute a Nash equilibrium of an underlying general-sum Markov game. Its original proof provided asymptotic guarantees and was for the tabular case. Recently, finite-sample guarantees have been provided using more modern RL techniques for the tabular case. Our work analyzes Nash Q-learning using linear function approximation -- a representation regime introduced when the state space is large or continuous -- and provides finite-sample guarantees that indicate its sample efficiency. We find that the obtained performance nearly matches an existing efficient result for single-agent RL under the same representation and has a polynomial gap when compared to the best-known result for the tabular case.

Comments:	25 pages. arXiv admin note: text overlap with arXiv:2205.15891
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2303.00177 [cs.LG]
	(or arXiv:2303.00177v1 [cs.LG] for this version)

Submission history

From: Pedro Cisneros-Velarde [view email]
[v1] Wed, 1 Mar 2023 02:09:49 GMT (42kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2303.00177

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

Submission history