Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees.

Panageas, Ioannis; Skoulakis, Efstratios Panteleimon; Viano, Luca; Wang, Xiao; Cevher, Volkan

Panageas, Ioannis; Skoulakis, Efstratios Panteleimon; Viano, Luca; Wang, Xiao; Cevher, Volkan

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Résumé

In this work, we introduce a new variant of online gradient descent, which provably converges to Nash Equilibria and simultaneously attains sublinear regret for the class of congestion games in the semi-bandit feedback setting. Our proposed method admits convergence rates depending only polynomially on the number of players and the number of facilities, but not on the size of the action set, which can be exponentially large in terms of the number of facilities. Moreover, the running time of our method has polynomial-time dependence on the implicit description of the game. As a result, our work answers an open question from (Cui et al., 2022).

Détails

Titre Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees.

Auteur(s) Panageas, Ioannis ; Skoulakis, Efstratios Panteleimon ; Viano, Luca ; Wang, Xiao ; Cevher, Volkan

Pagination 27

Présenté à 40th International Conference on Machine Learning (ICML), Honolulu, Hawaii, USA, July, 23-29, 2023

Date 2023

Mots-clés (libres)

ML-AI

Laboratoires LIONS

Le document apparaît dans Production scientifique et compétences > STI - Faculté des sciences et techniques de l'ingénieur > IEM - Institute of Electrical and Micro Engineering > LIONS - Laboratoire de systèmes d'information et d'inférence
Publications validées par des pairs
Papiers de conférence
Travail produit à l'EPFL

Date de création de la notice 2023-09-28

Files

Résumé

Détails

PDF