An Online Learning Approach to a Multi-player N-armed Functional Bandit

Sam O’Neill,Ovidiu Bagdasar,Antonio Liotta

An Online Learning Approach to a Multi-player N-armed Functional Bandit

2020

Sam O’Neill
Ovidiu Bagdasar
Antonio Liotta

Congestion games possess the property of emitting at least one pure Nash equilibrium and have a rich history of practical use in transport modelling. In this paper we approach the problem of modelling equilibrium within congestion games using a decentralised multi-player probabilistic approach via stochastic bandit feedback. Restricting the strategies available to players under the assumption of bounded rationality, we explore an online multiplayer exponential weights algorithm for unweighted atomic routing games and compare this with a \(\epsilon \)-greedy algorithm.

Keywords:

Artificial intelligence
Computer science
online learning
Mathematical optimization
Multi-armed bandit
Nash equilibrium
Bounded rationality
Probabilistic logic
Exponential function

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations