Position-Based Multiple-Play Bandits with Thompson Sampling.

Camille-Sovanneary Gauthier,Romaric Gaudel,Elisa Fromont

Position-Based Multiple-Play Bandits with Thompson Sampling.

2020

Camille-Sovanneary Gauthier
Romaric Gaudel
Elisa Fromont

Multiple-play bandits aim at displaying relevant items at relevant positions on a web page. We introduce a new bandit-based algorithm, PB-MHB, for online recommender systems which uses the Thompson sampling framework. This algorithm handles a display setting governed by the position-based model. Our sampling method does not require as input the probability of a user to look at a given position in the web page which is, in practice, very difficult to obtain. Experiments on simulated and real datasets show that our method, with fewer prior information, deliver better recommendations than state-of-the-art algorithms.

Keywords:

Web page
Recommender system
prior information
Thompson sampling
Computer science
Sampling (statistics)
Information retrieval

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations