Bandit algorithms for the multiple-play recommendation

Jonathan Louëdec,Max Chevalier,Aurélien Garivier,Josiane Mothe

Bandit algorithms for the multiple-play recommendation

2015

Jonathan Louëdec
Max Chevalier
Aurélien Garivier
Josiane Mothe

The multiple-play recommender systems (RS) are RS which recommend several items to the users. RS are based on learning models in order to choose the items to recommend. Among these models, the bandit algorithms offer the advantage to learn and exploite the learnt elements at the same time. Current approaches require running as many instances of a bandit algorithm as there are items to recommend. As opposed to that, we handle all recommendations simultaneously, by a single instance of a bandit algorithm. We show on two benchmark datasets (Movielens and Jester) that our method, MPB (Multiple Plays Bandit), obtains a learning rate about thirteen times faster while obtaining equivalent click-through rates. We also show that the choice of the bandit algorithm used impacts the level of improvement.

Keywords:

Recommender system
MovieLens
Algorithm
Computer science

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations