REVEAL 2020: Bandit and Reinforcement Learning from User Interactions

Thorsten Joachims,Yves Raimond,Olivier Koch,Maria Dimakopoulou,Flavian Vasile,Adith Swaminathan

REVEAL 2020: Bandit and Reinforcement Learning from User Interactions

2020

Thorsten Joachims
Yves Raimond
Olivier Koch
Maria Dimakopoulou
Flavian Vasile
Adith Swaminathan

The REVEAL workshop1 focuses on framing the recommendation problem as a one of making personalized interventions, e.g. deciding to recommend a particular item to a particular user. Moreover, these interventions sometimes depend on each other, where a stream of interactions occurs between the user and the system, and where each decision to recommend something will have an impact on future steps and long-term rewards. This framing creates a number of challenges we will discuss at the workshop. How can recommender systems be evaluated offline in such a context? How can we learn recommendation policies that are aware of these delayed consequences and outcomes?

Keywords:

Recommender system
Human–computer interaction
Computer science
Reinforcement learning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations