Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit.

Nikolai Karpov,Qin Zhang

Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit.

2020

Nikolai Karpov
Qin Zhang

Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. However, it is a bit surprising that pure exploration, a basic problem in bandit theory, has not been well studied in MNL-bandit so far. In this paper we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.

Keywords:

Algorithm
online learning
Online advertising
Multinomial logistic regression
efficient algorithm
Fast fashion
Computer science
Upper and lower bounds

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations