Successive Halving Top-k Operator.

Michał Pietruszka,Lukasz Borchmann,Filip Graliński

Successive Halving Top-k Operator.

2020

Michał Pietruszka
Lukasz Borchmann
Filip Graliński

We propose a differentiable successive halving method of relaxing the top-k operator, rendering gradient-based optimization possible. The need to perform softmax iteratively on the entire vector of scores is avoided using a tournament-style selection. As a result, a much better approximation of top-k and lower computational cost is achieved compared to the previous approach.

Keywords:

Algebra
Operator (computer programming)
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations