Photonic decision-making for arbitrary-number-armed bandit problem utilizing parallel chaos generation.

Jiafa Peng,Ning Jiang,Anke Zhao,Shiqin Liu,Yiqun Zhang,Kun Qiu,Qianwu Zhang

Photonic decision-making for arbitrary-number-armed bandit problem utilizing parallel chaos generation.

2021

In this paper, we propose and experimentally demonstrate a novel scheme that helps to solve an any-number-armed bandit problem by utilizing two parallel simultaneously-generated chaotic signals and the epsilon (ɛ)-greedy strategy. In the proposed scheme, two chaotic signals are experimentally generated, and then processed by an 8-bit analog-to-digital conversion (ADC) with 4 least significant bits (LSBs), to generate two amplitude-distribution-uniform sequences for decision-making. The correspondence between these two random sequences and different arms is established by a mapping rule designed in virtue of the ɛ-greedy-strategy. Based on this, decision-making for an exemplary 5-armed bandit problem is successfully performed, and moreover, the influences of the mapping rule and unknown reward probabilities on the correction decision rate (CDR) performance for the 4-armed to 7-armed bandit problems are investigated. This work provides a novel way for solving the arbitrary-number-armed bandit problem.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations