SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions

Chengwei Zhang,Xiao-hong Li,Jianye Hao,Siqi Chen,Karl Tuyls,Zhi-Yong Feng

SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions

2018

Although many reinforcement learning methods have been proposed for learning the optimal solutions in single-agent continuous action domains, multiagent coordination domains with continuous action have received relatively few investigations. In this paper, we propose an independent learner hierarchical method, named Sample Continuous Coordination with recursive Frequency Maximum Q-Value (SCC-rFMQ), which divides the coordination problem into two layers. The first layer samples a finite set of actions from the continuous action spaces by a sampling mechanism with variable exploratory rates, and the second layer evaluates the actions in the sampled action set and updates the policy using a multiagent reinforcement learning coordination method. By constructing coordination mechanisms at both levels, SCC-rFMQ can handle coordination problems in continuous action cooperative Markov games effectively. Experimental results show that SCC-rFMQ outperforms other reinforcement learning algorithms.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations