Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations

Jiachen Yang,Xiaojing Ye,Rakshit Trivedi,Huan Xu,Hongyuan Zha

Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations

2018

We consider the problem of representing a large population's behavior policy that drives the evolution of the population distribution over a discrete state space. A discrete time mean field game (MFG) is motivated as an interpretable model founded on game theory for understanding the aggregate effect of individual actions and predicting the temporal evolution of population distributions. We achieve a synthesis of MFG and Markov decision processes (MDP) by showing that a special MFG is reducible to an MDP. This enables us to broaden the scope of mean field game theory and infer MFG models of large real-world systems via deep inverse reinforcement learning. Our method learns both the reward function and forward dynamics of an MFG from real data, and we report the first empirical test of a mean field game model of a real-world social media population.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations