Agent Environment Cycle Games.

Justin K. Terry,Nathaniel Grammel,Benjamin Black,Ananth Hari,Caroline Horsch,Luis Santos

Agent Environment Cycle Games.

2020

Justin K. Terry
Nathaniel Grammel
Benjamin Black
Ananth Hari
Caroline Horsch
Luis Santos

Partially Observable Stochastic Games (POSGs), are the most general model of games used in Multi-Agent Reinforcement Learning (MARL), modeling actions and observations as happening sequentially for all agents. We introduce Agent Environment Cycle Games (AEC Games), a model of games based on sequential agent actions and observations. AEC Games can be thought of as sequential versions of POSGs, and we prove that they are equally powerful. We argue conceptually and through case studies that the AEC games model is useful in important scenarios in MARL for which the POSG model is not well suited. We additionally introduce "cyclically expansive curriculum learning," a new MARL curriculum learning method motivated by the AEC games model. It can be applied "for free," and experimentally we show this technique to achieve up to 35.1% more total reward on average.

Keywords:

Artificial intelligence
Curriculum
Reinforcement learning
Computer science
Expansive
learning methods

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations