Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma

Marc Harper,Vincent Anthony Knight,Martin Jones,Georgios Koutsovoulos,Nikoleta E. Glynatsi,Owen Campbell

Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma

2017

Marc Harper
Vincent Anthony Knight
Martin Jones
Georgios Koutsovoulos
Nikoleta E. Glynatsi
Owen Campbell

We present tournament results and several powerful strategies for the Iterated Prisoner's Dilemma created using reinforcement learning techniques (evolutionary and particle swarm algorithms). These strategies are trained to perform well against a corpus of over 170 distinct opponents, including many well-known and classic strategies. All the trained strategies win standard tournaments against the total collection of other opponents. The trained strategies and one particular human made designed strategy are the top performers in noisy tournaments also.

Keywords:

Iterated function
Tournament
Reinforcement learning
Dilemma
Machine learning
Prisoner's dilemma
Computer science
Particle swarm optimization
Artificial intelligence
Game theory
Evolutionary algorithm
Hidden Markov model
Artificial neural network

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations