Reinforcement learning produces dominant strategies for the Iterated Prisoner's Dilemma
2017
We present tournament results and several powerful strategies for the Iterated Prisoner's
Dilemma created using reinforcement learning techniques (evolutionary and particle swarm
algorithms). These strategies are trained to perform well against a corpus of over 170 distinct
opponents, including many well-known and classic strategies. All the trained strategies
win standard tournaments against the total collection of other opponents. The trained strategies
and one particular human made designed strategy are the top performers in noisy tournaments
also.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
54
References
27
Citations
NaN
KQI