Improving Model and Search for Computer Go.

Tristan Cazenave

Improving Model and Search for Computer Go.

2021

Tristan Cazenave

The standard for Deep Reinforcement Learning in games, following Alpha Zero, is to use residual networks and to increase the depth of the network to get better results. We propose to improve mobile networks as an alternative to residual networks and experimentally show the playing strength of the networks according to both their width and their depth. We also propose a generalization of the PUCT search algorithm that improves on PUCT.

Keywords:

Computer science
Reinforcement learning
Residual
Search algorithm
Artificial intelligence
Computer Go

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations