PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
-
Updated
Mar 24, 2023 - Python
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Code and other material for the book "Deep Learning and the Game of Go"
BetaGo: AlphaGo for the masses, live on GitHub.
A student implementation of Alpha Go Zero
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. Current strength is 3~4 amateur dan.
A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games
AlphaGo-Zero-Gobang 是一个基于强化学习的五子棋(Gobang)模型,主要用以了解AlphaGo Zero的运行原理的Demo,即神经网络是如何指导MCTS做出决策的,以及如何自我对弈学习。源码+教程
Reinforcing Your Learning of Reinforcement Learning
Computer go engine using Monte-Carlo Tree Search written in Python3.
基於深度學習的 GTP 圍棋(围棋)引擎,KGS 指引文件以及演算法教學。
alphaGo版本的五子棋(gobang, gomoku)
A very fast implementation of AlphaZero, applied to games like Splendor, Santorini, The Little Prince, … Browser version available
基于miniGo的幻影围棋AI,2019中国计算机博弈大赛幻影围棋组冠军;AI of Phantom Go based on miniGo
[ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"
Add a description, image, and links to the alphago topic page so that developers can more easily learn about it.
To associate your repository with the alphago topic, visit your repo's landing page and select "manage topics."