An environment of the board game Go using OpenAI's Gym API
-
Updated
May 3, 2022 - Python
An environment of the board game Go using OpenAI's Gym API
Applying DeepMind's MuZero algorithm to the cart pole environment in gym
The board game Go implemented in JAX for fast game processing and machine learning training.
Reinforcement learning algorithm that blends the N-th order Markov property with abstract MDPs, PPO, and a hybrid model-free/model-based approach.
Add a description, image, and links to the mu-zero topic page so that developers can more easily learn about it.
To associate your repository with the mu-zero topic, visit your repo's landing page and select "manage topics."