mu-zero

Here are 4 public repositories matching this topic...

An environment of the board game Go using OpenAI's Gym API

Applying DeepMind's MuZero algorithm to the cart pole environment in gym

The board game Go implemented in JAX for fast game processing and machine learning training.

Reinforcement learning algorithm that blends the N-th order Markov property with abstract MDPs, PPO, and a hybrid model-free/model-based approach.

Add a description, image, and links to the mu-zero topic page so that developers can more easily learn about it.

To associate your repository with the mu-zero topic, visit your repo's landing page and select "manage topics."