Build software better, together

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Updated Mar 28, 2021
Jupyter Notebook

yenw / computer-go-dataset

Star

datasets for computer go

go sgf alphago computer-go tygem computer-go-dataset fineart alphazero minigo phoenixgo leelazero muzero golaxy elf-opengo

Updated Jun 12, 2024
C++

Zeta36 / muzero

Star

A simple implementation of MuZero algorithm for connect4 game

python jupyter-notebook pytorch deepmind muzero

Updated Aug 11, 2020
Jupyter Notebook

rlglab / minizero

Star

MiniZero: An AlphaZero and MuZero Training Framework

go hex reinforcement-learning deep-reinforcement-learning mcts othello gomoku tictactoe atari monte-carlo-tree-search nogo board-games alphazero muzero gumbel-alphazero gumbel-muzero outer-open-gomoku killall-go

Updated Dec 17, 2024
C++

Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

machine-learning deep-reinforcement-learning pytorch transformer lstm rl monte-carlo-tree-search multilayer-perceptron gym-environments muzero arxiv-papers offline-reinforcement-learning resnetv2 online-reinforcement-learning stochastic-muzero muzero-stochastic

Updated Oct 20, 2023
Python

Hwhitetooth / jax_muzero

Star

An implementation of MuZero in JAX.

reinforcement-learning deep-learning deep-reinforcement-learning jax model-based-reinforcement-learning muzero

Updated Nov 8, 2022
Python

hr0nix / omega

Star

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

reinforcement-learning nethack mcts flax model-based-rl jax model-based-reinforcement-learning muzero minihack rlax

Updated Sep 19, 2022
Python

tuero / muzero-cpp

Star

A C++ pytorch implementation of MuZero

machine-learning reinforcement-learning cpp pytorch mcts alphazero libtorch muzero

Updated May 1, 2024
C++

sail-sg / rosmo

Star

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

reinforcement-learning atari arcade-learning-environment model-based-rl jax model-based-reinforcement-learning bsuite muzero dm-haiku offline-rl offline-reinforcement-learning rl-unplugged muzero-unplugged

Updated Jul 18, 2023
Python

michaelnny / muzero

Star

A PyTorch implementation of DeepMind's MuZero agent

reinforcement-learning pytorch model-based-rl alphazero muzero

Updated Dec 1, 2023
Python

DHDev0 / Muzero-unplugged

Star

Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch transformer lstm gym rl arxiv monte-carlo-tree-search gym-environments muzero arxiv-papers resnetv2 resnetv1 muzero-unplugged

Updated Feb 1, 2023
Python

bellerb / chappie.ai

Star

Generalized AI to perform a multitude of tasks written in python3

ai ml python3 pytorch transformer mcts attention-mechanism chess-ai muzero perceiver perceiverio

Updated Oct 24, 2023
Jupyter Notebook

DHDev0 / Muzero

Star

Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch transformer lstm gym rl arxiv monte-carlo-tree-search gym-environments muzero arxiv-papers resnetv2 resnetv1

Updated Jan 24, 2023
Python

Itomigna2 / Muesli-lunarlander

Star

Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)

reinforcement-learning deep-learning colab muesli model-based-rl lunarlander-v2 muzero

Updated Mar 18, 2024
Jupyter Notebook

rystrauss / dopamax

Star

Reinforcement learning in pure JAX.

reinforcement-learning dqn mcts ddpg sac ppo podracer alphazero jax td3 muzero brax anakin dopamax

Updated Dec 19, 2024
Python

jianzhnie / RLZero

Star

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

reinforcement-learning multi-agent mcts alpha-zero self-play muzero

Updated Oct 15, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

muzero

Here are 40 public repositories matching this topic...

werner-duvaud / muzero-general

opendilab / LightZero

huawei-noah / xingtian

johan-gras / MuZero

kaesve / muzero

yenw / computer-go-dataset

Zeta36 / muzero

rlglab / minizero

DHDev0 / Stochastic-muzero

Hwhitetooth / jax_muzero

hr0nix / omega

tuero / muzero-cpp

sail-sg / rosmo

michaelnny / muzero

DHDev0 / Muzero-unplugged

bellerb / chappie.ai

DHDev0 / Muzero

Itomigna2 / Muesli-lunarlander

rystrauss / dopamax

jianzhnie / RLZero

Improve this page

Add this topic to your repo