muzero
Here are 40 public repositories matching this topic...
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
-
Updated
Dec 24, 2024 - Python
A structured implementation of MuZero
-
Updated
Jun 4, 2022 - Python
A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.
-
Updated
Mar 28, 2021 - Jupyter Notebook
datasets for computer go
-
Updated
Jun 12, 2024 - C++
A simple implementation of MuZero algorithm for connect4 game
-
Updated
Aug 11, 2020 - Jupyter Notebook
MiniZero: An AlphaZero and MuZero Training Framework
-
Updated
Dec 17, 2024 - C++
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
-
Updated
Oct 20, 2023 - Python
An implementation of MuZero in JAX.
-
Updated
Nov 8, 2022 - Python
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
-
Updated
Sep 19, 2022 - Python
A C++ pytorch implementation of MuZero
-
Updated
May 1, 2024 - C++
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
-
Updated
Jul 18, 2023 - Python
A PyTorch implementation of DeepMind's MuZero agent
-
Updated
Dec 1, 2023 - Python
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
-
Updated
Feb 1, 2023 - Python
Generalized AI to perform a multitude of tasks written in python3
-
Updated
Oct 24, 2023 - Jupyter Notebook
Pytorch Implementation of MuZero for gym environment. It support any Discrete , Box and Box2D configuration for the action space and observation space.
-
Updated
Jan 24, 2023 - Python
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
-
Updated
Mar 18, 2024 - Jupyter Notebook
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
-
Updated
Oct 15, 2024 - Python
Improve this page
Add a description, image, and links to the muzero topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the muzero topic, visit your repo's landing page and select "manage topics."