Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
machine-learning
deep-reinforcement-learning
pytorch
transformer
lstm
rl
monte-carlo-tree-search
multilayer-perceptron
gym-environments
muzero
arxiv-papers
offline-reinforcement-learning
resnetv2
online-reinforcement-learning
stochastic-muzero
muzero-stochastic
-
Updated
Oct 20, 2023 - Python