DreamerV3

Implementation (TensorFlow/keras) of the "DreamerV3" model-based reinforcement learning (RL) algorithm by Hafner et al. 2023

For algorithm details, please see: https://arxiv.org/pdf/2301.04104v1.pdf

Results

Setup

# Install Atari env tools: 
pip install gymnasium[atari] gym==0.26.2 supersuit tinyscaler
# Clone this very repo.
git clone https://github.com/sven1977/dreamer_v3
cd dreamer_v3
# We use local imports, so make sure this dir is in your PYTHONPATH
export PYTHONPATH=$(pwd)

# Sorry, one more ugly hack: Supersuit does not seem to work yet with the
# new gymnasium vector envs. Hence, you will have to make one change to
# one of the supersuit source files:
pip show supersuit
# You should see something like:
# Name: SuperSuit
# Version: 3.7.1
# ..
# Location: [some path P ...]

# Edit the source file:
vim [some path P ...]/supersuit/lambda_wrappers/observation_lambda.py
# scroll down to the `class gym_observation_lambda` and simplify its `reset()`
# method to:
# def reset(self, seed=None, return_info=False, options=None):
#     observation, info = self.env.reset(
#         seed=seed, options=options
#     )
#     observation = self._modify_observation(observation)
#     return observation, info
# Save your changes and exit vim

# Run the Atari example.
python run_experiment.py -c examples/atari_100k.yaml --env ALE/Pong-v5

Name		Name	Last commit message	Last commit date
Latest commit History 172 Commits
examples		examples
losses		losses
models		models
training		training
utils		utils
.gitignore		.gitignore
README.md		README.md
project_log.txt		project_log.txt
run_atari_experiments.py		run_atari_experiments.py
run_experiment.py		run_experiment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DreamerV3

Results

Setup

About

Releases 4

Packages

Languages

sven1977/dreamer_v3

Folders and files

Latest commit

History

Repository files navigation

DreamerV3

Results

Setup

About

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages