drq_jax

Jax Implementation of Data-regularized Q (DrQ)

(It's my lecture project of Reinforcement Learning :)

python drq.py cfg=walker_walk train_seed=0

Running the code requires ≈38 GB GPU memory.

As I can access large memory GPUs, so I did not implement a memory-efficient replay buffer for image observations.

Leave it for future work (下次一定!)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
cfg		cfg
logger		logger
videos/dm_control		videos/dm_control
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
config.py		config.py
drq.py		drq.py
env.py		env.py
requirements.txt		requirements.txt

Provide feedback