High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
python
machine-learning
reinforcement-learning
deep-learning
deep-reinforcement-learning
pytorch
gym
atari
actor-critic
ale
proximal-policy-optimization
ppo
advantage-actor-critic
a2c
wandb
phasic-policy-gradient
-
Updated
Sep 24, 2024 - Python