This repository contains modular implementations of Deep Reinforcement Learning algorithms.
- Banana Collector: DQN agent collects yellow bananas while avoiding blue bananas.
- Reacher: DDPG agent controls double-jointed arm to reach ball.
- Tennis: MADDPG agents control rackets to play Tennis.
To set up your python environment to run the code in this repository, follow the instructions below.
-
Install conda
-
Create and activate a new environment with Python 3.6.
- Linux or Mac:
conda create --name drlnd python=3.6 source activate drlnd
- Windows:
conda create --name drlnd python=3.6 activate drlnd
-
Follow the instructions in this repository to perform a minimal install of OpenAI gym.
-
Clone the repository and install several dependencies.
git clone https://github.com/Zulkhuu/reinforcement-learning.git
cd python
pip install .
- Create an IPython kernel for the
drlnd
environment.
python -m ipykernel install --user --name drlnd --display-name "drlnd"
- Before running code in a notebook, change the kernel to match the
drlnd
environment by using the drop-downKernel
menu.
- Collaboration and Competition - Train a pair of agents to play soccer
- UC Berkeley's Deep RL Bootcamp
- UC Berkeley's Deep Reinforcement Learning course
- David Silver's course on reinforcement learning
- DeepMind's Advanced Deep Learning & Reinforcement Learning course
- Udacity Deep Reinforcement Learning Nanodegree program
- Ian Goodfellow: The Deep Learning textbook
- Sutton & Barto: Reinforcement Learning: An Introduction
- Miguel Morales: Grokking Deep Reinforcement Learning
- Human-level control through deep reinforcement learning
- Deep Reinforcement Learning with Double Q-learning
- Dueling Network Architectures for Deep Reinforcement Learning
- Prioritized Experience Replay
- Noisy Networks for Exploration
- Rainbow: Combining Improvements in Deep Reinforcement Learning
- Distributional Reinforcement Learning with Quantile Regression
- Neural Episodic Control
- Deterministic Policy Gradient Algorithms
- Asynchronous Methods for Deep Reinforcement Learning
- Trust Region Policy Optimization
- Proximal Policy Optimization Algorithms
- Continuous control with Deep Reinforcement Learning
- Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments