DDPG_MountainCar

The mountain car continuous problem from gym was solved using DDPG, with neural networks as function aproximators. The solution is inspired in the DDPG algorithm, but using only low level information as inputs to the net, basically the net uses the position and velocity from the gym environment. The exploration is done by adding Ornstein-Uhlenbeck Noise to the process.

Requirements:

Numpy
Tensorflow
Open AI Gym

How to run

There is a Constant DEVICE = '/cpu:0', you if you have a gpu you can set it to DEVICE = '/gpu:0' and it will use tensorflow for training. To run the algorithm you can do:

python mountain.py

If there is a model saved in the folder it will load and start the training/testing. For testing set episilon = 0.

Sources:

gym Mountain car Continuous
sutton's book.
DDPG Continuous control with deep reinforcement learning
Pemami's blog
Implementation of the Ornstein-Uhlenbeck Noise
Blog about RL
Playing Torch w/ keras Good explanation of how everything works.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
actor.py		actor.py
critic.py		critic.py
mountain.py		mountain.py
ou_noise.py		ou_noise.py
replay_buffer.py		replay_buffer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG_MountainCar

Requirements:

How to run

Sources:

About

Releases

Packages

Languages

marianodepaula/DDPG_MountainCar

Folders and files

Latest commit

History

Repository files navigation

DDPG_MountainCar

Requirements:

How to run

Sources:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages