Skip to content

Latest commit

 

History

History
26 lines (20 loc) · 977 Bytes

README.md

File metadata and controls

26 lines (20 loc) · 977 Bytes

tensowflow_ddpg

Deep Deterministic Policy Gradient

DDPG method is RL method that utilizes DPG Theorem, that can handle continuous action space. This implementation depends on Tensowflow and this RL model is experimented on aigym. Following papers are referred when implementing this model.

How to use

git clone https://github.com/sykwer/tensorflow_ddpg.git
cd tensowflow_ddpg
python main.py

Experiment

Experiment on InvertedPendulum-v2

InvertedPendulum-v2_experiment (episode - total reward)

Resources

If you can conprehend Japanese article, refer to following articles! I have written several articles to understand DDPG theorem.