This is an Inverted Pendulum problem from classic Control theory solved with Deep Deteministic Policy Gradients model.
The model was trained on OpenAI's Pendulum-v0.
To change parameters of the model please use global variables of the train script.
Note: commands for Ubuntu 18.04
- Clone the repository
git clone
- Go to source root directory
cd pendulum_problem
- Run train script
python3 train_ddpg.py
Folder (old)pendulum_problem1.x
contain an outdated implementation for tensorflow 1.8.0
Note: was tested on
- tensorflow == 2.0
- numpy == 1.18.1
- gym == 1.17.1