DQN

The DQN algorithm used for solving Gym's cartpole environment.
I changed the reward function:

reward = np.cos(2*next_state[3])

Requirements

There is a constant:

DEVICE = '/gpu:0'

Set it to '/cpu:0' if you don't have one.

And then run as:

$ python main.py

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
q_network.py		q_network.py
replay_buffer.py		replay_buffer.py