2048 Environment and DQN Algorithm implementation

Thanks to the author of gym-2048 https://github.com/rgal/gym-2048. The code is easy to understand and runs efficiently. I just made some little changes to make it a better RL environment. And I implemented dqn with many tricks using pytorch:

Performance of environment

I used random policy to evaluate the performance for 1000 times. We can take random policy as a baseline.
The evaluation main function is in base_agent.py.

(1) with rendering:

average episode time:0.10279795455932617 s;
average step time: 0.7373 ms；
average highest score:106.368;
average total score:1078.252;
average steps:139.417;

(2) without rendering:

average episode time:0.03773710775375366 s;
average step time: 0.2671 ms；
average highest score:108.24;
average total score:1102.088;
average steps:141.288;

some example:

Performance of Priority DQN

Training for 45k episodes and the max eval mean score is 7700(eval for 50 episodes).

Update

add max steps and max illegal steps of one episode;
add dqn agent and training infomation;
fix bug on the Double Q trick (the issue raised by mythsman);

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
log		log
pictures		pictures
save		save
Buffer_module.py		Buffer_module.py
NN_module.py		NN_module.py
README.md		README.md
base_agent.py		base_agent.py
dqn_agent.py		dqn_agent.py
gym_2048.py		gym_2048.py
logger.py		logger.py
main_dqn.py		main_dqn.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2048 Environment and DQN Algorithm implementation

Performance of environment

(1) with rendering:

(2) without rendering:

Performance of Priority DQN

Update

About

Releases

Packages

Languages

YangRui2015/2048_env

Folders and files

Latest commit

History

Repository files navigation

2048 Environment and DQN Algorithm implementation

Performance of environment

(1) with rendering:

(2) without rendering:

Performance of Priority DQN

Update

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages