Skip to content

Latest commit

 

History

History
4 lines (2 loc) · 398 Bytes

ReadMe.md

File metadata and controls

4 lines (2 loc) · 398 Bytes

Reinforcement learning for tetris game based on least squares policy evaluation method (Nedić, A., Bertsekas, D.P., 2003. Least squares policy evaluation algorithms with linear function approximation. Discrete Event Dynamic Systems 13, 79–110. doi:10.1023/A:1022192903948).

A demo of the training result is available at youtube.