Reinforcement learning for tetris game based on least squares policy evaluation method (Nedić, A., Bertsekas, D.P., 2003. Least squares policy evaluation algorithms with linear function approximation. Discrete Event Dynamic Systems 13, 79–110. doi:10.1023/A:1022192903948).
A demo of the training result is available at youtube.