NNE

Trains a multi-layer perceptron to play Tetris through an evolutionary algorithm (as opposed to Q-learning or gradient descent).

Overview

The neural network is fed the state of the game board (using an encoding where 0 means free block, 1 means falling block, and -1 means fixed block), and it outputs 4 neurons, each one indicating whether or not to exercise each of the game's inputs (i.e., buttons). The net is repeatedly fed the current game state, and the state is advanced according to its outputs until it loses.

Evolution happens through application of gaussian noise to the parameters of the network. An initial population is initialized randomly, an elite is passed on from one generation to the next (to ensure monotony), and the remainder of the population is generated through mutation of the best performing individuals of the previous population (selected at random from the top performers). This simple evolutionary algorithm was taken from Uber's blog cited below, as more sophisticated alternatives proved not to yield significantly better results.

Fitness of any given individual is a function of how many lines get cleared (clearing multiple lines at once rewards more points), and for how long the individual survives (to ensure a smooth fitness function). Encoding more information into the fitness function could yield better results (see Stevens et al below), but that feels like cheating, as ideally we would want the net to learn just from the score it achieved.

The project includes an ad-hoc implementation of Tetris and of multi-layer perceptrons, both using plain Javascript. The set of parameters can be specified when calling evolve (see js/evolution.js:162), it includes parameters such as max training time, topology of the network, mean and standard deviation, elitism size, etc.

Running

Just open index.html in your browser (tested using Firefox), during the evolutionary stage some stats will be printed to the developer console, and after this stage finishes you will be able to see the resulting neural network (the best performer of the last generation) playing on your screen. Once it loses you can press space to restart the game.

TODO

Use convolutional neural networks! Currently, the game board (a matrix) is being serialized, but maybe this is suboptimal.
Implement NEAT, although some results suggest that this might not help all that much.
Implement multi-threading, currently the game state is global and this forces us to do everything in a single thread.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
js		js
README.md		README.md
index.html		index.html
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NNE

Overview

Running

TODO

Bibliography

About

Releases

Packages

Languages

OctavioGalland/tetris-nn

Folders and files

Latest commit

History

Repository files navigation

NNE

Overview

Running

TODO

Bibliography

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages