ML-Battlesnake: Playing Battlesnake with Machine Learning

This is a project to play the competitive multiplayer game Battlesnake using machine learning.

NOTE: This project is still in development and is not yet optimized for competitive play.

Getting Started

Start by cloning the repository and opening the project in a DevContainer. This will ensure that you have all the necessary dependencies installed.

Try running scripts/ppo.py to train a model. Point a tensorboard server at the tensorboard log directory to see the training progress. Refer to the stable-baselines3 documentation for more information about the PPO algorithm and tensorboard log format.

Project Structure

bin: compiled executables and libraries, such as game engines
ml_battlesnake: project package
- common: common code for all package modules
- deployment: utilities and application code for Battlesnake services and their deployment. This directory will support the development/production deployment of the ML Battlesnake when it is ready.
- learning: training and evaluation of machine learning snakes
scripts: scripts for running the project, including local deployment and training
tests: unit tests for the project

Progress

TODO

Plateauing during training is likely due to the observation preprocessing and network architecture. The observation preprocessing uses single-channel (grayscale) image encoding which is not meaningful to the actor/critic networks in their current state. The following items should address these issues and allow for training to progress past the plateau.

Other items below should also improve training efficiency and performance.

Swap grayscale image encoding in range [0, 255] to binary encoding with multiple channels, where each channel represents a different feature of the game state
Add action masking to the environment to prevent snakes from moving into walls or other snakes, allowing the network to focus on learning less trivial behaviours
Add CNN network for feature extraction
Shape actor/critic networks to suit the CNN feature extraction network
Tune hyperparameters for training
Train 4+ snakes to imitate the best performing snakes on the global leaderboard to use in training to avoid ray interference that could be emphasized by the self-play training method
Add new environment and matchmaking system for pitting the snake in training against the imitation snakes (and potentially its own previous versions)
Train at scale
Deploy for competitive play on the global leaderboard

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.devcontainer		.devcontainer
.vscode		.vscode
ml_battlesnake		ml_battlesnake
scripts		scripts
tests/deployment		tests/deployment
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
makefile		makefile
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML-Battlesnake: Playing Battlesnake with Machine Learning

Getting Started

Project Structure

Progress

TODO

About

Releases

Packages

Languages

callumcurtis/ml-battlesnake

Folders and files

Latest commit

History

Repository files navigation

ML-Battlesnake: Playing Battlesnake with Machine Learning

Getting Started

Project Structure

Progress

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages