Efficient Deployment of Machine Learning Models in Scala

This repo contains the code associated with a series of blog posts about deploying machine learning models to live services using ONNX, TensorRT, Triton, gRPC, cats-effect, fs2 and other technologies/tools. The overview of the series can be found here

Installation and System

The code was written and tested on a Ubuntu 22.04 machine with an Nvidia RTX 3060, as well as a Ubuntu 22.04 machine with an Nvidia RTX 3080. Nvidia driver version 535.129.03, Cuda compilation tools V12.2.140 and Docker version 24.0.7. You likely do not need these exact versions of Nvidia Driver and CUDA. Try to run the code and if you have problems see the Nvidia installation instructions below.

Code was written using Scala 3.3+, sbt 1.9.7, Python 3.10+ and the Poetry dependency manager for python. Instructions for downloading Scala, sbt, Python and Poetry can be easily found on the internet.

Nvidia Installation

Expand for Nvidia installation details

In my experiance installing Nvidia tools can be tedious and error prone. Make sure to read all the documentation present in each link so you know what you are doing.

Download cuda 12.2 using this link. Select the options in the following order Linux > x86_64 > Ubuntu > 22.04 > deb (network) or make modifications to meet your system requirements. For me the generated instructions look like what is below:

$ wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2204/x86_64/cuda-keyring_1.1-1_all.deb
$ sudo dpkg -i cuda-keyring_1.1-1_all.deb
$ sudo apt-get update
$ sudo apt-get -y install cuda

To install cuDNN you will need an Nvidia sign in. It is still free they just make you sign in. Go to this link to download the current version of cuDNN. If you need an older version visit this link. Select Local Installer for Ubuntu22.04 x86_64 (Deb). After that follow the instructions linked here

After rebooting your system run the command nvcc -V and you should get an outputing specifying your version. If nvcc -V doesnt work you might need to add export export PATH="/usr/local/cuda/bin:$PATH" to ~/.bashrc

If something goes wrong and you need to start from scrath follow the instructions in this link to uninstall Nvidia stuff.

Docker Nvidia

Finally install Docker-Nvidia using these instructions

Python Installation

Installation assumes you have the poetry build tool installed on your system. Poetry is not required to create a python environment, but if you choose not to use it you will have to modify the instructions accordingly.

Simply run the following commands to install the dependencies specified in the pyproject.toml file and to activate the created environment:

poetry install
poetry shell

Scala Installation

Compile the project with the following command:

sbt compile

If there are no error messages then installation was successful

Easy Docker Deployment

The easiest way to deploy the system locally is with Docker/Docker Compose.

Before you do that though you will need to configure the docker-compose.yaml file, by adding your client key and client secret. These are managed through github (or whatever OAuth provider you implement) and are necessary for login.

Below is a snippet of the yaml file which shows the areas that need to be configured.

services:
  ...
  server:
    image: mattlangsenkamp/scalamachinelearningdeployment:latest
    ports:
      - "8080:8080"
    depends_on:
      - grpctriton
    environment:
      - SERVER_HOST=0.0.0.0
      - TRITON_HOST=grpctriton
      - KEY=<add GITHUB_CLIENT_KEY here>
      - SECRET=<add GITHUB_CLIENT_SECRET>
      - LABELS_DIR=/labels.json
  ...

Once docker-compose.yaml has been configured simply run docker compose up and navigate to localhost:5713

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
client/src/main/scala/com/mattlangsenkamp/client		client/src/main/scala/com/mattlangsenkamp/client
config		config
core/shared/src/main/scala		core/shared/src/main/scala
docker		docker
gatling/src		gatling/src
project		project
protobuf/src/main/protobuf		protobuf/src/main/protobuf
python		python
scripts		scripts
server/src		server/src
triton		triton
.env.production		.env.production
.gitignore		.gitignore
.scalafmt.conf		.scalafmt.conf
README.md		README.md
build.sbt		build.sbt
cheatsheet.md		cheatsheet.md
data.tar.xz		data.tar.xz
docker-compose-cpu.yaml		docker-compose-cpu.yaml
docker-compose.yaml		docker-compose.yaml
index.html		index.html
labels.json		labels.json
main.js		main.js
package.json		package.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
vite.config.js		vite.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient Deployment of Machine Learning Models in Scala

Installation and System

Nvidia Installation

Docker Nvidia

Python Installation

Scala Installation

Easy Docker Deployment

About

Releases

Packages

Languages

MattLangsenkamp/scala-machine-learning-deployment

Folders and files

Latest commit

History

Repository files navigation

Efficient Deployment of Machine Learning Models in Scala

Installation and System

Nvidia Installation

Docker Nvidia

Python Installation

Scala Installation

Easy Docker Deployment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages