Skip to content

PipelineAI: End-to-End ML and AI Platform for Real-time Spark and Tensorflow Data Pipelines

License

Notifications You must be signed in to change notification settings

cdemutiis/pipeline

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PipelineAI Open Source

Setup PipelineAI + Local

Setup PipelineAI + AWS (CPU)

Setup PipelineAI + AWS (GPU)

Setup PipelineAI + Google

Setup PipelineAI + Azure

Setup GPUs

Setup AWS + GPU

Setup Google + GPU

PipelineAI Products

Community Edition

Standalone Edition

Enterprise Edition

PipelineAI Workshops

TensorFlow + Spark + GPU

PipelineAI Core Features

Consistent, Immutable, Reproducible Model Runtimes

Consistent Model Environments

Every Model is burned into a separate Docker Image with its appropriate Python, C++, and Java/Scala Runtime Libraries.

We use this same Docker Image from Local Laptop to Production.

Supported Model Types

scikit, tensorflow, python, keras, pmml, spark, java, xgboost, R

More model samples coming soon (ie. R).

Nvidia GPU TensorFlow

Spark ML Scikit-Learn

R PMML

Xgboost Ensembles

Pre-Requisites

Docker

Python3 (Conda is Optional)

Install PipelineCLI

Note: This command line interface requires Python3 and Docker as detailed above.

pip install cli-pipeline==1.3.0 --ignore-installed --no-cache -U

Verify Successful PipelineCLI Installation

pipeline version

### EXPECTED OUTPUT ###
cli_version: 1.3.0
api_version: v1
capabilities_enabled: ['predict', 'server', 'version']
capabilities_disabled: ['train', 'cluster', 'optimize']

Review CLI Functionality

pipeline

### EXPECTED OUTPUT ###
Usage:       pipeline                    <-- This CLI Command

(Enterprise) pipeline cluster-describe   <-- Describe Model Cluster
             pipeline cluster-logs       <-- View Cluster Logs 
             pipeline cluster-proxy      <-- Secure Tunnel into Cluster 
             pipeline cluster-quarantine <-- Remove Instance from Cluster for Forensics
             pipeline cluster-rollback   <-- Rollback Cluster
             pipeline cluster-route      <-- Route Traffic across Model Versions (ie. Canary)
             pipeline cluster-scale      <-- Scale Cluster
             pipeline cluster-shell      <-- Shell into Cluster
             pipeline cluster-start      <-- Start Cluster 
             pipeline cluster-stop       <-- Stop Model Cluster
             pipeline cluster-upgrade    <-- Upgrade Cluster

(Standalone) pipeline optimize-model     <-- Optimize Model for Prediction

(Community)  pipeline predict-model      <-- Predict with Model
 
(Community)  pipeline server-build       <-- Build Model Server
             pipeline server-logs        <-- View Server Logs
             pipeline server-shell       <-- Shell into Server
             pipeline server-start       <-- Start Model Server
             pipeline server-stop        <-- Stop Model Server

(Standalone) pipeline train-model        <-- Train Model

(Community)  pipeline version            <-- View CLI Version

Prepare Model Samples

Clone the PipelineAI Models Repo

git clone https://github.com/PipelineAI/predict

Change into predict Directory

cd predict 

Model Predictions

Inspect Model Directory

ls -l ./models/tensorflow/mnist

### EXPECTED OUTPUT ###
pipeline_conda_environment.yml <-- Required.  Sets up the conda environment.
pipeline_install.sh            <-- Optional.  If file exists, we run it.
pipeline_predict.py            <-- Required.  `predict(request: bytes) -> bytes` is required.
versions/                      <-- Optional.  If directory exists, we start TensorFlow Serving.

Inspect ./models/tensorflow/mnist/pipeline_predict.py

cat ./models/tensorflow/mnist/pipeline_predict.py

### EXPECTED OUTPUT ###
import os
import logging
from pipeline_model import TensorFlowServingModel
from pipeline_logger.kafka_handler import KafkaHandler
from pipeline_monitor import prometheus_monitor as monitor
from pipeline_logger import log

...

__all__ = ['predict'] <-- Optional.  Nice to have as a good Python citizen.

...

def _initialize_upon_import() -> TensorFlowServingModel:    <-- Optional.  Called once upon server startup.
    return TensorFlowServingModel(host='localhost',         <-- Optional.  TensorFlow Serving.
                                  port=9000,
                                  model_name='mnist',
                                  inputs_name='inputs',
                                  outputs_name='outputs',
                                  timeout=100)

_model = _initialize_upon_import()  <-- Optional.  Called once upon server startup. 

_labels = {'model_type': os.environ['PIPELINE_MODEL_TYPE'], <-- Optional.  Tag metrics.
           'model_name': os.environ['PIPELINE_MODEL_NAME'],
           'model_tag': os.environ['PIPELINE_MODEL_TAG']}

_logger = logging.getLogger('predict-logger')               <-- Optional.  Standard Python logging.

_logger_kafka_handler = KafkaHandler(host_list='localhost:9092', <-- Optional.  Expose prediction stream.
                                     topic='predictions')

_logger.addHandler(_logger_kafka_handler)

@log(labels=_labels, logger=_logger)                          <-- Optional.  Sample and compare predictions.
def predict(request: bytes) -> bytes:                         <-- Required.  Called on every prediction.

    with monitor(labels=_labels, name="transform_request"):   <-- Optional.  Expose fine-grained metrics.
        transformed_request = _transform_request(request)     <-- Optional.  Transform input (json) into TensorFlow (tensor).

    with monitor(labels=_labels, name="predict"):
        predictions = _model.predict(transformed_request)     <-- Optional.  Call predict() function.

    with monitor(labels=_labels, name="transform_response"):
        return _transform_response(predictions)               <-- Optional.  Transform TensorFlow (tensor) into output (json).
...

Build Example Model into Docker Image

pipeline server-build --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0 --model-path=./models/tensorflow/mnist

model-path must be a relative path.

Start the Model Server

pipeline server-start --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0 --memory-limit=4G

If the port is already allocated, run docker ps, then docker rm -f <container-id>.

Monitor Runtime Logs

Wait for the model runtime to settle...

pipeline server-logs --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0

### EXPECTED OUTPUT ###
...
2017-10-10 03:56:00.695  INFO 121 --- [     run-main-0] i.p.predict.jvm.PredictionServiceMain$   : Started PredictionServiceMain. in 7.566 seconds (JVM running for 20.739)
[debug] 	Thread run-main-0 exited.
[debug] Waiting for thread container-0 to terminate.
kafka is [UP]
...
INFO[0050] Completed initial partial maintenance sweep through 4 in-memory fingerprints in 40.002264633s.  source="storage.go:1398"
...

You need to ctrl-c out of the log viewing before proceeding.

PipelineAI Prediction CLI

Perform 100 Predictions in Parallel

The first call takes 10-20x longer than subsequent calls (and may timeout causing a "fallback" message) due to lazy initialization and warm-up.

Try the call again if you see a "fallback" message.

Before proceeding, make sure you hit ctrl-c after viewing the logs in the command above.

pipeline predict-model --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0 --predict-server-url=http://localhost:6969 --test-request-path=./models/tensorflow/mnist/data/test_request.json

### Expected Output ###
{"outputs": [0.0022526539396494627, 2.63791100074684e-10, 0.4638307988643646, 0.21909376978874207, 3.2985670372909226e-07, 0.29357224702835083, 0.00019597385835368186, 5.230629176367074e-05, 0.020996594801545143, 5.426473762781825e-06]}

### Formatted Output ###
Digit  Confidence
=====  ==========
0      0.0022526539396494627
1      2.63791100074684e-10
2      0.4638307988643646      <-- Prediction
3      0.21909376978874207
4      3.2985670372909226e-07
5      0.29357224702835083 
6      0.00019597385835368186
7      5.230629176367074e-05
8      0.020996594801545143
9      5.426473762781825e-06

Perform 100 Predictions in Parallel (Mini Load Test)

pipeline predict-model --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0 --predict-server-url=http://localhost:6969 --test-request-path=./models/tensorflow/mnist/data/test_request.json --test-request-concurrency=100

PipelineAI Prediction REST API

Use the REST API to POST a JSON document representing the number 2.

MNIST 2

curl -X POST -H "Content-Type: application/json" \
  -d '{"image}' \
  http://localhost:6969/api/v1/model/predict/tensorflow/mnist/v1.3.0 \
  -w "\n\n"

### Expected Output ###
{"outputs": [0.0022526539396494627, 2.63791100074684e-10, 0.4638307988643646, 0.21909376978874207, 3.2985670372909226e-07, 0.29357224702835083, 0.00019597385835368186, 5.230629176367074e-05, 0.020996594801545143, 5.426473762781825e-06]}

### Formatted Output
Digit  Confidence
=====  ==========
0      0.0022526539396494627
1      2.63791100074684e-10
2      0.4638307988643646      <-- Prediction
3      0.21909376978874207
4      3.2985670372909226e-07
5      0.29357224702835083 
6      0.00019597385835368186
7      5.230629176367074e-05
8      0.020996594801545143
9      5.426473762781825e-06

View Real-Time Prediction Stream

Re-run the Prediction REST API while watching the following url:

http://localhost:6969/stream/kafka/

Live Stream Predictions

Monitor Real-Time Prediction Metrics

Re-run the Prediction REST API while watching the following dashboard URL:

http://localhost:6969/hystrix-dashboard/monitor/monitor.html?streams=%5B%7B%22name%22%3A%22%22%2C%22stream%22%3A%22http%3A%2F%2Flocalhost%3A6969%2Fhystrix.stream%22%2C%22auth%22%3A%22%22%2C%22delay%22%3A%22%22%7D%5D

Real-Time Throughput and Response Time

Monitor Detailed Prediction Metrics

Re-run the Prediction REST API while watching the following detailed metrics dashboard URL:

http://localhost:3000/

Prediction Dashboard

Username/Password: admin/admin Set Type to Prometheues.

Set Url to http://localhost:9090.

Set Access to direct.

Click Save & Test.

Click Dashboards -> Import upper-left menu drop-down.

Copy and Paste THIS raw json file into the paste JSON box.

Select the Prometheus-based data source that you setup above and click Import.

Create additional PipelineAI Prediction widgets using THIS guide to the Prometheus Syntax.

Stop Model Server

pipeline server-stop --model-type=tensorflow --model-name=mnist --model-tag=v1.3.0

Click HERE to compare PipelineAI Products.

Drag N' Drop Model Deploy

PipelineAI Drag n' Drop Model Deploy UI

Generate Optimize Model Versions Upon Upload

Automatic Model Optimization and Native Code Generation

Distributed Model Training and Hyper-Parameter Tuning

PipelineAI Advanced Model Training UI

PipelineAI Advanced Model Training UI 2

Continuously Deploy Models to Clusters of PipelineAI Servers

PipelineAI Weavescope Kubernetes Cluster

Compare Both Offline (Batch) and Real-Time Model Performance

PipelineAI Model Comparison

Compare Response Time, Throughput, and Cost-Per-Prediction

PipelineAI Compare Performance and Cost Per Prediction

Shift Live Traffic to Maximize Revenue and Minimize Cost

PipelineAI Traffic Shift Multi-armed Bandit Maxmimize Revenue Minimize Cost

Continuously Fix Borderline Predictions through Crowd Sourcing

Borderline Prediction Fixing and Crowd Sourcing

About

PipelineAI: End-to-End ML and AI Platform for Real-time Spark and Tensorflow Data Pipelines

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 82.4%
  • Jupyter Notebook 16.3%
  • Python 0.6%
  • Protocol Buffer 0.2%
  • Scala 0.1%
  • Shell 0.1%
  • Other 0.3%