Introduction to Intel® LPOT

The Intel® Low Precision Optimization Tool (Intel® LPOT) is an open-source Python library that delivers a unified low-precision inference interface across multiple Intel-optimized Deep Learning (DL) frameworks on both CPUs and GPUs. It supports automatic accuracy-driven tuning strategies, along with additional objectives such as optimizing for performance, model size, and memory footprint. It also provides easy extension capability for new backends, tuning strategies, metrics, and objectives.

Note

GPU support is under development.

Architecture

Intel® LPOT features an infrastructure and workflow that aids in increasing performance and faster deployments across architectures.

Infrastructure

Click the image to enlarge it.

Workflow

Click the image to enlarge it.

Supported Frameworks

Supported Intel-optimized DL frameworks are:

TensorFlow*, including 1.15.0 UP2, 1.15.0 UP1, 2.1.0, 2.2.0, 2.3.0, 2.4.0
PyTorch*, including 1.5.0+cpu, 1.6.0+cpu
Apache* MXNet, including 1.6.0, 1.7.0
ONNX* Runtime, including 1.6.0

Visit the Intel® LPOT website at: https://intel.github.io/lpot.

Installation

Select the installation based on your operating system.

Linux Installation

You can install LPOT using one of three options: Install just the LPOT library from binary or source, or get the Intel-optimized framework together with the LPOT library by installing the Intel® oneAPI AI Analytics Toolkit.

Option 1 Install from binary

# install from pip
pip install lpot

# install from conda
conda install lpot -c conda-forge -c intel

Option 2 Install from source

git clone https://github.com/intel/lpot.git
cd lpot
pip install -r requirements.txt
python setup.py install

Option 3 Install from AI Kit

The Intel® LPOT library is released as part of the Intel® oneAPI AI Analytics Toolkit (AI Kit). The AI Kit provides a consolidated package of Intel's latest deep learning and machine optimizations all in one place for ease of development. Along with LPOT, the AI Kit includes Intel-optimized versions of deep learning frameworks (such as TensorFlow and PyTorch) and high-performing Python libraries to streamline end-to-end data science and AI workflows on Intel architectures.

The AI Kit is distributed through many common channels, including from Intel's website, YUM, APT, Anaconda, and more. Select and download the AI Kit distribution package that's best suited for you and follow the Get Started Guide for post-installation instructions.

Download AI Kit	AI Kit Get Started Guide

Windows Installation

Prerequisites

The following prerequisites and requirements must be satisfied for a successful installation:

Python version: 3.6 or 3.7 or 3.8
Download and install anaconda.

Create a virtual environment named lpot in anaconda:

# Here we install python 3.7 for instance. You can also choose python 3.6 & 3.8.
conda create -n lpot python=3.7
conda activate lpot

Installation options

Option 1 Install from binary

# install from pip
pip install lpot

# install from conda
conda install lpot -c conda-forge -c intel

Option 2 Install from source

git clone https://github.com/intel/lpot.git
cd lpot
pip install -r requirements.txt
python setup.py install

Documentation

Get Started

APIs explains Intel® Low Precision Optimization Tool's API.
Transform introduces how to utilize LPOT's built-in data processing and how to develop a custom data processing method.
Dataset introduces how to utilize LPOT's built-in dataset and how to develop a custom dataset.
Metric introduces how to utilize LPOT's built-in metrics and how to develop a custom metric.
Tutorial provides comprehensive instructions on how to utilize LPOT's features with examples.
Examples are provided to demonstrate the usage of LPOT in different frameworks: TensorFlow, PyTorch, MXNet, and ONNX Runtime.
UX is a web-based system used to simplify LPOT usage.
Intel oneAPI AI Analytics Toolkit Get Started Guide explains the AI Kit components, installation and configuration guides, and instructions for building and running sample apps.
AI and Analytics Samples includes code samples for Intel oneAPI libraries.

Deep Dive

Quantization are processes that enable inference and training by performing computations at low-precision data types, such as fixed-point integers. LPOT supports Post-Training Quantization (PTQ) and Quantization-Aware Training (QAT). Note that (Dynamic Quantization) currently has limited support.
Pruning provides a common method for introducing sparsity in weights and activations.
Benchmarking introduces how to utilize the benchmark interface of LPOT.
Mixed precision introduces how to enable mixed precision, including BFP16 and int8 and FP32, on Intel platforms during tuning.
Graph Optimization introduces how to enable graph optimization for FP3232 and auto-mixed precision.
TensorBoard provides tensor histograms and execution graphs for tuning debugging purposes.

Advanced Topics

Adaptor is the interface between LPOT and framework. The method to develop adaptor extension is introduced with ONNX Runtime as example.
Strategy can automatically optimized low-precision recipes for deep learning models to achieve optimal product objectives like inference performance and memory usage with expected accuracy criteria. The method to develop a new strategy is introduced.

System Requirements

Intel® Low Precision Optimization Tool supports systems based on Intel 64 architecture or compatible processors, specially optimized for the following CPUs:

Intel Xeon Scalable processor (formerly Skylake, Cascade Lake, and Cooper Lake)
future Intel Xeon Scalable processor (code name Sapphire Rapids)

Intel® Low Precision Optimization Tool requires installing the pertinent Intel-optimized framework version for TensorFlow, PyTorch, and MXNet.

Validated Hardware/Software Environment

Platform	OS	Python	Framework	Version
Cascade Lake Cooper Lake Skylake	CentOS 7.8 Ubuntu 18.04	3.6 3.7 3.8	TensorFlow	2.4.0
				2.2.0
				1.15.0 UP1
				1.15.0 UP2
				2.3.0
				2.1.0
				1.15.2
			PyTorch	1.5.0+cpu
				1.6.0+cpu
				IPEX
			MXNet	1.7.0
			MXNet	1.6.0
			ONNX Runtime	1.6.0

Validated Models

Intel® Low Precision Optimization Tool provides numerous examples to show promising accuracy loss with the best performance gain. A full quantized model list on various frameworks is available in the Model List.

Framework	version	Model	dataset	Accuracy			Performance speed up
Framework	version	Model	dataset	INT8 Tuning Accuracy	FP32 Accuracy Baseline	Acc Ratio[(INT8-FP32)/FP32]	Realtime Latency Ratio[FP32/INT8]
tensorflow	2.4.0	resnet50v1.5	ImageNet	76.70%	76.50%	0.26%	3.23x
tensorflow	2.4.0	Resnet101	ImageNet	77.20%	76.40%	1.05%	2.42x
tensorflow	2.4.0	inception_v1	ImageNet	70.10%	69.70%	0.57%	1.88x
tensorflow	2.4.0	inception_v2	ImageNet	74.10%	74.00%	0.14%	1.96x
tensorflow	2.4.0	inception_v3	ImageNet	77.20%	76.70%	0.65%	2.36x
tensorflow	2.4.0	inception_v4	ImageNet	80.00%	80.30%	-0.37%	2.59x
tensorflow	2.4.0	inception_resnet_v2	ImageNet	80.10%	80.40%	-0.37%	1.97x
tensorflow	2.4.0	Mobilenetv1	ImageNet	71.10%	71.00%	0.14%	2.88x
tensorflow	2.4.0	ssd_resnet50_v1	Coco	37.90%	38.00%	-0.26%	2.97x
tensorflow	2.4.0	mask_rcnn_inception_v2	Coco	28.90%	29.10%	-0.69%	2.66x
tensorflow	2.4.0	vgg16	ImageNet	72.50%	70.90%	2.26%	3.75x
tensorflow	2.4.0	vgg19	ImageNet	72.40%	71.00%	1.97%	3.79x

Framework	version	model	dataset	Accuracy			Performance speed up
Framework	version	model	dataset	INT8 Tuning Accuracy	FP32 Accuracy Baseline	Acc Ratio[(INT8-FP32)/FP32]	Realtime Latency Ratio[FP32/INT8]
pytorch	1.5.0+cpu	resnet50	ImageNet	75.96%	76.13%	-0.23%	2.63x
pytorch	1.5.0+cpu	resnext101_32x8d	ImageNet	79.12%	79.31%	-0.24%	2.61x
pytorch	1.6.0a0+24aac32	bert_base_mrpc	MRPC	88.90%	88.73%	0.19%	1.98x
pytorch	1.6.0a0+24aac32	bert_base_cola	COLA	59.06%	58.84%	0.37%	2.19x
pytorch	1.6.0a0+24aac32	bert_base_sts-b	STS-B	88.40%	89.27%	-0.97%	2.28x
pytorch	1.6.0a0+24aac32	bert_base_sst-2	SST-2	91.51%	91.86%	-0.37%	2.30x
pytorch	1.6.0a0+24aac32	bert_base_rte	RTE	69.31%	69.68%	-0.52%	2.15x
pytorch	1.6.0a0+24aac32	bert_large_mrpc	MRPC	87.45%	88.33%	-0.99%	2.73x
pytorch	1.6.0a0+24aac32	bert_large_squad	SQUAD	92.85%	93.05%	-0.21%	2.01x
pytorch	1.6.0a0+24aac32	bert_large_qnli	QNLI	91.20%	91.82%	-0.68%	2.69x

Name		Name	Last commit message	Last commit date
Latest commit History 924 Commits
.github/workflows		.github/workflows
_static		_static
api-documentation		api-documentation
docs		docs
examples		examples
lpot		lpot
test		test
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conf.py		conf.py
contributions.md		contributions.md
examples_readme.md		examples_readme.md
getting_started.md		getting_started.md
index.rst		index.rst
legal_information.md		legal_information.md
make.bat		make.bat
meta.yaml		meta.yaml
releases_info.md		releases_info.md
requirements.txt		requirements.txt
security_policy.md		security_policy.md
setup.py		setup.py
sphinx-requirements.txt		sphinx-requirements.txt
template.png		template.png
third-party-programs.txt		third-party-programs.txt
welcome.md		welcome.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction to Intel® LPOT

Architecture

Infrastructure

Workflow

Supported Frameworks

Installation

Linux Installation

Option 1 Install from binary

Option 2 Install from source

Option 3 Install from AI Kit

Windows Installation

Option 1 Install from binary

Option 2 Install from source

Documentation

System Requirements

Validated Hardware/Software Environment

Validated Models

Additional Content

About

Releases

Packages

Contributors 26

Languages

License

deb-intel/LPOTtest

Folders and files

Latest commit

History

Repository files navigation

Introduction to Intel® LPOT

Architecture

Infrastructure

Workflow

Supported Frameworks

Installation

Linux Installation

Option 1 Install from binary

Option 2 Install from source

Option 3 Install from AI Kit

Windows Installation

Option 1 Install from binary

Option 2 Install from source

Documentation

System Requirements

Validated Hardware/Software Environment

Validated Models

Additional Content

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 26

Languages

Packages