GPFQ #666

Giuseppe5 · 2023-07-17T13:02:40Z

No description provided.

src/brevitas_examples/imagenet_classification/ptq/ptq_common.py

src/brevitas_examples/imagenet_classification/ptq/ptq_evaluate.py

src/brevitas/graph/gpxq.py

volcacius

We should potentially account for updating float layers (e.g. last unquantized) based on the activation quantization error at the input.

volcacius

For act_order, we could consider weights x activation magnitude as ordering criteria

Giuseppe5 · 2023-08-17T13:07:44Z

For act_order, we could consider weights x activation magnitude as ordering criteria

We need to define an order for quantizing the input channels. When multiplying weight x activation, the input channel is the inner dimension of the matmul which means that we "lose" that information

src/brevitas/graph/gpxq.py

Giuseppe5 force-pushed the gpfq branch from 0c31205 to b015669 Compare July 17, 2023 13:06

Giuseppe5 commented Jul 17, 2023

View reviewed changes

src/brevitas_examples/imagenet_classification/ptq/ptq_common.py Outdated Show resolved Hide resolved

volcacius reviewed Jul 17, 2023

View reviewed changes

src/brevitas_examples/imagenet_classification/ptq/ptq_evaluate.py Outdated Show resolved Hide resolved

Giuseppe5 commented Jul 17, 2023

View reviewed changes

src/brevitas/graph/gpxq.py Outdated Show resolved Hide resolved

Giuseppe5 force-pushed the gpfq branch from ebb87e7 to bba94df Compare July 25, 2023 15:08

Giuseppe5 commented Jul 25, 2023

View reviewed changes

src/brevitas/graph/gpxq.py Outdated Show resolved Hide resolved

volcacius reviewed Jul 28, 2023

View reviewed changes

src/brevitas/graph/gpxq.py Outdated Show resolved Hide resolved

volcacius reviewed Jul 31, 2023

View reviewed changes

Giuseppe5 force-pushed the gpfq branch from 8b1ff83 to 88626b9 Compare August 17, 2023 13:05

Giuseppe5 added 10 commits September 22, 2023 18:04

GPFQ support

a7fdf27

Review

ae87057

Fix for depthwise act_order gptq

7dbee11

Docstring update

60856fd

Update

20030f4

Fix for llm import

1055dc2

File split

57a164d

Support for gpfq in benchmark

98bf622

benchmark scripts updated

7cf9f4d

Updated readme

953ed0e

Giuseppe5 force-pushed the gpfq branch from 35dd213 to 953ed0e Compare September 26, 2023 16:27

Clean-up

f043fcd

volcacius reviewed Sep 27, 2023

View reviewed changes

src/brevitas/graph/gpxq.py Outdated Show resolved Hide resolved

Update for weight orig

dec588f

Giuseppe5 merged commit 3b7b9c7 into Xilinx:dev Sep 27, 2023
13 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPFQ #666

GPFQ #666

Giuseppe5 commented Jul 17, 2023

volcacius left a comment

volcacius left a comment

Giuseppe5 commented Aug 17, 2023

GPFQ #666

GPFQ #666

Conversation

Giuseppe5 commented Jul 17, 2023

volcacius left a comment

Choose a reason for hiding this comment

volcacius left a comment

Choose a reason for hiding this comment

Giuseppe5 commented Aug 17, 2023