Deprecate QOp Export #834

Giuseppe5 · 2024-02-08T14:51:21Z

Although we will keep the interface to have layer-wise export handlers, we will be deprecating support to QOp in favour of QCDQ.

Barrot · 2024-02-21T19:23:02Z

What is the reason for deprecation?

Giuseppe5 · 2024-02-22T10:33:50Z

Generally, QCDQ is much easier to use given its flexibility, whilst ONNX and Torch QOp have several constraints about how the layer input, weights, and output should be quantized to work correctly.

Similarly, QCDQ is also much easier to support and work around compared to QOp.

Barrot · 2024-02-22T14:28:35Z

Thanks @Giuseppe5

prathameshd8 · 2024-07-16T09:01:12Z

Hi @Giuseppe5,

I have tried both QCDQ and QOp ONNX export. Indeed QCDQ provides a great flexibility in order to export the models to ONNX, whereas for the QOp export one has to consider a lot of constraints.

However, in order to perform full-integer inference by generating C code with the help of frameworks such as TVM, QCDQ adds several Quantize and Dequantize nodes in the ONNX graph, where all the computation essentially happens in floating points.

In this case where you want to perform a full-integer inference, QOp worked quite well, as the integer tensors are passed on to the next layer if you set return_quant_tensor=True, while defining the QuantLayer. Furthermore, one can see that in the C code generated, the computations are performed on integers as expected.

Since, QOp Export will be deprecated, is there any way with QCDQ export, one can perform a full-integer inference?

Giuseppe5 added the refactoring label Feb 8, 2024

Giuseppe5 changed the title ~~Deprecatie QOp ONNX Export~~ Deprecate QOp ONNX Export Feb 8, 2024

Giuseppe5 changed the title ~~Deprecate QOp ONNX Export~~ Deprecate QOp Export Feb 12, 2024

Giuseppe5 linked a pull request Mar 22, 2024 that will close this issue

Feat: Remove QOP Export #917

Merged

14 tasks

Giuseppe5 mentioned this issue Mar 22, 2024

Export ONNX QOperator #882

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate QOp Export #834

Deprecate QOp Export #834

Giuseppe5 commented Feb 8, 2024 •

edited

Loading

Barrot commented Feb 21, 2024

Giuseppe5 commented Feb 22, 2024

Barrot commented Feb 22, 2024

prathameshd8 commented Jul 16, 2024 •

edited

Loading

Deprecate QOp Export #834

Deprecate QOp Export #834

Comments

Giuseppe5 commented Feb 8, 2024 • edited Loading

Barrot commented Feb 21, 2024

Giuseppe5 commented Feb 22, 2024

Barrot commented Feb 22, 2024

prathameshd8 commented Jul 16, 2024 • edited Loading

Giuseppe5 commented Feb 8, 2024 •

edited

Loading

prathameshd8 commented Jul 16, 2024 •

edited

Loading