Release v0.10.1
Highlights
- A2Q+ support paper
- A2Q+ examples with CIFAR10 and Super Resolution
- Support for concatenation equalization for weights and activations
- Support for GPFQ + A2Q L1 Norm bound
- Possibility to explicitly export Q node for weights in QCDQ export
- Support for float16 and bfloat16 for QCDQ export
- Support for Dynamic Activation Quantization for ONNX QDQ export
- Support for channel-splitting paper
- (Beta) Better compatibility with Huggingface accelerate and optimum
- (Beta) Improved support and testing for minifloat quantization
What's Changed
- Fix (examples/generative): set weight_bit_width in weight_quant by @Giuseppe5 in #783
- Feat (graph/equalize): improvements for llm equalization by @Giuseppe5 in #784
- [graph] Fix typo in class name by @nickfraser in #765
- Fix (graph/equalize): refactor for act equalization by @Giuseppe5 in #787
- [quant_tensor] Updates
__truediv__
behaviour to match "standard fixed point rules" by @nickfraser in #769 - Feat (export): (b)float16 support for qcdq export by @Giuseppe5 in #776
- Feat (ptq): Adding A2Q Upper Bound clipping to GPFQ by @fabianandresgrob in #734
- Extended equalization by @Giuseppe5 in #778
- Better Bfloat16 support by @Giuseppe5 in #777
- Fix (stats): add return statement in state_dict by @Giuseppe5 in #792
- Fix (equalize): improved cat eq checks by @Giuseppe5 in #793
- Fix (export): add CastMixin by @Giuseppe5 in #794
- Dynamic Act Quant support by @Giuseppe5 in #796
- Fix (examples/quantizers): correct dynamic zero point handling by @Giuseppe5 in #806
- Feat (a2q+): improving accumulator-aware weight quantization by @i-colbert in #797
- Feat (a2q+): adding new super resolution models to brevitas_examples by @i-colbert in #811
- Feat (Channel-Splitting): sets up first skeleton for channel-splitting by @fabianandresgrob in #772
- Feat: support for optimum by @Giuseppe5 in #826
- Fix (tests): adding tests for FloatQuant by @fabianandresgrob in #815
- Fix (export): correct q node export by @Giuseppe5 in #829
- Fix (examples/llm): correct groupwise export by @Giuseppe5 in #832
- Fix (examples/super_res): updating README by @i-colbert in #828
- Fix (examples/export): improved export by @Giuseppe5 in #838
- Fix (graph/equalize): cleanup and device management by @Giuseppe5 in #840
- Feat (examples/a2q): adding CIFAR10 example by @i-colbert in #813
- Fix (export): check for Per Group quantization by @Giuseppe5 in #848
Full Changelog: v0.10.0...v0.10.1