Feat (mx): gptq compatibility and quant tests #1013

Giuseppe5 · 2024-08-28T12:04:49Z

Depends on #1011 and #1012

Added tests for MX and FloatQuant, with some restrictions;

Neither support external bias quantization at the moment
Compatibility with MHA is limited due to how the various quant options interact with each-other
MX quantizers need to be paired to account for the possibility of padding (to be revisited post-release)
MX quantizers are not JIT compatible

src/brevitas/core/function_wrapper/shape.py

src/brevitas/quant/solver/parameter.py

nickfraser

Two small comments, otherwise LGTM!

nickfraser · 2024-09-04T12:42:41Z

src/brevitas/proxy/groupwise_int_parameter_quant.py

@@ -14,6 +14,11 @@ def group_dim(self):
    def group_size(self):
        return self.quant_injector.group_size

+    def apply_input_view(self, x):


Worth making a GroupwiseQuantProxyMixin which provides these functions? Looks like that a lot is shared between GroupwiseWeightQuantProxyFromInjector & GroupwiseActQuantProxyFromInjector...

After further review, I seem to recall there was some issue with Proxy Mixins (perhaps with dependency injection) results in the ExportMixin needing to be in a certain location. I'll leave this for now - we can revisit at another time.

nickfraser · 2024-09-04T12:50:40Z

src/brevitas/graph/gpxq.py

@@ -232,7 +238,8 @@ def process_input(self, inp):
        if isinstance(inp, IntQuantTensor):
            if is_quant_enabled and self.quant_metadata is None:
                self.quant_metadata = _CachedIO(inp, metadata_only=True)
-            inp = inp.value
+            if isinstance(inp, QuantTensor):


When could this not be True? I see you've added this check, but it's not obvious to me. Can inp get modified in-place by _CachedIO?

I think the answer to my question is "no", so I've removed that check in 1f0933f.

nickfraser · 2024-09-04T14:14:55Z

Rebased after #1012 was merged. Note, had to resolve some merge conflicts, but I think I made all the right choices about which bits to keep or not...

nickfraser · 2024-09-05T08:44:06Z

The failing tests seem to be a github actions issue rather than a real issue. I can run the failing test suite (nox -s "tests_brevitas_cpu-3.9(jit_disabled, pytorch_2.1.0)") locally on a windows machine.

Merging, and will open another issue if this becomes a regular thing.

Giuseppe5 mentioned this pull request Aug 28, 2024

Feat (mx): PTQ MX + Float support #1010

Merged

Giuseppe5 requested a review from nickfraser August 29, 2024 11:38

nickfraser reviewed Aug 29, 2024

View reviewed changes

src/brevitas/core/function_wrapper/shape.py Outdated Show resolved Hide resolved

nickfraser reviewed Aug 29, 2024

View reviewed changes

src/brevitas/quant/solver/parameter.py Outdated Show resolved Hide resolved

Giuseppe5 requested a review from nickfraser August 31, 2024 10:30

Giuseppe5 changed the title ~~Feat (mx): adding gptq and quant tests~~ Feat (mx): gptq compatibility and quant tests Aug 31, 2024

Giuseppe5 requested review from nickfraser and removed request for nickfraser August 31, 2024 11:46

nickfraser requested changes Sep 4, 2024

View reviewed changes

Giuseppe5 added 12 commits September 4, 2024 15:05

GPTQ fix

b166e52

Adding gptq and quant tests

4183234

precommit fix

81945e1

tensor instead of list

34e1332

New tests

34cccbc

precommit

1c3d51b

Naming/depinj cleanup

6ea876c

fix tests and more tests

7177262

Fix tests for MX/Float

9f9e889

shared padding func

568420f

Ignore jit flag

498899f

Fix tests + JIT

4b1377a

nickfraser force-pushed the fix_gptq branch from e97b733 to 4b1377a Compare September 4, 2024 14:13

Fix (graph/gptq): Removed unnecessary QuantTensor check.

1f0933f

nickfraser self-requested a review September 4, 2024 14:29

nickfraser merged commit d4834bd into Xilinx:dev Sep 5, 2024
336 of 337 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (mx): gptq compatibility and quant tests #1013

Feat (mx): gptq compatibility and quant tests #1013

Giuseppe5 commented Aug 28, 2024 •

edited

Loading

nickfraser left a comment

nickfraser Sep 4, 2024

nickfraser Sep 4, 2024

nickfraser Sep 4, 2024

nickfraser Sep 4, 2024

nickfraser commented Sep 4, 2024 •

edited

Loading

nickfraser commented Sep 5, 2024

Feat (mx): gptq compatibility and quant tests #1013

Feat (mx): gptq compatibility and quant tests #1013

Conversation

Giuseppe5 commented Aug 28, 2024 • edited Loading

nickfraser left a comment

Choose a reason for hiding this comment

nickfraser Sep 4, 2024

Choose a reason for hiding this comment

nickfraser Sep 4, 2024

Choose a reason for hiding this comment

nickfraser Sep 4, 2024

Choose a reason for hiding this comment

nickfraser Sep 4, 2024

Choose a reason for hiding this comment

nickfraser commented Sep 4, 2024 • edited Loading

nickfraser commented Sep 5, 2024

Giuseppe5 commented Aug 28, 2024 •

edited

Loading

nickfraser commented Sep 4, 2024 •

edited

Loading