New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Enhance: Quant Tensor Test #894

Merged

Giuseppe5 merged 10 commits into Xilinx:dev from costigt-dev:enhance/quant-tensor-test

Apr 10, 2024

Collaborator

costigt-dev commented Mar 5, 2024 •

edited

Loading

Issue

Implements #727

Details

Creates test file for QuantTensor

Has tests for:

maths operators
divide by zero
transpose
view

It's not exhaustive but should form a good starting point.

costigt-dev added 3 commits

March 5, 2024 13:45


          created some tests for QuantTensor operators and some functions

67f63ab


          removed main call

ca962fd


          modified init test to use to_quant_tensor helper func

7e80aac

costigt-dev requested a review from nickfraser

March 5, 2024 13:54

Giuseppe5 reviewed

View reviewed changes

tests/brevitas/core/test_quant_tensor.py Outdated

Collaborator

Giuseppe5 Mar 5, 2024

Let's create a folder brevitas/quant_tensor for these tests

Collaborator Author

costigt-dev Mar 5, 2024

I'm inclined to leave it as is at the moment as the other tests in this area don't have their own folders and I want to keep this PR nice and simple. If it expands later we can create a folder.

Collaborator

Giuseppe5 Mar 5, 2024 •

edited

Loading

All tests in brevitas/core refer to file that are srcs/brevitas/core (we don't follow all the subfolders but that's the general idea).
QuantTensor does not belong there. Moreover, soon there will be more QuantTensor stuff to test so we might as well go ahead and create a folder.


          moved test to dedicated folder

0740f06

costigt-dev requested a review from Giuseppe5

March 6, 2024 08:22


          fixes to correct errors in some of the test workflows

c7e6d60

Giuseppe5 reviewed

View reviewed changes

tests/brevitas/quant_tensor/test_quant_tensor.py Outdated Show resolved Hide resolved


          remove test which isn't playing well

12c5108

Giuseppe5 reviewed

View reviewed changes

tests/brevitas/quant_tensor/test_quant_tensor.py Outdated

		MATMUL = 4


		# QuantTensor isn't meant to be initialized directly, it'll be invalid if you do

Collaborator

Giuseppe5 Mar 12, 2024

I would say that it could be invalid if you generate it manually (likewise, it is possible to generate manually a valid QuantTensor if you carefully pick scale factors, bit_width, values, etc.).

tests/brevitas/quant_tensor/test_quant_tensor.py Outdated

+              # QuantTensor isn't meant to be initialized directly, it'll be invalid if you do
+              # so you need to create it indirectly via QuantIdentity for example
+              def to_quant_tensor(input: torch.Tensor) -> QuantTensor:
+                  mod = QuantIdentity(bit_width=8, quant_type=QuantType.INT, return_quant_tensor=True)

Collaborator

Giuseppe5 Mar 12, 2024

I believe quant_type arg is not necessary.

tests/brevitas/quant_tensor/test_quant_tensor.py Outdated

+                      assert False
+                  # tolerance set to a high value as there is considerable loss of precision
+                  assert torch.isclose(normal, quant, atol=0.1).all().item()

Collaborator

Giuseppe5 Mar 12, 2024

Is still tolerance required for all operators or are there any operators more troublesome than other?
I would try to keep a tighter bound where possible if it's not too much headache

Collaborator Author

costigt-dev Mar 12, 2024

I'll see if I can tighten up the tolerance

tests/brevitas/quant_tensor/test_quant_tensor.py

+                      quant = qa * qb
+                  elif op == Operator.MATMUL:
+                      normal = a @ b
+                      # @ matmul operator not implemented for QuantTensor

Collaborator

Giuseppe5 Mar 12, 2024

What's the difference between @ and matmul? Also in terms of implementations, what would we need to override to implement @?

Collaborator Author

costigt-dev Mar 25, 2024

I don't believe there is a difference so its probably something we should create an issue to implement

tests/brevitas/quant_tensor/test_quant_tensor.py

		assert torch.isclose(a.transpose(0, 1), b.transpose(0, 1), atol=0.01).all().item()


		def test_quant_tensor_view():

Collaborator

Giuseppe5 Mar 12, 2024 •

edited

Loading

View and transpose open the discussion to a broader topic regarding how to deal with quant metadata views and transpose, especially in the case where we are doing per channel or finer granularity quantizations.

For now, I would add a TODO in both test case that says that we need to deal with quant metadata and test it


          updated tests

367e906

costigt-dev commented

View reviewed changes

tests/brevitas/quant_tensor/test_quant_tensor.py

+                      quant = qa * qb
+                  elif op == Operator.MATMUL:
+                      normal = a @ b
+                      # @ matmul operator not implemented for QuantTensor

Collaborator Author

costigt-dev Mar 25, 2024

I don't believe there is a difference so its probably something we should create an issue to implement

tests/brevitas/quant_tensor/test_quant_tensor.py

+                  x = torch.randn(4, 4)
+                  quant_tensor = to_quant_tensor(x)
+                  normal_tensor = torch.Tensor(x)
+                  assert torch.allclose(qdq(normal_tensor, quant_tensor), quant_tensor, rtol=0.01)

Collaborator Author

costigt-dev Mar 25, 2024

difference between the qdq result and quant tensor is extremely close but some error is creeping in from the quanttensor somewhere so added relative tolerance

tests/brevitas/quant_tensor/test_quant_tensor.py

+                  qb = to_quant_tensor(b)
+                  # to factor in quantisation error
+                  e_a = a - qa

Collaborator Author

costigt-dev Mar 25, 2024

didn't use qdq approach above as should be covered by the init test, I just need the difference so I can incorporate it into the calculations below

costigt-dev added 2 commits

March 25, 2024 16:18


          Merge branch 'dev' of github.com:Xilinx/brevitas into enhance/quant-t…

1b9b62a

…ensor-test


          Merge branch 'dev' of github.com:Xilinx/brevitas into enhance/quant-t…

261831e

…ensor-test

Giuseppe5 reviewed

View reviewed changes

tests/brevitas/quant_tensor/test_quant_tensor.py Outdated

                       # @ matmul operator not implemented for QuantTensor
                       quant = torch.matmul(qa, qb)
+                      normal = (a - e_a) @ (b - e_b)
                   else:
                       # unrecognised operator
                       assert False
                   # tolerance set to a high value as there is considerable loss of precision

Collaborator

Giuseppe5 Apr 8, 2024

Comment is outdated I believe

Giuseppe5 approved these changes

View reviewed changes

Collaborator

Giuseppe5 left a comment

Just a small clean-up of comments but otherwise it's ready to go


          removed outdated comment

7fc19bb

Giuseppe5 merged commit a106a6d into Xilinx:dev

22 checks passed

nickfraser mentioned this pull request

QuantTensor division #727

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet