Support for llvm.vector.reduce.* intrinsics #2198

bwlodarcz · 2023-11-06T16:52:34Z

A set of llvm.vector.reduce.* intrinsics doesn't have straight forward operation equivalent on the SPIRV side. The commit needed be split on three sections: Integer, Floating-point and Comparison reductions.
On every category at first we need to extract each value from vector.
For Integer reductions we perform an operation on each pair of vector elements and repeat until there is only one value.
This way emitted code has the least number of dependencies between each operation and maximally utilizes instruction parallelism.
For FP reductions because of non associativity (due to rounding errors) we needed to emit sequential code in which every next operation is based on previous result. This solution is slow but correct. Potential way of improvement Floating-point is to use reassoc flag from -ffast-math.
For Comparison reductions we can utilize similar algorithm like in Integer section with the difference that now we need to emit Comp operation and OpSelect on which result is based like in cpp ternary operation. The rest is the same.

lib/SPIRV/SPIRVWriter.cpp

asudarsa · 2023-11-10T03:37:51Z

lib/SPIRV/SPIRVWriter.cpp

@@ -4023,6 +4024,90 @@ SPIRVValue *LLVMToSPIRVBase::transIntrinsicInst(IntrinsicInst *II,
                             transValue(II->getArgOperand(0), BB),
                             transValue(II->getArgOperand(1), BB), BB);
  }
+  case Intrinsic::vector_reduce_add:


Please add some details in a comment about the implementation here.

Thanks

asudarsa · 2023-11-10T03:51:52Z

test/llvm-intrinsics/llvm-vector-reduce/add.ll

+
+target triple = "spir64-unknown-unknown"
+
+; CHECK-SPIRV-DAG: TypeInt [[#I8TYPE:]] 8


I do appreciate the thoroughness of this test. But I am not sure if it is really necessary to check for all data types here. If the implementation passes/fails for a particular data type, is there a reason why it will have a different result for some other data type?

IMO there is. The major necessity here comes from the fact that translator during translation is a stateful machine. This actually requires from the test to contain at least some kind of combination of types to be sure that at least in some ways this stateful behavior is covered by tests. The second reason is to be 100% sure that the types aren't changing of the implementation behavior. And third (imo. weakest) is that the api calls which are used by implementation are also tested for correctness in that context.

I am not sure about correctness as we are not actually running anything and creating results in the LIT tests. My argument is this. LIT tests that are being added here test specifically the sequence of SPIR-V instructions generated by 'your' implementation and if your implementation breaks one test that checks SPIR-V code generation for one type, it is going to break for all types.

Also, I am ok to have this discussion offline and not block this PR.

Thanks

asudarsa · 2023-11-10T03:56:22Z

test/llvm-intrinsics/llvm-vector-reduce/add.ll

@@ -0,0 +1,552 @@
+; RUN: llvm-as %s -o %t.bc


Given the amount of effort that has already gone into writing this test, I will categorize this comment as 'feel free to ignore' and 'non-blocking'. Do we want to test the reverse translated LLVM code here?

Thanks

IMO. no. These are covered by appropriate instruction tests written in the past - such effort will be redundant when such tests are in place.

Hmm. I do not think we have instruction level tests anywhere. Please feel free to point me to such tests.

Thanks

asudarsa

A few nits. Overall change looks good. If possible, it will be a good idea to verify the functionality using E2E tests though it is not required for this PR to be approved.
Also, for future reference, I see two places where the complexity of the testing can be reduced: (1) I think it's sufficient to test for a single data type. If the implementation failed in one data type, i suspect it's going to fail for all (2) I also think testing for add/and/mul/or/xor can be represented using a single test as the implementation is same for all these ops. Similarly for smax/smin/umax/umin.

Good job on writing the tests.

Thanks

This PR is still in Draft state. Sorry, i jumped the gun by approving. Will wait for it to move out of 'draft' state.

A set of llvm.vector.reduce.* intrinsics doesn't have straight forward operation equivalent on the SPIRV side. The easiest solution to this problem is to use scalar operation on each pair of vector elements and repeat until there is only one value.

MrSidims

Integer add/mul, or/and/xor and float add/mul are LGTM. Tomorrow will take a look at min/max

lib/SPIRV/SPIRVWriter.cpp

bwlodarcz force-pushed the llvm_vector_reduce branch 3 times, most recently from 42175d3 to a89ef46 Compare November 9, 2023 12:34

MrSidims reviewed Nov 9, 2023

View reviewed changes

lib/SPIRV/SPIRVWriter.cpp Outdated Show resolved Hide resolved

asudarsa reviewed Nov 10, 2023

View reviewed changes

lib/SPIRV/SPIRVWriter.cpp Show resolved Hide resolved

asudarsa reviewed Nov 10, 2023

View reviewed changes

lib/SPIRV/SPIRVWriter.cpp Outdated Show resolved Hide resolved

asudarsa reviewed Nov 10, 2023

View reviewed changes

asudarsa previously approved these changes Nov 10, 2023

View reviewed changes

bwlodarcz added 6 commits November 10, 2023 04:55

Finished add test

686feea

Support for llvm.vector.reduce.mul

3467f9d

Bitwise reductions and tests

c668954

Added min and max

555c006

Added fadd

aec34ac

bwlodarcz force-pushed the llvm_vector_reduce branch from a89ef46 to aec34ac Compare November 10, 2023 12:56

bwlodarcz added 3 commits November 10, 2023 06:17

Added FMul

4e030ae

Added fmax* and fmin*

b685987

Renames

908f32a

bwlodarcz force-pushed the llvm_vector_reduce branch from b6a4f10 to 908f32a Compare November 13, 2023 12:03

bwlodarcz marked this pull request as ready for review November 13, 2023 12:58

MrSidims reviewed Nov 13, 2023

View reviewed changes

MrSidims approved these changes Nov 14, 2023

View reviewed changes

lib/SPIRV/SPIRVWriter.cpp Show resolved Hide resolved

MrSidims requested review from asudarsa, svenvh and vmaksimo November 14, 2023 13:23

MrSidims merged commit fe088cd into KhronosGroup:main Nov 14, 2023
9 checks passed

MrSidims mentioned this pull request Jan 11, 2024

InvalidFunctionCall: Unexpected llvm intrinsic: 7> llvm.vector.reduce.or.v4i8 #1631

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for llvm.vector.reduce.* intrinsics #2198

Support for llvm.vector.reduce.* intrinsics #2198

bwlodarcz commented Nov 6, 2023 •

edited

Loading

asudarsa Nov 10, 2023

asudarsa Nov 10, 2023

bwlodarcz Nov 10, 2023

asudarsa Nov 10, 2023

asudarsa Nov 10, 2023

bwlodarcz Nov 10, 2023

asudarsa Nov 10, 2023

asudarsa left a comment

MrSidims left a comment


		target triple = "spir64-unknown-unknown"

		; CHECK-SPIRV-DAG: TypeInt [[#I8TYPE:]] 8

Support for llvm.vector.reduce.* intrinsics #2198

Support for llvm.vector.reduce.* intrinsics #2198

Conversation

bwlodarcz commented Nov 6, 2023 • edited Loading

asudarsa Nov 10, 2023

Choose a reason for hiding this comment

asudarsa Nov 10, 2023

Choose a reason for hiding this comment

bwlodarcz Nov 10, 2023

Choose a reason for hiding this comment

asudarsa Nov 10, 2023

Choose a reason for hiding this comment

asudarsa Nov 10, 2023

Choose a reason for hiding this comment

bwlodarcz Nov 10, 2023

Choose a reason for hiding this comment

asudarsa Nov 10, 2023

Choose a reason for hiding this comment

asudarsa left a comment

Choose a reason for hiding this comment

MrSidims left a comment

Choose a reason for hiding this comment

bwlodarcz commented Nov 6, 2023 •

edited

Loading