-
Notifications
You must be signed in to change notification settings - Fork 44
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Adjust the DPAS instruction order in TritonIntelGPUToLLVM (#2627)
Adjust the order of the generated DPAS instruction to get better DPAS operands locality in the generated LLVM IR. The new logic generates a large set of the DPAS instructions which can reuse the A or B operands across multiple DPAS instructions. Add a new env-var `TRITON_INTEL_AGGRESSIVE_DPAS_REUSE` to generate most aggressive DPAS instruction order for experimental. Will make the aggressive order as default when the IGC scalar backend could perfectly generate the best performance kernel in instruction scheduling. --------- Signed-off-by: Tiotto, Ettore <ettore.tiotto@intel.com> Co-authored-by: Tiotto, Ettore <ettore.tiotto@intel.com>
- Loading branch information
1 parent
290bfa9
commit 29c0ece
Showing
3 changed files
with
71 additions
and
23 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters