Enable the SLPVectorizer on Triton side with no scheduling #2714

chengjunlu · 2024-11-15T03:20:33Z

The IGCVectorizer doesn't support the flash attention kernel so far.
To enable the SLPVectorizer on Triton side with no scheduling in basic block will get better performance for now.

chengjunlu · 2024-11-15T03:22:25Z

third_party/intel/backend/compiler.py

@@ -335,6 +335,9 @@ def make_spv(src, metadata, options):
        else:
            metadata["build_flags"] = ""

+        if os.getenv("TRITON_INTEL_ENABLE_POST_PROCESS_LLIR", "1") == "1":
+            metadata["build_flags"] += " -igc_opts 'DisablePHIScalarization=1'"


Check with IGC about how to use the IGC options to set the optimization flag instead of using environment variable.

…ve the SLPVectorizer once the IGCVectorizer works well.

chengjunlu requested review from etiotto and whitneywhtsang November 15, 2024 03:20

chengjunlu changed the title ~~[WIP] Enable the SLPVectorizer on Triton side with no scheduling~~ Enable the SLPVectorizer on Triton side with no scheduling Nov 15, 2024

chengjunlu commented Nov 15, 2024

View reviewed changes

chengjunlu linked an issue Nov 15, 2024 that may be closed by this pull request

[Performance] Enable the SLPVectorizer to improve flash attention performance #2715

Open

chengjunlu closed this Nov 15, 2024

Enable the SLPVectorizer on Triton side with no scheduling part. Remo…

161f980

…ve the SLPVectorizer once the IGCVectorizer works well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable the SLPVectorizer on Triton side with no scheduling #2714

Enable the SLPVectorizer on Triton side with no scheduling #2714

chengjunlu commented Nov 15, 2024

chengjunlu Nov 15, 2024

Enable the SLPVectorizer on Triton side with no scheduling #2714

Enable the SLPVectorizer on Triton side with no scheduling #2714

Conversation

chengjunlu commented Nov 15, 2024

chengjunlu Nov 15, 2024

Choose a reason for hiding this comment