[SYCL]Fix SYCL target triple check for NVidia targets. #14505

srividya-sundaram · 2024-07-09T21:26:12Z

-fsycl-targets enables AOT compilations for specified device targets.
For NVidia GPU targets, per oneAPI DPCPP documentation, the valid target triple string is nvptx64-nvidia-cuda
This PR, checks if the target triple string provided for NVidia targets is a valid one.

Example:

clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda-invalidEnv syclfile.cpp
clang++: error: SYCL target is invalid: 'nvptx64-nvidia-cuda-invalidEnv'

AlexeySachkov · 2024-07-10T07:14:14Z

clang/lib/Driver/Driver.cpp

+static bool isValidSYCLTriple(llvm::Triple TargetTriple) {
+  // Check for invalid SYCL device triple values for NVidia GPU targets.
+  // nvptx64-nvidia-cuda is the valid triple for NVidia targets.
+  if (TargetTriple.getArchTypeName(TargetTriple.getArch()) == "nvptx64" &&


Suggested change

if (TargetTriple.getArchTypeName(TargetTriple.getArch()) == "nvptx64" &&

if (TargetTriple.isNVPTX() &&

Why do we need to do string comparisons?

AlexeySachkov · 2024-07-10T07:15:03Z

clang/lib/Driver/Driver.cpp

+  // Check for invalid SYCL device triple values for NVidia GPU targets.
+  // nvptx64-nvidia-cuda is the valid triple for NVidia targets.
+  if (TargetTriple.getArchTypeName(TargetTriple.getArch()) == "nvptx64" &&
+      TargetTriple.getVendorTypeName(TargetTriple.getVendor()) == "nvidia" &&


Once again, I would prefer to avoid string comparison. Target.getVendor() == Triple::VendorType::NVIDIA should be enough

AlexeySachkov · 2024-07-10T08:37:59Z

llvm/lib/TargetParser/Triple.cpp

+      .StartsWith("rtems", Triple::RTEMS)
+      .StartsWith("nacl", Triple::NaCl)
+      .StartsWith("aix", Triple::AIX)
+      .Case("cuda", Triple::CUDA)


I'm hesitant to accept this change at intel/llvm. If you can get this accepted to the upstream LLVM, then it is fine.

This change is at least inconsistent with handling of other OSes. For example, clang will still accept nvptx-nvidia-nvclmyown as a valid triple for OpenCL on NVIDIA, even though the spelling contains some extra myown.

My main concern is that no one prohibits using other things from intel/llvm besides SYCL (and I know for sure that folks are using OpenMP for example) and therefore this change could break someone's workflow that we don't know about. Since we don't mark our customizations to the upstream codebase in any way, this change could easily get lost, cause conflicts or be hard to find during debugging.

Hi @AlexeySachkov

Currently, when we pass an invalid target triple value to fsycl-targets for NVidia targets , we get the following error:

fatal error: error in backend: Cannot select: intrinsic %llvm.nvvm.implicit.offset llvm-foreach: clang++: error: clang frontend command failed with exit code 70 (use -v to see invocation)

This error is non-intuitive and does not clearly describe what the problem is.

Currently, in the driver code base, there is no check to verify if the target triple string passed matches the exact value documented - nvptx64-nvidia-cuda
We only check if the ‘architecture’ component has ‘nvptx64’ and return that as a valid string.
Example: nvptx64-badVendor-badOS would still be passed as a valid triple to the toolchain and we end up getting an error.

For OpenMP, they do allow partial/shortened triple strings with empty Vendor or/and OS components. The following triples are all valid. Please see this function
nvptx64
nvptx64-nvidia
nvptx64--cuda

As for this change: .Case("cuda", Triple::CUDA)
When we just check using StartsWith , for an example like this - nvptx64-nvidia-cuda123,
the 123 get parsed as the OS Major version but the triple still gets rejected because cuda123 is not a valid OS component for offloading SYCL to nvptx backend.

I can remove the string comparisons as parseArch() , parseVendor() and parseOS() all use .Case match for Nvidia targets.

I think the best we can do is check for Triple::CUDA for the OS and not try to be strict about it. Sure we would be allowing OS values of cudaBadString, but given how doing so is consistent with other OS settings I'm hard pressed to deviate. If the failure occurs down the line when using cudaBadString, there may be an issue later in the compilation when using the triple that isn't using the known OS check.

This error is non-intuitive and does not clearly describe what the problem is.

I agree, but I still don't think that the suggested customization is right in context of deviation from the community code, further maintenance effort and potential unexpected/unwanted side effects.

For OpenMP, they do allow partial/shortened triple strings with empty Vendor or/and OS components.

We should adopt this in sycl and it will automatically reduce the amount of uses of longer triple that is prone to errors. Moreover, -fsycl-targets will eventually move to --arch that accepts a target name and not a triple, resolving this issue all together.

[SYCL]Fix SYCL target triple check for NVidia targets.

af850b6

srividya-sundaram had a problem deploying to WindowsCILock July 9, 2024 21:27 — with GitHub Actions Error

Remove else block.

7956296

srividya-sundaram had a problem deploying to WindowsCILock July 9, 2024 22:26 — with GitHub Actions Error

Update tests with valid target triple for NVidia targets.

00f95f1

srividya-sundaram had a problem deploying to WindowsCILock July 9, 2024 23:10 — with GitHub Actions Failure

srividya-sundaram temporarily deployed to WindowsCILock July 10, 2024 00:31 — with GitHub Actions Inactive

Update tests.

3976955

srividya-sundaram temporarily deployed to WindowsCILock July 10, 2024 02:05 — with GitHub Actions Inactive

srividya-sundaram temporarily deployed to WindowsCILock July 10, 2024 02:49 — with GitHub Actions Inactive

srividya-sundaram marked this pull request as ready for review July 10, 2024 04:43

srividya-sundaram requested review from a team as code owners July 10, 2024 04:43

AlexeySachkov reviewed Jul 10, 2024

View reviewed changes

srividya-sundaram closed this Jul 30, 2024

srividya-sundaram deleted the fix-nvptx-tt branch July 30, 2024 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL]Fix SYCL target triple check for NVidia targets. #14505

[SYCL]Fix SYCL target triple check for NVidia targets. #14505

srividya-sundaram commented Jul 9, 2024 •

edited

Loading

AlexeySachkov Jul 10, 2024

AlexeySachkov Jul 10, 2024

AlexeySachkov Jul 10, 2024

srividya-sundaram Jul 10, 2024

mdtoguchi Jul 11, 2024

AlexeySachkov Jul 12, 2024

	if (TargetTriple.getArchTypeName(TargetTriple.getArch()) == "nvptx64" &&
	if (TargetTriple.isNVPTX() &&

[SYCL]Fix SYCL target triple check for NVidia targets. #14505

[SYCL]Fix SYCL target triple check for NVidia targets. #14505

Conversation

srividya-sundaram commented Jul 9, 2024 • edited Loading

AlexeySachkov Jul 10, 2024

Choose a reason for hiding this comment

AlexeySachkov Jul 10, 2024

Choose a reason for hiding this comment

AlexeySachkov Jul 10, 2024

Choose a reason for hiding this comment

srividya-sundaram Jul 10, 2024

Choose a reason for hiding this comment

mdtoguchi Jul 11, 2024

Choose a reason for hiding this comment

AlexeySachkov Jul 12, 2024

Choose a reason for hiding this comment

srividya-sundaram commented Jul 9, 2024 •

edited

Loading