[UR] Support fetch adapter source individually #12907

kbenzie · 2024-03-05T15:22:42Z

This patch and its counterpart in oneapi-src/unified-runtime#1410 add CMake support for fetching an individual Unified Runtime adapter's source code from a different repo/tag combination using the new fetch_adapter_source() CMake function. Only the source is cloned, it is not added to the build directly.

Instead, the path to the adapter source is passed into Unified Runtime clone described by the UNIFIED_RUNTIME_REPO and UNIFIED_RUNTIME_TAG CMake variables. This clone is the source of truth for the Unified Runtime API and drives the build of the external adapter source.

Using fetch_adapter_source() is optional.

This patch and its counterpart in oneapi-src/unified-runtime#1410 add CMake support for fetching an individual Unified Runtime adapter's source code from a different repo/tag combination using the new `fetch_adapter_source()` CMake function. Only the source is cloned, it is not added to the build directly. Instead, the path to the adapter source is passed into Unified Runtime clone described by the `UNIFIED_RUNTIME_REPO` and `UNIFIED_RUNTIME_TAG` CMake variables. This clone is the source of truth for the Unified Runtime API and drives the build of the external adapter source. Using `fetch_adapter_source()` is optional.

kbenzie · 2024-03-18T13:29:30Z

@intel/llvm-gatekeepers please merge

Implement `seq_cst` RC11/ptx6.0 memory consistency for CUDA backend. See https://dl.acm.org/doi/pdf/10.1145/3297858.3304043 and https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#memory-consistency-model for full details. Requires sm_70 or above. With this PR there is now a complete mapping between SYCL memory consistency model capabilities and the official CUDA model, fully exploiting CUDA capabilities when possible on supported arches. This makes the SYCL-CTS atomic_ref tests fully pass for sm_70 on the cuda backend. Fixes #11208 Depends on #12907 --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

Implement `seq_cst` RC11/ptx6.0 memory consistency for CUDA backend. See https://dl.acm.org/doi/pdf/10.1145/3297858.3304043 and https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#memory-consistency-model for full details. Requires sm_70 or above. With this PR there is now a complete mapping between SYCL memory consistency model capabilities and the official CUDA model, fully exploiting CUDA capabilities when possible on supported arches. This makes the SYCL-CTS atomic_ref tests fully pass for sm_70 on the cuda backend. Fixes intel#11208 Depends on intel#12907 --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com>

kbenzie temporarily deployed to WindowsCILock March 5, 2024 15:23 — with GitHub Actions Inactive

kbenzie temporarily deployed to WindowsCILock March 5, 2024 15:46 — with GitHub Actions Inactive

kbenzie force-pushed the benie/ur-per-adapter-tags branch from eef1784 to ec17d9e Compare March 8, 2024 11:32

kbenzie temporarily deployed to WindowsCILock March 8, 2024 11:32 — with GitHub Actions Inactive

kbenzie temporarily deployed to WindowsCILock March 8, 2024 11:54 — with GitHub Actions Inactive

kbenzie force-pushed the benie/ur-per-adapter-tags branch from ec17d9e to 8eb4dae Compare March 14, 2024 22:46

kbenzie temporarily deployed to WindowsCILock March 14, 2024 22:48 — with GitHub Actions Inactive

kbenzie had a problem deploying to WindowsCILock March 14, 2024 23:39 — with GitHub Actions Failure

kbenzie temporarily deployed to WindowsCILock March 15, 2024 11:00 — with GitHub Actions Inactive

kbenzie temporarily deployed to WindowsCILock March 15, 2024 11:29 — with GitHub Actions Inactive

kbenzie force-pushed the benie/ur-per-adapter-tags branch from 8eb4dae to de7976f Compare March 18, 2024 11:04

kbenzie marked this pull request as ready for review March 18, 2024 11:05

kbenzie requested a review from a team as a code owner March 18, 2024 11:05

kbenzie temporarily deployed to WindowsCILock March 18, 2024 11:06 — with GitHub Actions Inactive

aarongreig approved these changes Mar 18, 2024

View reviewed changes

kbenzie temporarily deployed to WindowsCILock March 18, 2024 11:47 — with GitHub Actions Inactive

kbenzie mentioned this pull request Mar 18, 2024

[CUDA][LIBCLC] Implement RC11 seq_cst for PTX6.0 #12516

Merged

steffenlarsen merged commit 7616785 into intel:sycl Mar 18, 2024
11 checks passed

kbenzie mentioned this pull request Apr 15, 2024

sycl-rel_5_2_0: [CUDA][LIBCLC] Implement RC11 seq_cst for PTX6.0 (#12516) #13403

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[UR] Support fetch adapter source individually #12907

[UR] Support fetch adapter source individually #12907

kbenzie commented Mar 5, 2024 •

edited

Loading

kbenzie commented Mar 18, 2024

[UR] Support fetch adapter source individually #12907

[UR] Support fetch adapter source individually #12907

Conversation

kbenzie commented Mar 5, 2024 • edited Loading

kbenzie commented Mar 18, 2024

kbenzie commented Mar 5, 2024 •

edited

Loading