Skip to content

Commit

Permalink
[SYCL] Enable checking the result of max_num_work_groups query with e…
Browse files Browse the repository at this point in the history
…xceeded launch limits on more backends (hip and opencl)
  • Loading branch information
GeorgeWeb committed Sep 13, 2024
1 parent 5b9f0f6 commit 15a5ccf
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 11 deletions.
17 changes: 9 additions & 8 deletions sycl/cmake/modules/FetchUnifiedRuntime.cmake
Original file line number Diff line number Diff line change
Expand Up @@ -116,14 +116,15 @@ if(SYCL_UR_USE_FETCH_CONTENT)
CACHE PATH "Path to external '${name}' adapter source dir" FORCE)
endfunction()

set(UNIFIED_RUNTIME_REPO "https://github.com/oneapi-src/unified-runtime.git")
# commit f31160dea6d142014f441bc4ca5e58e48827490e
# Merge: 2bbe9526 64068799
# Author: Piotr Balcer <piotr.balcer@intel.com>
# Date: Thu Sep 12 14:19:48 2024 +0200
# Merge pull request #2083 from kswiecicki/xpti-init-fix
# Fix XPTI initialization bug
set(UNIFIED_RUNTIME_TAG f31160dea6d142014f441bc4ca5e58e48827490e)
# Temporary changed for testing, update once the UR part is merged.
set(UNIFIED_RUNTIME_REPO "https://github.com/GeorgeWeb/unified-runtime.git")
# commit 24a8299efc59c715a1c2dd180692a5e12a12283a
# Merge: eb63d1a2 2fea679d
# Author: Omar Ahmed <omar.ahmed@codeplay.com>
# Date: Wed Sep 11 10:40:59 2024 +0100
# Merge pull request #2078 from callumfare/callum/fix_device_extensions_fpga
# Add workaround for silently supported OpenCL extensions on Intel FPGA
set(UNIFIED_RUNTIME_TAG 6eb27d527791012b575ba443586223e4038058af)

set(UMF_BUILD_EXAMPLES OFF CACHE INTERNAL "EXAMPLES")
# Due to the use of dependentloadflag and no installer for UMF and hwloc we need
Expand Down
6 changes: 3 additions & 3 deletions sycl/test-e2e/Basic/launch_queries/max_num_work_groups.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -161,9 +161,9 @@ int test_max_num_work_groups(sycl::queue &q, const sycl::device &dev) {
// It cannot be possible to launch a kernel successfully with a configuration
// that exceeds the available resources as in the above defined workGroupSize.
// workGroupSize is larger than maxWorkGroupSize, hence maxWGs must equal 0.
// Note: Level-Zero currently always returns a non-zero value. While other
// backends (i.e., OpenCL, HIP) always return 1 in their implementations.
if (dev.get_backend() == sycl::backend::ext_oneapi_cuda) {
// Note: Level-Zero currently always returns a non-zero value.
// TODO: Remove the backend condition once the Level-Zero API issue is fixed.
if (dev.get_backend() != sycl::backend::ext_oneapi_level_zero) {
assert(maxWGs == 0 && "max_num_work_groups query failed");
}

Expand Down

0 comments on commit 15a5ccf

Please sign in to comment.