-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
benie/test adapters #1020
benie/test adapters #1020
Commits on Aug 18, 2023
-
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 18, 2023 Configuration menu - View commit details
-
Copy full SHA for 644755c - Browse repository at this point
Copy the full SHA 644755cView commit details
Commits on Aug 21, 2023
-
[UR] Remove Prefixes from CUDA tests
Petr Vesely committedAug 21, 2023 Configuration menu - View commit details
-
Copy full SHA for 5eaf0a2 - Browse repository at this point
Copy the full SHA 5eaf0a2View commit details -
Merge pull request oneapi-src#795 from PatKamin/fuzz-test-generate-ap…
…i-calls-adapters Test parts of API with libFuzzer.
Configuration menu - View commit details
-
Copy full SHA for 318ef5a - Browse repository at this point
Copy the full SHA 318ef5aView commit details -
[UR] Remove Prefixes from HIP tests
Petr Vesely committedAug 21, 2023 Configuration menu - View commit details
-
Copy full SHA for fa89ec4 - Browse repository at this point
Copy the full SHA fa89ec4View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf16571 - Browse repository at this point
Copy the full SHA bf16571View commit details -
Configuration menu - View commit details
-
Copy full SHA for 858e062 - Browse repository at this point
Copy the full SHA 858e062View commit details -
Configuration menu - View commit details
-
Copy full SHA for 31be4f4 - Browse repository at this point
Copy the full SHA 31be4f4View commit details -
Merge pull request oneapi-src#811 from veselypeta/petr/fix-prefixes
[UR] Remove `cuda_`/`hip_` prefixes from tests files.
Configuration menu - View commit details
-
Copy full SHA for 12c0cdf - Browse repository at this point
Copy the full SHA 12c0cdfView commit details -
Merge pull request oneapi-src#813 from pbalcer/adapters-update-v0.7.1
[adapters] update to latest sycl (with 0.7.1 tag)
Configuration menu - View commit details
-
Copy full SHA for f95dfbc - Browse repository at this point
Copy the full SHA f95dfbcView commit details
Commits on Aug 22, 2023
-
[SYCL][Graph] Throw an exception when unsupported features used in a …
…graph (#10789) This PR contains a set of changes that implement throwing an exception when a feature unsupported by [sycl_ext_oneapi_graph](https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/proposed/sycl_ext_oneapi_graph.asciidoc) is used. An error is thrown when an application uses the following features with the graph extension: * An unsupported extension. * Querying the event returned from graph submission for profiling information. * Level Zero immediate command-list (see intel/llvm#10467) * Specialization constants, kernel bundles, or reductions are used in a graph node. ## Authors Co-authored-by: Pablo Reble <pablo.reble@intel.com> Co-authored-by: Julian Miller <julian.miller@intel.com> Co-authored-by: Ben Tracy <ben.tracy@codeplay.com> Co-authored-by: Ewan Crawford <ewan@codeplay.com> Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for fd179fa - Browse repository at this point
Copy the full SHA fd179faView commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 22, 2023 Configuration menu - View commit details
-
Copy full SHA for 98f5fbf - Browse repository at this point
Copy the full SHA 98f5fbfView commit details -
[SYCL][UR][L0] Fix some leftovers from PI-UR porting (#10785)
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Jaime Arteaga authoredAug 22, 2023 Configuration menu - View commit details
-
Copy full SHA for a5ae24d - Browse repository at this point
Copy the full SHA a5ae24dView commit details -
[SYCL][UR][L0] Correctly set OwnNativeHandle on image create (#10919)
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Jaime Arteaga authoredAug 22, 2023 Configuration menu - View commit details
-
Copy full SHA for cc6e8ab - Browse repository at this point
Copy the full SHA cc6e8abView commit details
Commits on Aug 23, 2023
-
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 23, 2023 Configuration menu - View commit details
-
Copy full SHA for 3b09c80 - Browse repository at this point
Copy the full SHA 3b09c80View commit details
Commits on Aug 24, 2023
-
Add prefetch for HIP USM allocations (#10430)
This change is necessary to workaround a delightful bug in either HIP runtime, or the HIP spec. It's discussed at length in github.com/intel/llvm/issues/7252 but for the purposes of this patch, it suffices to say that a call to `hipMemPrefetchAsync` is *required* for correctness in the face of global atomic operations on (*at least*) shared USM allocations. The architecture of this change is slightly strange on first sight in that we reduntantly track allocation information in several places. The context now keeps track of all USM mappings. We require a mapping of pointers to the allocated size, but these allocations aren't pinned to any particular queue or HIP stream. The `hipMemPrefetchAsync`, however, requires the associated HIP stream object, and the size of the allocation. The stream comes hot-off-the-queue *only* just before a kernel is launched, so we need to defer the prefetch until we have that information. Finally, the kernel itself keeps track of pointer arguments in a more accessible way so we can determine which of the kernel's pointer arguments do, in-fact, point to USM allocations.
Configuration menu - View commit details
-
Copy full SHA for f02aa8f - Browse repository at this point
Copy the full SHA f02aa8fView commit details
Commits on Aug 25, 2023
-
[UR][CUDA][HIP] Fix Set Arg Local (#10710)
In the CUDA/HIP adapters `urKernelSetArgValue` was being used to implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However, if the validation layer is enabled in UR then the path to set local arg is never taken since it includes a check that `pArgValue` is not null. This PR: * Implements `urKernelSetArgLocal` for CUDA/HIP adapters * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is `nullptr` * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back into `urKernelSetArgValue`.
Configuration menu - View commit details
-
Copy full SHA for 503755d - Browse repository at this point
Copy the full SHA 503755dView commit details -
[UR][CUDA][HIP] Fix Set Arg Local (#10710)
In the CUDA/HIP adapters `urKernelSetArgValue` was being used to implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However, if the validation layer is enabled in UR then the path to set local arg is never taken since it includes a check that `pArgValue` is not null. This PR: * Implements `urKernelSetArgLocal` for CUDA/HIP adapters * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is `nullptr` * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back into `urKernelSetArgValue`.
Configuration menu - View commit details
-
Copy full SHA for 4fb0004 - Browse repository at this point
Copy the full SHA 4fb0004View commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 25, 2023 Configuration menu - View commit details
-
Copy full SHA for fcf0903 - Browse repository at this point
Copy the full SHA fcf0903View commit details
Commits on Aug 28, 2023
-
[SYCL][UR][HIP] Check if module needs to be unloaded (#10787)
In the HIP adapter the HIP module is not set until the program is built with `urProgramBuild`, therefore we should check that the module actually needs to be unloaded in `urProgramRelease`. The following will result in a failure, but should still be valid UR trace: ```cpp uint8_t *source = "<some ptx>"; ur_program_handle_t prog; urProgramCreate(context, device, sizeof(source), ptxSource, nullptr, &prog); urProgramRelease(prog); // fails when it tries to unload the module. ```
Configuration menu - View commit details
-
Copy full SHA for 9d5d494 - Browse repository at this point
Copy the full SHA 9d5d494View commit details -
[UR][CUDA][HIP][L0] Cleanup licence header (#10824)
Fix the license headers at the top of each source file in the unified runtime directory. --------- Co-authored-by: Alexey Bader <alexey.bader@intel.com>
Configuration menu - View commit details
-
Copy full SHA for e0c87a5 - Browse repository at this point
Copy the full SHA e0c87a5View commit details -
[UR][CUDA][HIP][L0] Cleanup licence header (#10824)
Fix the license headers at the top of each source file in the unified runtime directory. --------- Co-authored-by: Alexey Bader <alexey.bader@intel.com>
Configuration menu - View commit details
-
Copy full SHA for b445a12 - Browse repository at this point
Copy the full SHA b445a12View commit details -
Configuration menu - View commit details
-
Copy full SHA for 37a2225 - Browse repository at this point
Copy the full SHA 37a2225View commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 28, 2023 Configuration menu - View commit details
-
Copy full SHA for e714348 - Browse repository at this point
Copy the full SHA e714348View commit details
Commits on Aug 29, 2023
-
[cmake] Use separate FetchSource's per adapter
While attempting to enable the Level-Zero and OpenCL adapters at the same time git will report an error due to the OpenCL tag not existing in the intel/llvm clone use for the Level-Zero adapter. Furthermore, if I wanted to test separate bug fixes for the CUDA and HIP adapters from different sources, this would also not be possible. This patch uses a separate FetchSource for each adapter to allow each adapters source to be fetched independently of the others and moves those clones into the build directory. This does result in additional git clones when enabling multiple adapters and again in multiple build directories but avoids conflicting remote URL's or multiple build configurations trampling on each others checked out tags.
Configuration menu - View commit details
-
Copy full SHA for 789adb0 - Browse repository at this point
Copy the full SHA 789adb0View commit details -
[fuzz test] discover all corpora in 'corpus' dir
and re-name the label to 'fuzz-short' for easier usage.
Configuration menu - View commit details
-
Copy full SHA for b15f186 - Browse repository at this point
Copy the full SHA b15f186View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9f2ea9a - Browse repository at this point
Copy the full SHA 9f2ea9aView commit details -
Merge pull request oneapi-src#831 from kbenzie/benie/adapters-separat…
…e-clones [cmake] Use separate FetchSource's per adapter
Configuration menu - View commit details
-
Copy full SHA for fa4b5cf - Browse repository at this point
Copy the full SHA fa4b5cfView commit details -
[cts] Add adapters ignore lists for conformance tests
This change enables conformance tests on HW with adapters. Tests that currently do not pass are ignored, but in the future, when they are all fixed, tests should pass without using this functionality.
Configuration menu - View commit details
-
Copy full SHA for ddeb5b5 - Browse repository at this point
Copy the full SHA ddeb5b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 007d939 - Browse repository at this point
Copy the full SHA 007d939View commit details
Commits on Aug 30, 2023
-
Merge pull request oneapi-src#823 from wlemkows/test-test
[cts] Add adapters ignore lists for conformance tests
Configuration menu - View commit details
-
Copy full SHA for 0ab28ff - Browse repository at this point
Copy the full SHA 0ab28ffView commit details -
Merge pull request oneapi-src#830 from lukaszstolarczuk/ext-fuzz-doc
Small updates in fuzz tests
Configuration menu - View commit details
-
Copy full SHA for 5577bae - Browse repository at this point
Copy the full SHA 5577baeView commit details -
[SYCL][HIP] Implemented supported make_* interop functions. (#10526)
This PR adds missing functions in the hip backend to allow for interoperability in programs that create sycl objects from native hip objects. The new function implementations are: - `make_device` - `make_queue` - `make_event` Note that it would really make sense for intel/llvm#10491 to be merged first because this PR makes the same code change in pi2ur, for a fix that is attributed to #10491. --------- Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for e3d2184 - Browse repository at this point
Copy the full SHA e3d2184View commit details -
`urinfo` is a command-line tool for inspecting the current execution environment: ```console $ build/bin/urinfo --help usage: build/bin/urinfo [-h] [-v] [-V] This tool enumerates Unified Runtime layers, adapters, platforms, and devices which are currently visible in the local execution environment. options: -h, --help show this help message and exit --version show version number and exit -v, --verbose print additional information ```
Configuration menu - View commit details
-
Copy full SHA for 3e1cb9d - Browse repository at this point
Copy the full SHA 3e1cb9dView commit details
Commits on Aug 31, 2023
-
Merge pull request oneapi-src#751 from kbenzie/benie/urinfo
[ur] Introduce urinfo tool
Configuration menu - View commit details
-
Copy full SHA for c1bc465 - Browse repository at this point
Copy the full SHA c1bc465View commit details -
[CTS] fix cts device adapter L0 match file
Test urDeviceGetGlobalTimestampTest.SuccessSynchronizedTime is marked as OPT because is flaky.
Configuration menu - View commit details
-
Copy full SHA for d8d1e40 - Browse repository at this point
Copy the full SHA d8d1e40View commit details -
Run fuzz test with L0 when available
When UR is built with the UR_BUILD_ADAPTER_L0 flag, run fuzz tests with level zero adapter force loaded.
Configuration menu - View commit details
-
Copy full SHA for f658931 - Browse repository at this point
Copy the full SHA f658931View commit details -
Add information on generating new corpus files
Add instructions for generating new short fuzz test scenarios.
Configuration menu - View commit details
-
Copy full SHA for 9bf0014 - Browse repository at this point
Copy the full SHA 9bf0014View commit details -
Merge pull request oneapi-src#836 from wlemkows/device-l0-match-fix
[CTS] fix cts device adapter L0 match file
Configuration menu - View commit details
-
Copy full SHA for 6114e57 - Browse repository at this point
Copy the full SHA 6114e57View commit details -
[SYCL] Optimize urEnqueueEventsWaitWithBarrier for in-order queues (#…
…10995) Don't insert zeCommandListAppendWaitOnEvents/zeCommandListAppendSignalEvent for queue::ext_oneapi_submit_barrier() if we have in-order queue and all events in the waitlist are from the same queue.
Configuration menu - View commit details
-
Copy full SHA for afa564d - Browse repository at this point
Copy the full SHA afa564dView commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedAug 31, 2023 Configuration menu - View commit details
-
Copy full SHA for 1f5b07b - Browse repository at this point
Copy the full SHA 1f5b07bView commit details
Commits on Sep 1, 2023
-
[ci] fix running cts tests that require device binaries
Creating device binaries silently fails if SYCL isn't available.
Configuration menu - View commit details
-
Copy full SHA for fc6f8db - Browse repository at this point
Copy the full SHA fc6f8dbView commit details -
Merge pull request oneapi-src#839 from PatKamin/fuzztest-readme
Add information on generating new corpus files
Configuration menu - View commit details
-
Copy full SHA for d242d04 - Browse repository at this point
Copy the full SHA d242d04View commit details -
Merge pull request oneapi-src#838 from PatKamin/fuzztest-l0
Run fuzz test with L0 when available
Configuration menu - View commit details
-
Copy full SHA for 17e4bcd - Browse repository at this point
Copy the full SHA 17e4bcdView commit details -
Remove phony nightly CI workflow
Cron scheduled workflows are triggered on the default branch only, which is the 'main' branch.
Configuration menu - View commit details
-
Copy full SHA for 7e342cc - Browse repository at this point
Copy the full SHA 7e342ccView commit details -
Merge pull request oneapi-src#845 from PatKamin/remove-nightly-build
Remove phony nightly CI workflow [adapters branch]
Configuration menu - View commit details
-
Copy full SHA for b7e8594 - Browse repository at this point
Copy the full SHA b7e8594View commit details -
Configuration menu - View commit details
-
Copy full SHA for 17e60d3 - Browse repository at this point
Copy the full SHA 17e60d3View commit details -
Remove now redundant job (w/o HW)
In current form this job is redundant, since we run the same builds on the same OS, but with hardware and tests' execution.
Configuration menu - View commit details
-
Copy full SHA for 18b85d4 - Browse repository at this point
Copy the full SHA 18b85d4View commit details -
LLVM and SPIRV-LLVM-Translator pulldown (WW35)
LLVM: llvm/llvm-project@ac2d265 SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@8701279
Configuration menu - View commit details
-
Copy full SHA for 475b1aa - Browse repository at this point
Copy the full SHA 475b1aaView commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedSep 1, 2023 Configuration menu - View commit details
-
Copy full SHA for 05575e8 - Browse repository at this point
Copy the full SHA 05575e8View commit details
Commits on Sep 4, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 8ccb7ba - Browse repository at this point
Copy the full SHA 8ccb7baView commit details -
[SYCL][HIP] Add managed memory USM checks for HIP prefetch (#10761)
Adds `hipDeviceAttributeManagedMemory` attribute check to verify the HIP device supports managed memory in order to use the prefetch API as well as checking the pointer to migrate is actually a managed allocation (USM/SVM). If the check fails we return early and set a warning message (with `UR_RESULT_SUCCESS` as return code). As a follow-up. we may need to detect system-allocated memory and either continue with just a warning message or throw a more targeted error message and exit.
Configuration menu - View commit details
-
Copy full SHA for 68faafa - Browse repository at this point
Copy the full SHA 68faafaView commit details -
Merge pull request oneapi-src#843 from PatKamin/shorten-ci
[adapters] Shorten CI
Configuration menu - View commit details
-
Copy full SHA for 6a2f948 - Browse repository at this point
Copy the full SHA 6a2f948View commit details -
Merge pull request oneapi-src#840 from pbalcer/ci-device-binaries
[ci] fix running cts tests that require device binaries
Configuration menu - View commit details
-
Copy full SHA for 20337b2 - Browse repository at this point
Copy the full SHA 20337b2View commit details
Commits on Sep 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 544ded5 - Browse repository at this point
Copy the full SHA 544ded5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 41883a9 - Browse repository at this point
Copy the full SHA 41883a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for aa88398 - Browse repository at this point
Copy the full SHA aa88398View commit details -
Configuration menu - View commit details
-
Copy full SHA for cabcd98 - Browse repository at this point
Copy the full SHA cabcd98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 80f6eb3 - Browse repository at this point
Copy the full SHA 80f6eb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 60372d3 - Browse repository at this point
Copy the full SHA 60372d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 115abc2 - Browse repository at this point
Copy the full SHA 115abc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4797c71 - Browse repository at this point
Copy the full SHA 4797c71View commit details -
[SYCL] [UR] Fix for make_queue. (#11046)
Correction to make_queue functionality. During the transition from L0 Plugin to UR Adapter a line of code in the make queue function was inadvertenty dropped. This change restores it and also modifies the associated test to do more careful checking.
Configuration menu - View commit details
-
Copy full SHA for c33b13b - Browse repository at this point
Copy the full SHA c33b13bView commit details -
Merge from 'sycl' to 'sycl-web'
iclsrc committedSep 5, 2023 Configuration menu - View commit details
-
Copy full SHA for 502401b - Browse repository at this point
Copy the full SHA 502401bView commit details
Commits on Sep 8, 2023
-
[UR][CUDA][HIP] Replace
ur::assertion
withUR_CHECK_ERROR
(#10986)Lots of hip/cu driver API calls were wrapped in `ur::assertion(res == CU_SUCCESS)` etc which: - Means that any native error messages returned from the affected driver api calls were lost. - Since these APIs report errors asynchronously, such that they are thrown from the last API call rather than the call which led to the error, previous asynchronous error messages from different APIs to the ones wrapped by the `ur::assertion` could also be lost depending on user code. These problems are fixed by swapping these assertions with `UR_CHECK_ERROR`. Note that in the future UR may want to adjust `UR_CHECK_ERROR` so that it throws `UR_RESULT_ERROR_ADAPTER_SPECIFIC` etc instead of using `std::cerr` etc to report the error etc. But I think it makes sense to still use `UR_CHECK_ERROR` to wrap driver API calls because it means that the __LINE__, __FUNCTION__ etc info can be correctly passed to native error reporting. --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 8639075 - Browse repository at this point
Copy the full SHA 8639075View commit details
Commits on Sep 12, 2023
-
[SYCL][UR] Implement usm memory pool for hip adapter. (#11065)
Based on #10758 Co-authored-by: Ronan Keryell <ronan@keryell.fr>
Configuration menu - View commit details
-
Copy full SHA for 73527bc - Browse repository at this point
Copy the full SHA 73527bcView commit details -
[SYCL][UR] Remove unused MaxAllocSize member variable and fix unused …
…parameter 'Alignment' warning (#11154) Fixes https://github.com/intel/llvm/actions/runs/6154418223/job/16699759620
Configuration menu - View commit details
-
Copy full SHA for e08d63b - Browse repository at this point
Copy the full SHA e08d63bView commit details -
[SYCL][UR][L0] Use uint8_t instead of uint32_t for 2 cases of urConte…
…xtGetInfo (#10929) Modified the return value for UR_CONTEXT_INFO_USM_FILL2D_SUPPORT and UR_CONTEXT_INFO_USM_MEMCPY_2D_SUPPORT to return uint8_t instead of uint32_t to comply with unified-runtime conformance test. Signed-off-by: Zhang, Winston <winston.zhang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 14178b0 - Browse repository at this point
Copy the full SHA 14178b0View commit details -
[SYCL] Return Success in USM Mem Free if L0 driver is already release…
…d (#11075) Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 9cf2ff8 - Browse repository at this point
Copy the full SHA 9cf2ff8View commit details
Commits on Sep 14, 2023
-
[SYCL][UR][L0] Replace memory type look-up with UMF tracking (#10807)
This also makes SharedReadOnly allocations tracking obsolete.
Configuration menu - View commit details
-
Copy full SHA for 7ced7ca - Browse repository at this point
Copy the full SHA 7ced7caView commit details -
[SYCL][UR][L0] Fix USMProxyPool unused parameter errors (#11178)
Post-commit error fix for: intel/llvm#10807
Configuration menu - View commit details
-
Copy full SHA for a0a20d8 - Browse repository at this point
Copy the full SHA a0a20d8View commit details
Commits on Sep 15, 2023
-
[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)
Fetch the latest revision of unified runtime: Notable changes * Some command buffer entry-points have been renamed, also 2 additions of membufferFill and USMFill * UrInit/urTearDown have been removed - replaced with loader only versions.
Configuration menu - View commit details
-
Copy full SHA for 630fbb8 - Browse repository at this point
Copy the full SHA 630fbb8View commit details -
[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)
Fetch the latest revision of unified runtime: Notable changes * Some command buffer entry-points have been renamed, also 2 additions of membufferFill and USMFill * UrInit/urTearDown have been removed - replaced with loader only versions.
Configuration menu - View commit details
-
Copy full SHA for 0b56545 - Browse repository at this point
Copy the full SHA 0b56545View commit details
Commits on Sep 18, 2023
-
Merge pull request oneapi-src#610 from kurapov-peter/pakurapo/codegen…
…-example End-to-end code generation example
Configuration menu - View commit details
-
Copy full SHA for 1d706f6 - Browse repository at this point
Copy the full SHA 1d706f6View commit details
Commits on Sep 19, 2023
-
[SYCL] Add esimd as an optional device feature (#10017)
This change adds a new aspect for esimd, `ext_intel_esimd`, and annotates the two fundamental esimd classes, `simd_obj_impl` and `simd_view_impl` with the `uses_aspect` attribute. `simd_obj_impl` is the base class of `simd` and `simd_mask` which are the fundamental user-facing classes. `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not a base of `simd_view_impl`, but every `simd_view_impl` requires a reference to a `simd` or `simd_mask` at construction time, so I am not sure if we truly need to annotate `simd_view_impl`, but I added it to be safe. It also adds a new PI device info query, `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at runtime if a device supports ESIMD. For UR-based plugins, we map that to `UR_DEVICE_INFO_ESIMD_SUPPOR`. The implementation simply returns false for cuda, hip and native_cpu. For l0 and opencl, we check that the device is an intel gpu by querying the device type is gpu and the vendor id is `0x8086`. For ESIMD emulator we simply return true. I would appreciate careful review on the plugin changes in particular, as I am not an expert. This change also updates the esimd spec to document the new aspect. In a future change, I plan to use the new aspect to remove the requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros that set function attributes, but I am not doing that as part of this PR as it requires more investigation. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 3b84011 - Browse repository at this point
Copy the full SHA 3b84011View commit details -
[SYCL] Add esimd as an optional device feature (#10017)
This change adds a new aspect for esimd, `ext_intel_esimd`, and annotates the two fundamental esimd classes, `simd_obj_impl` and `simd_view_impl` with the `uses_aspect` attribute. `simd_obj_impl` is the base class of `simd` and `simd_mask` which are the fundamental user-facing classes. `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not a base of `simd_view_impl`, but every `simd_view_impl` requires a reference to a `simd` or `simd_mask` at construction time, so I am not sure if we truly need to annotate `simd_view_impl`, but I added it to be safe. It also adds a new PI device info query, `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at runtime if a device supports ESIMD. For UR-based plugins, we map that to `UR_DEVICE_INFO_ESIMD_SUPPOR`. The implementation simply returns false for cuda, hip and native_cpu. For l0 and opencl, we check that the device is an intel gpu by querying the device type is gpu and the vendor id is `0x8086`. For ESIMD emulator we simply return true. I would appreciate careful review on the plugin changes in particular, as I am not an expert. This change also updates the esimd spec to document the new aspect. In a future change, I plan to use the new aspect to remove the requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros that set function attributes, but I am not doing that as part of this PR as it requires more investigation. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 5c1d65e - Browse repository at this point
Copy the full SHA 5c1d65eView commit details
Commits on Sep 20, 2023
-
Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)
Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for the past few days.
Configuration menu - View commit details
-
Copy full SHA for ee78d5f - Browse repository at this point
Copy the full SHA ee78d5fView commit details -
Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)
Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for the past few days.
Configuration menu - View commit details
-
Copy full SHA for eb86b33 - Browse repository at this point
Copy the full SHA eb86b33View commit details -
[SYCL][Graph] Bug fixes for hanging tests and urCommandBufferAppendKe…
…rnelLaunchExp (#11232) This patch targets to address two issues found in the testing: - Test hangs: This patch serializes the graph submissions in the tests to prevent hangs when submitting multiple graphs. - Issue adding a kernel to the command buffer in `urCommandBufferAppendKernelLaunchExp`. --------- Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for ee3a5de - Browse repository at this point
Copy the full SHA ee3a5deView commit details
Commits on Sep 21, 2023
-
[UR][L0] Make the queue [root-]device appear first in the context (#1…
…1213) Signed-off-by: smaslov <sergey.v.maslov@intel.com>
Configuration menu - View commit details
-
Copy full SHA for a1a57f5 - Browse repository at this point
Copy the full SHA a1a57f5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 10421df - Browse repository at this point
Copy the full SHA 10421dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for f0944fe - Browse repository at this point
Copy the full SHA f0944feView commit details -
Configuration menu - View commit details
-
Copy full SHA for 89c1377 - Browse repository at this point
Copy the full SHA 89c1377View commit details -
[SYCL][HIP] Fix hip prefetch/mem_advise (noop) for rocm < 5.0 (#10958)
The main fix here is to enable prefetch functionality when the HIP backend is built with rocm versions < 5. The change to the prefix/mem_advise UR hip adapter APIs fixes four e2e-tests that fail test-e2e only for HIP 4.x versions (These failures don't come up in the CI since it tests using rocm 5.x). The change to Tracing/image_printers.cpp is effectively re-XFailing this test for rocm 4 only. rocm 4 doesn't support `hipCreateSurfaceObject` which is called by this test. I'm not sure this legacy image functionality (Which as I understand it will be replaced by bindless images) is really working for rocm 5 in a meaningful way, since most other legacy image e2e tests are XFAIL for hip. But this test can still be useful for us for ROCM 5 compatibility testing. The XFAIL was recently removed here intel/llvm@745febe which led to us finding the rocm 4 fail. I guess that the CI at some point switched from testing rocm 4 to testing rocm 5, which meant this test stopped failing in CI. --------- Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for cca22fa - Browse repository at this point
Copy the full SHA cca22faView commit details -
Configuration menu - View commit details
-
Copy full SHA for d9f48c5 - Browse repository at this point
Copy the full SHA d9f48c5View commit details
Commits on Sep 22, 2023
-
[SYCL][NFC] Make UR_CHECK_ERROR a void return macro (#11100)
`UR_CHECK_ERROR` was designed to return `ur_result_t`, however in practice it was guaranteed to only ever return `UR_RESULT_SUCCESS`, as other paths would either terminate, abort or throw. This in turns leads to poor quality/error prone code, as the codebase was littered with: * statements not checking the return value - depending on the compiler generating a warning, * extra check on the return which was only ever going to be true. Some care was required, as the codebase has a habit of accumulating err codes across branches, so depending on the use case the initial value of `ur_result_t Result`s had to be set accordingly (now that `UR_CHECK_ERROR` does not return).
Configuration menu - View commit details
-
Copy full SHA for 834e127 - Browse repository at this point
Copy the full SHA 834e127View commit details -
[UR][L0] Do not rely on CTAD to avoid build warnings/failures (#11255)
Fixes post-commit build fails in https://github.com/intel/llvm/actions/runs/6256835642/job/16988269333 Signed-off-by: smaslov <sergey.v.maslov@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 72bc1d4 - Browse repository at this point
Copy the full SHA 72bc1d4View commit details
Commits on Sep 25, 2023
-
Configuration menu - View commit details
-
Copy full SHA for f5e0ed5 - Browse repository at this point
Copy the full SHA f5e0ed5View commit details -
Configuration menu - View commit details
-
Copy full SHA for aa80476 - Browse repository at this point
Copy the full SHA aa80476View commit details -
Merge pull request oneapi-src#889 from aarongreig/aaron/disableGenera…
…teWindowsCherrypick [ADAPTERS] Cherry-pick "Temporarily disable running check-generated target in windows CI."
Configuration menu - View commit details
-
Copy full SHA for 4c6d556 - Browse repository at this point
Copy the full SHA 4c6d556View commit details -
[test] improve match file error output
When a test fails a match, cmakes print both the input file and the match file, so that it's easy to see why the failure happened and how to fix it. Unfortunately, cmake was inserting 3 newlines between every line of both input and match files, making things hard to read. This patch fixes this by moving match and input files printing to the python script. Co-authored-by: Petr Vesely <petr.vesely@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for be85a49 - Browse repository at this point
Copy the full SHA be85a49View commit details
Commits on Sep 26, 2023
-
Merge pull request oneapi-src#886 from pbalcer/better-match-out
[test] improve match file error output
Configuration menu - View commit details
-
Copy full SHA for d0200e3 - Browse repository at this point
Copy the full SHA d0200e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 442de38 - Browse repository at this point
Copy the full SHA 442de38View commit details -
Configuration menu - View commit details
-
Copy full SHA for 663d9f3 - Browse repository at this point
Copy the full SHA 663d9f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for b368ca1 - Browse repository at this point
Copy the full SHA b368ca1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 15a089a - Browse repository at this point
Copy the full SHA 15a089aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e66fbfa - Browse repository at this point
Copy the full SHA e66fbfaView commit details -
Configuration menu - View commit details
-
Copy full SHA for fd33970 - Browse repository at this point
Copy the full SHA fd33970View commit details -
Remove misplaced statement from urKernelCreate doc
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 179695d - Browse repository at this point
Copy the full SHA 179695dView commit details -
Configuration menu - View commit details
-
Copy full SHA for b457b11 - Browse repository at this point
Copy the full SHA b457b11View commit details -
Configuration menu - View commit details
-
Copy full SHA for c4c3455 - Browse repository at this point
Copy the full SHA c4c3455View commit details -
Configuration menu - View commit details
-
Copy full SHA for 22b9cf9 - Browse repository at this point
Copy the full SHA 22b9cf9View commit details -
Revert "(NATIVECPU) added #define for device binary target"
This reverts commit c6e2987.
Configuration menu - View commit details
-
Copy full SHA for d7fa90a - Browse repository at this point
Copy the full SHA d7fa90aView commit details -
Revert "(NATIVECPU) added #define for device binary target"
This reverts commit afa0525.
Configuration menu - View commit details
-
Copy full SHA for 7dad101 - Browse repository at this point
Copy the full SHA 7dad101View commit details -
Configuration menu - View commit details
-
Copy full SHA for 30c07c2 - Browse repository at this point
Copy the full SHA 30c07c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for fb87081 - Browse repository at this point
Copy the full SHA fb87081View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb65b2c - Browse repository at this point
Copy the full SHA bb65b2cView commit details -
[CMake] Omit warning for missing 'generate' command - make it a status
Lack of this command is not a problem for projects using UR and they may use it without 'UR_FORMAT_CPP_STYLE' set.
Configuration menu - View commit details
-
Copy full SHA for 33ed352 - Browse repository at this point
Copy the full SHA 33ed352View commit details -
Configuration menu - View commit details
-
Copy full SHA for fdf3597 - Browse repository at this point
Copy the full SHA fdf3597View commit details -
[CMake] Make Python3 required dependency
we basically use it everywhere.
Configuration menu - View commit details
-
Copy full SHA for 7c3cf17 - Browse repository at this point
Copy the full SHA 7c3cf17View commit details -
Add UR_DEVICE_INFO_ESIMD_SUPPORT_EXP
This will be used by DPC++ PI to determine if a device supports the DPC++ ESIMD extension. Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for ab1967c - Browse repository at this point
Copy the full SHA ab1967cView commit details -
Rework change to not use experimental infrastructure
Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 6d9c29d - Browse repository at this point
Copy the full SHA 6d9c29dView commit details -
Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for ddf6ac4 - Browse repository at this point
Copy the full SHA ddf6ac4View commit details -
[UR] Bump sycl nightly version
[UR] Fix ur info spec [UR] Bump compiler version in CI [UR] Extract into directory [UR] Temporarily disable 20.04 [UR] Disable dpc++ on 20.04 [UR] Fix hip matchers [UR] Fix cuda match files [UR] Fix L0 match files [UR] Tmp print out os info [UR] Tmp enable verbose logging [UR] Print out match files [L0] Remove segfault matchers [UR] Fix cuda matcher [UR] Remove temporary fixes
Configuration menu - View commit details
-
Copy full SHA for 294a4d6 - Browse repository at this point
Copy the full SHA 294a4d6View commit details -
[SYCL][UR][L0] Modify flag for zeInit (#11242)
Based on the existence of NPU driver for L0, GPU flag is now passed to zeInit Signed-off-by: Zhang, Winston <winston.zhang@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 4320ea1 - Browse repository at this point
Copy the full SHA 4320ea1View commit details -
Merge pull request oneapi-src#881 from veselypeta/petr/bump_adapters
[ADAPTERS] Bump adapters branch to commit used in SYCL
Configuration menu - View commit details
-
Copy full SHA for 88e71c0 - Browse repository at this point
Copy the full SHA 88e71c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e2d8d26 - Browse repository at this point
Copy the full SHA e2d8d26View commit details -
Configuration menu - View commit details
-
Copy full SHA for ffcba56 - Browse repository at this point
Copy the full SHA ffcba56View commit details -
Merge pull request oneapi-src#879 from veselypeta/petr/level_zero
Move L0 Sources from intel/llvm to UR.
Configuration menu - View commit details
-
Copy full SHA for 83799e6 - Browse repository at this point
Copy the full SHA 83799e6View commit details -
Configuration menu - View commit details
-
Copy full SHA for d4d936f - Browse repository at this point
Copy the full SHA d4d936fView commit details -
Merge pull request oneapi-src#891 from callumfare/fix_adapter_def_path
Use absolute path for adapter .def files
Configuration menu - View commit details
-
Copy full SHA for 886e593 - Browse repository at this point
Copy the full SHA 886e593View commit details -
Configuration menu - View commit details
-
Copy full SHA for b29c580 - Browse repository at this point
Copy the full SHA b29c580View commit details -
Configuration menu - View commit details
-
Copy full SHA for d2edad6 - Browse repository at this point
Copy the full SHA d2edad6View commit details -
Merge pull request oneapi-src#892 from callumfare/fix_l0_adapter_ddi
[L0] Add missing loader interfaces to L0 adapter
Configuration menu - View commit details
-
Copy full SHA for ed395be - Browse repository at this point
Copy the full SHA ed395beView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a10921 - Browse repository at this point
Copy the full SHA 8a10921View commit details
Commits on Sep 27, 2023
-
Merge pull request oneapi-src#895 from callumfare/l0_adapter_image_cpp
[L0] Add missing image.cpp source file to the adapter
Configuration menu - View commit details
-
Copy full SHA for c791b8b - Browse repository at this point
Copy the full SHA c791b8bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a33007b - Browse repository at this point
Copy the full SHA a33007bView commit details -
Merge pull request oneapi-src#876 from omarahmed1111/followup-review-…
…feedback-on-e2e-codegen-example Followup review feedback on e2e codegen example
Configuration menu - View commit details
-
Copy full SHA for 7180719 - Browse repository at this point
Copy the full SHA 7180719View commit details -
Merge pull request oneapi-src#894 from veselypeta/petr/fixup-level-ze…
…ro-licenses [UR][L0] Cleanup L0 License headers
Configuration menu - View commit details
-
Copy full SHA for 48b4062 - Browse repository at this point
Copy the full SHA 48b4062View commit details -
[SYCL][CUDA] Port CUDA plugin to Unified Runtime (#9512)
This moves the CUDA plugin implementation to Unified Runtime; and changes the pi_cuda plugin to use pi2ur to implement PI. The changes to the implementation have been kept to a minimum and should be functionally the same. Documentation and comments have been moved verbatim, other than changing PI references to UR. This PR is based on top of the Level Zero adapter (#8744) so will only be ready when that is merged. --------- Co-authored-by: Petr Vesely <petr.vesely@codeplay.com> Co-authored-by: Omar Ahmed <omar.ahmed@codeplay.com> Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com> Co-authored-by: Aaron Greig <aaron.greig@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 8fbd9af - Browse repository at this point
Copy the full SHA 8fbd9afView commit details -
[SYCL][CUDA] Fix post merge errors from #9512 (#9872)
Resolves the warnings as errors reported in [post merge](https://github.com/intel/llvm/actions/runs/5266121277/jobs/9519634360) as a result of merging #9512. Additionally move pre-processor guards to resolve unused global variables which would also fail in this build configuration (clang & SYCL_ENABLE_WERROR=ON).
Configuration menu - View commit details
-
Copy full SHA for 5f22cfc - Browse repository at this point
Copy the full SHA 5f22cfcView commit details -
[SYCL][CUDA][HIP] Block until the event is ready to start profiling. …
…(#9938) * Call to `hipEventElapsedTime` return `hipErrorNotReady` when the timestamp has not yet been `recorded` on one or both events. Calling `hipEventSynchronize` block until the event is ready. * The issue showed itself when profiling sycl-blas benchmark. * Enable support for cuda / hip in event_profiling_info.cpp
Configuration menu - View commit details
-
Copy full SHA for 24d46d8 - Browse repository at this point
Copy the full SHA 24d46d8View commit details -
[SYCL][UR] Update Unified Runtime tag to support UR_DEVICE_INFO_IP_VE…
…RSION (#9873) This should have been an obvious update of Unified Runtime tag to support UR_DEVICE_INFO_IP_VERSION, required in intel/llvm#9843 (just tag update, nothing else), but it also brought many API breaks caused by this patch: oneapi-src#536. So the current PR updates our codebase in accordance with changed UR API. --------- Signed-off-by: Dmitry Vodopyanov <dmitry.vodopyanov@intel.com> Co-authored-by: Callum Fare <callum@codeplay.com> Co-authored-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 4f7afd2 - Browse repository at this point
Copy the full SHA 4f7afd2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f97119 - Browse repository at this point
Copy the full SHA 2f97119View commit details -
[SYCL][CUDA] Fix broken synchronization of CUstreams (#10031)
This fixes a change in `hasBeenSynchronized` accidentally introduced during the UR port
Configuration menu - View commit details
-
Copy full SHA for 6f29e4a - Browse repository at this point
Copy the full SHA 6f29e4aView commit details -
Partially reverts back the update on guess local work size function. …
…(#10055) * Reverts back the update to calculation of threads per block for 0th dimension when primary ranges are involved. That could cause out of range access.
Configuration menu - View commit details
-
Copy full SHA for 23999fc - Browse repository at this point
Copy the full SHA 23999fcView commit details -
[SYCL][UR] Bump UR version and add missing values to pi2ur (#10049)
Includes a large number of missing `pi_result` mappings, so previously reported errors should no longer map to just `PI_ERROR_UNKNOWN`. NFCI for the adapters.
Configuration menu - View commit details
-
Copy full SHA for 16d4f24 - Browse repository at this point
Copy the full SHA 16d4f24View commit details -
[SYCL][CUDA] Remove size checks from USM allocations (#10034)
These checks are causing issues for very large USM allocations because the `MAX_MEM_ALLOC_SIZE` reported is lower than what CUDA actually supports. We will follow up with an update on the reported `MAX_MEM_ALLOC_SIZE`, but it makes sense to remove the checks either way, as the CUDA allocation functions will return an error if they can't allocate the memory.
Configuration menu - View commit details
-
Copy full SHA for 9d1ea51 - Browse repository at this point
Copy the full SHA 9d1ea51View commit details -
[SYCL][Graph] L0 Backend support for SYCL Graphs (2/4) (#9992)
# Level Zero Backend Support for SYCL Graphs This is the second patch of a series that adds support for an [experimental command graph extension](intel/llvm#5626) A snapshot of the complete work can be seen in draft PR #9375 which has support all the specification defined ways of adding nodes and edges to the graph, including both Explicit and Record & Replay graph construction. The two types of nodes currently implemented are kernel execution and memcpy commands. See https://github.com/reble/llvm#implementation-status for the status of our total work. ## Scope This second patch focuses on the required PI/UR support for the experimental command-buffer feature in the Level Zero adapter: * PI stubs for all adapters to enable compilation, no functionality. * Command-buffer implementation for the Level Zero UR adapter. * Stubs for the CUDA UR adapter to enable compilation, no functionality. ## Following Split PRs Future follow-up PRs with the remainder of our work on the extension will include: * Hooking up backend to graphs runtime, bugfixes and other feature additions, will add symbols but not break the ABI. (3/4) * Add end-to-end tests for SYCL Graph extension. (4/4) * NFC changes - Design doc and codeowner update. ## Authors Co-authored-by: Pablo Reble <pablo.reble@intel.com> Co-authored-by: Julian Miller <julian.miller@intel.com> Co-authored-by: Ben Tracy <ben.tracy@codeplay.com> Co-authored-by: Ewan Crawford <ewan@codeplay.com> Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com> --------- Co-authored-by: Ewan Crawford <ewan@codeplay.com> Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for caed4b7 - Browse repository at this point
Copy the full SHA caed4b7View commit details -
[SYCL][UR][CUDA] Fix typo in ur_device_handle_t (#10130)
The destructor should be calling `cuDevicePrimaryCtxRelease`
Configuration menu - View commit details
-
Copy full SHA for 5b95881 - Browse repository at this point
Copy the full SHA 5b95881View commit details -
[SYCL][HIP] Ignore unused PI command-buffer parameters (#10192)
Fixes error found in [post-commit CI](https://github.com/intel/llvm/actions/runs/5454766342/jobs/9925392005) after the merge of intel/llvm#9992 ``` /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5635:24: error: unused parameter 'sync_point' [-Werror,-Wunused-parameter] 5635 | pi_ext_sync_point *sync_point) { | ^ /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5691:12: error: unused parameter 'dst_row_pitch' [-Werror,-Wunused-parameter] 5691 | size_t dst_row_pitch, size_t dst_slice_pitch, | ^ /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5691:34: error: unused parameter 'dst_slice_pitch' [-Werror,-Wunused-parameter] 5691 | size_t dst_row_pitch, size_t dst_slice_pitch, | ^ 3 errors generated. ```
Configuration menu - View commit details
-
Copy full SHA for 69b8a12 - Browse repository at this point
Copy the full SHA 69b8a12View commit details -
[SYCL][CUDA] Fix cuDeviceGetUuid for older CUDA version (#10205)
This patch re-introduces the fix from intel/llvm#8765 Which seems to have been accidentally dropped by the UR port.
Configuration menu - View commit details
-
Copy full SHA for bb24713 - Browse repository at this point
Copy the full SHA bb24713View commit details -
[SYCL][CUDA] Cached info from CUDA API calls to the kernel and device…
… classes (#10104) This PR fixes an issue with overhead when calling the same kernel multiple times in a loop. Right now, some calls to CUDA API happen every time the kernel is invoked to query the same information. Those calls were moved to `device` and `kernel` constructors and the info was cached as private members of those classes.
Configuration menu - View commit details
-
Copy full SHA for 4ec1ec9 - Browse repository at this point
Copy the full SHA 4ec1ec9View commit details -
[SYCL][UR][CUDA] Remove unnecessary UR entry point validation (#9969)
These checks are already performed by the UR validation layer
Configuration menu - View commit details
-
Copy full SHA for 0cdc873 - Browse repository at this point
Copy the full SHA 0cdc873View commit details -
[SYCL][CUDA] Implement sycl_ext_oneapi_peer_access extension (#8303)
This implements the current extension doc from intel/llvm#6104 in the CUDA backend only. Fixes intel/llvm#7543. Fixes intel/llvm#6749. --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com> Co-authored-by: Nicolas Miller <nicolas.miller@codeplay.com> Co-authored-by: JackAKirk <chezjakirk@gmail.com> Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 55f3e87 - Browse repository at this point
Copy the full SHA 55f3e87View commit details -
[PI][UR] Report error from cuInit (#10353)
Currently if `cuInit` fails while getting platforms it returns UR_RESULT_SUCCESS. Instead we use `UR_CHECK_ERROR` to return the correct error code.
Configuration menu - View commit details
-
Copy full SHA for 1637319 - Browse repository at this point
Copy the full SHA 1637319View commit details -
[SYCL][CUDA] Remove SYCL dependencies from CUDA UR adapter (#10135)
This was preventing out-of-tree build of the adapter for standalone use with unified runtime. Signed-off-by: Piotr Balcer <piotr.balcer@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 3696b20 - Browse repository at this point
Copy the full SHA 3696b20View commit details -
[SYCL][Bindless][2/4] Add experimental implementation of SYCL bindles…
…s images extension (#10112) # Experimental Implementation of SYCL Bindless Images Extension This commit stands as the second commit of four to make code review easier, implementing revision 4 of the [bindless images extension proposal](intel/llvm#9842). ## Scope This PR covers changes made to the PI and the UR. This includes - Extending PI with extension functions - Updating UR FetchContent commit and implementing [UR bindless images experimental features](https://oneapi-src.github.io/unified-runtime/core/EXP-BINDLESS-IMAGES.html) on the CUDA adaptor ## Following Split PRs - [3/4] Implement the user-facing SYCL extension - [4/4] Add tests ## Authors Co-authored-by: Isaac Ault <isaac.ault@codeplay.com> Co-authored-by: Hugh Bird <hugh.bird@codeplay.com> Co-authored-by: Duncan Brawley <duncan.brawley@codeplay.com> Co-authored-by: Przemek Malon <przemek.malon@codeplay.com> Co-authored-by: Chedy Najjar <chedy.najjar@codeplay.com> Co-authored-by: Sean Stirling <sean.stirling@codeplay.com> Co-authored-by: Peter Zuzek <peter@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for a9ff0ed - Browse repository at this point
Copy the full SHA a9ff0edView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0af7d63 - Browse repository at this point
Copy the full SHA 0af7d63View commit details -
[SYCL][CUDA] Fix device creation from native handle (#10450)
In CUDA objects are represented as integers rather than opaque handles. This patch fixes a segmentation fault when creating a device handle from a native handle by avoiding dereferencing a pointer which should be treated as an integer.
Configuration menu - View commit details
-
Copy full SHA for 39d46a2 - Browse repository at this point
Copy the full SHA 39d46a2View commit details -
[SYCL][Bindless] Fix compiler Werrors (#10464)
- Fixed compiler errors/warnings related to unused and uninitialized variables and parameters. Post-commit fix for PR: intel/llvm#10112
Configuration menu - View commit details
-
Copy full SHA for b99c5d2 - Browse repository at this point
Copy the full SHA b99c5d2View commit details -
[SYCL][UR] Fix device partition queries (#10355)
After the recent device partition changes in the UR spec (i.e. ur_device_partition_property_t), level_zero, cuda and hip adapters are returning incorrect values and types for UR_DEVICE_INFO_SUPPORTED_PARTITIONS and UR_DEVICE_INFO_PARTITION_TYPE. This PR fixes this issues with the adapters and updates pi2ur to correctly convert between ur_device_partition_properties_t and pi_device_partition_property.
Configuration menu - View commit details
-
Copy full SHA for 67b7b14 - Browse repository at this point
Copy the full SHA 67b7b14View commit details -
[SYCL] [L0] Add user APIs to import / release host memory from USM. (…
…#9294) This change adds a SYCL interface to the Level Zero APIs zexDriverImportExternalPointer and zexDriverReleaseImportedPointer. These functions are used for importing host memory into USM for the duration of data transfer to increase bandwidth.
Configuration menu - View commit details
-
Copy full SHA for 3505cb0 - Browse repository at this point
Copy the full SHA 3505cb0View commit details -
[CUDA][UR] Cache the max local mem size (#10577)
Cache the max local mem size so that we can call less CUDA driver entry points at `urEnqueueKernelLaunch`. Also allows us to query the value set for `SYCL_PI_CUDA_MAX_LOCAL_MEM_SIZE` using `device.get_info<sycl::info::device::local_mem_size>()`
Configuration menu - View commit details
-
Copy full SHA for e2bae7b - Browse repository at this point
Copy the full SHA e2bae7bView commit details -
[SYCL][PI] Restore CUDA and HIP plugin ABI (#10518)
During the port to UR the CUDA and HIP PI plugin ABI's were unintentionally changed. There does not appear to be symbol checks for these plugins, unlike the [Level Zero symbol check](https://github.com/intel/llvm/blob/sycl/sycl/test/abi/pi_level_zero_symbol_check.dump) and [OpenCL symbol check](https://github.com/intel/llvm/blob/sycl/sycl/test/abi/pi_opencl_symbol_check.dump). As such, the ABI change went unnoticed until intel/llvm#10490 was opened using the same approach for the OpenCL port, which [failed](https://github.com/intel/llvm/actions/runs/5610646255/job/15200624025?pr=10490) the OpenCL symbol check. This PR restores the expected ABI for the CUDA and HIP plugins and introduces new CUDA and HIP symbol check tests.
Configuration menu - View commit details
-
Copy full SHA for 056f1f3 - Browse repository at this point
Copy the full SHA 056f1f3View commit details -
[SYCL][UR] Bump UR and implement adapter handles (#10349)
Bump the Unified Runtime commit, and make adapter changes needed for the newly added adapter handles (see oneapi-src#715 for details) This fixes #10066 by providing an implementation of `piPluginGetLastError` in pi2ur.
Configuration menu - View commit details
-
Copy full SHA for 6530d39 - Browse repository at this point
Copy the full SHA 6530d39View commit details -
[SYCL][CUDA] Make plugin specific error return an error (#10626)
The `UR_RESULT_ADAPTER_SPECIFIC_ERROR` was not returning an error to the SYCL RT which meant all errors were treated as warnings and ignored unless `SYCL_RT_WARNING_LEVEL` is set to geq 2. This changes things so the adapter specific error is now reported as such, meaning all uses `UR_RESULT_ADAPTER_SPECIFIC_ERROR` meant as warnings are now caught as errors. --------- Co-authored-by: Hugh Delaney <hughd94@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d133b55 - Browse repository at this point
Copy the full SHA d133b55View commit details -
[SYCL][CUDA] Pass device from context in create queue. (#10491)
Recently in the switch to UR `urQueueCreateFromNativeHandle` changed the previous behaviour whereby a queue was created with a device taken as the default device from the context. It changed it so that the queue was created with the device argument instead. Since the sycl runtime always passes a nullptr for the device when programmers call `make_queue(nativeStream, context)`, this broke `make_queue`. This patch reverts to the previous behaviour before the switch from pi cuda to ur cuda. Note that this should also fix `make_queue` for l0 which I also guess was broken due to the asserts meaning that this line was never reached: https://github.com/intel/llvm/blob/sycl/sycl/plugins/unified_runtime/ur/adapters/level_zero/queue.cpp#L574. But I have not tested this. --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 934be28 - Browse repository at this point
Copy the full SHA 934be28View commit details -
[SYCL][CUDA] ifdef cuda 11.5 datatypes (#10691)
These new datatypes were introduced in CUDA 11.5 Co-authored-by: Hugh Delaney <hughd94@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 2afff55 - Browse repository at this point
Copy the full SHA 2afff55View commit details -
[SYCL][CUDA] fix compilation of CUDA plugin on Windows with MSVC (#10…
…809) Compilation failed on Windows with Visual Studio 2019/2022 when enabling the CUDA plugin. This commit fixes compilation by * adding missing headers * adding a static_cast of ContextInfoType to fix the following compilation error: sycl\plugins\unified_runtime\ur\adapters\cuda\context.cpp(49): error C2397: conversion from 'ur_context_info_t' to 'uint32_t' requires a narrowing conversion
Configuration menu - View commit details
-
Copy full SHA for 0e5aa09 - Browse repository at this point
Copy the full SHA 0e5aa09View commit details -
[SYCL][UR][CUDA] Fix a few issues in the cuda adapter revealed by UR …
…enqueue CTS. (#10244)
Configuration menu - View commit details
-
Copy full SHA for 8ca215d - Browse repository at this point
Copy the full SHA 8ca215dView commit details -
[CUDA][HIP][UR] Fix race condition in adapters (#10826)
event->record was being called before event->start, which resulted in event->record failing silently. This fixes that.
Configuration menu - View commit details
-
Copy full SHA for 3b1586e - Browse repository at this point
Copy the full SHA 3b1586eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1bebd84 - Browse repository at this point
Copy the full SHA 1bebd84View commit details -
[SYCL][CUDA] Add std::ignore for USM alloc args that are only used in…
… an assert. (#10899) This should fix the post-commit build fails.
Configuration menu - View commit details
-
Copy full SHA for e8cfc2c - Browse repository at this point
Copy the full SHA e8cfc2cView commit details -
[UR][CUDA][HIP] Fix Set Arg Local (#10710)
In the CUDA/HIP adapters `urKernelSetArgValue` was being used to implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However, if the validation layer is enabled in UR then the path to set local arg is never taken since it includes a check that `pArgValue` is not null. This PR: * Implements `urKernelSetArgLocal` for CUDA/HIP adapters * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is `nullptr` * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back into `urKernelSetArgValue`.
Configuration menu - View commit details
-
Copy full SHA for cb57a66 - Browse repository at this point
Copy the full SHA cb57a66View commit details -
[UR][CUDA][HIP][L0] Cleanup licence header (#10824)
Fix the license headers at the top of each source file in the unified runtime directory. --------- Co-authored-by: Alexey Bader <alexey.bader@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 710e8b5 - Browse repository at this point
Copy the full SHA 710e8b5View commit details -
[SYCL][HIP] Implemented supported make_* interop functions. (#10526)
This PR adds missing functions in the hip backend to allow for interoperability in programs that create sycl objects from native hip objects. The new function implementations are: - `make_device` - `make_queue` - `make_event` Note that it would really make sense for intel/llvm#10491 to be merged first because this PR makes the same code change in pi2ur, for a fix that is attributed to #10491. --------- Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 10e2880 - Browse repository at this point
Copy the full SHA 10e2880View commit details -
[UR][CUDA] Ifdef remaining CUDA 11 datatypes (#10922)
intel/llvm#10691 missed these extra cases. This PR adds the needed ifdefs
Configuration menu - View commit details
-
Copy full SHA for 5eafdf5 - Browse repository at this point
Copy the full SHA 5eafdf5View commit details -
[SYCL][CUDA] Reintroduce removing of size checks from USM allocations…
… (#11023) Resolve same issue fixed in [PR](intel/llvm#10034) Co-authored-by: Omar Ahmed <omar.ahmed@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 7ba3c58 - Browse repository at this point
Copy the full SHA 7ba3c58View commit details -
[UR][CUDA][HIP] Replace
ur::assertion
withUR_CHECK_ERROR
(#10986)Lots of hip/cu driver API calls were wrapped in `ur::assertion(res == CU_SUCCESS)` etc which: - Means that any native error messages returned from the affected driver api calls were lost. - Since these APIs report errors asynchronously, such that they are thrown from the last API call rather than the call which led to the error, previous asynchronous error messages from different APIs to the ones wrapped by the `ur::assertion` could also be lost depending on user code. These problems are fixed by swapping these assertions with `UR_CHECK_ERROR`. Note that in the future UR may want to adjust `UR_CHECK_ERROR` so that it throws `UR_RESULT_ERROR_ADAPTER_SPECIFIC` etc instead of using `std::cerr` etc to report the error etc. But I think it makes sense to still use `UR_CHECK_ERROR` to wrap driver API calls because it means that the __LINE__, __FUNCTION__ etc info can be correctly passed to native error reporting. --------- Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 02383b0 - Browse repository at this point
Copy the full SHA 02383b0View commit details -
[SYCL][CUDA][Bindless] Add support for normalized channel types (#11120)
- Support was added for the following image channel types: - `unorm_int8` - `unorm_int16` - `snorm_int8` - `snorm_int16` - Reading these types through `read_image` returns the denormalized floating point data. - A test was added for these new types. - Support for the following packed normalized image channel types was removed from the proposal: - `unorm_short_565` - `unorm_short_555` - `unorm_int_101010` - This was done due to lack of device support. If the need for these types arises in the future, we can revisit support for these types.
Configuration menu - View commit details
-
Copy full SHA for 2d5a79b - Browse repository at this point
Copy the full SHA 2d5a79bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ad82940 - Browse repository at this point
Copy the full SHA ad82940View commit details -
[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)
Fetch the latest revision of unified runtime: Notable changes * Some command buffer entry-points have been renamed, also 2 additions of membufferFill and USMFill * UrInit/urTearDown have been removed - replaced with loader only versions.
Configuration menu - View commit details
-
Copy full SHA for c85400d - Browse repository at this point
Copy the full SHA c85400dView commit details -
[SYCL] Add esimd as an optional device feature (#10017)
This change adds a new aspect for esimd, `ext_intel_esimd`, and annotates the two fundamental esimd classes, `simd_obj_impl` and `simd_view_impl` with the `uses_aspect` attribute. `simd_obj_impl` is the base class of `simd` and `simd_mask` which are the fundamental user-facing classes. `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not a base of `simd_view_impl`, but every `simd_view_impl` requires a reference to a `simd` or `simd_mask` at construction time, so I am not sure if we truly need to annotate `simd_view_impl`, but I added it to be safe. It also adds a new PI device info query, `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at runtime if a device supports ESIMD. For UR-based plugins, we map that to `UR_DEVICE_INFO_ESIMD_SUPPOR`. The implementation simply returns false for cuda, hip and native_cpu. For l0 and opencl, we check that the device is an intel gpu by querying the device type is gpu and the vendor id is `0x8086`. For ESIMD emulator we simply return true. I would appreciate careful review on the plugin changes in particular, as I am not an expert. This change also updates the esimd spec to document the new aspect. In a future change, I plan to use the new aspect to remove the requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros that set function attributes, but I am not doing that as part of this PR as it requires more investigation. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for d8e85fc - Browse repository at this point
Copy the full SHA d8e85fcView commit details -
Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)
Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for the past few days.
Configuration menu - View commit details
-
Copy full SHA for 905d2f2 - Browse repository at this point
Copy the full SHA 905d2f2View commit details -
[SYCL][CUDA] Multiple fixes for the CUDA adapter (#11121)
- Return correct error code in urContextCreateWithNativeHandle - Add UR_DEVICE_INFO_VIRTUAL_MEMORY_SUPPORT to urDeviceGetInfo - Move asserts from urEnqueueMemBufferFill to UR validation layers (commit 2c533e6 on UR repository) - Make behaviour of urEventSetCallback consistent with other unsupported entrypoints. - urProgramGetInfo and urQueueGetInfo now return UR_RESULT_ERROR_UNSUPPORTED_ENUMERATION for unsupported cases. - Add entrypoints for urSamplerGetNativeHandle and urSamplerCreateWithNativeHandle - Fix UR_USM_ALLOC_INFO_BASE_PTR and UR_USM_ALLOC_INFO_SIZE implementation.
Configuration menu - View commit details
-
Copy full SHA for 00761dc - Browse repository at this point
Copy the full SHA 00761dcView commit details -
[SYCL][NFC] Make UR_CHECK_ERROR a void return macro (#11100)
`UR_CHECK_ERROR` was designed to return `ur_result_t`, however in practice it was guaranteed to only ever return `UR_RESULT_SUCCESS`, as other paths would either terminate, abort or throw. This in turns leads to poor quality/error prone code, as the codebase was littered with: * statements not checking the return value - depending on the compiler generating a warning, * extra check on the return which was only ever going to be true. Some care was required, as the codebase has a habit of accumulating err codes across branches, so depending on the use case the initial value of `ur_result_t Result`s had to be set accordingly (now that `UR_CHECK_ERROR` does not return).
Configuration menu - View commit details
-
Copy full SHA for bed33f7 - Browse repository at this point
Copy the full SHA bed33f7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 46912f6 - Browse repository at this point
Copy the full SHA 46912f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9461c79 - Browse repository at this point
Copy the full SHA 9461c79View commit details
Commits on Sep 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for c55e3d1 - Browse repository at this point
Copy the full SHA c55e3d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for a0eb5a6 - Browse repository at this point
Copy the full SHA a0eb5a6View commit details -
Merge pull request oneapi-src#897 from callumfare/ci/cancel_old_workf…
…lows CI: Cancel stale workflows when new commits are added to a PR
Configuration menu - View commit details
-
Copy full SHA for 1941850 - Browse repository at this point
Copy the full SHA 1941850View commit details -
Configuration menu - View commit details
-
Copy full SHA for d1f7ebc - Browse repository at this point
Copy the full SHA d1f7ebcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 899f831 - Browse repository at this point
Copy the full SHA 899f831View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cc31e5 - Browse repository at this point
Copy the full SHA 2cc31e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20297af - Browse repository at this point
Copy the full SHA 20297afView commit details -
Configuration menu - View commit details
-
Copy full SHA for fbd14e1 - Browse repository at this point
Copy the full SHA fbd14e1View commit details -
Merge pull request oneapi-src#884 from omarahmed1111/Add-test-workflo…
…w-for-codegen-example Add test workflow for codegen example
Configuration menu - View commit details
-
Copy full SHA for e72d570 - Browse repository at this point
Copy the full SHA e72d570View commit details -
[CUDA] Update license headers and fix compilation issues (oneapi-src#900
Configuration menu - View commit details
-
Copy full SHA for 00c7edb - Browse repository at this point
Copy the full SHA 00c7edbView commit details -
Configuration menu - View commit details
-
Copy full SHA for 48a03ac - Browse repository at this point
Copy the full SHA 48a03acView commit details -
[UR][L0] Propagate errors from
USMAllocationMakeResident
This change ensures that USM allocation APIs don't return `UR_RESULT_SUCCESS` when an error occurs within `USMAllocationMakeResident`. Signed-off-by: Michael Aziz <michael.aziz@intel.com>
Configuration menu - View commit details
-
Copy full SHA for e5d6a91 - Browse repository at this point
Copy the full SHA e5d6a91View commit details
Commits on Sep 29, 2023
-
[UR][Tests] fix match.py on-error printing
Input and match files were printed only on one error. Now, it will be printed for both places (where this makes sense). This is an extension to commit: be85a49
Configuration menu - View commit details
-
Copy full SHA for fedcee6 - Browse repository at this point
Copy the full SHA fedcee6View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6dc4c0 - Browse repository at this point
Copy the full SHA d6dc4c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19597b2 - Browse repository at this point
Copy the full SHA 19597b2View commit details
Commits on Oct 2, 2023
-
Merge pull request oneapi-src#904 from lukaszstolarczuk/fix-match.py
[UR][Tests] Update match scripts
Configuration menu - View commit details
-
Copy full SHA for 187633a - Browse repository at this point
Copy the full SHA 187633aView commit details -
[SYCL][CUDA] Implement ext_oneapi_queue_priority (#11296)
The E2E test assumes that the device supports priorities (sm_35 or newer).
Configuration menu - View commit details
-
Copy full SHA for 6a0eb7e - Browse repository at this point
Copy the full SHA 6a0eb7eView commit details -
Merge pull request oneapi-src#903 from veselypeta/petr/move_hip
[UR][HIP] Move HIP adapter sourcer to Unified Runtime
Configuration menu - View commit details
-
Copy full SHA for 1c6fa3c - Browse repository at this point
Copy the full SHA 1c6fa3cView commit details -
[UR][L0][CMake] Disable warning for ICX on Windows
On Windows we may build L0 using ICX , this is to revert changes to check WIN32 instead of MSVC, to fix build failures with ICX on Windows.
Configuration menu - View commit details
-
Copy full SHA for 9265d33 - Browse repository at this point
Copy the full SHA 9265d33View commit details -
Merge pull request oneapi-src#917 from jsji/adapters
[UR][L0][CMake] Disable warning for ICX on Windows
Configuration menu - View commit details
-
Copy full SHA for f1c17dc - Browse repository at this point
Copy the full SHA f1c17dcView commit details
Commits on Oct 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ec915f8 - Browse repository at this point
Copy the full SHA ec915f8View commit details
Commits on Oct 4, 2023
-
Merge pull request oneapi-src#923 from pbalcer/l0-win-fix-build
[L0] fix windows cmake scripts
Configuration menu - View commit details
-
Copy full SHA for e6343f4 - Browse repository at this point
Copy the full SHA e6343f4View commit details -
[UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
We allow reusing existing pre-built L0 Libs and Headers. However, using LEVEL_ZERO_LIBRARY and LEVEL_ZERO_INCLUDE_DIR directly may leak the path of shared folder. This PR introduces two L0_LIBRARY and L0_INCLUDE_DIR , and copy L0 loader/headers locally to the build to avoid leaking their location on a shared drive.
Configuration menu - View commit details
-
Copy full SHA for 6a2c548 - Browse repository at this point
Copy the full SHA 6a2c548View commit details
Commits on Oct 5, 2023
-
Merge pull request oneapi-src#920 from jsji/localcopy
[UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
Configuration menu - View commit details
-
Copy full SHA for b38855e - Browse repository at this point
Copy the full SHA b38855eView commit details
Commits on Oct 9, 2023
-
Adapter implementations are being moved into the repo we should mirror the code ownership of those implementations. This patch adds a CODEOWNERS file which matches the pre adapter move ownership to be used as a baseline for future changes for each individual adapter.
Configuration menu - View commit details
-
Copy full SHA for 49770f5 - Browse repository at this point
Copy the full SHA 49770f5View commit details -
[CUDA] Fix queue creation with native handle
The new priority parameter hadn't been reflected here so the ownership property was being used as priority and default to `true` which caused crashes.
Configuration menu - View commit details
-
Copy full SHA for b4425bc - Browse repository at this point
Copy the full SHA b4425bcView commit details
Commits on Oct 10, 2023
-
[HIP] Enable kernel finalization using comgr
For kernel fusion support for hip, we need to finalize the kernels using comgr. The patch finalizes tagged binaries during buildProgram before handing it over to the hip runtime. Signed-off-by: Victor Lomuller <victor@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 2fd9dea - Browse repository at this point
Copy the full SHA 2fd9deaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 55dccfc - Browse repository at this point
Copy the full SHA 55dccfcView commit details -
[UR][L0] Fix minor code duplication
- Second return has no effect - Checking the same flag twice has no effect
Configuration menu - View commit details
-
Copy full SHA for 6180a78 - Browse repository at this point
Copy the full SHA 6180a78View commit details -
Merge pull request oneapi-src#909 from kbenzie/benie/codeowners
Add CODEOWNERS file
Configuration menu - View commit details
-
Copy full SHA for 0677296 - Browse repository at this point
Copy the full SHA 0677296View commit details -
Configuration menu - View commit details
-
Copy full SHA for 484cd07 - Browse repository at this point
Copy the full SHA 484cd07View commit details -
Merge pull request oneapi-src#906 from 0x12CC/l0_usm_error_checking
[UR][L0] Propagate errors from `USMAllocationMakeResident`
Configuration menu - View commit details
-
Copy full SHA for 4954850 - Browse repository at this point
Copy the full SHA 4954850View commit details -
Merge pull request oneapi-src#915 from al42and/fix-minor-typos-in-lev…
…elzero-adapter [UR][L0] Fix minor code duplication
Configuration menu - View commit details
-
Copy full SHA for 0d3fd4b - Browse repository at this point
Copy the full SHA 0d3fd4bView commit details
Commits on Oct 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a6f382a - Browse repository at this point
Copy the full SHA a6f382aView commit details -
Use CUDA team for HIP adapter too
The HIP team needs some fixing up currently so use the CUDA team for now.
Configuration menu - View commit details
-
Copy full SHA for 463e7ce - Browse repository at this point
Copy the full SHA 463e7ceView commit details
Commits on Oct 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b002e00 - Browse repository at this point
Copy the full SHA b002e00View commit details -
Merge pull request oneapi-src#946 from npmiller/adapters-owners
Add code owners for the CUDA and HIP adapters
Configuration menu - View commit details
-
Copy full SHA for 775af6e - Browse repository at this point
Copy the full SHA 775af6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f831b2 - Browse repository at this point
Copy the full SHA 3f831b2View commit details
Commits on Oct 13, 2023
-
Merge pull request oneapi-src#955 from kbenzie/benie/codeowners-level…
…-zero [UR][L0] Add adapter source to codeowners file
Configuration menu - View commit details
-
Copy full SHA for a5d102a - Browse repository at this point
Copy the full SHA a5d102aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9af962b - Browse repository at this point
Copy the full SHA 9af962bView commit details -
Merge pull request oneapi-src#958 from kbenzie/benie/codeowners-hip
Update HIP adapter codeowners to use specific team
Configuration menu - View commit details
-
Copy full SHA for ab45620 - Browse repository at this point
Copy the full SHA ab45620View commit details
Commits on Oct 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 496d694 - Browse repository at this point
Copy the full SHA 496d694View commit details -
[Command-buffer][L0] Reset sync-point events
The L0 events used to implement UR sync-points in a command-buffer aren't reset after the first execution. Leaving them in a signalled state before subsequent submissions of the UR command-buffer. This patch resets the L0 events used as UR command-buffer sync-points to the non-signalled state, by appending event reset commands to the end of the user defined command-list. There is also a reset added for the UR command-buffer wait event that is signalled when the wait list dependencies of `urCommandBufferEnqueueExp` are met. Additionally this patch removes the host scope flag from being set on L0 events created by the UR command-buffer, as we never signal/reset them from host, only device.
Configuration menu - View commit details
-
Copy full SHA for e8e1415 - Browse repository at this point
Copy the full SHA e8e1415View commit details
Commits on Oct 17, 2023
-
[Command-buffer][L0] Condense signalling
Rather than having a command-list wide barrier and separate signal command, have a single barrier command which signals the UR command-buffer signal event and waits on the events that correspond to sync-points. Co-authored-by: Pablo Reble <pablo.reble@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 6523932 - Browse repository at this point
Copy the full SHA 6523932View commit details -
[SYCL][OpenCL] Port OpenCL plugin to Unified Runtime (#10490)
Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com> Co-authored-by: Petr Vesely <petr.vesely@codeplay.com> Co-authored-by: Callum Fare <callum@codeplay.com> Co-authored-by: aarongreig <aarongreig01@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7c755c4 - Browse repository at this point
Copy the full SHA 7c755c4View commit details -
[OpenCL] Follow up changes after moving the adapter
- Update Cmake to use local adapter source files - Update license headers - Add virtual memory entrypoints to interface_loader - Add .clang-format - Update CODEOWNERS file for OpenCL adapter
Configuration menu - View commit details
-
Copy full SHA for 1f149fb - Browse repository at this point
Copy the full SHA 1f149fbView commit details -
Merge pull request oneapi-src#943 from hdelan/re-add-extended-deleters
[HIP] Re add extended deleters
Configuration menu - View commit details
-
Copy full SHA for f0de2f4 - Browse repository at this point
Copy the full SHA f0de2f4View commit details
Commits on Oct 18, 2023
-
[OpenCL] Allow upstream to manually set the OpenCL target (oneapi-src…
…#969) - Upstream CI is unable to use find_package() for OpenCL. This commit allows it to set the target manually and skip the call to find_package(). - This commit also changes UR to use the OpenCL::OpenCL target. It provides a simpler alternative to import the opencl library without having to use add_library().
Configuration menu - View commit details
-
Copy full SHA for 3653e58 - Browse repository at this point
Copy the full SHA 3653e58View commit details
Commits on Oct 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 657ffde - Browse repository at this point
Copy the full SHA 657ffdeView commit details -
Set command-buffer experimental feature codeowners
Set [unified-runtime-command-buffer-write](https://github.com/orgs/oneapi-src/teams/unified-runtime-command-buffer-write] as the codeowners of the command-buffer experimental feature. Including: * Adapter implementations * Specification document * YML definition
Configuration menu - View commit details
-
Copy full SHA for 7d759ee - Browse repository at this point
Copy the full SHA 7d759eeView commit details
Commits on Oct 20, 2023
-
Merge pull request oneapi-src#972 from oneapi-src/revert-906-l0_usm_e…
…rror_checking Revert "[UR][L0] Propagate errors from `USMAllocationMakeResident`"
Configuration menu - View commit details
-
Copy full SHA for a76e3b1 - Browse repository at this point
Copy the full SHA a76e3b1View commit details -
Merge pull request oneapi-src#973 from Bensuo/ewan/command_buffer_cod…
…eowners Set command-buffer experimental feature codeowners
Configuration menu - View commit details
-
Copy full SHA for 614e6d0 - Browse repository at this point
Copy the full SHA 614e6d0View commit details
Commits on Oct 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9a13afd - Browse repository at this point
Copy the full SHA 9a13afdView commit details -
[CUDA][EXP] CUDA adapter support for command buffers
- Implement initial subset of command buffer functions for CUDA - Report command buffer extension as supported from device query - Some additional methods and refactoring to reuse existing adapter code
Configuration menu - View commit details
-
Copy full SHA for 74f42f8 - Browse repository at this point
Copy the full SHA 74f42f8View commit details -
Merge pull request oneapi-src#945 from npmiller/fix-priority
[CUDA] Fix queue creation with native handle
Configuration menu - View commit details
-
Copy full SHA for 3a3aae3 - Browse repository at this point
Copy the full SHA 3a3aae3View commit details
Commits on Oct 24, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 86f96f0 - Browse repository at this point
Copy the full SHA 86f96f0View commit details
Commits on Oct 25, 2023
-
Merge pull request oneapi-src#940 from Naghasan/victor/kernel-fusion-amd
[UR][HIP] Enable kernel finalization using comgr
Configuration menu - View commit details
-
Copy full SHA for cf26de2 - Browse repository at this point
Copy the full SHA cf26de2View commit details -
CODEOWNERS for Bindless Images
Add `unified-runtime-bindless-images-write` team as code owners for anything related to images.
Configuration menu - View commit details
-
Copy full SHA for 33e2494 - Browse repository at this point
Copy the full SHA 33e2494View commit details
Commits on Oct 26, 2023
-
Merge pull request oneapi-src#995 from oneapi-src/peter/bindless-code…
…owners CODEOWNERS for Bindless Images
Configuration menu - View commit details
-
Copy full SHA for 20d4f49 - Browse repository at this point
Copy the full SHA 20d4f49View commit details -
Merge pull request oneapi-src#965 from Bensuo/ewan/L0_internal_event_fix
[Command-buffer][L0] Reset sync-point events
Configuration menu - View commit details
-
Copy full SHA for 7c8617b - Browse repository at this point
Copy the full SHA 7c8617bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fdf2cf8 - Browse repository at this point
Copy the full SHA fdf2cf8View commit details
Commits on Oct 27, 2023
-
Merge pull request oneapi-src#908 from szadam/test_output
Print all CTS outputs (from gtest) in CI.
Configuration menu - View commit details
-
Copy full SHA for 2ab0734 - Browse repository at this point
Copy the full SHA 2ab0734View commit details -
Merge pull request oneapi-src#961 from hdelan/change-unions-to-stdvar…
…iant [HIP][CUDA] Change unions in ur_mem_handle_t_ to stdvariant
Configuration menu - View commit details
-
Copy full SHA for 036b9cf - Browse repository at this point
Copy the full SHA 036b9cfView commit details -
Merge commit '9d82232b9c7e57997325608320e2d89c5fb95187' into merge-so…
…me-main-changes-into-adapters
Configuration menu - View commit details
-
Copy full SHA for 2f2aebe - Browse repository at this point
Copy the full SHA 2f2aebeView commit details -
Merge pull request oneapi-src#1005 from omarahmed1111/merge-some-main…
…-changes-into-adapters Bump adapters branch
Configuration menu - View commit details
-
Copy full SHA for 2417fcf - Browse repository at this point
Copy the full SHA 2417fcfView commit details
Commits on Oct 30, 2023
-
Configuration menu - View commit details
-
Copy full SHA for df79dad - Browse repository at this point
Copy the full SHA df79dadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fb2f28 - Browse repository at this point
Copy the full SHA 5fb2f28View commit details -
Configuration menu - View commit details
-
Copy full SHA for b232631 - Browse repository at this point
Copy the full SHA b232631View commit details -
Merge pull request oneapi-src#1015 from oneapi-src/revert-1005-merge-…
…some-main-changes-into-adapters Revert "Bump adapters branch"
Configuration menu - View commit details
-
Copy full SHA for 55d432c - Browse repository at this point
Copy the full SHA 55d432cView commit details
Commits on Oct 31, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 5ecaf66 - Browse repository at this point
Copy the full SHA 5ecaf66View commit details -
Merge pull request oneapi-src#932 from Bensuo/cuda-cmd-buffers
[CUDA][EXP] CUDA adapter support for command buffers
Configuration menu - View commit details
-
Copy full SHA for be53fb3 - Browse repository at this point
Copy the full SHA be53fb3View commit details -
[CUDA][HIP] Move adapter specific testing out of CTS
This patch moves the `test-adapter-cuda` and `test-adapter-hip` test suites out of the `test/conformance` directory. The goal of this move is to ensure that all adapters will be tested with the same set and number of conformance tests, making conformance pass rates directly comparable between adapters. Up to now the CUDA and HIP adapters have included additional unit tests of internals which skews pass rates.
Configuration menu - View commit details
-
Copy full SHA for f533123 - Browse repository at this point
Copy the full SHA f533123View commit details