-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge adapters branch into main #1158
Commits on Sep 28, 2023
-
[UR][L0] Propagate errors from
USMAllocationMakeResident
This change ensures that USM allocation APIs don't return `UR_RESULT_SUCCESS` when an error occurs within `USMAllocationMakeResident`. Signed-off-by: Michael Aziz <michael.aziz@intel.com>
Configuration menu - View commit details
-
Copy full SHA for e5d6a91 - Browse repository at this point
Copy the full SHA e5d6a91View commit details
Commits on Sep 29, 2023
-
[UR][Tests] fix match.py on-error printing
Input and match files were printed only on one error. Now, it will be printed for both places (where this makes sense). This is an extension to commit: be85a49
Configuration menu - View commit details
-
Copy full SHA for fedcee6 - Browse repository at this point
Copy the full SHA fedcee6View commit details -
Configuration menu - View commit details
-
Copy full SHA for d6dc4c0 - Browse repository at this point
Copy the full SHA d6dc4c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 19597b2 - Browse repository at this point
Copy the full SHA 19597b2View commit details
Commits on Oct 2, 2023
-
Merge pull request oneapi-src#904 from lukaszstolarczuk/fix-match.py
[UR][Tests] Update match scripts
Configuration menu - View commit details
-
Copy full SHA for 187633a - Browse repository at this point
Copy the full SHA 187633aView commit details -
[SYCL][CUDA] Implement ext_oneapi_queue_priority (#11296)
The E2E test assumes that the device supports priorities (sm_35 or newer).
Configuration menu - View commit details
-
Copy full SHA for 6a0eb7e - Browse repository at this point
Copy the full SHA 6a0eb7eView commit details -
Merge pull request oneapi-src#903 from veselypeta/petr/move_hip
[UR][HIP] Move HIP adapter sourcer to Unified Runtime
Configuration menu - View commit details
-
Copy full SHA for 1c6fa3c - Browse repository at this point
Copy the full SHA 1c6fa3cView commit details -
[UR][L0][CMake] Disable warning for ICX on Windows
On Windows we may build L0 using ICX , this is to revert changes to check WIN32 instead of MSVC, to fix build failures with ICX on Windows.
Configuration menu - View commit details
-
Copy full SHA for 9265d33 - Browse repository at this point
Copy the full SHA 9265d33View commit details -
Merge pull request oneapi-src#917 from jsji/adapters
[UR][L0][CMake] Disable warning for ICX on Windows
Configuration menu - View commit details
-
Copy full SHA for f1c17dc - Browse repository at this point
Copy the full SHA f1c17dcView commit details
Commits on Oct 3, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ec915f8 - Browse repository at this point
Copy the full SHA ec915f8View commit details
Commits on Oct 4, 2023
-
Merge pull request oneapi-src#923 from pbalcer/l0-win-fix-build
[L0] fix windows cmake scripts
Configuration menu - View commit details
-
Copy full SHA for e6343f4 - Browse repository at this point
Copy the full SHA e6343f4View commit details -
[UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
We allow reusing existing pre-built L0 Libs and Headers. However, using LEVEL_ZERO_LIBRARY and LEVEL_ZERO_INCLUDE_DIR directly may leak the path of shared folder. This PR introduces two L0_LIBRARY and L0_INCLUDE_DIR , and copy L0 loader/headers locally to the build to avoid leaking their location on a shared drive.
Configuration menu - View commit details
-
Copy full SHA for 6a2c548 - Browse repository at this point
Copy the full SHA 6a2c548View commit details
Commits on Oct 5, 2023
-
Merge pull request oneapi-src#920 from jsji/localcopy
[UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
Configuration menu - View commit details
-
Copy full SHA for b38855e - Browse repository at this point
Copy the full SHA b38855eView commit details
Commits on Oct 9, 2023
-
Adapter implementations are being moved into the repo we should mirror the code ownership of those implementations. This patch adds a CODEOWNERS file which matches the pre adapter move ownership to be used as a baseline for future changes for each individual adapter.
Configuration menu - View commit details
-
Copy full SHA for 49770f5 - Browse repository at this point
Copy the full SHA 49770f5View commit details -
[CUDA] Fix queue creation with native handle
The new priority parameter hadn't been reflected here so the ownership property was being used as priority and default to `true` which caused crashes.
Configuration menu - View commit details
-
Copy full SHA for b4425bc - Browse repository at this point
Copy the full SHA b4425bcView commit details
Commits on Oct 10, 2023
-
[HIP] Enable kernel finalization using comgr
For kernel fusion support for hip, we need to finalize the kernels using comgr. The patch finalizes tagged binaries during buildProgram before handing it over to the hip runtime. Signed-off-by: Victor Lomuller <victor@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for 2fd9dea - Browse repository at this point
Copy the full SHA 2fd9deaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 55dccfc - Browse repository at this point
Copy the full SHA 55dccfcView commit details -
[UR][L0] Fix minor code duplication
- Second return has no effect - Checking the same flag twice has no effect
Configuration menu - View commit details
-
Copy full SHA for 6180a78 - Browse repository at this point
Copy the full SHA 6180a78View commit details -
Merge pull request oneapi-src#909 from kbenzie/benie/codeowners
Add CODEOWNERS file
Configuration menu - View commit details
-
Copy full SHA for 0677296 - Browse repository at this point
Copy the full SHA 0677296View commit details -
Configuration menu - View commit details
-
Copy full SHA for 484cd07 - Browse repository at this point
Copy the full SHA 484cd07View commit details -
Merge pull request oneapi-src#906 from 0x12CC/l0_usm_error_checking
[UR][L0] Propagate errors from `USMAllocationMakeResident`
Configuration menu - View commit details
-
Copy full SHA for 4954850 - Browse repository at this point
Copy the full SHA 4954850View commit details -
Merge pull request oneapi-src#915 from al42and/fix-minor-typos-in-lev…
…elzero-adapter [UR][L0] Fix minor code duplication
Configuration menu - View commit details
-
Copy full SHA for 0d3fd4b - Browse repository at this point
Copy the full SHA 0d3fd4bView commit details
Commits on Oct 11, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a6f382a - Browse repository at this point
Copy the full SHA a6f382aView commit details -
Use CUDA team for HIP adapter too
The HIP team needs some fixing up currently so use the CUDA team for now.
Configuration menu - View commit details
-
Copy full SHA for 463e7ce - Browse repository at this point
Copy the full SHA 463e7ceView commit details
Commits on Oct 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b002e00 - Browse repository at this point
Copy the full SHA b002e00View commit details -
Merge pull request oneapi-src#946 from npmiller/adapters-owners
Add code owners for the CUDA and HIP adapters
Configuration menu - View commit details
-
Copy full SHA for 775af6e - Browse repository at this point
Copy the full SHA 775af6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f831b2 - Browse repository at this point
Copy the full SHA 3f831b2View commit details
Commits on Oct 13, 2023
-
Merge pull request oneapi-src#955 from kbenzie/benie/codeowners-level…
…-zero [UR][L0] Add adapter source to codeowners file
Configuration menu - View commit details
-
Copy full SHA for a5d102a - Browse repository at this point
Copy the full SHA a5d102aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9af962b - Browse repository at this point
Copy the full SHA 9af962bView commit details -
Merge pull request oneapi-src#958 from kbenzie/benie/codeowners-hip
Update HIP adapter codeowners to use specific team
Configuration menu - View commit details
-
Copy full SHA for ab45620 - Browse repository at this point
Copy the full SHA ab45620View commit details
Commits on Oct 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 496d694 - Browse repository at this point
Copy the full SHA 496d694View commit details -
[Command-buffer][L0] Reset sync-point events
The L0 events used to implement UR sync-points in a command-buffer aren't reset after the first execution. Leaving them in a signalled state before subsequent submissions of the UR command-buffer. This patch resets the L0 events used as UR command-buffer sync-points to the non-signalled state, by appending event reset commands to the end of the user defined command-list. There is also a reset added for the UR command-buffer wait event that is signalled when the wait list dependencies of `urCommandBufferEnqueueExp` are met. Additionally this patch removes the host scope flag from being set on L0 events created by the UR command-buffer, as we never signal/reset them from host, only device.
Configuration menu - View commit details
-
Copy full SHA for e8e1415 - Browse repository at this point
Copy the full SHA e8e1415View commit details
Commits on Oct 17, 2023
-
[Command-buffer][L0] Condense signalling
Rather than having a command-list wide barrier and separate signal command, have a single barrier command which signals the UR command-buffer signal event and waits on the events that correspond to sync-points. Co-authored-by: Pablo Reble <pablo.reble@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 6523932 - Browse repository at this point
Copy the full SHA 6523932View commit details -
[SYCL][OpenCL] Port OpenCL plugin to Unified Runtime (#10490)
Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com> Co-authored-by: Petr Vesely <petr.vesely@codeplay.com> Co-authored-by: Callum Fare <callum@codeplay.com> Co-authored-by: aarongreig <aarongreig01@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 7c755c4 - Browse repository at this point
Copy the full SHA 7c755c4View commit details -
[OpenCL] Follow up changes after moving the adapter
- Update Cmake to use local adapter source files - Update license headers - Add virtual memory entrypoints to interface_loader - Add .clang-format - Update CODEOWNERS file for OpenCL adapter
Configuration menu - View commit details
-
Copy full SHA for 1f149fb - Browse repository at this point
Copy the full SHA 1f149fbView commit details -
Merge pull request oneapi-src#943 from hdelan/re-add-extended-deleters
[HIP] Re add extended deleters
Configuration menu - View commit details
-
Copy full SHA for f0de2f4 - Browse repository at this point
Copy the full SHA f0de2f4View commit details
Commits on Oct 18, 2023
-
[OpenCL] Allow upstream to manually set the OpenCL target (oneapi-src…
…#969) - Upstream CI is unable to use find_package() for OpenCL. This commit allows it to set the target manually and skip the call to find_package(). - This commit also changes UR to use the OpenCL::OpenCL target. It provides a simpler alternative to import the opencl library without having to use add_library().
Configuration menu - View commit details
-
Copy full SHA for 3653e58 - Browse repository at this point
Copy the full SHA 3653e58View commit details
Commits on Oct 19, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 77b705d - Browse repository at this point
Copy the full SHA 77b705dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 657ffde - Browse repository at this point
Copy the full SHA 657ffdeView commit details -
Set command-buffer experimental feature codeowners
Set [unified-runtime-command-buffer-write](https://github.com/orgs/oneapi-src/teams/unified-runtime-command-buffer-write] as the codeowners of the command-buffer experimental feature. Including: * Adapter implementations * Specification document * YML definition
Configuration menu - View commit details
-
Copy full SHA for 7d759ee - Browse repository at this point
Copy the full SHA 7d759eeView commit details
Commits on Oct 20, 2023
-
[OpenCL] Add more mappings from CL error codes to UR error codes.
Also merge urQueueCreate InvalidValueProperties test into InvalidQueueProperties test.
Configuration menu - View commit details
-
Copy full SHA for 2792092 - Browse repository at this point
Copy the full SHA 2792092View commit details -
Configuration menu - View commit details
-
Copy full SHA for 16e28e2 - Browse repository at this point
Copy the full SHA 16e28e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5c8a86b - Browse repository at this point
Copy the full SHA 5c8a86bView commit details -
Merge pull request oneapi-src#972 from oneapi-src/revert-906-l0_usm_e…
…rror_checking Revert "[UR][L0] Propagate errors from `USMAllocationMakeResident`"
Configuration menu - View commit details
-
Copy full SHA for a76e3b1 - Browse repository at this point
Copy the full SHA a76e3b1View commit details -
Merge pull request oneapi-src#973 from Bensuo/ewan/command_buffer_cod…
…eowners Set command-buffer experimental feature codeowners
Configuration menu - View commit details
-
Copy full SHA for 614e6d0 - Browse repository at this point
Copy the full SHA 614e6d0View commit details
Commits on Oct 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 9a13afd - Browse repository at this point
Copy the full SHA 9a13afdView commit details -
[CUDA][EXP] CUDA adapter support for command buffers
- Implement initial subset of command buffer functions for CUDA - Report command buffer extension as supported from device query - Some additional methods and refactoring to reuse existing adapter code
Configuration menu - View commit details
-
Copy full SHA for 74f42f8 - Browse repository at this point
Copy the full SHA 74f42f8View commit details -
Merge pull request oneapi-src#945 from npmiller/fix-priority
[CUDA] Fix queue creation with native handle
Configuration menu - View commit details
-
Copy full SHA for 3a3aae3 - Browse repository at this point
Copy the full SHA 3a3aae3View commit details -
Configuration menu - View commit details
-
Copy full SHA for dfc5abe - Browse repository at this point
Copy the full SHA dfc5abeView commit details
Commits on Oct 24, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a2623e2 - Browse repository at this point
Copy the full SHA a2623e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 86f96f0 - Browse repository at this point
Copy the full SHA 86f96f0View commit details
Commits on Oct 25, 2023
-
Merge pull request oneapi-src#940 from Naghasan/victor/kernel-fusion-amd
[UR][HIP] Enable kernel finalization using comgr
Configuration menu - View commit details
-
Copy full SHA for cf26de2 - Browse repository at this point
Copy the full SHA cf26de2View commit details -
CODEOWNERS for Bindless Images
Add `unified-runtime-bindless-images-write` team as code owners for anything related to images.
Configuration menu - View commit details
-
Copy full SHA for 33e2494 - Browse repository at this point
Copy the full SHA 33e2494View commit details
Commits on Oct 26, 2023
-
Merge pull request oneapi-src#995 from oneapi-src/peter/bindless-code…
…owners CODEOWNERS for Bindless Images
Configuration menu - View commit details
-
Copy full SHA for 20d4f49 - Browse repository at this point
Copy the full SHA 20d4f49View commit details -
Merge pull request oneapi-src#965 from Bensuo/ewan/L0_internal_event_fix
[Command-buffer][L0] Reset sync-point events
Configuration menu - View commit details
-
Copy full SHA for 7c8617b - Browse repository at this point
Copy the full SHA 7c8617bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fdf2cf8 - Browse repository at this point
Copy the full SHA fdf2cf8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 66763bf - Browse repository at this point
Copy the full SHA 66763bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for c7f34f3 - Browse repository at this point
Copy the full SHA c7f34f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ce1654 - Browse repository at this point
Copy the full SHA 0ce1654View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8f5320 - Browse repository at this point
Copy the full SHA c8f5320View commit details
Commits on Oct 27, 2023
-
Merge pull request oneapi-src#908 from szadam/test_output
Print all CTS outputs (from gtest) in CI.
Configuration menu - View commit details
-
Copy full SHA for 2ab0734 - Browse repository at this point
Copy the full SHA 2ab0734View commit details -
[UR][L0] Propagate errors from
USMAllocationMakeResident
This change ensures that USM allocation APIs don't return `UR_RESULT_SUCCESS` when an error occurs within `USMAllocationMakeResident`. Signed-off-by: Michael Aziz <michael.aziz@intel.com>
Configuration menu - View commit details
-
Copy full SHA for f2be823 - Browse repository at this point
Copy the full SHA f2be823View commit details -
Merge pull request oneapi-src#961 from hdelan/change-unions-to-stdvar…
…iant [HIP][CUDA] Change unions in ur_mem_handle_t_ to stdvariant
Configuration menu - View commit details
-
Copy full SHA for 036b9cf - Browse repository at this point
Copy the full SHA 036b9cfView commit details -
Merge commit '9d82232b9c7e57997325608320e2d89c5fb95187' into merge-so…
…me-main-changes-into-adapters
Configuration menu - View commit details
-
Copy full SHA for 2f2aebe - Browse repository at this point
Copy the full SHA 2f2aebeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 55409e4 - Browse repository at this point
Copy the full SHA 55409e4View commit details -
Merge pull request oneapi-src#1005 from omarahmed1111/merge-some-main…
…-changes-into-adapters Bump adapters branch
Configuration menu - View commit details
-
Copy full SHA for 2417fcf - Browse repository at this point
Copy the full SHA 2417fcfView commit details
Commits on Oct 30, 2023
-
Configuration menu - View commit details
-
Copy full SHA for df79dad - Browse repository at this point
Copy the full SHA df79dadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fb2f28 - Browse repository at this point
Copy the full SHA 5fb2f28View commit details -
Configuration menu - View commit details
-
Copy full SHA for 451017f - Browse repository at this point
Copy the full SHA 451017fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7377fc0 - Browse repository at this point
Copy the full SHA 7377fc0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 28ff1cb - Browse repository at this point
Copy the full SHA 28ff1cbView commit details -
Configuration menu - View commit details
-
Copy full SHA for d1d4f10 - Browse repository at this point
Copy the full SHA d1d4f10View commit details -
Configuration menu - View commit details
-
Copy full SHA for b232631 - Browse repository at this point
Copy the full SHA b232631View commit details -
Configuration menu - View commit details
-
Copy full SHA for 60c0b81 - Browse repository at this point
Copy the full SHA 60c0b81View commit details -
Merge pull request oneapi-src#1015 from oneapi-src/revert-1005-merge-…
…some-main-changes-into-adapters Revert "Bump adapters branch"
Configuration menu - View commit details
-
Copy full SHA for 55d432c - Browse repository at this point
Copy the full SHA 55d432cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b42649f - Browse repository at this point
Copy the full SHA b42649fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8e7f785 - Browse repository at this point
Copy the full SHA 8e7f785View commit details -
Signed-off-by: Michael Aziz <michael.aziz@intel.com>
Configuration menu - View commit details
-
Copy full SHA for f056f97 - Browse repository at this point
Copy the full SHA f056f97View commit details
Commits on Oct 31, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 5ecaf66 - Browse repository at this point
Copy the full SHA 5ecaf66View commit details -
[OpenCL] Implement urEnqueueUSMMemcpy2D and allow large fill patterns.
Normally OpenCL limits fill type operations to a max pattern size of 128, this patch includes a workaround to extend that.
Configuration menu - View commit details
-
Copy full SHA for bfb3dac - Browse repository at this point
Copy the full SHA bfb3dacView commit details -
Merge pull request oneapi-src#932 from Bensuo/cuda-cmd-buffers
[CUDA][EXP] CUDA adapter support for command buffers
Configuration menu - View commit details
-
Copy full SHA for be53fb3 - Browse repository at this point
Copy the full SHA be53fb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 603dcfb - Browse repository at this point
Copy the full SHA 603dcfbView commit details -
Configuration menu - View commit details
-
Copy full SHA for b205652 - Browse repository at this point
Copy the full SHA b205652View commit details
Commits on Nov 1, 2023
-
[OpenCL] Make USM functions return UR_RESULT_ERROR_INVALID_USM_SIZE
Also ignore flags in no-op urEnqueueUSMPrefetch hint.
Configuration menu - View commit details
-
Copy full SHA for fc34c26 - Browse repository at this point
Copy the full SHA fc34c26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 371e1b8 - Browse repository at this point
Copy the full SHA 371e1b8View commit details -
Configuration menu - View commit details
-
Copy full SHA for f2a365c - Browse repository at this point
Copy the full SHA f2a365cView commit details -
Configuration menu - View commit details
-
Copy full SHA for bc7c0f4 - Browse repository at this point
Copy the full SHA bc7c0f4View commit details -
[UR][L0] Add support for zeCommandListHostSynchronize
Instead of creating an event, appending, and synchronizing it on it to wait for all commands in an immediate command list, emit a zeCommandListHostSynchronize instead. Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Jaime Arteaga committedNov 1, 2023 Configuration menu - View commit details
-
Copy full SHA for 3b26f7b - Browse repository at this point
Copy the full SHA 3b26f7bView commit details
Commits on Nov 2, 2023
-
Add TODO for handling other error results
Signed-off-by: Michael Aziz <michael.aziz@intel.com>
Configuration menu - View commit details
-
Copy full SHA for fe469d7 - Browse repository at this point
Copy the full SHA fe469d7View commit details
Commits on Nov 3, 2023
-
[UR][L0] Add support for urAdapterGetLastError in L0
Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Jaime Arteaga committedNov 3, 2023 Configuration menu - View commit details
-
Copy full SHA for 143a2e4 - Browse repository at this point
Copy the full SHA 143a2e4View commit details -
[CUDA][HIP] Move adapter specific testing out of CTS
This patch moves the `test-adapter-cuda` and `test-adapter-hip` test suites out of the `test/conformance` directory. The goal of this move is to ensure that all adapters will be tested with the same set and number of conformance tests, making conformance pass rates directly comparable between adapters. Up to now the CUDA and HIP adapters have included additional unit tests of internals which skews pass rates.
Configuration menu - View commit details
-
Copy full SHA for 93bdb81 - Browse repository at this point
Copy the full SHA 93bdb81View commit details -
Merge pull request oneapi-src#968 from martygrant/martin/versionCheck…
…SpecializationConstants [OpenCL] Add version check for urProgramSetSpecializationConstants
Configuration menu - View commit details
-
Copy full SHA for ba994bb - Browse repository at this point
Copy the full SHA ba994bbView commit details -
[UR][CUDA] Fix compatibility with CUDA 11.x
The code introduced in 74f42f8 uses the signature of cuGraphInstantiate from CUDA 12.x. In CUDA 11.x, this function has different parameters.
Configuration menu - View commit details
-
Copy full SHA for 3fd39f3 - Browse repository at this point
Copy the full SHA 3fd39f3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ca370d - Browse repository at this point
Copy the full SHA 7ca370dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 807434f - Browse repository at this point
Copy the full SHA 807434fView commit details -
Merge pull request oneapi-src#1021 from kbenzie/benie/test-adapters
[CUDA][HIP] Move adapter specific testing out of CTS
Configuration menu - View commit details
-
Copy full SHA for 7f56cec - Browse repository at this point
Copy the full SHA 7f56cecView commit details -
Configuration menu - View commit details
-
Copy full SHA for 56f6994 - Browse repository at this point
Copy the full SHA 56f6994View commit details -
Configuration menu - View commit details
-
Copy full SHA for c55dc2a - Browse repository at this point
Copy the full SHA c55dc2aView commit details -
Configuration menu - View commit details
-
Copy full SHA for c86b841 - Browse repository at this point
Copy the full SHA c86b841View commit details -
Configuration menu - View commit details
-
Copy full SHA for 17cb185 - Browse repository at this point
Copy the full SHA 17cb185View commit details -
Configuration menu - View commit details
-
Copy full SHA for d5a4691 - Browse repository at this point
Copy the full SHA d5a4691View commit details -
Fail codegen example configuration if no dependcies
We rather want to fail CMake configuration than silently move on, to find out about missing example, after the build. Codegen example is only build "on demand", when UR_BUILD_EXAMPLE_CODEGEN is set.
Configuration menu - View commit details
-
Copy full SHA for bdcb662 - Browse repository at this point
Copy the full SHA bdcb662View commit details
Commits on Nov 6, 2023
-
Configuration menu - View commit details
-
Copy full SHA for ff7a4d8 - Browse repository at this point
Copy the full SHA ff7a4d8View commit details -
Merge pull request oneapi-src#1036 from pbalcer/mako-map-file
automatically generate linker version script for adapter libraries
Configuration menu - View commit details
-
Copy full SHA for a62423d - Browse repository at this point
Copy the full SHA a62423dView commit details -
Merge pull request oneapi-src#931 from konradkusiak97/konradkusiak/Lo…
…calMemSizeQuery [UR] [CUDA] Changed the output of querying localMemSize
Configuration menu - View commit details
-
Copy full SHA for 0e24ab8 - Browse repository at this point
Copy the full SHA 0e24ab8View commit details -
[OpenCL] Fix enum passed for urKernelSetExecInfo's USM_PTRS property
Also return RESULT_SUCCESS for no-op UR_KERNEL_EXEC_INFO_CACHE_CONFIG hint.
Configuration menu - View commit details
-
Copy full SHA for c5fbda0 - Browse repository at this point
Copy the full SHA c5fbda0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8125902 - Browse repository at this point
Copy the full SHA 8125902View commit details -
Configuration menu - View commit details
-
Copy full SHA for 74ad837 - Browse repository at this point
Copy the full SHA 74ad837View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ad3f0a - Browse repository at this point
Copy the full SHA 5ad3f0aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4b1dd79 - Browse repository at this point
Copy the full SHA 4b1dd79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 42d00cd - Browse repository at this point
Copy the full SHA 42d00cdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 690fef3 - Browse repository at this point
Copy the full SHA 690fef3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d88fc7 - Browse repository at this point
Copy the full SHA 5d88fc7View commit details -
[OpenCL] Return INVALID_SIZE from GetInfo entry points.
Also includes a few other GetInfo related fixes: * Add missing device info queries * Add mapping of CL command type to UR command type * Correct mapping of UR_QUEUE_INFO_FLAGS * Add mapping of cl_command_queue_properties to ur_queue_flags_t * Add mapping of cl_unified_shared_memory_type_intel to ur_usm_type_t * Add UNSUPPORTED_ENUMERATION path to KernelGeGroupInfo tests. And a fix related to one of the fixed queries: * Populate pfnReadHostPipe and pfnWriteHostPipe ddi table entries.
Configuration menu - View commit details
-
Copy full SHA for 6a3c63d - Browse repository at this point
Copy the full SHA 6a3c63dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3029501 - Browse repository at this point
Copy the full SHA 3029501View commit details -
Merge pull request oneapi-src#1041 from lukaszstolarczuk/fix-cmake-conda
Fail codegen example configuration if no dependcies
Configuration menu - View commit details
-
Copy full SHA for 7db941d - Browse repository at this point
Copy the full SHA 7db941dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 39eec0c - Browse repository at this point
Copy the full SHA 39eec0cView commit details -
[CI] Clean up after conda init in cmake.yml
When initializing conda few extra commands are added to user's profile. These commands make it impossible to log in as the test user to the machine, since conda may be missing. We can safely remove the whole '.profile' after the job.
Configuration menu - View commit details
-
Copy full SHA for 6c832ba - Browse repository at this point
Copy the full SHA 6c832baView commit details
Commits on Nov 7, 2023
-
Merge pull request oneapi-src#1011 from fabiomestre/fabio/fix_opencl_…
…leak [OpenCL] Fix memory leak
Configuration menu - View commit details
-
Copy full SHA for 612a263 - Browse repository at this point
Copy the full SHA 612a263View commit details -
[CTS] Add UR_SYCL_LIBRARY_DIR CMake variable
The CTS is already dependant on the DPC++ compiler to generate programs inputs for the program, kernel, and enqueue test suites specified via the `UR_DPCXX` CMake variable. If the DPC++ compiler is not installed on the system the executables it outputs will likely fail to find the SYCL runtime library when executed, breaking the generation of CTS program inputs. The patch introduces the `UR_SYCL_LIBRARY_DIR` CMake variable enabling the user to specify the path to the SYCL runtime library for use when generating CTS program inputs.
Configuration menu - View commit details
-
Copy full SHA for b01beb7 - Browse repository at this point
Copy the full SHA b01beb7View commit details -
Merge pull request oneapi-src#1026 from kbenzie/benie/test-conformanc…
…e-sycl-library-dir [CTS] Add UR_SYCL_LIBRARY_DIR CMake variable
Configuration menu - View commit details
-
Copy full SHA for d70bae4 - Browse repository at this point
Copy the full SHA d70bae4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fb8292 - Browse repository at this point
Copy the full SHA 5fb8292View commit details
Commits on Nov 8, 2023
-
Merge branch 'adapters' into revert-1015-revert-1005-merge-some-main-…
…changes-into-adapters
Configuration menu - View commit details
-
Copy full SHA for 7ba8fec - Browse repository at this point
Copy the full SHA 7ba8fecView commit details -
[OpenCL] Add bounds checking to the Enqueue memory operations.
This allows us to return UR_ERROR_INVALID_SIZE when we should. Extra checks are only performed on a non-success error code. Also adds a missing bounds check to urMemBufferPartition
Configuration menu - View commit details
-
Copy full SHA for f65473d - Browse repository at this point
Copy the full SHA f65473dView commit details -
Merge pull request oneapi-src#1033 from jandres742/getlasterror
[UR][L0] Add support for urAdapterGetLastError in L0
Configuration menu - View commit details
-
Copy full SHA for ba27855 - Browse repository at this point
Copy the full SHA ba27855View commit details -
Merge pull request oneapi-src#1028 from jandres742/listhostsync
[UR][L0] Add support for zeCommandListHostSynchronize
Configuration menu - View commit details
-
Copy full SHA for 62e6d2f - Browse repository at this point
Copy the full SHA 62e6d2fView commit details -
Merge pull request oneapi-src#1022 from 0x12CC/l0_usm_error_checking_2
[UR][L0] Propagate OOM errors from `USMAllocationMakeResident`
Configuration menu - View commit details
-
Copy full SHA for ec7982b - Browse repository at this point
Copy the full SHA ec7982bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 63dfb35 - Browse repository at this point
Copy the full SHA 63dfb35View commit details -
Configuration menu - View commit details
-
Copy full SHA for 877fc5b - Browse repository at this point
Copy the full SHA 877fc5bView commit details -
Merge pull request oneapi-src#1045 from kswiecicki/windows-min-max-fix
[SYCL][UR][L0] Distinguish min/max calls from macros
Configuration menu - View commit details
-
Copy full SHA for 2f44433 - Browse repository at this point
Copy the full SHA 2f44433View commit details
Commits on Nov 9, 2023
-
[SYCL][NATIVECPU] Refactor from PI plugin to UR adapter (#10523)
This PR refactors the current SYCL Native CPU Implementation as a Unified Runtime adapter. No new functionalities are added. Contents of this PR: - Native CPU implementation divided in multiple source files at `sycl/plugins/unified_runtime/ur/adapters/native_cpu` - Native CPU PI plugin is now implemented through the `pi2ur.hpp` interface - _pi_* classes have now an equivalent ur_* class - `UR_PLATFORM_BACKEND_NATIVE_CPU` value added to `ur2piPlatformInfoValue`
Configuration menu - View commit details
-
Copy full SHA for 58b72b8 - Browse repository at this point
Copy the full SHA 58b72b8View commit details -
[SYCL][UR] Bump UR and implement adapter handles (#10349)
Bump the Unified Runtime commit, and make adapter changes needed for the newly added adapter handles (see oneapi-src#715 for details) This fixes #10066 by providing an implementation of `piPluginGetLastError` in pi2ur.
Configuration menu - View commit details
-
Copy full SHA for 7f2f12e - Browse repository at this point
Copy the full SHA 7f2f12eView commit details -
[SYCL][NATIVECPU] Fixed missing changes from latest UR version (#10658)
This PR fixes some errors while building the Native CPU plugin: Updated urPlatformGet signature Added missing function urAdapterGetLastError
Configuration menu - View commit details
-
Copy full SHA for 5e2e218 - Browse repository at this point
Copy the full SHA 5e2e218View commit details -
[SYCL][NATIVECPU] Support multiple SYCL targets in the same compiler …
…invocation (#10495) This PR adds support to multiple SYCL targets alongside `native_cpu` in the same compiler invocation (e.g. `clang++ -fsycl -fsycl-targets=native_cpu,spir64 input.cpp`). In order to implement this we had to make changes to multiple components, here is a quick overview: * Driver: changes in the Driver allow to correctly parse all the targets passed to `-fsycl-targets` (before we were just looking for `native_cpu`, ignoring the others). The Driver now also calls `sycl-post-link` and `clang-offload-wrapper`, performing a compilation flow more similar to the one used for other targets. * Sema: since the kernel name needs to be the same for all the SYCL targets, the change to the kernel name in Sema has been removed, and replaced with an LLVM Pass that gets run when lowering the device module (`llvm/lib/SYCLLowerIR/RenameKernelSYCLNativeCPU.cpp`). * Runtime: The definition for `_pi_program` in the Native CPU Plug-In now supports multiple kernels in one program, and the `__SYCL_PI_DEVICE_BINARY_TARGET_NATIVE_CPU` binary type has been added in order to identify kernels compiled for Native CPU. * clang-offload-wrapper: for Native CPU, the offload-wrapper doesn't bundle the device code in the host module, but instead produces an array containing function declarations that are resolved by the linker, see `sycl/doc/design/SYCLNativeCPU.md` for more information. --------- Co-authored-by: Uwe Dolinsky <uwe@codeplay.com> Co-authored-by: Alexey Sachkov <alexey.sachkov@intel.com> Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 28ec448 - Browse repository at this point
Copy the full SHA 28ec448View commit details -
[SYCL][NATIVECPU] Initial support for memory copy operations (#10808)
This PR adds some initial support for memory copy operations to the NativeCPU device. It also fixes global offsets and the query for queue properties in the NativeCPU PI/UR adapter.
Configuration menu - View commit details
-
Copy full SHA for 382b27d - Browse repository at this point
Copy the full SHA 382b27dView commit details -
[SYCL][NATIVECPU] UR adapter updates (#10857)
Removes some unneeded UR_ASSERTS Added initial GlobalTimeStamp implementation for `sycl-ls`.
Configuration menu - View commit details
-
Copy full SHA for 82ed146 - Browse repository at this point
Copy the full SHA 82ed146View commit details -
[SYCL][NFC] Remove executable permission from text files (#10916)
Processed sycl, libclc and libdevice directories.
Configuration menu - View commit details
-
Copy full SHA for 78fc5e7 - Browse repository at this point
Copy the full SHA 78fc5e7View commit details -
[UR][CUDA][HIP][L0] Cleanup licence header (#10824)
Fix the license headers at the top of each source file in the unified runtime directory. --------- Co-authored-by: Alexey Bader <alexey.bader@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 0d302ab - Browse repository at this point
Copy the full SHA 0d302abView commit details -
[SYCL][NATIVECPU] more ur adapter implementations (#10972)
Adding more UR adapter functionality to pass more tests related to buffers and subbuffers, and to enable `sycl-ls --verbose`.
Configuration menu - View commit details
-
Copy full SHA for c077125 - Browse repository at this point
Copy the full SHA c077125View commit details -
[SYCL][NATIVECPU] Added remaining UR interface functions, PI exports …
…and ABI Check (#10883) This PR adds the remaining (not fully implemented) UR interface functions to the NativeCPU/UR adapter. The NativeCPU PI plugin now also exports all PI interface functions and an ABI check was added.
Configuration menu - View commit details
-
Copy full SHA for 27bbfcf - Browse repository at this point
Copy the full SHA 27bbfcfView commit details -
[SYCL][NATIVECPU][LIBCLC] Use libclc for SYCL Native CPU (#10970)
This PR allows linking to libclc when compiling for SYCL Native CPU. Currently only the `x86_64-unknown-linux-gnu` target triple is supported, additional target triples (and possibly a more versatile way of setting them) will come with follow up PRs. Some useful information for reviewing: * We start using an `AddrSpaceMap` (set in `TargetInfo.cpp`) because the mangled names emitted by the device compiler need to match with the names provided by `libclc`. The AddressSpaceMap is taken from the `PTX` Target. * Changes in `Driver` are needed to find and link to `libclc`. * `libclc/ptx-nvidiacl/libspirv/atomic/loadstore_helpers.ll` has been split into 4 modules, one for each memory ordering constraint. Copies of these modules have been added in `generic` (because some functions in `generic/libspirv/atomic` needed them), and the module split allows to specialize the file for targets that may not support some orderings. Currently only a couple of function for `acquire` and `seq_cst` have been implemented for `generic`, but the others will be implemented in a follow up PR. * We've added a target in `libclc` for `x86_64-unknown-linux`. This has been done because some math builtins in `generic` have been defined as ``` typedef char vec __attribute__((ext_vector_type(8))); __attribute__((overloadable)) vec __clc_native_popcount(vec x) __asm("llvm.ctpop" ".v16i" "8"); vec call(vec x) { return __clc_native_popcount(x); } ``` While this approach conveniently allows to call directly LLVM intrinsics, it does seem to play well with the ABI for `x86_64-unknown-linux`, since it leads to this IR: ``` define dso_local double @call(double noundef %x.coerce) #0 { entry: %0 = bitcast double %x.coerce to <8 x i8> %1 = bitcast <8 x i8> %0 to double %call = call double @llvm.ctpop.v8i8(double noundef %1) oneapi-src#8 %2 = bitcast double %call to <8 x i8> %3 = bitcast <8 x i8> %2 to double ret double %3 } ``` Which is invalid because `lvm.ctpop.v8i8` expect a vector of `i8` and not a `double`, leading to failing asserts in the compiler that prevented from building `libclc`. As a temporary work around we have added empty files that override the files in `generic` when building for `x86_64-unknown-linux`, allowing to complete the build, even though the corresponding builtins will be missing from the library. We are working on a proper solution for this. --------- Co-authored-by: Uwe Dolinsky <uwe@codeplay.com>
Configuration menu - View commit details
-
Copy full SHA for e1fbecc - Browse repository at this point
Copy the full SHA e1fbeccView commit details -
[SYCL] [NATIVECPU] Implement urKernelSetArgLocal (#11101)
This PR adds support to `local_accessors` by implementing `urKernelSetArgLocal`.
Configuration menu - View commit details
-
Copy full SHA for 6009642 - Browse repository at this point
Copy the full SHA 6009642View commit details -
[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)
Fetch the latest revision of unified runtime: Notable changes * Some command buffer entry-points have been renamed, also 2 additions of membufferFill and USMFill * UrInit/urTearDown have been removed - replaced with loader only versions.
Configuration menu - View commit details
-
Copy full SHA for 91a74ca - Browse repository at this point
Copy the full SHA 91a74caView commit details -
[SYCL] Add esimd as an optional device feature (#10017)
This change adds a new aspect for esimd, `ext_intel_esimd`, and annotates the two fundamental esimd classes, `simd_obj_impl` and `simd_view_impl` with the `uses_aspect` attribute. `simd_obj_impl` is the base class of `simd` and `simd_mask` which are the fundamental user-facing classes. `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not a base of `simd_view_impl`, but every `simd_view_impl` requires a reference to a `simd` or `simd_mask` at construction time, so I am not sure if we truly need to annotate `simd_view_impl`, but I added it to be safe. It also adds a new PI device info query, `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at runtime if a device supports ESIMD. For UR-based plugins, we map that to `UR_DEVICE_INFO_ESIMD_SUPPOR`. The implementation simply returns false for cuda, hip and native_cpu. For l0 and opencl, we check that the device is an intel gpu by querying the device type is gpu and the vendor id is `0x8086`. For ESIMD emulator we simply return true. I would appreciate careful review on the plugin changes in particular, as I am not an expert. This change also updates the esimd spec to document the new aspect. In a future change, I plan to use the new aspect to remove the requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros that set function attributes, but I am not doing that as part of this PR as it requires more investigation. --------- Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
Configuration menu - View commit details
-
Copy full SHA for b866b01 - Browse repository at this point
Copy the full SHA b866b01View commit details -
Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)
Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for the past few days.
Configuration menu - View commit details
-
Copy full SHA for 833e4e3 - Browse repository at this point
Copy the full SHA 833e4e3View commit details -
[SYCL] [NATIVECPU] Remove dependencies to sycl in UR adapter (#11685)
This PR removes dependencies to the sycl headers/library in the Native CPU UR adapter. `sycl/include/sycl/detail/native_cpu.hpp` has been moved to `sycl/plugins/unified_runtime/ur/adapters/native_cpu/nativecpu_state.hpp`, and the definitions of work item builtins has been moved from that header to the compiler, which now emits them in the `PrepareSYCLNativeCPUPass`.
Configuration menu - View commit details
-
Copy full SHA for 0868c6e - Browse repository at this point
Copy the full SHA 0868c6eView commit details
Commits on Nov 10, 2023
-
Merge pull request oneapi-src#1044 from aarongreig/aaron/clCTSFixMega…
…Branch [OpenCL] Combined CTS fixes
Configuration menu - View commit details
-
Copy full SHA for 192e940 - Browse repository at this point
Copy the full SHA 192e940View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7928a9b - Browse repository at this point
Copy the full SHA 7928a9bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 858938d - Browse repository at this point
Copy the full SHA 858938dView commit details -
Configuration menu - View commit details
-
Copy full SHA for d4e45e1 - Browse repository at this point
Copy the full SHA d4e45e1View commit details -
Configuration menu - View commit details
-
Copy full SHA for e0e55b9 - Browse repository at this point
Copy the full SHA e0e55b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd1e816 - Browse repository at this point
Copy the full SHA dd1e816View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c7b74e - Browse repository at this point
Copy the full SHA 0c7b74eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 716cd8f - Browse repository at this point
Copy the full SHA 716cd8fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 08ca4be - Browse repository at this point
Copy the full SHA 08ca4beView commit details -
Merge commit 'b12b606c35bb7469c1dfdf932e2917238635f02d' into merge-so…
…me-main-changes-into-adapters-second-patch
Configuration menu - View commit details
-
Copy full SHA for 7c7bf22 - Browse repository at this point
Copy the full SHA 7c7bf22View commit details -
Configuration menu - View commit details
-
Copy full SHA for 07ad4f8 - Browse repository at this point
Copy the full SHA 07ad4f8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 02baece - Browse repository at this point
Copy the full SHA 02baeceView commit details -
Configuration menu - View commit details
-
Copy full SHA for 65bfeab - Browse repository at this point
Copy the full SHA 65bfeabView commit details -
Configuration menu - View commit details
-
Copy full SHA for 863d230 - Browse repository at this point
Copy the full SHA 863d230View commit details
Commits on Nov 12, 2023
-
[UR][L0] Add UR_L0_LEAKS_DEBUG key
Use a new environment variable, UR_L0_LEAKS_DEBUG, to check for leaks in the UR L0 adapter, instead of relying on a specific value being set in UR_L0_DEBUG. Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Jaime Arteaga committedNov 12, 2023 Configuration menu - View commit details
-
Copy full SHA for 3da2133 - Browse repository at this point
Copy the full SHA 3da2133View commit details
Commits on Nov 13, 2023
-
Configuration menu - View commit details
-
Copy full SHA for b22a0ae - Browse repository at this point
Copy the full SHA b22a0aeView commit details
Commits on Nov 14, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 46cf37f - Browse repository at this point
Copy the full SHA 46cf37fView commit details -
[CUDA] Update hint functions to only return warnings
- The UR spec was recently changed to guarantee that hint entryponts never return errors. This commit changes the CUDA adapter to be conformant with this change. - This commit also changes the type of PointerRangeSize which was causing a stack corruption.
Configuration menu - View commit details
-
Copy full SHA for 8945db4 - Browse repository at this point
Copy the full SHA 8945db4View commit details -
[CUDA] Add support for binary type query
CUDA does not make a distinction between binaryTypes (it treats PTX and binaries using the same entrypoints). However, for UR, by definition: * urProgramCompile should set the binary type to UR_PROGRAM_BINARY_TYPE_COMPILED_OBJECT * urProgramBuild / urProgramLink should set it to UR_PROGRAM_BINARY_TYPE_EXECUTABLE. * urProgramCreateWithBinary should set the binary type to UR_PROGRAM_BINARY_TYPE_COMPILED_OBJECT
Configuration menu - View commit details
-
Copy full SHA for 7ea615a - Browse repository at this point
Copy the full SHA 7ea615aView commit details -
Configuration menu - View commit details
-
Copy full SHA for d164792 - Browse repository at this point
Copy the full SHA d164792View commit details -
[NATIVECPU] Add missing DDI table entries for BindlessImagesExp, Comm…
…andBufferExp, PhysicalMem, USMExp, VirtualMem.
Configuration menu - View commit details
-
Copy full SHA for f94550b - Browse repository at this point
Copy the full SHA f94550bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a42adf2 - Browse repository at this point
Copy the full SHA a42adf2View commit details -
[OpenCL] Address coverity issue with dodgy struct-to-array casts
Also fix a similar issue coverity had with CreateProgramWithBinary which was passing array pointers straight through to the cl entry point by putting these params in local const arrays.
Configuration menu - View commit details
-
Copy full SHA for 4cf02cb - Browse repository at this point
Copy the full SHA 4cf02cbView commit details -
Merge pull request oneapi-src#1059 from martygrant/martin/moveNativeC…
…PUAdapterToUR [NATIVECPU] Move Native CPU adapter to UR.
Configuration menu - View commit details
-
Copy full SHA for 659d3f4 - Browse repository at this point
Copy the full SHA 659d3f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for d86aada - Browse repository at this point
Copy the full SHA d86aadaView commit details -
Configuration menu - View commit details
-
Copy full SHA for dad6534 - Browse repository at this point
Copy the full SHA dad6534View commit details
Commits on Nov 15, 2023
-
Merge pull request oneapi-src#979 from lukaszstolarczuk/fix-conda-wor…
…kflow [CI] Clean up after conda init in cmake.yml
Configuration menu - View commit details
-
Copy full SHA for 9fc8230 - Browse repository at this point
Copy the full SHA 9fc8230View commit details -
Merge pull request oneapi-src#1077 from fabiomestre/fabio/combines_fi…
…xes_cuda_hip [CUDA][HIP] Combined CTS Fixes
Configuration menu - View commit details
-
Copy full SHA for 534071e - Browse repository at this point
Copy the full SHA 534071eView commit details -
Merge branch 'merge-some-main-changes-into-adapters-second-patch' int…
…o merge-some-main-changes-into-adapters-third-patch
Configuration menu - View commit details
-
Copy full SHA for 4a9468f - Browse repository at this point
Copy the full SHA 4a9468fView commit details
Commits on Nov 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 461d7b7 - Browse repository at this point
Copy the full SHA 461d7b7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2652a77 - Browse repository at this point
Copy the full SHA 2652a77View commit details -
[UR][L0] Add support for passing device list to urProgramBuild
piProgramBuild receives a list of devices, while urProgramBuild does not. This produces a series of issues when a UR program needs to be created for a specific device. So define a new API, called urProgramBuildExp to pass this list. Authored-by: jaime.a.arteaga.molina@intel.com
Configuration menu - View commit details
-
Copy full SHA for c583ecc - Browse repository at this point
Copy the full SHA c583eccView commit details -
[UR][L0] Add multi-device-compile experimental feature
Expand upon the introduction of `urProgramBuildExp` and include `urProgramCompileExp` and `urProgramLinkExp` which include a device-list in place of a context. These more closely align with the PI/OpenCL analogues but only to introduce device-lists, not all extant arguments from those entry-points. This patch also moves the `urProgramBuildExp` definition into an experimental feature file and introduces a brief document containing motivation.
Configuration menu - View commit details
-
Copy full SHA for 92e608f - Browse repository at this point
Copy the full SHA 92e608fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1bb0543 - Browse repository at this point
Copy the full SHA 1bb0543View commit details -
[UR][L0] Add support for passing device list to urProgramBuild/Link/C…
…ompile piProgramBuild receives a list of devices, while urProgramBuild does not. This produces a series of issues when a UR program needs to be created for a specific device. So define a new API, called urProgramBuildExp to pass this list. Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com> Co-authored-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
Configuration menu - View commit details
-
Copy full SHA for a42a607 - Browse repository at this point
Copy the full SHA a42a607View commit details -
[L0] Hook up multi-device compile entry points
Add the `urProgramBuildExp`, `urProgramCompileExp`, and `urProgramLinkExp` to the loader `ur_program_exp_dditable_t`. Also add `"ur_exp_multi_device_compile"` to the list of extensions supported by the L0 adapter, enables the SYCL RT to query support.
Configuration menu - View commit details
-
Copy full SHA for 26b9829 - Browse repository at this point
Copy the full SHA 26b9829View commit details -
[L0] Fix urProgramLinkExp argument order
Align the `urProgramLinkExp` spec and implementation argument orders to fix Windows link error.
Configuration menu - View commit details
-
Copy full SHA for 4b5e559 - Browse repository at this point
Copy the full SHA 4b5e559View commit details
Commits on Nov 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 686b3d7 - Browse repository at this point
Copy the full SHA 686b3d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 21e7c46 - Browse repository at this point
Copy the full SHA 21e7c46View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2300531 - Browse repository at this point
Copy the full SHA 2300531View commit details -
Fix urProgramCompileExp, urProgramBuildExp, and urProgramLinkExp defi…
…nition to match spec Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
Configuration menu - View commit details
-
Copy full SHA for 0790bf8 - Browse repository at this point
Copy the full SHA 0790bf8View commit details
Commits on Nov 20, 2023
-
Merge pull request oneapi-src#1048 from callumfare/opencl_fix_urMemBu…
…fferCreate_leak [OpenCL] Fix memory leak and coverity issue with struct-to-array casts
Configuration menu - View commit details
-
Copy full SHA for 04799e7 - Browse repository at this point
Copy the full SHA 04799e7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5ef979b - Browse repository at this point
Copy the full SHA 5ef979bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ca374bc - Browse repository at this point
Copy the full SHA ca374bcView commit details -
Configuration menu - View commit details
-
Copy full SHA for bffa67d - Browse repository at this point
Copy the full SHA bffa67dView commit details
Commits on Nov 21, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 413de31 - Browse repository at this point
Copy the full SHA 413de31View commit details -
Merge pull request oneapi-src#934 from nrspruit/fixprogrambuild_updated
[UR][L0] Add support for passing device list to urProgramBuild/Link/Compile
Configuration menu - View commit details
-
Copy full SHA for ce152a6 - Browse repository at this point
Copy the full SHA ce152a6View commit details -
[CUDA][HIP][OpenCL][NATIVECPU] Fix multi-device compile
Ensure that all adapters have the correct signatures for the multi-device compile experimental feature entry points and that they entry points exist even when returning `UR_RESULT_ERROR_UNSUPPORTED_FEATURE`.
Configuration menu - View commit details
-
Copy full SHA for e001b98 - Browse repository at this point
Copy the full SHA e001b98View commit details -
Configuration menu - View commit details
-
Copy full SHA for 841a287 - Browse repository at this point
Copy the full SHA 841a287View commit details -
Configuration menu - View commit details
-
Copy full SHA for e2f9eb0 - Browse repository at this point
Copy the full SHA e2f9eb0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e413c6c - Browse repository at this point
Copy the full SHA e413c6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2b77f79 - Browse repository at this point
Copy the full SHA 2b77f79View commit details -
Configuration menu - View commit details
-
Copy full SHA for 18d333f - Browse repository at this point
Copy the full SHA 18d333fView commit details -
Merge pull request oneapi-src#1100 from kbenzie/benie/fixup-multi-dev…
…ice-compile [CUDA][HIP][OpenCL][NATIVECPU] Fix multi-device compile
Configuration menu - View commit details
-
Copy full SHA for 71957e8 - Browse repository at this point
Copy the full SHA 71957e8View commit details
Commits on Nov 22, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 82d4cef - Browse repository at this point
Copy the full SHA 82d4cefView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c8e5c4 - Browse repository at this point
Copy the full SHA 9c8e5c4View commit details -
Merge pull request oneapi-src#1053 from jandres742/url0leakkey
[UR][L0] Add UR_L0_LEAKS_DEBUG key
Configuration menu - View commit details
-
Copy full SHA for 31b654f - Browse repository at this point
Copy the full SHA 31b654fView commit details -
Configuration menu - View commit details
-
Copy full SHA for b7731bc - Browse repository at this point
Copy the full SHA b7731bcView commit details -
Merge pull request oneapi-src#966 from martygrant/martin/openclComman…
…dBuffers [OpenCL] Add Command Buffer extension to OpenCL adapter.
Configuration menu - View commit details
-
Copy full SHA for 109ed46 - Browse repository at this point
Copy the full SHA 109ed46View commit details
Commits on Nov 23, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d0319b2 - Browse repository at this point
Copy the full SHA d0319b2View commit details
Commits on Nov 24, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 1b9fdc6 - Browse repository at this point
Copy the full SHA 1b9fdc6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8262de6 - Browse repository at this point
Copy the full SHA 8262de6View commit details -
Merge pull request oneapi-src#936 from hdelan/revert-hip-prefetch
[HIP] Revert add prefetch for USM hip allocations a6b8fa66b537753415d24076f…
Configuration menu - View commit details
-
Copy full SHA for 28cf40f - Browse repository at this point
Copy the full SHA 28cf40fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 712d791 - Browse repository at this point
Copy the full SHA 712d791View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7074cf9 - Browse repository at this point
Copy the full SHA 7074cf9View commit details -
Merge pull request oneapi-src#1122 from kbenzie/benie/cmake-bump-v0.9.0
[CMake] Bump version to v0.9.0
Configuration menu - View commit details
-
Copy full SHA for 1cc70f2 - Browse repository at this point
Copy the full SHA 1cc70f2View commit details -
Merge pull request oneapi-src#1037 from al42and/fix-cuda-11-8-compat
[UR][CUDA] Fix compatibility with CUDA 11.x
Configuration menu - View commit details
-
Copy full SHA for e197941 - Browse repository at this point
Copy the full SHA e197941View commit details -
Merge pull request oneapi-src#1097 from PietroGhg/pietro/refcounting
[NATIVECPU] Implement correct reference counting for Native CPU adapter
Configuration menu - View commit details
-
Copy full SHA for b4e665f - Browse repository at this point
Copy the full SHA b4e665fView commit details -
Merge pull request oneapi-src#1106 from wenju-he/fix-native_cpu-build…
…-warnings [NATIVECPU] Fix build errors when -Werror=on
Configuration menu - View commit details
-
Copy full SHA for 433b952 - Browse repository at this point
Copy the full SHA 433b952View commit details -
Merge pull request oneapi-src#1039 from omarahmed1111/eliminate-usage…
…-of-regex-in-opencl [UR][OPENCL] Eliminate usage of regex in opencl
Configuration menu - View commit details
-
Copy full SHA for 0250c64 - Browse repository at this point
Copy the full SHA 0250c64View commit details -
[OpenCL] Fetch headers/icd-loader by default
Introduces the follow CMake options: * `UR_OPENCL_INCLUDE_DIR` - directory containing the OpenCL Headers * `UR_OPENCL_ICD_LOADER_LIBRARY` - path of the OpenCL ICD Loader library In the event that `UR_OPENCL_INCLUDE_DIR` is not specified, clone [KhronosGroup/OpenCL-Headers](https://github.com/KhronosGroup/OpenCL-Headers.git). In the event that `UR_OPENCL_ICD_LOADER_LIBRARY` is not specified, first inspect the system with `find_package(OpenCL 3.0)` and use that if found. Otherwise, clone [KhronosGroup/OpenCL-ICD-Loader](https://github.com/KhronosGroup/OpenCL-ICD-Loader.git).
Configuration menu - View commit details
-
Copy full SHA for 9f4b0a6 - Browse repository at this point
Copy the full SHA 9f4b0a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8a05674 - Browse repository at this point
Copy the full SHA 8a05674View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0c4585f - Browse repository at this point
Copy the full SHA 0c4585fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6502de1 - Browse repository at this point
Copy the full SHA 6502de1View commit details -
Merge pull request oneapi-src#993 from fabiomestre/fabio/opencl_ci
[CI] Add support for OpenCL CI on CPU
Configuration menu - View commit details
-
Copy full SHA for 41228d3 - Browse repository at this point
Copy the full SHA 41228d3View commit details
Commits on Nov 27, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 79457b2 - Browse repository at this point
Copy the full SHA 79457b2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c85750 - Browse repository at this point
Copy the full SHA 1c85750View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6c476d - Browse repository at this point
Copy the full SHA f6c476dView commit details
Commits on Nov 28, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2a11e49 - Browse repository at this point
Copy the full SHA 2a11e49View commit details -
Merge pull request oneapi-src#1088 from hdelan/ifdef-hip-5.6
[HIP] Ifdef entry point introduced in HIP 5.6
Configuration menu - View commit details
-
Copy full SHA for 5e914c5 - Browse repository at this point
Copy the full SHA 5e914c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3e1f163 - Browse repository at this point
Copy the full SHA 3e1f163View commit details
Commits on Nov 29, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a32be71 - Browse repository at this point
Copy the full SHA a32be71View commit details
Commits on Nov 30, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e770fdd - Browse repository at this point
Copy the full SHA e770fddView commit details -
Merge pull request oneapi-src#1141 from kbenzie/benie/disable-windows…
…-l0-clang-cl [GHA] Disable Windows L0 clang-cl job
Configuration menu - View commit details
-
Copy full SHA for a99d1db - Browse repository at this point
Copy the full SHA a99d1dbView commit details -
Configuration menu - View commit details
-
Copy full SHA for a92a989 - Browse repository at this point
Copy the full SHA a92a989View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4cffc1d - Browse repository at this point
Copy the full SHA 4cffc1dView commit details -
Configuration menu - View commit details
-
Copy full SHA for f2ca7a9 - Browse repository at this point
Copy the full SHA f2ca7a9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 681e607 - Browse repository at this point
Copy the full SHA 681e607View commit details -
Merge pull request oneapi-src#1050 from szadam/adapters_ci_info
Add system info in CI workflows run on self-hosted runners
Configuration menu - View commit details
-
Copy full SHA for bd76c51 - Browse repository at this point
Copy the full SHA bd76c51View commit details -
Merge pull request oneapi-src#1072 from omarahmed1111/merge-some-main…
…-changes-into-adapters-third-patch Merge main into adapters branch
Configuration menu - View commit details
-
Copy full SHA for 47af3ee - Browse repository at this point
Copy the full SHA 47af3eeView commit details
Commits on Dec 1, 2023
-
Merge pull request oneapi-src#1102 from hdelan/adapter-batch1
[HIP] Adapter PR batch
Configuration menu - View commit details
-
Copy full SHA for fe5bc76 - Browse repository at this point
Copy the full SHA fe5bc76View commit details -
Merge pull request oneapi-src#1126 from martygrant/martin/buildAllAda…
…pters CMake Flag for building all adapters
Configuration menu - View commit details
-
Copy full SHA for 53a429f - Browse repository at this point
Copy the full SHA 53a429fView commit details
Commits on Dec 4, 2023
-
Configuration menu - View commit details
-
Copy full SHA for a3bf168 - Browse repository at this point
Copy the full SHA a3bf168View commit details -
Merge pull request oneapi-src#1150 from kswiecicki/adapters-val-adapt…
…er-refcount-fix [UR] Fix adapter refcount tracking
Configuration menu - View commit details
-
Copy full SHA for 670b30f - Browse repository at this point
Copy the full SHA 670b30fView commit details -
Merge pull request oneapi-src#1108 from AllanZyne/review/yang/urAdapt…
…erGet [UR] Fix urAdapterGet ignores NumEntries
Configuration menu - View commit details
-
Copy full SHA for fe2735a - Browse repository at this point
Copy the full SHA fe2735aView commit details -
Allow fixtures derived from KernelTest to defer compilation.
Separating the device code compilation from the rest of the setup lets a test query device or platform properties to decide whether it should skip without potentially hitting a compilation error because the kernel being built as part of the setup uses an unsupported feature.
Configuration menu - View commit details
-
Copy full SHA for 512b158 - Browse repository at this point
Copy the full SHA 512b158View commit details -
[CTS] Update ctest_parser.py script
Update the `scripts/ctest_parser.py` to work with the adapter branch changes.
Configuration menu - View commit details
-
Copy full SHA for 3c568c3 - Browse repository at this point
Copy the full SHA 3c568c3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5dc6e46 - Browse repository at this point
Copy the full SHA 5dc6e46View commit details -
Configuration menu - View commit details
-
Copy full SHA for f0e0be2 - Browse repository at this point
Copy the full SHA f0e0be2View commit details -
Merge pull request oneapi-src#1153 from aarongreig/aaron/deferBuildKe…
…rnelFixtures Allow fixtures derived from KernelTest to defer compilation.
Configuration menu - View commit details
-
Copy full SHA for 9c6e151 - Browse repository at this point
Copy the full SHA 9c6e151View commit details -
Add bounds checking for enqueue operations to the validation layer.
This is accomplished with the various size queries for buffers, images and USM allocations. Since not all adapters have these queries implemented the bounds checking isn't entirely comprehensive on all platforms just yet.
Configuration menu - View commit details
-
Copy full SHA for 4b7f70f - Browse repository at this point
Copy the full SHA 4b7f70fView commit details -
Merge pull request oneapi-src#1154 from kbenzie/benie/update-ctest_pa…
…rser [CTS] Update ctest_parser.py script
Configuration menu - View commit details
-
Copy full SHA for 9b1fb4e - Browse repository at this point
Copy the full SHA 9b1fb4eView commit details
Commits on Dec 5, 2023
-
Merge pull request oneapi-src#999 from hdelan/hip-adapter-multi-dev-ctx
[HIP] Hip adapter multi dev ctx
Configuration menu - View commit details
-
Copy full SHA for 67e7da3 - Browse repository at this point
Copy the full SHA 67e7da3View commit details -
Merge pull request oneapi-src#1093 from aarongreig/aaron/addBoundsChe…
…ckingValidation Add bounds checking for enqueue operations to the validation layer.
Configuration menu - View commit details
-
Copy full SHA for 9e3bb2f - Browse repository at this point
Copy the full SHA 9e3bb2fView commit details