Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge adapters branch into main #1158

Closed
wants to merge 614 commits into from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Sep 28, 2023

  1. [UR][L0] Propagate errors from USMAllocationMakeResident

    This change ensures that USM allocation APIs don't return
    `UR_RESULT_SUCCESS` when an error occurs within
    `USMAllocationMakeResident`.
    
    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    e5d6a91 View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2023

  1. [UR][Tests] fix match.py on-error printing

    Input and match files were printed only on one error.
    Now, it will be printed for both places (where this makes sense).
    
    This is an extension to commit: be85a49
    lukaszstolarczuk committed Sep 29, 2023
    Configuration menu
    Copy the full SHA
    fedcee6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d6dc4c0 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    19597b2 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2023

  1. Merge pull request oneapi-src#904 from lukaszstolarczuk/fix-match.py

    [UR][Tests] Update match scripts
    pbalcer authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    187633a View commit details
    Browse the repository at this point in the history
  2. [SYCL][CUDA] Implement ext_oneapi_queue_priority (#11296)

    The E2E test assumes that the device supports priorities (sm_35 or
    newer).
    al42and authored and fabiomestre committed Oct 2, 2023
    Configuration menu
    Copy the full SHA
    6a0eb7e View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#903 from veselypeta/petr/move_hip

    [UR][HIP] Move HIP adapter sourcer to Unified Runtime
    veselypeta authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    1c6fa3c View commit details
    Browse the repository at this point in the history
  4. [UR][L0][CMake] Disable warning for ICX on Windows

    On Windows we may build L0 using ICX , this is to revert changes to
    check WIN32 instead of MSVC, to fix build failures with ICX on Windows.
    jsji committed Oct 2, 2023
    Configuration menu
    Copy the full SHA
    9265d33 View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#917 from jsji/adapters

    [UR][L0][CMake] Disable warning for ICX on Windows
    igchor authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    f1c17dc View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2023

  1. Configuration menu
    Copy the full SHA
    ec915f8 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2023

  1. Merge pull request oneapi-src#923 from pbalcer/l0-win-fix-build

    [L0] fix windows cmake scripts
    pbalcer authored Oct 4, 2023
    Configuration menu
    Copy the full SHA
    e6343f4 View commit details
    Browse the repository at this point in the history
  2. [UR][L0] Copy prebuilt L0 to avoid leaking shared folder path

    We allow reusing existing pre-built L0 Libs and Headers.
    However, using LEVEL_ZERO_LIBRARY and LEVEL_ZERO_INCLUDE_DIR directly
    may leak the path of shared folder.
    
    This PR introduces two L0_LIBRARY and L0_INCLUDE_DIR ,
    and copy L0 loader/headers locally to the build to avoid leaking their
    location on a shared drive.
    jsji committed Oct 4, 2023
    Configuration menu
    Copy the full SHA
    6a2c548 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2023

  1. Merge pull request oneapi-src#920 from jsji/localcopy

    [UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
    pbalcer authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    b38855e View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2023

  1. Add CODEOWNERS file

    Adapter implementations are being moved into the repo we should mirror
    the code ownership of those implementations. This patch adds a
    CODEOWNERS file which matches the pre adapter move ownership to be used
    as a baseline for future changes for each individual adapter.
    kbenzie committed Oct 9, 2023
    Configuration menu
    Copy the full SHA
    49770f5 View commit details
    Browse the repository at this point in the history
  2. [CUDA] Fix queue creation with native handle

    The new priority parameter hadn't been reflected here so the ownership
    property was being used as priority and default to `true` which caused
    crashes.
    npmiller committed Oct 9, 2023
    Configuration menu
    Copy the full SHA
    b4425bc View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2023

  1. [HIP] Enable kernel finalization using comgr

    For kernel fusion support for hip, we need to finalize the kernels using comgr.
    The patch finalizes tagged binaries during buildProgram
    before handing it over to the hip runtime.
    
    Signed-off-by: Victor Lomuller <victor@codeplay.com>
    Naghasan committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    2fd9dea View commit details
    Browse the repository at this point in the history
  2. Re add extended deleters

    hdelan committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    55dccfc View commit details
    Browse the repository at this point in the history
  3. [UR][L0] Fix minor code duplication

    - Second return has no effect
    - Checking the same flag twice has no effect
    al42and committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    6180a78 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#909 from kbenzie/benie/codeowners

    Add CODEOWNERS file
    kbenzie authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    0677296 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    484cd07 View commit details
    Browse the repository at this point in the history
  6. Merge pull request oneapi-src#906 from 0x12CC/l0_usm_error_checking

    [UR][L0] Propagate errors from `USMAllocationMakeResident`
    kbenzie authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    4954850 View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#915 from al42and/fix-minor-typos-in-lev…

    …elzero-adapter
    
    [UR][L0] Fix minor code duplication
    pbalcer authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    0d3fd4b View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. Configuration menu
    Copy the full SHA
    a6f382a View commit details
    Browse the repository at this point in the history
  2. Use CUDA team for HIP adapter too

    The HIP team needs some fixing up currently so use the CUDA team for
    now.
    npmiller committed Oct 11, 2023
    Configuration menu
    Copy the full SHA
    463e7ce View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2023

  1. Change xfail list

    hdelan committed Oct 12, 2023
    Configuration menu
    Copy the full SHA
    b002e00 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#946 from npmiller/adapters-owners

    Add code owners for the CUDA and HIP adapters
    kbenzie authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    775af6e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3f831b2 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2023

  1. Merge pull request oneapi-src#955 from kbenzie/benie/codeowners-level…

    …-zero
    
    [UR][L0] Add adapter source to codeowners file
    kbenzie authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    a5d102a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9af962b View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#958 from kbenzie/benie/codeowners-hip

    Update HIP adapter codeowners to use specific team
    kbenzie authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    ab45620 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2023

  1. Configuration menu
    Copy the full SHA
    496d694 View commit details
    Browse the repository at this point in the history
  2. [Command-buffer][L0] Reset sync-point events

    The L0 events used to implement UR sync-points in
    a command-buffer aren't reset after the first execution.
    Leaving them in a signalled state before subsequent submissions
    of the UR command-buffer.
    
    This patch resets the L0 events used as UR command-buffer sync-points
    to the non-signalled state, by appending event reset commands to the
    end of the user defined command-list. There is also a reset added for
    the UR command-buffer wait event that is signalled when the wait list
    dependencies of `urCommandBufferEnqueueExp` are met.
    
    Additionally this patch removes the host scope flag from being set on L0 events
    created by the UR command-buffer, as we never signal/reset them from host, only
    device.
    EwanC committed Oct 16, 2023
    Configuration menu
    Copy the full SHA
    e8e1415 View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2023

  1. [Command-buffer][L0] Condense signalling

    Rather than having a command-list wide barrier and separate
    signal command, have a single barrier command which signals
    the UR command-buffer signal event and waits on the
    events that correspond to sync-points.
    
    Co-authored-by: Pablo Reble <pablo.reble@intel.com>
    EwanC and reble committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    6523932 View commit details
    Browse the repository at this point in the history
  2. [SYCL][OpenCL] Port OpenCL plugin to Unified Runtime (#10490)

    Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com>
    Co-authored-by: Petr Vesely <petr.vesely@codeplay.com>
    Co-authored-by: Callum Fare <callum@codeplay.com>
    Co-authored-by: aarongreig <aarongreig01@gmail.com>
    5 people committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    7c755c4 View commit details
    Browse the repository at this point in the history
  3. [OpenCL] Follow up changes after moving the adapter

    - Update Cmake to use local adapter source files
    - Update license headers
    - Add virtual memory entrypoints to interface_loader
    - Add .clang-format
    - Update CODEOWNERS file for OpenCL adapter
    fabiomestre committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    1f149fb View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#943 from hdelan/re-add-extended-deleters

    [HIP] Re add extended deleters
    kbenzie authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    f0de2f4 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. [OpenCL] Allow upstream to manually set the OpenCL target (oneapi-src…

    …#969)
    
    - Upstream CI is unable to use find_package() for OpenCL. This
    commit allows it to set the target manually and skip the call to
    find_package().
    - This commit also changes UR to use the OpenCL::OpenCL target.
    It provides a simpler alternative to import the opencl library
    without having to use add_library().
    fabiomestre authored Oct 18, 2023
    Configuration menu
    Copy the full SHA
    3653e58 View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2023

  1. Configuration menu
    Copy the full SHA
    77b705d View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    657ffde View commit details
    Browse the repository at this point in the history
  3. Set command-buffer experimental feature codeowners

    Set [unified-runtime-command-buffer-write](https://github.com/orgs/oneapi-src/teams/unified-runtime-command-buffer-write] as the codeowners of the command-buffer experimental feature.
    Including:
    * Adapter implementations
    * Specification document
    * YML definition
    EwanC committed Oct 19, 2023
    Configuration menu
    Copy the full SHA
    7d759ee View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2023

  1. [OpenCL] Add more mappings from CL error codes to UR error codes.

    Also merge urQueueCreate InvalidValueProperties test into
    InvalidQueueProperties test.
    aarongreig committed Oct 20, 2023
    Configuration menu
    Copy the full SHA
    2792092 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    16e28e2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5c8a86b View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#972 from oneapi-src/revert-906-l0_usm_e…

    …rror_checking
    
    Revert "[UR][L0] Propagate errors from `USMAllocationMakeResident`"
    kbenzie authored Oct 20, 2023
    Configuration menu
    Copy the full SHA
    a76e3b1 View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#973 from Bensuo/ewan/command_buffer_cod…

    …eowners
    
    Set command-buffer experimental feature codeowners
    kbenzie authored Oct 20, 2023
    Configuration menu
    Copy the full SHA
    614e6d0 View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2023

  1. Configuration menu
    Copy the full SHA
    9a13afd View commit details
    Browse the repository at this point in the history
  2. [CUDA][EXP] CUDA adapter support for command buffers

    - Implement initial subset of command buffer functions for CUDA
    - Report command buffer extension as supported from device query
    - Some additional methods and refactoring to reuse existing adapter code
    Bensuo committed Oct 23, 2023
    Configuration menu
    Copy the full SHA
    74f42f8 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#945 from npmiller/fix-priority

    [CUDA] Fix queue creation with native handle
    kbenzie authored Oct 23, 2023
    Configuration menu
    Copy the full SHA
    3a3aae3 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    dfc5abe View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2023

  1. Configuration menu
    Copy the full SHA
    a2623e2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    86f96f0 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Merge pull request oneapi-src#940 from Naghasan/victor/kernel-fusion-amd

    [UR][HIP] Enable kernel finalization using comgr
    kbenzie authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    cf26de2 View commit details
    Browse the repository at this point in the history
  2. CODEOWNERS for Bindless Images

    Add `unified-runtime-bindless-images-write` team
    as code owners for anything related to images.
    ProGTX committed Oct 25, 2023
    Configuration menu
    Copy the full SHA
    33e2494 View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2023

  1. Merge pull request oneapi-src#995 from oneapi-src/peter/bindless-code…

    …owners
    
    CODEOWNERS for Bindless Images
    kbenzie authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    20d4f49 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#965 from Bensuo/ewan/L0_internal_event_fix

    [Command-buffer][L0] Reset sync-point events
    kbenzie authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    7c8617b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    fdf2cf8 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    66763bf View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    c7f34f3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    0ce1654 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    c8f5320 View commit details
    Browse the repository at this point in the history

Commits on Oct 27, 2023

  1. Merge pull request oneapi-src#908 from szadam/test_output

    Print all CTS outputs (from gtest) in CI.
    pbalcer authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2ab0734 View commit details
    Browse the repository at this point in the history
  2. [UR][L0] Propagate errors from USMAllocationMakeResident

    This change ensures that USM allocation APIs don't return
    `UR_RESULT_SUCCESS` when an error occurs within
    `USMAllocationMakeResident`.
    
    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Oct 27, 2023
    Configuration menu
    Copy the full SHA
    f2be823 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#961 from hdelan/change-unions-to-stdvar…

    …iant
    
    [HIP][CUDA] Change unions in ur_mem_handle_t_ to stdvariant
    kbenzie authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    036b9cf View commit details
    Browse the repository at this point in the history
  4. Merge commit '9d82232b9c7e57997325608320e2d89c5fb95187' into merge-so…

    …me-main-changes-into-adapters
    omarahmed1111 committed Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2f2aebe View commit details
    Browse the repository at this point in the history
  5. [OpenCL] Fix memory leak

    fabiomestre committed Oct 27, 2023
    Configuration menu
    Copy the full SHA
    55409e4 View commit details
    Browse the repository at this point in the history
  6. Merge pull request oneapi-src#1005 from omarahmed1111/merge-some-main…

    …-changes-into-adapters
    
    Bump adapters branch
    omarahmed1111 authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2417fcf View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2023

  1. Configuration menu
    Copy the full SHA
    df79dad View commit details
    Browse the repository at this point in the history
  2. Use std::get<BufferMem>

    This is required after oneapi-src#961
    EwanC committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    5fb2f28 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    451017f View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7377fc0 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    28ff1cb View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    d1d4f10 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    b232631 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    60c0b81 View commit details
    Browse the repository at this point in the history
  9. Merge pull request oneapi-src#1015 from oneapi-src/revert-1005-merge-…

    …some-main-changes-into-adapters
    
    Revert "Bump adapters branch"
    kbenzie authored Oct 30, 2023
    Configuration menu
    Copy the full SHA
    55d432c View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    b42649f View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    8e7f785 View commit details
    Browse the repository at this point in the history
  12. Fix error propagation

    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    f056f97 View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2023

  1. Configuration menu
    Copy the full SHA
    5ecaf66 View commit details
    Browse the repository at this point in the history
  2. [OpenCL] Implement urEnqueueUSMMemcpy2D and allow large fill patterns.

    Normally OpenCL limits fill type operations to a max pattern size of
    128, this patch includes a workaround to extend that.
    aarongreig committed Oct 31, 2023
    Configuration menu
    Copy the full SHA
    bfb3dac View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#932 from Bensuo/cuda-cmd-buffers

    [CUDA][EXP] CUDA adapter support for command buffers
    kbenzie authored Oct 31, 2023
    Configuration menu
    Copy the full SHA
    be53fb3 View commit details
    Browse the repository at this point in the history
  4. Address feedback

    aarongreig committed Oct 31, 2023
    Configuration menu
    Copy the full SHA
    603dcfb View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    b205652 View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2023

  1. [OpenCL] Make USM functions return UR_RESULT_ERROR_INVALID_USM_SIZE

    Also ignore flags in no-op urEnqueueUSMPrefetch hint.
    aarongreig committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    fc34c26 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    371e1b8 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f2a365c View commit details
    Browse the repository at this point in the history
  4. Fix result checks

    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    bc7c0f4 View commit details
    Browse the repository at this point in the history
  5. [UR][L0] Add support for zeCommandListHostSynchronize

    Instead of creating an event, appending, and synchronizing it on it to
    wait for all commands in an immediate command list, emit a
    zeCommandListHostSynchronize instead.
    
    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga committed Nov 1, 2023
    Configuration menu
    Copy the full SHA
    3b26f7b View commit details
    Browse the repository at this point in the history

Commits on Nov 2, 2023

  1. Add TODO for handling other error results

    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Nov 2, 2023
    Configuration menu
    Copy the full SHA
    fe469d7 View commit details
    Browse the repository at this point in the history

Commits on Nov 3, 2023

  1. [UR][L0] Add support for urAdapterGetLastError in L0

    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    143a2e4 View commit details
    Browse the repository at this point in the history
  2. [CUDA][HIP] Move adapter specific testing out of CTS

    This patch moves the `test-adapter-cuda` and `test-adapter-hip` test
    suites out of the `test/conformance` directory. The goal of this move is
    to ensure that all adapters will be tested with the same set and number
    of conformance tests, making conformance pass rates directly comparable
    between adapters. Up to now the CUDA and HIP adapters have included
    additional unit tests of internals which skews pass rates.
    kbenzie committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    93bdb81 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#968 from martygrant/martin/versionCheck…

    …SpecializationConstants
    
    [OpenCL] Add version check for urProgramSetSpecializationConstants
    kbenzie authored Nov 3, 2023
    Configuration menu
    Copy the full SHA
    ba994bb View commit details
    Browse the repository at this point in the history
  4. [UR][CUDA] Fix compatibility with CUDA 11.x

    The code introduced in 74f42f8 uses the
    signature of cuGraphInstantiate from CUDA 12.x. In CUDA 11.x, this
    function has different parameters.
    al42and committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    3fd39f3 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7ca370d View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    807434f View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#1021 from kbenzie/benie/test-adapters

    [CUDA][HIP] Move adapter specific testing out of CTS
    kbenzie authored Nov 3, 2023
    Configuration menu
    Copy the full SHA
    7f56cec View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    56f6994 View commit details
    Browse the repository at this point in the history
  9. Resolved conflicts

    konradkusiak97 committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    c55dc2a View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    c86b841 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    17cb185 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    d5a4691 View commit details
    Browse the repository at this point in the history
  13. Fail codegen example configuration if no dependcies

    We rather want to fail CMake configuration than silently move on,
    to find out about missing example, after the build. Codegen example
    is only build "on demand", when UR_BUILD_EXAMPLE_CODEGEN is set.
    lukaszstolarczuk committed Nov 3, 2023
    Configuration menu
    Copy the full SHA
    bdcb662 View commit details
    Browse the repository at this point in the history

Commits on Nov 6, 2023

  1. Configuration menu
    Copy the full SHA
    ff7a4d8 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1036 from pbalcer/mako-map-file

    automatically generate linker version script for adapter libraries
    pbalcer authored Nov 6, 2023
    Configuration menu
    Copy the full SHA
    a62423d View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#931 from konradkusiak97/konradkusiak/Lo…

    …calMemSizeQuery
    
    [UR] [CUDA] Changed the output of querying localMemSize
    kbenzie authored Nov 6, 2023
    Configuration menu
    Copy the full SHA
    0e24ab8 View commit details
    Browse the repository at this point in the history
  4. [OpenCL] Fix enum passed for urKernelSetExecInfo's USM_PTRS property

    Also return RESULT_SUCCESS for no-op UR_KERNEL_EXEC_INFO_CACHE_CONFIG
    hint.
    aarongreig committed Nov 6, 2023
    Configuration menu
    Copy the full SHA
    c5fbda0 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    8125902 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    74ad837 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    5ad3f0a View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    4b1dd79 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    42d00cd View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    690fef3 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    5d88fc7 View commit details
    Browse the repository at this point in the history
  12. [OpenCL] Return INVALID_SIZE from GetInfo entry points.

    Also includes a few other GetInfo related fixes:
    * Add missing device info queries
    * Add mapping of CL command type to UR command type
    * Correct mapping of UR_QUEUE_INFO_FLAGS
    * Add mapping of cl_command_queue_properties to ur_queue_flags_t
    * Add mapping of cl_unified_shared_memory_type_intel to ur_usm_type_t
    * Add UNSUPPORTED_ENUMERATION path to KernelGeGroupInfo tests.
    
    And a fix related to one of the fixed queries:
    * Populate pfnReadHostPipe and pfnWriteHostPipe ddi table entries.
    aarongreig committed Nov 6, 2023
    Configuration menu
    Copy the full SHA
    6a3c63d View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    3029501 View commit details
    Browse the repository at this point in the history
  14. Merge pull request oneapi-src#1041 from lukaszstolarczuk/fix-cmake-conda

    Fail codegen example configuration if no dependcies
    pbalcer authored Nov 6, 2023
    Configuration menu
    Copy the full SHA
    7db941d View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    39eec0c View commit details
    Browse the repository at this point in the history
  16. [CI] Clean up after conda init in cmake.yml

    When initializing conda few extra commands are added to user's profile.
    These commands make it impossible to log in as the test user to the machine,
    since conda may be missing.
    We can safely remove the whole '.profile' after the job.
    lukaszstolarczuk committed Nov 6, 2023
    Configuration menu
    Copy the full SHA
    6c832ba View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2023

  1. Merge pull request oneapi-src#1011 from fabiomestre/fabio/fix_opencl_…

    …leak
    
    [OpenCL] Fix memory leak
    kbenzie authored Nov 7, 2023
    Configuration menu
    Copy the full SHA
    612a263 View commit details
    Browse the repository at this point in the history
  2. [CTS] Add UR_SYCL_LIBRARY_DIR CMake variable

    The CTS is already dependant on the DPC++ compiler to generate programs
    inputs for the program, kernel, and enqueue test suites specified via
    the `UR_DPCXX` CMake variable. If the DPC++ compiler is not installed on
    the system the executables it outputs will likely fail to find the SYCL
    runtime library when executed, breaking the generation of CTS program
    inputs. The patch introduces the `UR_SYCL_LIBRARY_DIR` CMake variable
    enabling the user to specify the path to the SYCL runtime library for
    use when generating CTS program inputs.
    kbenzie committed Nov 7, 2023
    Configuration menu
    Copy the full SHA
    b01beb7 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#1026 from kbenzie/benie/test-conformanc…

    …e-sycl-library-dir
    
    [CTS] Add UR_SYCL_LIBRARY_DIR CMake variable
    kbenzie authored Nov 7, 2023
    Configuration menu
    Copy the full SHA
    d70bae4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5fb8292 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2023

  1. Configuration menu
    Copy the full SHA
    7ba8fec View commit details
    Browse the repository at this point in the history
  2. [OpenCL] Add bounds checking to the Enqueue memory operations.

    This allows us to return UR_ERROR_INVALID_SIZE when we should. Extra
    checks are only performed on a non-success error code.
    
    Also adds a missing bounds check to urMemBufferPartition
    aarongreig committed Nov 8, 2023
    Configuration menu
    Copy the full SHA
    f65473d View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#1033 from jandres742/getlasterror

    [UR][L0] Add support for urAdapterGetLastError in L0
    kbenzie authored Nov 8, 2023
    Configuration menu
    Copy the full SHA
    ba27855 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#1028 from jandres742/listhostsync

    [UR][L0] Add support for zeCommandListHostSynchronize
    kbenzie authored Nov 8, 2023
    Configuration menu
    Copy the full SHA
    62e6d2f View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#1022 from 0x12CC/l0_usm_error_checking_2

    [UR][L0] Propagate OOM errors from `USMAllocationMakeResident`
    kbenzie authored Nov 8, 2023
    Configuration menu
    Copy the full SHA
    ec7982b View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    63dfb35 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    877fc5b View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#1045 from kswiecicki/windows-min-max-fix

    [SYCL][UR][L0] Distinguish min/max calls from macros
    igchor authored Nov 8, 2023
    Configuration menu
    Copy the full SHA
    2f44433 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2023

  1. [SYCL][NATIVECPU] Refactor from PI plugin to UR adapter (#10523)

    This PR refactors the current SYCL Native CPU Implementation as a
    Unified Runtime adapter.
    No new functionalities are added.
    
    Contents of this PR:
    - Native CPU implementation divided in multiple source files at
    `sycl/plugins/unified_runtime/ur/adapters/native_cpu`
    - Native CPU PI plugin is now implemented through the `pi2ur.hpp`
    interface
    - _pi_* classes have now an equivalent ur_* class
    - `UR_PLATFORM_BACKEND_NATIVE_CPU` value added to
    `ur2piPlatformInfoValue`
    Alcpz authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    58b72b8 View commit details
    Browse the repository at this point in the history
  2. [SYCL][UR] Bump UR and implement adapter handles (#10349)

    Bump the Unified Runtime commit, and make adapter changes needed for the
    newly added adapter handles (see
    oneapi-src#715 for details)
    
    This fixes #10066 by providing an implementation of
    `piPluginGetLastError` in pi2ur.
    callumfare authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    7f2f12e View commit details
    Browse the repository at this point in the history
  3. [SYCL][NATIVECPU] Fixed missing changes from latest UR version (#10658)

    This PR fixes some errors while building the Native CPU plugin:
    
    Updated urPlatformGet signature
    Added missing function urAdapterGetLastError
    Alcpz authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    5e2e218 View commit details
    Browse the repository at this point in the history
  4. [SYCL][NATIVECPU] Support multiple SYCL targets in the same compiler …

    …invocation (#10495)
    
    This PR adds support to multiple SYCL targets alongside `native_cpu` in
    the same compiler invocation (e.g. `clang++ -fsycl
    -fsycl-targets=native_cpu,spir64 input.cpp`). In order to implement this
    we had to make changes to multiple components, here is a quick overview:
    * Driver: changes in the Driver allow to correctly parse all the targets
    passed to `-fsycl-targets` (before we were just looking for
    `native_cpu`, ignoring the others). The Driver now also calls
    `sycl-post-link` and `clang-offload-wrapper`, performing a compilation
    flow more similar to the one used for other targets.
    * Sema: since the kernel name needs to be the same for all the SYCL
    targets, the change to the kernel name in Sema has been removed, and
    replaced with an LLVM Pass that gets run when lowering the device module
    (`llvm/lib/SYCLLowerIR/RenameKernelSYCLNativeCPU.cpp`).
    * Runtime: The definition for `_pi_program` in the Native CPU Plug-In
    now supports multiple kernels in one program, and the
    `__SYCL_PI_DEVICE_BINARY_TARGET_NATIVE_CPU` binary type has been added
    in order to identify kernels compiled for Native CPU.
    * clang-offload-wrapper: for Native CPU, the offload-wrapper doesn't
    bundle the device code in the host module, but instead produces an array
    containing function declarations that are resolved by the linker, see
    `sycl/doc/design/SYCLNativeCPU.md` for more information.
    
    ---------
    
    Co-authored-by: Uwe Dolinsky <uwe@codeplay.com>
    Co-authored-by: Alexey Sachkov <alexey.sachkov@intel.com>
    Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>
    4 people authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    28ec448 View commit details
    Browse the repository at this point in the history
  5. [SYCL][NATIVECPU] Initial support for memory copy operations (#10808)

    This PR adds some initial support for memory copy operations to the
    NativeCPU device.
    It also fixes global offsets and the query for queue properties in the
    NativeCPU PI/UR adapter.
    uwedolinsky authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    382b27d View commit details
    Browse the repository at this point in the history
  6. [SYCL][NATIVECPU] UR adapter updates (#10857)

    Removes some unneeded UR_ASSERTS
    Added initial GlobalTimeStamp implementation for `sycl-ls`.
    uwedolinsky authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    82ed146 View commit details
    Browse the repository at this point in the history
  7. [SYCL][NFC] Remove executable permission from text files (#10916)

    Processed sycl, libclc and libdevice directories.
    bader authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    78fc5e7 View commit details
    Browse the repository at this point in the history
  8. [UR][CUDA][HIP][L0] Cleanup licence header (#10824)

    Fix the license headers at the top of each source file in the unified
    runtime directory.
    
    ---------
    
    Co-authored-by: Alexey Bader <alexey.bader@intel.com>
    2 people authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    0d302ab View commit details
    Browse the repository at this point in the history
  9. [SYCL][NATIVECPU] more ur adapter implementations (#10972)

    Adding more UR adapter functionality to pass more tests related to
    buffers and subbuffers, and to enable `sycl-ls --verbose`.
    uwedolinsky authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    c077125 View commit details
    Browse the repository at this point in the history
  10. [SYCL][NATIVECPU] Added remaining UR interface functions, PI exports …

    …and ABI Check (#10883)
    
    This PR adds the remaining (not fully implemented) UR interface
    functions to the NativeCPU/UR adapter.
    The NativeCPU PI plugin now also exports all PI interface functions and
    an ABI check was added.
    uwedolinsky authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    27bbfcf View commit details
    Browse the repository at this point in the history
  11. [SYCL][NATIVECPU][LIBCLC] Use libclc for SYCL Native CPU (#10970)

    This PR allows linking to libclc when compiling for SYCL Native CPU.
    Currently only the `x86_64-unknown-linux-gnu` target triple is
    supported, additional target triples (and possibly a more versatile way
    of setting them) will come with follow up PRs.
    Some useful information for reviewing:
    * We start using an `AddrSpaceMap` (set in `TargetInfo.cpp`) because the
    mangled names emitted by the device compiler need to match with the
    names provided by `libclc`. The AddressSpaceMap is taken from the `PTX`
    Target.
    * Changes in `Driver` are needed to find and link to `libclc`.
    * `libclc/ptx-nvidiacl/libspirv/atomic/loadstore_helpers.ll` has been
    split into 4 modules, one for each memory ordering constraint. Copies of
    these modules have been added in `generic` (because some functions in
    `generic/libspirv/atomic` needed them), and the module split allows to
    specialize the file for targets that may not support some orderings.
    Currently only a couple of function for `acquire` and `seq_cst` have
    been implemented for `generic`, but the others will be implemented in a
    follow up PR.
    * We've added a target in `libclc` for `x86_64-unknown-linux`. This has
    been done because some math builtins in `generic` have been defined as
    ```
    typedef char vec __attribute__((ext_vector_type(8)));
    __attribute__((overloadable)) vec __clc_native_popcount(vec x) __asm("llvm.ctpop" ".v16i" "8");
    
    vec call(vec x) {
      return __clc_native_popcount(x);
    }
    ```
    While this approach conveniently allows to call directly LLVM
    intrinsics, it does seem to play well with the ABI for
    `x86_64-unknown-linux`, since it leads to this IR:
    ```
    define dso_local double @call(double noundef %x.coerce) #0 {
    entry:
      %0 = bitcast double %x.coerce to <8 x i8>
      %1 = bitcast <8 x i8> %0 to double
      %call = call double @llvm.ctpop.v8i8(double noundef %1) oneapi-src#8
      %2 = bitcast double %call to <8 x i8>
      %3 = bitcast <8 x i8> %2 to double
      ret double %3
    }
    ```
    Which is invalid because `lvm.ctpop.v8i8` expect a vector of `i8` and
    not a `double`, leading to failing asserts in the compiler that
    prevented from building `libclc`.
    
    As a temporary work around we have added empty files that override the
    files in `generic` when building for `x86_64-unknown-linux`, allowing to
    complete the build, even though the corresponding builtins will be
    missing from the library. We are working on a proper solution for this.
    
    ---------
    
    Co-authored-by: Uwe Dolinsky <uwe@codeplay.com>
    2 people authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    e1fbecc View commit details
    Browse the repository at this point in the history
  12. [SYCL] [NATIVECPU] Implement urKernelSetArgLocal (#11101)

    This PR adds support to `local_accessors` by implementing
    `urKernelSetArgLocal`.
    PietroGhg authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    6009642 View commit details
    Browse the repository at this point in the history
  13. [PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)

    Fetch the latest revision of unified runtime:
    
    Notable changes
    * Some command buffer entry-points have been renamed, also 2 additions
    of membufferFill and USMFill
    * UrInit/urTearDown have been removed - replaced with loader only
    versions.
    veselypeta authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    91a74ca View commit details
    Browse the repository at this point in the history
  14. [SYCL] Add esimd as an optional device feature (#10017)

    This change adds a new aspect for esimd, `ext_intel_esimd`, and
    annotates the two fundamental esimd classes, `simd_obj_impl` and
    `simd_view_impl` with the `uses_aspect` attribute.
    
    `simd_obj_impl` is the base class of `simd` and `simd_mask` which are
    the fundamental user-facing classes.
    `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not
    a base of `simd_view_impl`, but every `simd_view_impl` requires a
    reference to a `simd` or `simd_mask` at construction time, so I am not
    sure if we truly need to annotate `simd_view_impl`, but I added it to be
    safe.
    
    It also adds a new PI device info query,
    `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at
    runtime if a device supports ESIMD. For UR-based plugins, we map that to
    `UR_DEVICE_INFO_ESIMD_SUPPOR`.
    
    The implementation simply returns false for cuda, hip and native_cpu.
    For l0 and opencl, we check that the device is an intel gpu by querying
    the device type is gpu and the vendor id is `0x8086`.
    For ESIMD emulator we simply return true.
    
    I would appreciate careful review on the plugin changes in particular,
    as I am not an expert.
    
    This change also updates the esimd spec to document the new aspect.
    
    In a future change, I plan to use the new aspect to remove the
    requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros
    that set function attributes, but I am not doing that as part of this PR
    as it requires more investigation.
    
    ---------
    
    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    b866b01 View commit details
    Browse the repository at this point in the history
  15. Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)

    Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for
    the past few days.
    veselypeta authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    833e4e3 View commit details
    Browse the repository at this point in the history
  16. [SYCL] [NATIVECPU] Remove dependencies to sycl in UR adapter (#11685)

    This PR removes dependencies to the sycl headers/library in the Native
    CPU UR adapter.
    `sycl/include/sycl/detail/native_cpu.hpp` has been moved to
    `sycl/plugins/unified_runtime/ur/adapters/native_cpu/nativecpu_state.hpp`,
    and the definitions of work item builtins has been moved from that
    header to the compiler, which now emits them in the
    `PrepareSYCLNativeCPUPass`.
    PietroGhg authored and martygrant committed Nov 9, 2023
    Configuration menu
    Copy the full SHA
    0868c6e View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2023

  1. Merge pull request oneapi-src#1044 from aarongreig/aaron/clCTSFixMega…

    …Branch
    
    [OpenCL] Combined CTS fixes
    kbenzie authored Nov 10, 2023
    Configuration menu
    Copy the full SHA
    192e940 View commit details
    Browse the repository at this point in the history
  2. [HIP] Define all UR entry points

    veselypeta authored and kbenzie committed Nov 10, 2023
    Configuration menu
    Copy the full SHA
    7928a9b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    858938d View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d4e45e1 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    e0e55b9 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    dd1e816 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    0c7b74e View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    716cd8f View commit details
    Browse the repository at this point in the history
  9. [HIP] Update CTS match files

    kbenzie committed Nov 10, 2023
    Configuration menu
    Copy the full SHA
    08ca4be View commit details
    Browse the repository at this point in the history
  10. Merge commit 'b12b606c35bb7469c1dfdf932e2917238635f02d' into merge-so…

    …me-main-changes-into-adapters-second-patch
    omarahmed1111 committed Nov 10, 2023
    Configuration menu
    Copy the full SHA
    7c7bf22 View commit details
    Browse the repository at this point in the history
  11. Fix some merge problems

    omarahmed1111 committed Nov 10, 2023
    Configuration menu
    Copy the full SHA
    07ad4f8 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    02baece View commit details
    Browse the repository at this point in the history
  13. Configuration menu
    Copy the full SHA
    65bfeab View commit details
    Browse the repository at this point in the history
  14. Add more merge fixes

    omarahmed1111 committed Nov 10, 2023
    Configuration menu
    Copy the full SHA
    863d230 View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2023

  1. [UR][L0] Add UR_L0_LEAKS_DEBUG key

    Use a new environment variable, UR_L0_LEAKS_DEBUG, to check
    for leaks in the UR L0 adapter, instead of relying on a specific
    value being set in UR_L0_DEBUG.
    
    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga committed Nov 12, 2023
    Configuration menu
    Copy the full SHA
    3da2133 View commit details
    Browse the repository at this point in the history

Commits on Nov 13, 2023

  1. Configuration menu
    Copy the full SHA
    b22a0ae View commit details
    Browse the repository at this point in the history

Commits on Nov 14, 2023

  1. Configuration menu
    Copy the full SHA
    46cf37f View commit details
    Browse the repository at this point in the history
  2. [CUDA] Update hint functions to only return warnings

    - The UR spec was recently changed to guarantee that hint entryponts
    never return errors. This commit changes the CUDA adapter to be
    conformant with this change.
    - This commit also changes the type of PointerRangeSize which
     was causing a stack corruption.
    fabiomestre committed Nov 14, 2023
    Configuration menu
    Copy the full SHA
    8945db4 View commit details
    Browse the repository at this point in the history
  3. [CUDA] Add support for binary type query

    CUDA does not make a distinction between binaryTypes (it treats PTX and binaries using the same entrypoints).
    
    However, for UR, by definition:
    
    * urProgramCompile should set the binary type to UR_PROGRAM_BINARY_TYPE_COMPILED_OBJECT
    * urProgramBuild / urProgramLink should set it to UR_PROGRAM_BINARY_TYPE_EXECUTABLE.
    * urProgramCreateWithBinary should set the binary type to UR_PROGRAM_BINARY_TYPE_COMPILED_OBJECT
    fabiomestre committed Nov 14, 2023
    Configuration menu
    Copy the full SHA
    7ea615a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d164792 View commit details
    Browse the repository at this point in the history
  5. [NATIVECPU] Add missing DDI table entries for BindlessImagesExp, Comm…

    …andBufferExp, PhysicalMem, USMExp, VirtualMem.
    martygrant committed Nov 14, 2023
    Configuration menu
    Copy the full SHA
    f94550b View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a42adf2 View commit details
    Browse the repository at this point in the history
  7. [OpenCL] Address coverity issue with dodgy struct-to-array casts

    Also fix a similar issue coverity had with CreateProgramWithBinary which
    was passing array pointers straight through to the cl entry point by
    putting these params in local const arrays.
    aarongreig authored and callumfare committed Nov 14, 2023
    Configuration menu
    Copy the full SHA
    4cf02cb View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#1059 from martygrant/martin/moveNativeC…

    …PUAdapterToUR
    
    [NATIVECPU] Move Native CPU adapter to UR.
    kbenzie authored Nov 14, 2023
    Configuration menu
    Copy the full SHA
    659d3f4 View commit details
    Browse the repository at this point in the history
  9. Add more merge fixes

    omarahmed1111 committed Nov 14, 2023
    Configuration menu
    Copy the full SHA
    d86aada View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    dad6534 View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2023

  1. Merge pull request oneapi-src#979 from lukaszstolarczuk/fix-conda-wor…

    …kflow
    
    [CI] Clean up after conda init in cmake.yml
    lukaszstolarczuk authored Nov 15, 2023
    Configuration menu
    Copy the full SHA
    9fc8230 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1077 from fabiomestre/fabio/combines_fi…

    …xes_cuda_hip
    
    [CUDA][HIP] Combined CTS Fixes
    kbenzie authored Nov 15, 2023
    Configuration menu
    Copy the full SHA
    534071e View commit details
    Browse the repository at this point in the history
  3. Merge branch 'merge-some-main-changes-into-adapters-second-patch' int…

    …o merge-some-main-changes-into-adapters-third-patch
    omarahmed1111 committed Nov 15, 2023
    Configuration menu
    Copy the full SHA
    4a9468f View commit details
    Browse the repository at this point in the history

Commits on Nov 16, 2023

  1. Configuration menu
    Copy the full SHA
    461d7b7 View commit details
    Browse the repository at this point in the history
  2. Ifdef HIP 5.6 entry point

    hdelan committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    2652a77 View commit details
    Browse the repository at this point in the history
  3. [UR][L0] Add support for passing device list to urProgramBuild

    piProgramBuild receives a list of devices, while urProgramBuild
    does not. This produces a series of issues when a UR program
    needs to be created for a specific device.
    
    So define a new API, called urProgramBuildExp to pass this
    list.
    
    Authored-by: jaime.a.arteaga.molina@intel.com
    igchor authored and kbenzie committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    c583ecc View commit details
    Browse the repository at this point in the history
  4. [UR][L0] Add multi-device-compile experimental feature

    Expand upon the introduction of `urProgramBuildExp` and include
    `urProgramCompileExp` and `urProgramLinkExp` which include a device-list
    in place of a context. These more closely align with the PI/OpenCL
    analogues but only to introduce device-lists, not all extant arguments
    from those entry-points. This patch also moves the `urProgramBuildExp`
    definition into an experimental feature file and introduces a brief
    document containing motivation.
    kbenzie committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    92e608f View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    1bb0543 View commit details
    Browse the repository at this point in the history
  6. [UR][L0] Add support for passing device list to urProgramBuild/Link/C…

    …ompile
    
    piProgramBuild receives a list of devices, while urProgramBuild does
    not. This produces a series of issues when a UR program needs to be
    created for a specific device.
    
    So define a new API, called urProgramBuildExp to pass this list.
    
    Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
    Co-authored-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    2 people authored and kbenzie committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    a42a607 View commit details
    Browse the repository at this point in the history
  7. [L0] Hook up multi-device compile entry points

    Add the `urProgramBuildExp`, `urProgramCompileExp`, and
    `urProgramLinkExp` to the loader `ur_program_exp_dditable_t`. Also add
    `"ur_exp_multi_device_compile"` to the list of extensions supported by
    the L0 adapter, enables the SYCL RT to query support.
    kbenzie committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    26b9829 View commit details
    Browse the repository at this point in the history
  8. [L0] Fix urProgramLinkExp argument order

    Align the `urProgramLinkExp` spec and implementation argument orders to
    fix Windows link error.
    kbenzie committed Nov 16, 2023
    Configuration menu
    Copy the full SHA
    4b5e559 View commit details
    Browse the repository at this point in the history

Commits on Nov 17, 2023

  1. Change to HIP_VERSION macro

    hdelan committed Nov 17, 2023
    Configuration menu
    Copy the full SHA
    686b3d7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    21e7c46 View commit details
    Browse the repository at this point in the history
  3. Fix formatting.

    martygrant committed Nov 17, 2023
    Configuration menu
    Copy the full SHA
    2300531 View commit details
    Browse the repository at this point in the history
  4. Fix urProgramCompileExp, urProgramBuildExp, and urProgramLinkExp defi…

    …nition to match spec
    
    Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
    nrspruit committed Nov 17, 2023
    Configuration menu
    Copy the full SHA
    0790bf8 View commit details
    Browse the repository at this point in the history

Commits on Nov 20, 2023

  1. Merge pull request oneapi-src#1048 from callumfare/opencl_fix_urMemBu…

    …fferCreate_leak
    
    [OpenCL] Fix memory leak and coverity issue with struct-to-array casts
    kbenzie authored Nov 20, 2023
    Configuration menu
    Copy the full SHA
    04799e7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    5ef979b View commit details
    Browse the repository at this point in the history
  3. use common helper

    PietroGhg committed Nov 20, 2023
    Configuration menu
    Copy the full SHA
    ca374bc View commit details
    Browse the repository at this point in the history
  4. Remove mutex

    PietroGhg committed Nov 20, 2023
    Configuration menu
    Copy the full SHA
    bffa67d View commit details
    Browse the repository at this point in the history

Commits on Nov 21, 2023

  1. Formatting

    PietroGhg committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    413de31 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#934 from nrspruit/fixprogrambuild_updated

    [UR][L0] Add support for passing device list to urProgramBuild/Link/Compile
    kbenzie authored Nov 21, 2023
    Configuration menu
    Copy the full SHA
    ce152a6 View commit details
    Browse the repository at this point in the history
  3. [CUDA][HIP][OpenCL][NATIVECPU] Fix multi-device compile

    Ensure that all adapters have the correct signatures for the
    multi-device compile experimental feature entry points and that they
    entry points exist even when returning
    `UR_RESULT_ERROR_UNSUPPORTED_FEATURE`.
    kbenzie committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    e001b98 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    841a287 View commit details
    Browse the repository at this point in the history
  5. Add missing symbols

    hdelan committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    e2f9eb0 View commit details
    Browse the repository at this point in the history
  6. Make return UR_RESULT_SUCCESS

    hdelan committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    e413c6c View commit details
    Browse the repository at this point in the history
  7. Don't reinstate old context

    hdelan committed Nov 21, 2023
    Configuration menu
    Copy the full SHA
    2b77f79 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    18d333f View commit details
    Browse the repository at this point in the history
  9. Merge pull request oneapi-src#1100 from kbenzie/benie/fixup-multi-dev…

    …ice-compile
    
    [CUDA][HIP][OpenCL][NATIVECPU] Fix multi-device compile
    kbenzie authored Nov 21, 2023
    Configuration menu
    Copy the full SHA
    71957e8 View commit details
    Browse the repository at this point in the history

Commits on Nov 22, 2023

  1. Configuration menu
    Copy the full SHA
    82d4cef View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9c8e5c4 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#1053 from jandres742/url0leakkey

    [UR][L0] Add UR_L0_LEAKS_DEBUG key
    kbenzie authored Nov 22, 2023
    Configuration menu
    Copy the full SHA
    31b654f View commit details
    Browse the repository at this point in the history
  4. atomic incr/decr-compare

    PietroGhg committed Nov 22, 2023
    Configuration menu
    Copy the full SHA
    b7731bc View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#966 from martygrant/martin/openclComman…

    …dBuffers
    
    [OpenCL] Add Command Buffer extension to OpenCL adapter.
    kbenzie authored Nov 22, 2023
    Configuration menu
    Copy the full SHA
    109ed46 View commit details
    Browse the repository at this point in the history

Commits on Nov 23, 2023

  1. Configuration menu
    Copy the full SHA
    d0319b2 View commit details
    Browse the repository at this point in the history

Commits on Nov 24, 2023

  1. update template file

    AllanZyne committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    1b9fdc6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8262de6 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#936 from hdelan/revert-hip-prefetch

    [HIP] Revert add prefetch for USM hip allocations a6b8fa66b537753415d24076f…
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    28cf40f View commit details
    Browse the repository at this point in the history
  4. Set version to v0.8.0

    kbenzie committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    712d791 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    7074cf9 View commit details
    Browse the repository at this point in the history
  6. Merge pull request oneapi-src#1122 from kbenzie/benie/cmake-bump-v0.9.0

    [CMake] Bump version to v0.9.0
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    1cc70f2 View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#1037 from al42and/fix-cuda-11-8-compat

    [UR][CUDA] Fix compatibility with CUDA 11.x
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    e197941 View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#1097 from PietroGhg/pietro/refcounting

    [NATIVECPU] Implement correct reference counting for Native CPU adapter
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    b4e665f View commit details
    Browse the repository at this point in the history
  9. Merge pull request oneapi-src#1106 from wenju-he/fix-native_cpu-build…

    …-warnings
    
    [NATIVECPU] Fix build errors when -Werror=on
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    433b952 View commit details
    Browse the repository at this point in the history
  10. Merge pull request oneapi-src#1039 from omarahmed1111/eliminate-usage…

    …-of-regex-in-opencl
    
    [UR][OPENCL] Eliminate usage of regex in opencl
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    0250c64 View commit details
    Browse the repository at this point in the history
  11. [OpenCL] Fetch headers/icd-loader by default

    Introduces the follow CMake options:
    
    * `UR_OPENCL_INCLUDE_DIR` - directory containing the OpenCL Headers
    * `UR_OPENCL_ICD_LOADER_LIBRARY` - path of the OpenCL ICD Loader library
    
    In the event that `UR_OPENCL_INCLUDE_DIR` is not specified, clone
    [KhronosGroup/OpenCL-Headers](https://github.com/KhronosGroup/OpenCL-Headers.git).
    
    In the event that `UR_OPENCL_ICD_LOADER_LIBRARY` is not specified, first
    inspect the system with `find_package(OpenCL 3.0)` and use that if
    found. Otherwise, clone
    [KhronosGroup/OpenCL-ICD-Loader](https://github.com/KhronosGroup/OpenCL-ICD-Loader.git).
    kbenzie committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    9f4b0a6 View commit details
    Browse the repository at this point in the history
  12. [CI] Add support for OpenCL CI on CPU

    fabiomestre authored and kbenzie committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    8a05674 View commit details
    Browse the repository at this point in the history
  13. Fix windows build

    fabiomestre authored and kbenzie committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    0c4585f View commit details
    Browse the repository at this point in the history
  14. Fix formatting

    fabiomestre authored and kbenzie committed Nov 24, 2023
    Configuration menu
    Copy the full SHA
    6502de1 View commit details
    Browse the repository at this point in the history
  15. Merge pull request oneapi-src#993 from fabiomestre/fabio/opencl_ci

    [CI] Add support for OpenCL CI on CPU
    kbenzie authored Nov 24, 2023
    Configuration menu
    Copy the full SHA
    41228d3 View commit details
    Browse the repository at this point in the history

Commits on Nov 27, 2023

  1. Configuration menu
    Copy the full SHA
    79457b2 View commit details
    Browse the repository at this point in the history
  2. Make work for buffer case

    hdelan committed Nov 27, 2023
    Configuration menu
    Copy the full SHA
    1c85750 View commit details
    Browse the repository at this point in the history
  3. Throw from std::visit

    hdelan committed Nov 27, 2023
    Configuration menu
    Copy the full SHA
    f6c476d View commit details
    Browse the repository at this point in the history

Commits on Nov 28, 2023

  1. Configuration menu
    Copy the full SHA
    2a11e49 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1088 from hdelan/ifdef-hip-5.6

    [HIP] Ifdef entry point introduced in HIP 5.6
    kbenzie authored Nov 28, 2023
    Configuration menu
    Copy the full SHA
    5e914c5 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3e1f163 View commit details
    Browse the repository at this point in the history

Commits on Nov 29, 2023

  1. Configuration menu
    Copy the full SHA
    a32be71 View commit details
    Browse the repository at this point in the history

Commits on Nov 30, 2023

  1. Configuration menu
    Copy the full SHA
    e770fdd View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1141 from kbenzie/benie/disable-windows…

    …-l0-clang-cl
    
    [GHA] Disable Windows L0 clang-cl job
    kbenzie authored Nov 30, 2023
    Configuration menu
    Copy the full SHA
    a99d1db View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a92a989 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    4cffc1d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    f2ca7a9 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    681e607 View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#1050 from szadam/adapters_ci_info

    Add system info in CI workflows run on self-hosted runners
    pbalcer authored Nov 30, 2023
    Configuration menu
    Copy the full SHA
    bd76c51 View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#1072 from omarahmed1111/merge-some-main…

    …-changes-into-adapters-third-patch
    
    Merge main into adapters branch
    omarahmed1111 authored Nov 30, 2023
    Configuration menu
    Copy the full SHA
    47af3ee View commit details
    Browse the repository at this point in the history

Commits on Dec 1, 2023

  1. Merge pull request oneapi-src#1102 from hdelan/adapter-batch1

    [HIP] Adapter PR batch
    kbenzie authored Dec 1, 2023
    Configuration menu
    Copy the full SHA
    fe5bc76 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1126 from martygrant/martin/buildAllAda…

    …pters
    
    CMake Flag for building all adapters
    martygrant authored Dec 1, 2023
    Configuration menu
    Copy the full SHA
    53a429f View commit details
    Browse the repository at this point in the history

Commits on Dec 4, 2023

  1. Configuration menu
    Copy the full SHA
    a3bf168 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1150 from kswiecicki/adapters-val-adapt…

    …er-refcount-fix
    
    [UR] Fix adapter refcount tracking
    pbalcer authored Dec 4, 2023
    Configuration menu
    Copy the full SHA
    670b30f View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#1108 from AllanZyne/review/yang/urAdapt…

    …erGet
    
    [UR] Fix urAdapterGet ignores NumEntries
    pbalcer authored Dec 4, 2023
    Configuration menu
    Copy the full SHA
    fe2735a View commit details
    Browse the repository at this point in the history
  4. Allow fixtures derived from KernelTest to defer compilation.

    Separating the device code compilation from the rest of the setup lets
    a test query device or platform properties to decide whether it should
    skip without potentially hitting a compilation error because the kernel
    being built as part of the setup uses an unsupported feature.
    aarongreig committed Dec 4, 2023
    Configuration menu
    Copy the full SHA
    512b158 View commit details
    Browse the repository at this point in the history
  5. [CTS] Update ctest_parser.py script

    Update the `scripts/ctest_parser.py` to work with the adapter branch
    changes.
    kbenzie committed Dec 4, 2023
    Configuration menu
    Copy the full SHA
    3c568c3 View commit details
    Browse the repository at this point in the history
  6. HIP adapter multi dev ctx

    hdelan authored and kbenzie committed Dec 4, 2023
    Configuration menu
    Copy the full SHA
    5dc6e46 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    f0e0be2 View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#1153 from aarongreig/aaron/deferBuildKe…

    …rnelFixtures
    
    Allow fixtures derived from KernelTest to defer compilation.
    aarongreig authored Dec 4, 2023
    Configuration menu
    Copy the full SHA
    9c6e151 View commit details
    Browse the repository at this point in the history
  9. Add bounds checking for enqueue operations to the validation layer.

    This is accomplished with the various size queries for buffers, images
    and USM allocations. Since not all adapters have these queries
    implemented the bounds checking isn't entirely comprehensive on all
    platforms just yet.
    aarongreig committed Dec 4, 2023
    Configuration menu
    Copy the full SHA
    4b7f70f View commit details
    Browse the repository at this point in the history
  10. Merge pull request oneapi-src#1154 from kbenzie/benie/update-ctest_pa…

    …rser
    
    [CTS] Update ctest_parser.py script
    kbenzie authored Dec 4, 2023
    Configuration menu
    Copy the full SHA
    9b1fb4e View commit details
    Browse the repository at this point in the history

Commits on Dec 5, 2023

  1. Merge pull request oneapi-src#999 from hdelan/hip-adapter-multi-dev-ctx

    [HIP] Hip adapter multi dev ctx
    kbenzie authored Dec 5, 2023
    Configuration menu
    Copy the full SHA
    67e7da3 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#1093 from aarongreig/aaron/addBoundsChe…

    …ckingValidation
    
    Add bounds checking for enqueue operations to the validation layer.
    aarongreig authored Dec 5, 2023
    Configuration menu
    Copy the full SHA
    9e3bb2f View commit details
    Browse the repository at this point in the history