Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

benie/test adapters #1020

Closed
wants to merge 425 commits into from
Closed
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Aug 18, 2023

  1. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 18, 2023
    Configuration menu
    Copy the full SHA
    644755c View commit details
    Browse the repository at this point in the history

Commits on Aug 21, 2023

  1. [UR] Remove Prefixes from CUDA tests

    Petr Vesely committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    5eaf0a2 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#795 from PatKamin/fuzz-test-generate-ap…

    …i-calls-adapters
    
    Test parts of API with libFuzzer.
    pbalcer authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    318ef5a View commit details
    Browse the repository at this point in the history
  3. [UR] Remove Prefixes from HIP tests

    Petr Vesely committed Aug 21, 2023
    Configuration menu
    Copy the full SHA
    fa89ec4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    bf16571 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    858e062 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    31be4f4 View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#811 from veselypeta/petr/fix-prefixes

    [UR] Remove `cuda_`/`hip_` prefixes from tests files.
    veselypeta authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    12c0cdf View commit details
    Browse the repository at this point in the history
  8. Merge pull request oneapi-src#813 from pbalcer/adapters-update-v0.7.1

    [adapters] update to latest sycl (with 0.7.1 tag)
    pbalcer authored Aug 21, 2023
    Configuration menu
    Copy the full SHA
    f95dfbc View commit details
    Browse the repository at this point in the history

Commits on Aug 22, 2023

  1. [SYCL][Graph] Throw an exception when unsupported features used in a …

    …graph (#10789)
    
    This PR contains a set of changes that implement throwing an exception
    when a feature unsupported by
    
    [sycl_ext_oneapi_graph](https://github.com/intel/llvm/blob/sycl/sycl/doc/extensions/proposed/sycl_ext_oneapi_graph.asciidoc)
    is used.
    
    An error is thrown when an application uses the following features with
    the graph extension:
    * An unsupported extension.
    * Querying the event returned from graph submission for profiling
    information.
    * Level Zero immediate command-list (see
    intel/llvm#10467)
    * Specialization constants, kernel bundles, or reductions are used in a
    graph node.
    
    ## Authors
    
    Co-authored-by: Pablo Reble <pablo.reble@intel.com>
    Co-authored-by: Julian Miller <julian.miller@intel.com>
    Co-authored-by: Ben Tracy <ben.tracy@codeplay.com>
    Co-authored-by: Ewan Crawford <ewan@codeplay.com>
    Co-authored-by: Maxime France-Pillois
    <maxime.francepillois@codeplay.com>
    EwanC authored Aug 22, 2023
    Configuration menu
    Copy the full SHA
    fd179fa View commit details
    Browse the repository at this point in the history
  2. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 22, 2023
    Configuration menu
    Copy the full SHA
    98f5fbf View commit details
    Browse the repository at this point in the history
  3. [SYCL][UR][L0] Fix some leftovers from PI-UR porting (#10785)

    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga authored Aug 22, 2023
    Configuration menu
    Copy the full SHA
    a5ae24d View commit details
    Browse the repository at this point in the history
  4. [SYCL][UR][L0] Correctly set OwnNativeHandle on image create (#10919)

    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga authored Aug 22, 2023
    Configuration menu
    Copy the full SHA
    cc6e8ab View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2023

  1. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 23, 2023
    Configuration menu
    Copy the full SHA
    3b09c80 View commit details
    Browse the repository at this point in the history

Commits on Aug 24, 2023

  1. Add prefetch for HIP USM allocations (#10430)

    This change is necessary to workaround a delightful bug in either HIP
    runtime, or the HIP spec.
    
    It's discussed at length in github.com/intel/llvm/issues/7252 but for
    the purposes of this patch, it suffices to say that a call to
    `hipMemPrefetchAsync` is *required* for correctness in the face of
    global atomic operations on (*at least*) shared USM allocations.
    
    The architecture of this change is slightly strange on first sight in
    that we reduntantly track allocation information in several places. The
    context now keeps track of all USM mappings. We require a mapping of
    pointers to the allocated size, but these allocations aren't pinned to
    any particular queue or HIP stream.
    The `hipMemPrefetchAsync`, however, requires the associated HIP stream
    object, and the size of the allocation. The stream comes
    hot-off-the-queue *only* just before a kernel is launched, so we need to
    defer the prefetch until we have that information.
    
    Finally, the kernel itself keeps track of pointer arguments in a more
    accessible way so we can determine which of the kernel's pointer
    arguments do, in-fact, point to USM allocations.
    ldrumm authored Aug 24, 2023
    Configuration menu
    Copy the full SHA
    f02aa8f View commit details
    Browse the repository at this point in the history

Commits on Aug 25, 2023

  1. [UR][CUDA][HIP] Fix Set Arg Local (#10710)

    In the CUDA/HIP adapters `urKernelSetArgValue` was being used to
    implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However,
    if the validation layer is enabled in UR then the path to set local arg
    is never taken since it includes a check that `pArgValue` is not null.
    
    This PR:
     * Implements `urKernelSetArgLocal` for CUDA/HIP adapters
    * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is
    `nullptr`
    * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back
    into `urKernelSetArgValue`.
    veselypeta authored Aug 25, 2023
    Configuration menu
    Copy the full SHA
    503755d View commit details
    Browse the repository at this point in the history
  2. [UR][CUDA][HIP] Fix Set Arg Local (#10710)

    In the CUDA/HIP adapters `urKernelSetArgValue` was being used to
    implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However,
    if the validation layer is enabled in UR then the path to set local arg
    is never taken since it includes a check that `pArgValue` is not null.
    
    This PR:
     * Implements `urKernelSetArgLocal` for CUDA/HIP adapters
    * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is
    `nullptr`
    * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back
    into `urKernelSetArgValue`.
    veselypeta authored Aug 25, 2023
    Configuration menu
    Copy the full SHA
    4fb0004 View commit details
    Browse the repository at this point in the history
  3. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 25, 2023
    Configuration menu
    Copy the full SHA
    fcf0903 View commit details
    Browse the repository at this point in the history

Commits on Aug 28, 2023

  1. [SYCL][UR][HIP] Check if module needs to be unloaded (#10787)

    In the HIP adapter the HIP module is not set until the program is built
    with `urProgramBuild`, therefore we should check that the module
    actually needs to be unloaded in `urProgramRelease`.
    
    The following will result in a failure, but should still be valid UR
    trace:
    
    ```cpp
    uint8_t *source = "<some ptx>";
    ur_program_handle_t prog;
    urProgramCreate(context, device, sizeof(source), ptxSource, nullptr, &prog);
    urProgramRelease(prog); // fails when it tries to unload the module.
    ```
    veselypeta authored Aug 28, 2023
    Configuration menu
    Copy the full SHA
    9d5d494 View commit details
    Browse the repository at this point in the history
  2. [UR][CUDA][HIP][L0] Cleanup licence header (#10824)

    Fix the license headers at the top of each source file in the unified
    runtime directory.
    
    ---------
    
    Co-authored-by: Alexey Bader <alexey.bader@intel.com>
    veselypeta and bader authored Aug 28, 2023
    Configuration menu
    Copy the full SHA
    e0c87a5 View commit details
    Browse the repository at this point in the history
  3. [UR][CUDA][HIP][L0] Cleanup licence header (#10824)

    Fix the license headers at the top of each source file in the unified
    runtime directory.
    
    ---------
    
    Co-authored-by: Alexey Bader <alexey.bader@intel.com>
    veselypeta and bader authored Aug 28, 2023
    Configuration menu
    Copy the full SHA
    b445a12 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    37a2225 View commit details
    Browse the repository at this point in the history
  5. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 28, 2023
    Configuration menu
    Copy the full SHA
    e714348 View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2023

  1. [cmake] Use separate FetchSource's per adapter

    While attempting to enable the Level-Zero and OpenCL adapters at the
    same time git will report an error due to the OpenCL tag not existing in
    the intel/llvm clone use for the Level-Zero adapter. Furthermore, if I
    wanted to test separate bug fixes for the CUDA and HIP adapters from
    different sources, this would also not be possible.
    
    This patch uses a separate FetchSource for each adapter to allow each
    adapters source to be fetched independently of the others and moves
    those clones into the build directory. This does result in additional
    git clones when enabling multiple adapters and again in multiple build
    directories but avoids conflicting remote URL's or multiple build
    configurations trampling on each others checked out tags.
    kbenzie committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    789adb0 View commit details
    Browse the repository at this point in the history
  2. [fuzz test] discover all corpora in 'corpus' dir

    and re-name the label to 'fuzz-short' for easier usage.
    lukaszstolarczuk committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    b15f186 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9f2ea9a View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#831 from kbenzie/benie/adapters-separat…

    …e-clones
    
    [cmake] Use separate FetchSource's per adapter
    kbenzie authored Aug 29, 2023
    Configuration menu
    Copy the full SHA
    fa4b5cf View commit details
    Browse the repository at this point in the history
  5. [cts] Add adapters ignore lists for conformance tests

    This change enables conformance tests on HW with adapters.
    Tests that currently do not pass are ignored, but in the future,
    when they are all fixed, tests should pass without using this functionality.
    wlemkows committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    ddeb5b5 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    007d939 View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2023

  1. Merge pull request oneapi-src#823 from wlemkows/test-test

    [cts] Add adapters ignore lists for conformance tests
    pbalcer authored Aug 30, 2023
    Configuration menu
    Copy the full SHA
    0ab28ff View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#830 from lukaszstolarczuk/ext-fuzz-doc

    Small updates in fuzz tests
    pbalcer authored Aug 30, 2023
    Configuration menu
    Copy the full SHA
    5577bae View commit details
    Browse the repository at this point in the history
  3. [SYCL][HIP] Implemented supported make_* interop functions. (#10526)

    This PR adds missing functions in the hip backend to allow for
    interoperability in programs that create sycl objects from native hip
    objects. The new function implementations are:
    
    - `make_device`
    - `make_queue`
    - `make_event`
    
    Note that it would really make sense for
    intel/llvm#10491 to be merged first because this
    PR makes the same code change in pi2ur, for a fix that is attributed to
    #10491.
    
    ---------
    
    Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
    JackAKirk authored Aug 30, 2023
    Configuration menu
    Copy the full SHA
    e3d2184 View commit details
    Browse the repository at this point in the history
  4. [ur] Introduce urinfo tool

    `urinfo` is a command-line tool for inspecting the current execution
    environment:
    
    ```console
    $ build/bin/urinfo --help
    usage: build/bin/urinfo [-h] [-v] [-V]
    
    This tool enumerates Unified Runtime layers, adapters, platforms, and
    devices which are currently visible in the local execution environment.
    
    options:
      -h, --help            show this help message and exit
      --version             show version number and exit
      -v, --verbose         print additional information
    ```
    kbenzie committed Aug 30, 2023
    Configuration menu
    Copy the full SHA
    3e1cb9d View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2023

  1. Merge pull request oneapi-src#751 from kbenzie/benie/urinfo

    [ur] Introduce urinfo tool
    kbenzie authored Aug 31, 2023
    Configuration menu
    Copy the full SHA
    c1bc465 View commit details
    Browse the repository at this point in the history
  2. [CTS] fix cts device adapter L0 match file

    Test urDeviceGetGlobalTimestampTest.SuccessSynchronizedTime
    is marked as OPT because is flaky.
    wlemkows committed Aug 31, 2023
    Configuration menu
    Copy the full SHA
    d8d1e40 View commit details
    Browse the repository at this point in the history
  3. Run fuzz test with L0 when available

    When UR is built with the UR_BUILD_ADAPTER_L0 flag, run fuzz tests
    with level zero adapter force loaded.
    PatKamin committed Aug 31, 2023
    Configuration menu
    Copy the full SHA
    f658931 View commit details
    Browse the repository at this point in the history
  4. Add information on generating new corpus files

    Add instructions for generating new short fuzz test scenarios.
    PatKamin committed Aug 31, 2023
    Configuration menu
    Copy the full SHA
    9bf0014 View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#836 from wlemkows/device-l0-match-fix

    [CTS] fix cts device adapter L0 match file
    pbalcer authored Aug 31, 2023
    Configuration menu
    Copy the full SHA
    6114e57 View commit details
    Browse the repository at this point in the history
  6. [SYCL] Optimize urEnqueueEventsWaitWithBarrier for in-order queues (#…

    …10995)
    
    Don't insert
    zeCommandListAppendWaitOnEvents/zeCommandListAppendSignalEvent for
    queue::ext_oneapi_submit_barrier() if we have in-order queue and all
    events in the waitlist are from the same queue.
    againull authored Aug 31, 2023
    Configuration menu
    Copy the full SHA
    afa564d View commit details
    Browse the repository at this point in the history
  7. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Aug 31, 2023
    Configuration menu
    Copy the full SHA
    1f5b07b View commit details
    Browse the repository at this point in the history

Commits on Sep 1, 2023

  1. [ci] fix running cts tests that require device binaries

    Creating device binaries silently fails if SYCL isn't available.
    pbalcer committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    fc6f8db View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#839 from PatKamin/fuzztest-readme

    Add information on generating new corpus files
    pbalcer authored Sep 1, 2023
    Configuration menu
    Copy the full SHA
    d242d04 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#838 from PatKamin/fuzztest-l0

    Run fuzz test with L0 when available
    pbalcer authored Sep 1, 2023
    Configuration menu
    Copy the full SHA
    17e4bcd View commit details
    Browse the repository at this point in the history
  4. Remove phony nightly CI workflow

    Cron scheduled workflows are triggered on the default branch only,
    which is the 'main' branch.
    PatKamin committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    7e342cc View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#845 from PatKamin/remove-nightly-build

    Remove phony nightly CI workflow [adapters branch]
    pbalcer authored Sep 1, 2023
    Configuration menu
    Copy the full SHA
    b7e8594 View commit details
    Browse the repository at this point in the history
  6. Speed up CI jobs

    PatKamin committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    17e60d3 View commit details
    Browse the repository at this point in the history
  7. Remove now redundant job (w/o HW)

    In current form this job is redundant, since we run the same builds
    on the same OS, but with hardware and tests' execution.
    lukaszstolarczuk authored and PatKamin committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    18b85d4 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    475b1aa View commit details
    Browse the repository at this point in the history
  9. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Sep 1, 2023
    Configuration menu
    Copy the full SHA
    05575e8 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2023

  1. Configuration menu
    Copy the full SHA
    8ccb7ba View commit details
    Browse the repository at this point in the history
  2. [SYCL][HIP] Add managed memory USM checks for HIP prefetch (#10761)

    Adds `hipDeviceAttributeManagedMemory` attribute check to verify the HIP
    device supports managed memory in order to use the prefetch API as well
    as checking the pointer to migrate is actually a managed allocation
    (USM/SVM).
    
    If the check fails we return early and set a warning message (with
    `UR_RESULT_SUCCESS` as return code).
    
    As a follow-up. we may need to detect system-allocated memory and either
    continue with just a warning message or throw a more targeted error
    message and exit.
    GeorgeWeb authored Sep 4, 2023
    Configuration menu
    Copy the full SHA
    68faafa View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#843 from PatKamin/shorten-ci

    [adapters] Shorten CI
    pbalcer authored Sep 4, 2023
    Configuration menu
    Copy the full SHA
    6a2f948 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#840 from pbalcer/ci-device-binaries

    [ci] fix running cts tests that require device binaries
    pbalcer authored Sep 4, 2023
    Configuration menu
    Copy the full SHA
    20337b2 View commit details
    Browse the repository at this point in the history

Commits on Sep 5, 2023

  1. Add codegen example

    kurapov-peter committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    544ded5 View commit details
    Browse the repository at this point in the history
  2. Add conda dependencies

    kurapov-peter committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    41883a9 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    aa88398 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cabcd98 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    80f6eb3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    60372d3 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    115abc2 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    4797c71 View commit details
    Browse the repository at this point in the history
  9. [SYCL] [UR] Fix for make_queue. (#11046)

    Correction to make_queue functionality. During the transition from L0
    Plugin to UR Adapter a line of code in the make queue function was
    inadvertenty dropped. This change restores it and also modifies the
    associated test to do more careful checking.
    rdeodhar authored Sep 5, 2023
    Configuration menu
    Copy the full SHA
    c33b13b View commit details
    Browse the repository at this point in the history
  10. Merge from 'sycl' to 'sycl-web'

    iclsrc committed Sep 5, 2023
    Configuration menu
    Copy the full SHA
    502401b View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2023

  1. [UR][CUDA][HIP] Replace ur::assertion with UR_CHECK_ERROR (#10986)

    Lots of hip/cu driver API calls were wrapped in `ur::assertion(res ==
    CU_SUCCESS)` etc which:
    
    - Means that any native error messages returned from the affected driver
    api calls were lost.
    - Since these APIs report errors asynchronously, such that they are
    thrown from the last API call rather than the call which led to the
    error, previous asynchronous error messages from different APIs to the
    ones wrapped by the `ur::assertion` could also be lost depending on user
    code.
    
    These problems are fixed by swapping these assertions with
    `UR_CHECK_ERROR`.
    
    Note that in the future UR may want to adjust `UR_CHECK_ERROR` so that
    it throws `UR_RESULT_ERROR_ADAPTER_SPECIFIC` etc instead of using
    `std::cerr` etc to report the error etc. But I think it makes sense to
    still use `UR_CHECK_ERROR` to wrap driver API calls because it means
    that the __LINE__, __FUNCTION__ etc info can be correctly passed to
    native error reporting.
    
    ---------
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk authored Sep 8, 2023
    Configuration menu
    Copy the full SHA
    8639075 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2023

  1. [SYCL][UR] Implement usm memory pool for hip adapter. (#11065)

    Based on #10758
    
    Co-authored-by: Ronan Keryell <ronan@keryell.fr>
    martygrant and keryell authored Sep 12, 2023
    Configuration menu
    Copy the full SHA
    73527bc View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e08d63b View commit details
    Browse the repository at this point in the history
  3. [SYCL][UR][L0] Use uint8_t instead of uint32_t for 2 cases of urConte…

    …xtGetInfo (#10929)
    
    Modified the return value for UR_CONTEXT_INFO_USM_FILL2D_SUPPORT and
    UR_CONTEXT_INFO_USM_MEMCPY_2D_SUPPORT to return uint8_t instead of
    uint32_t to comply with unified-runtime conformance test.
    
    Signed-off-by: Zhang, Winston <winston.zhang@intel.com>
    winstonzhang-intel authored Sep 12, 2023
    Configuration menu
    Copy the full SHA
    14178b0 View commit details
    Browse the repository at this point in the history
  4. [SYCL] Return Success in USM Mem Free if L0 driver is already release…

    …d (#11075)
    
    Signed-off-by: Spruit, Neil R <neil.r.spruit@intel.com>
    nrspruit authored Sep 12, 2023
    Configuration menu
    Copy the full SHA
    9cf2ff8 View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2023

  1. [SYCL][UR][L0] Replace memory type look-up with UMF tracking (#10807)

    This also makes SharedReadOnly allocations tracking obsolete.
    kswiecicki authored Sep 14, 2023
    Configuration menu
    Copy the full SHA
    7ced7ca View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a0a20d8 View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2023

  1. [PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)

    Fetch the latest revision of unified runtime:
    
    Notable changes
    * Some command buffer entry-points have been renamed, also 2 additions
    of membufferFill and USMFill
    * UrInit/urTearDown have been removed - replaced with loader only
    versions.
    veselypeta authored Sep 15, 2023
    Configuration menu
    Copy the full SHA
    630fbb8 View commit details
    Browse the repository at this point in the history
  2. [PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)

    Fetch the latest revision of unified runtime:
    
    Notable changes
    * Some command buffer entry-points have been renamed, also 2 additions
    of membufferFill and USMFill
    * UrInit/urTearDown have been removed - replaced with loader only
    versions.
    veselypeta authored Sep 15, 2023
    Configuration menu
    Copy the full SHA
    0b56545 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2023

  1. Merge pull request oneapi-src#610 from kurapov-peter/pakurapo/codegen…

    …-example
    
    End-to-end code generation example
    kbenzie authored Sep 18, 2023
    Configuration menu
    Copy the full SHA
    1d706f6 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2023

  1. [SYCL] Add esimd as an optional device feature (#10017)

    This change adds a new aspect for esimd, `ext_intel_esimd`, and
    annotates the two fundamental esimd classes, `simd_obj_impl` and
    `simd_view_impl` with the `uses_aspect` attribute.
    
    `simd_obj_impl` is the base class of `simd` and `simd_mask` which are
    the fundamental user-facing classes.
    `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not
    a base of `simd_view_impl`, but every `simd_view_impl` requires a
    reference to a `simd` or `simd_mask` at construction time, so I am not
    sure if we truly need to annotate `simd_view_impl`, but I added it to be
    safe.
    
    It also adds a new PI device info query,
    `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at
    runtime if a device supports ESIMD. For UR-based plugins, we map that to
    `UR_DEVICE_INFO_ESIMD_SUPPOR`.
    
    The implementation simply returns false for cuda, hip and native_cpu.
    For l0 and opencl, we check that the device is an intel gpu by querying
    the device type is gpu and the vendor id is `0x8086`.
    For ESIMD emulator we simply return true.
    
    I would appreciate careful review on the plugin changes in particular,
    as I am not an expert.
    
    This change also updates the esimd spec to document the new aspect.
    
    In a future change, I plan to use the new aspect to remove the
    requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros
    that set function attributes, but I am not doing that as part of this PR
    as it requires more investigation.
    
    ---------
    
    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored Sep 19, 2023
    Configuration menu
    Copy the full SHA
    3b84011 View commit details
    Browse the repository at this point in the history
  2. [SYCL] Add esimd as an optional device feature (#10017)

    This change adds a new aspect for esimd, `ext_intel_esimd`, and
    annotates the two fundamental esimd classes, `simd_obj_impl` and
    `simd_view_impl` with the `uses_aspect` attribute.
    
    `simd_obj_impl` is the base class of `simd` and `simd_mask` which are
    the fundamental user-facing classes.
    `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not
    a base of `simd_view_impl`, but every `simd_view_impl` requires a
    reference to a `simd` or `simd_mask` at construction time, so I am not
    sure if we truly need to annotate `simd_view_impl`, but I added it to be
    safe.
    
    It also adds a new PI device info query,
    `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at
    runtime if a device supports ESIMD. For UR-based plugins, we map that to
    `UR_DEVICE_INFO_ESIMD_SUPPOR`.
    
    The implementation simply returns false for cuda, hip and native_cpu.
    For l0 and opencl, we check that the device is an intel gpu by querying
    the device type is gpu and the vendor id is `0x8086`.
    For ESIMD emulator we simply return true.
    
    I would appreciate careful review on the plugin changes in particular,
    as I am not an expert.
    
    This change also updates the esimd spec to document the new aspect.
    
    In a future change, I plan to use the new aspect to remove the
    requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros
    that set function attributes, but I am not doing that as part of this PR
    as it requires more investigation.
    
    ---------
    
    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored Sep 19, 2023
    Configuration menu
    Copy the full SHA
    5c1d65e View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2023

  1. Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)

    Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for
    the past few days.
    aelovikov-intel authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    ee78d5f View commit details
    Browse the repository at this point in the history
  2. Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)

    Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for
    the past few days.
    aelovikov-intel authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    eb86b33 View commit details
    Browse the repository at this point in the history
  3. [SYCL][Graph] Bug fixes for hanging tests and urCommandBufferAppendKe…

    …rnelLaunchExp (#11232)
    
    This patch targets to address two issues found in the testing:
    - Test hangs: This patch serializes the graph submissions in the tests
    to prevent hangs when submitting multiple graphs.
    - Issue adding a kernel to the command buffer in
    `urCommandBufferAppendKernelLaunchExp`.
    
    ---------
    
    Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com>
    julianmi and mfrancepillois authored Sep 20, 2023
    Configuration menu
    Copy the full SHA
    ee3a5de View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2023

  1. [UR][L0] Make the queue [root-]device appear first in the context (#1…

    …1213)
    
    Signed-off-by: smaslov <sergey.v.maslov@intel.com>
    smaslov-intel authored Sep 21, 2023
    Configuration menu
    Copy the full SHA
    a1a57f5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    10421df View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f0944fe View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    89c1377 View commit details
    Browse the repository at this point in the history
  5. [SYCL][HIP] Fix hip prefetch/mem_advise (noop) for rocm < 5.0 (#10958)

    The main fix here is to enable prefetch functionality when the HIP
    backend is built with rocm versions < 5.
    The change to the prefix/mem_advise UR hip adapter APIs fixes four
    e2e-tests that fail test-e2e only for HIP 4.x versions (These failures
    don't come up in the CI since it tests using rocm 5.x).
    
    The change to Tracing/image_printers.cpp is effectively re-XFailing this
    test for rocm 4 only. rocm 4 doesn't support `hipCreateSurfaceObject`
    which is called by this test. I'm not sure this legacy image
    functionality (Which as I understand it will be replaced by bindless
    images) is really working for rocm 5 in a meaningful way, since most
    other legacy image e2e tests are XFAIL for hip. But this test can still
    be useful for us for ROCM 5 compatibility testing. The XFAIL was
    recently removed here
    intel/llvm@745febe
    which led to us finding the rocm 4 fail. I guess that the CI at some
    point switched from testing rocm 4 to testing rocm 5, which meant this
    test stopped failing in CI.
    
    ---------
    
    Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
    JackAKirk authored Sep 21, 2023
    Configuration menu
    Copy the full SHA
    cca22fa View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    d9f48c5 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2023

  1. [SYCL][NFC] Make UR_CHECK_ERROR a void return macro (#11100)

    `UR_CHECK_ERROR` was designed to return `ur_result_t`, however in
    practice it was guaranteed to only ever return `UR_RESULT_SUCCESS`, as
    other paths would either terminate, abort or throw.
    
    This in turns leads to poor quality/error prone code, as the codebase
    was littered with:
    * statements not checking the return value - depending on the compiler
    generating a warning,
    * extra check on the return which was only ever going to be true.
    
    Some care was required, as the codebase has a habit of accumulating err
    codes across branches, so depending on the use case the initial value of
    `ur_result_t Result`s had to be set accordingly (now that
    `UR_CHECK_ERROR` does not return).
    jchlanda authored Sep 22, 2023
    Configuration menu
    Copy the full SHA
    834e127 View commit details
    Browse the repository at this point in the history
  2. [UR][L0] Do not rely on CTAD to avoid build warnings/failures (#11255)

    Fixes post-commit build fails in
    https://github.com/intel/llvm/actions/runs/6256835642/job/16988269333
    
    Signed-off-by: smaslov <sergey.v.maslov@intel.com>
    smaslov-intel authored and veselypeta committed Sep 22, 2023
    Configuration menu
    Copy the full SHA
    72bc1d4 View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2023

  1. Configuration menu
    Copy the full SHA
    f5e0ed5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    aa80476 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#889 from aarongreig/aaron/disableGenera…

    …teWindowsCherrypick
    
    [ADAPTERS] Cherry-pick "Temporarily disable running check-generated target in windows CI."
    aarongreig authored Sep 25, 2023
    Configuration menu
    Copy the full SHA
    4c6d556 View commit details
    Browse the repository at this point in the history
  4. [test] improve match file error output

    When a test fails a match, cmakes print both the
    input file and the match file, so that it's easy to see
    why the failure happened and how to fix it. Unfortunately,
    cmake was inserting 3 newlines between every line of both
    input and match files, making things hard to read.
    
    This patch fixes this by moving match and input files printing
    to the python script.
    
    Co-authored-by: Petr Vesely <petr.vesely@codeplay.com>
    pbalcer and Petr Vesely committed Sep 25, 2023
    Configuration menu
    Copy the full SHA
    be85a49 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2023

  1. Merge pull request oneapi-src#886 from pbalcer/better-match-out

    [test] improve match file error output
    pbalcer authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    d0200e3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    442de38 View commit details
    Browse the repository at this point in the history
  3. Use autogenerated tools

    hdelan authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    663d9f3 View commit details
    Browse the repository at this point in the history
  4. Add anonymous namespace

    hdelan authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    b368ca1 View commit details
    Browse the repository at this point in the history
  5. Undo code changes and add fix in cmake

    hdelan authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    15a089a View commit details
    Browse the repository at this point in the history
  6. Add required

    hdelan authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    e66fbfa View commit details
    Browse the repository at this point in the history
  7. Only add flag if using pthreads

    hdelan authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    fd33970 View commit details
    Browse the repository at this point in the history
  8. Remove misplaced statement from urKernelCreate doc

    Signed-off-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    Jaime Arteaga authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    179695d View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    b457b11 View commit details
    Browse the repository at this point in the history
  10. Configuration menu
    Copy the full SHA
    c4c3455 View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    22b9cf9 View commit details
    Browse the repository at this point in the history
  12. Revert "(NATIVECPU) added #define for device binary target"

    This reverts commit c6e2987.
    uwedolinsky authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    d7fa90a View commit details
    Browse the repository at this point in the history
  13. Revert "(NATIVECPU) added #define for device binary target"

    This reverts commit afa0525.
    uwedolinsky authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    7dad101 View commit details
    Browse the repository at this point in the history
  14. (NATIVECPU) added Native CPU macro

    uwedolinsky authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    30c07c2 View commit details
    Browse the repository at this point in the history
  15. Configuration menu
    Copy the full SHA
    fb87081 View commit details
    Browse the repository at this point in the history
  16. Configuration menu
    Copy the full SHA
    bb65b2c View commit details
    Browse the repository at this point in the history
  17. [CMake] Omit warning for missing 'generate' command - make it a status

    Lack of this command is not a problem for projects using UR
    and they may use it without 'UR_FORMAT_CPP_STYLE' set.
    lukaszstolarczuk authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    33ed352 View commit details
    Browse the repository at this point in the history
  18. Configuration menu
    Copy the full SHA
    fdf3597 View commit details
    Browse the repository at this point in the history
  19. [CMake] Make Python3 required dependency

    we basically use it everywhere.
    lukaszstolarczuk authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    7c3cf17 View commit details
    Browse the repository at this point in the history
  20. Add UR_DEVICE_INFO_ESIMD_SUPPORT_EXP

    This will be used by DPC++ PI to determine if a device supports the DPC++ ESIMD extension.
    
    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    ab1967c View commit details
    Browse the repository at this point in the history
  21. Rework change to not use experimental infrastructure

    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    6d9c29d View commit details
    Browse the repository at this point in the history
  22. Formatting and cleanup

    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    ddf6ac4 View commit details
    Browse the repository at this point in the history
  23. [UR] Bump sycl nightly version

    [UR] Fix ur info spec
    
    [UR] Bump compiler version in CI
    
    [UR] Extract into directory
    
    [UR] Temporarily disable 20.04
    
    [UR] Disable dpc++ on 20.04
    
    [UR] Fix hip matchers
    
    [UR] Fix cuda match files
    
    [UR] Fix L0 match files
    
    [UR] Tmp print out os info
    
    [UR] Tmp enable verbose logging
    
    [UR] Print out match files
    
    [L0] Remove segfault matchers
    
    [UR] Fix cuda matcher
    
    [UR] Remove temporary fixes
    veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    294a4d6 View commit details
    Browse the repository at this point in the history
  24. [SYCL][UR][L0] Modify flag for zeInit (#11242)

    Based on the existence of NPU driver for L0, GPU flag is now passed to
    zeInit
    
    Signed-off-by: Zhang, Winston <winston.zhang@intel.com>
    winstonzhang-intel authored and veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    4320ea1 View commit details
    Browse the repository at this point in the history
  25. Merge pull request oneapi-src#881 from veselypeta/petr/bump_adapters

    [ADAPTERS] Bump adapters branch to commit used in SYCL
    veselypeta authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    88e71c0 View commit details
    Browse the repository at this point in the history
  26. Configuration menu
    Copy the full SHA
    e2d8d26 View commit details
    Browse the repository at this point in the history
  27. [UR] Fix formatting

    veselypeta committed Sep 26, 2023
    Configuration menu
    Copy the full SHA
    ffcba56 View commit details
    Browse the repository at this point in the history
  28. Merge pull request oneapi-src#879 from veselypeta/petr/level_zero

    Move L0 Sources from intel/llvm to UR.
    veselypeta authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    83799e6 View commit details
    Browse the repository at this point in the history
  29. Configuration menu
    Copy the full SHA
    d4d936f View commit details
    Browse the repository at this point in the history
  30. Merge pull request oneapi-src#891 from callumfare/fix_adapter_def_path

    Use absolute path for adapter .def files
    callumfare authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    886e593 View commit details
    Browse the repository at this point in the history
  31. Configuration menu
    Copy the full SHA
    b29c580 View commit details
    Browse the repository at this point in the history
  32. Configuration menu
    Copy the full SHA
    d2edad6 View commit details
    Browse the repository at this point in the history
  33. Merge pull request oneapi-src#892 from callumfare/fix_l0_adapter_ddi

    [L0] Add missing loader interfaces to L0 adapter
    callumfare authored Sep 26, 2023
    Configuration menu
    Copy the full SHA
    ed395be View commit details
    Browse the repository at this point in the history
  34. Configuration menu
    Copy the full SHA
    8a10921 View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2023

  1. Merge pull request oneapi-src#895 from callumfare/l0_adapter_image_cpp

    [L0] Add missing image.cpp source file to the adapter
    callumfare authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    c791b8b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a33007b View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#876 from omarahmed1111/followup-review-…

    …feedback-on-e2e-codegen-example
    
    Followup review feedback on e2e codegen example
    omarahmed1111 authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    7180719 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#894 from veselypeta/petr/fixup-level-ze…

    …ro-licenses
    
    [UR][L0] Cleanup L0 License headers
    veselypeta authored Sep 27, 2023
    Configuration menu
    Copy the full SHA
    48b4062 View commit details
    Browse the repository at this point in the history
  5. [SYCL][CUDA] Port CUDA plugin to Unified Runtime (#9512)

    This moves the CUDA plugin implementation to Unified Runtime; and
    changes the pi_cuda plugin to use pi2ur to implement PI. The changes to
    the implementation have been kept to a minimum and should be
    functionally the same. Documentation and comments have been moved
    verbatim, other than changing PI references to UR.
    
    This PR is based on top of the Level Zero adapter (#8744) so will only
    be ready when that is merged.
    
    ---------
    
    Co-authored-by: Petr Vesely <petr.vesely@codeplay.com>
    Co-authored-by: Omar Ahmed <omar.ahmed@codeplay.com>
    Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com>
    Co-authored-by: Aaron Greig <aaron.greig@codeplay.com>
    5 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    8fbd9af View commit details
    Browse the repository at this point in the history
  6. [SYCL][CUDA] Fix post merge errors from #9512 (#9872)

    Resolves the warnings as errors reported in [post
    merge](https://github.com/intel/llvm/actions/runs/5266121277/jobs/9519634360)
    as a result of merging #9512. Additionally move pre-processor guards to
    resolve unused global variables which would also fail in this build
    configuration (clang & SYCL_ENABLE_WERROR=ON).
    kbenzie authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    5f22cfc View commit details
    Browse the repository at this point in the history
  7. [SYCL][CUDA][HIP] Block until the event is ready to start profiling. …

    …(#9938)
    
    * Call to `hipEventElapsedTime` return `hipErrorNotReady` when the
    timestamp has not yet been `recorded` on one or both events. Calling
    `hipEventSynchronize` block until the event is ready.
    * The issue showed itself when profiling sycl-blas benchmark.
    * Enable support for cuda / hip in event_profiling_info.cpp
    mmoadeli authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    24d46d8 View commit details
    Browse the repository at this point in the history
  8. [SYCL][UR] Update Unified Runtime tag to support UR_DEVICE_INFO_IP_VE…

    …RSION (#9873)
    
    This should have been an obvious update of Unified Runtime tag to
    support UR_DEVICE_INFO_IP_VERSION, required in
    intel/llvm#9843 (just tag update, nothing else),
    but it also brought many API breaks caused by this patch:
    oneapi-src#536. So the current
    PR updates our codebase in accordance with changed UR API.
    
    ---------
    
    Signed-off-by: Dmitry Vodopyanov <dmitry.vodopyanov@intel.com>
    Co-authored-by: Callum Fare <callum@codeplay.com>
    Co-authored-by: Jaime Arteaga <jaime.a.arteaga.molina@intel.com>
    3 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    4f7afd2 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    2f97119 View commit details
    Browse the repository at this point in the history
  10. [SYCL][CUDA] Fix broken synchronization of CUstreams (#10031)

    This fixes a change in `hasBeenSynchronized` accidentally introduced
    during the UR port
    callumfare authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    6f29e4a View commit details
    Browse the repository at this point in the history
  11. Partially reverts back the update on guess local work size function. …

    …(#10055)
    
    * Reverts back the update to calculation of threads per block for 0th
    dimension when primary ranges are involved. That could cause out of
    range access.
    mmoadeli authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    23999fc View commit details
    Browse the repository at this point in the history
  12. [SYCL][UR] Bump UR version and add missing values to pi2ur (#10049)

    Includes a large number of missing `pi_result` mappings, so previously
    reported errors should no longer map to just `PI_ERROR_UNKNOWN`. NFCI
    for the adapters.
    callumfare authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    16d4f24 View commit details
    Browse the repository at this point in the history
  13. [SYCL][CUDA] Remove size checks from USM allocations (#10034)

    These checks are causing issues for very large USM allocations because
    the `MAX_MEM_ALLOC_SIZE` reported is lower than what CUDA actually
    supports.
    
    We will follow up with an update on the reported `MAX_MEM_ALLOC_SIZE`,
    but it makes sense to remove the checks either way, as the CUDA
    allocation functions will return an error if they can't allocate the
    memory.
    npmiller authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    9d1ea51 View commit details
    Browse the repository at this point in the history
  14. [SYCL][Graph] L0 Backend support for SYCL Graphs (2/4) (#9992)

    # Level Zero Backend Support for SYCL Graphs
    This is the second patch of a series that adds support for an
    [experimental command graph
    extension](intel/llvm#5626)
    
    A snapshot of the complete work can be seen in draft PR #9375 which has
    support all the specification defined ways of
    adding nodes and edges to the graph, including both Explicit and Record
    & Replay graph construction. The two types of nodes currently
    implemented are kernel execution and memcpy commands.
    
    See https://github.com/reble/llvm#implementation-status for the status
    of our total work.
    
    ## Scope
    This second patch focuses on the required PI/UR support for the
    experimental command-buffer feature in the Level Zero adapter:
    * PI stubs for all adapters to enable compilation, no functionality.
    * Command-buffer implementation for the Level Zero UR adapter.
    * Stubs for the CUDA UR adapter to enable compilation, no functionality.
    
    ## Following Split PRs
    Future follow-up PRs with the remainder of our work on the extension
    will include:
    * Hooking up backend to graphs runtime, bugfixes and other feature
    additions, will add symbols but not break the ABI. (3/4)
    * Add end-to-end tests for SYCL Graph extension. (4/4)
    * NFC changes - Design doc and codeowner update.
    
    ## Authors
    Co-authored-by: Pablo Reble <pablo.reble@intel.com>
    Co-authored-by: Julian Miller <julian.miller@intel.com>
    Co-authored-by: Ben Tracy <ben.tracy@codeplay.com>
    Co-authored-by: Ewan Crawford <ewan@codeplay.com>
    Co-authored-by: Maxime France-Pillois
    <maxime.francepillois@codeplay.com>
    
    ---------
    
    Co-authored-by: Ewan Crawford <ewan@codeplay.com>
    Co-authored-by: Maxime France-Pillois <maxime.francepillois@codeplay.com>
    3 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    caed4b7 View commit details
    Browse the repository at this point in the history
  15. [SYCL][UR][CUDA] Fix typo in ur_device_handle_t (#10130)

    The destructor should be calling `cuDevicePrimaryCtxRelease`
    hdelan authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    5b95881 View commit details
    Browse the repository at this point in the history
  16. [SYCL][HIP] Ignore unused PI command-buffer parameters (#10192)

    Fixes error found in [post-commit
    CI](https://github.com/intel/llvm/actions/runs/5454766342/jobs/9925392005)
    after the merge of intel/llvm#9992
    
    ```
     /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5635:24: error: unused parameter 'sync_point' [-Werror,-Wunused-parameter]
     5635 |     pi_ext_sync_point *sync_point) {
          |                        ^
    /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5691:12: error: unused parameter 'dst_row_pitch' [-Werror,-Wunused-parameter]
     5691 |     size_t dst_row_pitch, size_t dst_slice_pitch,
          |            ^
    /__w/llvm/llvm/src/sycl/plugins/hip/pi_hip.cpp:5691:34: error: unused parameter 'dst_slice_pitch' [-Werror,-Wunused-parameter]
     5691 |     size_t dst_row_pitch, size_t dst_slice_pitch,
          |                                  ^
    3 errors generated.
    ```
    EwanC authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    69b8a12 View commit details
    Browse the repository at this point in the history
  17. [SYCL][CUDA] Fix cuDeviceGetUuid for older CUDA version (#10205)

    This patch re-introduces the fix from
    intel/llvm#8765
    
    Which seems to have been accidentally dropped by the UR port.
    npmiller authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    bb24713 View commit details
    Browse the repository at this point in the history
  18. [SYCL][CUDA] Cached info from CUDA API calls to the kernel and device…

    … classes (#10104)
    
    This PR fixes an issue with overhead when calling the same kernel
    multiple times in a loop.
    
    Right now, some calls to CUDA API happen every time the kernel is
    invoked to query the same information. Those calls were moved to
    `device` and `kernel` constructors and the info was cached as private
    members of those classes.
    konradkusiak97 authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    4ec1ec9 View commit details
    Browse the repository at this point in the history
  19. [SYCL][UR][CUDA] Remove unnecessary UR entry point validation (#9969)

    These checks are already performed by the UR validation layer
    callumfare authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    0cdc873 View commit details
    Browse the repository at this point in the history
  20. [SYCL][CUDA] Implement sycl_ext_oneapi_peer_access extension (#8303)

    This implements the current extension doc from
    intel/llvm#6104 in the CUDA backend only.
    
    Fixes intel/llvm#7543.
    Fixes intel/llvm#6749.
    
    ---------
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    Co-authored-by: Nicolas Miller <nicolas.miller@codeplay.com>
    Co-authored-by: JackAKirk <chezjakirk@gmail.com>
    Co-authored-by: Steffen Larsen <steffen.larsen@intel.com>
    4 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    55f3e87 View commit details
    Browse the repository at this point in the history
  21. [PI][UR] Report error from cuInit (#10353)

    Currently if `cuInit` fails while getting platforms it returns
    UR_RESULT_SUCCESS. Instead we use `UR_CHECK_ERROR` to return the correct
    error code.
    veselypeta authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    1637319 View commit details
    Browse the repository at this point in the history
  22. [SYCL][CUDA] Remove SYCL dependencies from CUDA UR adapter (#10135)

    This was preventing out-of-tree build of the adapter for standalone use
    with unified runtime.
    
    Signed-off-by: Piotr Balcer <piotr.balcer@intel.com>
    pbalcer authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    3696b20 View commit details
    Browse the repository at this point in the history
  23. [SYCL][Bindless][2/4] Add experimental implementation of SYCL bindles…

    …s images extension (#10112)
    
    # Experimental Implementation of SYCL Bindless Images Extension
    
    This commit stands as the second commit of four to make code review
    easier, implementing revision 4 of the [bindless images extension
    proposal](intel/llvm#9842).
    
    ## Scope
    
    This PR covers changes made to the PI and the UR. This includes
    
    - Extending PI with extension functions
    - Updating UR FetchContent commit and implementing [UR bindless images
    experimental
    features](https://oneapi-src.github.io/unified-runtime/core/EXP-BINDLESS-IMAGES.html)
    on the CUDA adaptor
    
    ## Following Split PRs
    
    - [3/4] Implement the user-facing SYCL extension
    - [4/4] Add tests
    
    ## Authors
    
    Co-authored-by: Isaac Ault <isaac.ault@codeplay.com>
    Co-authored-by: Hugh Bird <hugh.bird@codeplay.com>
    Co-authored-by: Duncan Brawley <duncan.brawley@codeplay.com>
    Co-authored-by: Przemek Malon <przemek.malon@codeplay.com>
    Co-authored-by: Chedy Najjar <chedy.najjar@codeplay.com>
    Co-authored-by: Sean Stirling <sean.stirling@codeplay.com>
    Co-authored-by: Peter Zuzek <peter@codeplay.com>
    7 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    a9ff0ed View commit details
    Browse the repository at this point in the history
  24. Configuration menu
    Copy the full SHA
    0af7d63 View commit details
    Browse the repository at this point in the history
  25. [SYCL][CUDA] Fix device creation from native handle (#10450)

    In CUDA objects are represented as integers rather than opaque handles.
    This patch fixes a segmentation fault when creating a device handle from
    a native handle by avoiding dereferencing a pointer which should be
    treated as an integer.
    kbenzie authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    39d46a2 View commit details
    Browse the repository at this point in the history
  26. [SYCL][Bindless] Fix compiler Werrors (#10464)

    - Fixed compiler errors/warnings related to unused and uninitialized
    variables and parameters.
    
    Post-commit fix for PR: intel/llvm#10112
    przemektmalon authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    b99c5d2 View commit details
    Browse the repository at this point in the history
  27. [SYCL][UR] Fix device partition queries (#10355)

    After the recent device partition changes in the UR spec (i.e.
    ur_device_partition_property_t), level_zero, cuda and hip adapters are
    returning incorrect values and types for
    UR_DEVICE_INFO_SUPPORTED_PARTITIONS and UR_DEVICE_INFO_PARTITION_TYPE.
    
    This PR fixes this issues with the adapters and updates pi2ur to
    correctly convert between ur_device_partition_properties_t and
    pi_device_partition_property.
    fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    67b7b14 View commit details
    Browse the repository at this point in the history
  28. [SYCL] [L0] Add user APIs to import / release host memory from USM. (…

    …#9294)
    
    This change adds a SYCL interface to the Level Zero APIs
    zexDriverImportExternalPointer and zexDriverReleaseImportedPointer.
    These functions are used for importing host memory into USM for the
    duration of data transfer to increase bandwidth.
    rdeodhar authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    3505cb0 View commit details
    Browse the repository at this point in the history
  29. [CUDA][UR] Cache the max local mem size (#10577)

    Cache the max local mem size so that we can call less CUDA driver entry
    points at `urEnqueueKernelLaunch`. Also allows us to query the value set
    for `SYCL_PI_CUDA_MAX_LOCAL_MEM_SIZE` using
    `device.get_info<sycl::info::device::local_mem_size>()`
    hdelan authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    e2bae7b View commit details
    Browse the repository at this point in the history
  30. [SYCL][PI] Restore CUDA and HIP plugin ABI (#10518)

    During the port to UR the CUDA and HIP PI plugin ABI's were
    unintentionally changed. There does not appear to be symbol checks for
    these plugins, unlike the [Level Zero symbol
    check](https://github.com/intel/llvm/blob/sycl/sycl/test/abi/pi_level_zero_symbol_check.dump)
    and [OpenCL symbol
    check](https://github.com/intel/llvm/blob/sycl/sycl/test/abi/pi_opencl_symbol_check.dump).
    As such, the ABI change went unnoticed until
    intel/llvm#10490 was opened using the same
    approach for the OpenCL port, which
    [failed](https://github.com/intel/llvm/actions/runs/5610646255/job/15200624025?pr=10490)
    the OpenCL symbol check.
    
    This PR restores the expected ABI for the CUDA and HIP plugins and
    introduces new CUDA and HIP symbol check tests.
    kbenzie authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    056f1f3 View commit details
    Browse the repository at this point in the history
  31. [SYCL][UR] Bump UR and implement adapter handles (#10349)

    Bump the Unified Runtime commit, and make adapter changes needed for the
    newly added adapter handles (see
    oneapi-src#715 for details)
    
    This fixes #10066 by providing an implementation of
    `piPluginGetLastError` in pi2ur.
    callumfare authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    6530d39 View commit details
    Browse the repository at this point in the history
  32. [SYCL][CUDA] Make plugin specific error return an error (#10626)

    The `UR_RESULT_ADAPTER_SPECIFIC_ERROR` was not returning an error to the
    SYCL RT which meant all errors were treated as warnings and ignored
    unless `SYCL_RT_WARNING_LEVEL` is set to geq 2. This changes things so
    the adapter specific error is now reported as such, meaning all uses
    `UR_RESULT_ADAPTER_SPECIFIC_ERROR` meant as warnings are now caught as
    errors.
    
    ---------
    
    Co-authored-by: Hugh Delaney <hughd94@gmail.com>
    2 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    d133b55 View commit details
    Browse the repository at this point in the history
  33. [SYCL][CUDA] Pass device from context in create queue. (#10491)

    Recently in the switch to UR `urQueueCreateFromNativeHandle` changed the
    previous behaviour whereby a queue was created with a device taken as
    the default device from the context. It changed it so that the queue was
    created with the device argument instead. Since the sycl runtime always
    passes a nullptr for the device when programmers call
    `make_queue(nativeStream, context)`, this broke `make_queue`. This patch
    reverts to the previous behaviour before the switch from pi cuda to ur
    cuda.
    
    Note that this should also fix `make_queue` for l0 which I also guess
    was broken due to the asserts meaning that this line was never reached:
    https://github.com/intel/llvm/blob/sycl/sycl/plugins/unified_runtime/ur/adapters/level_zero/queue.cpp#L574.
    But I have not tested this.
    
    ---------
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    934be28 View commit details
    Browse the repository at this point in the history
  34. [SYCL][CUDA] ifdef cuda 11.5 datatypes (#10691)

    These new datatypes were introduced in CUDA 11.5
    
    Co-authored-by: Hugh Delaney <hughd94@gmail.com>
    2 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    2afff55 View commit details
    Browse the repository at this point in the history
  35. [SYCL][CUDA] fix compilation of CUDA plugin on Windows with MSVC (#10…

    …809)
    
    Compilation failed on Windows with Visual Studio 2019/2022 when enabling
    the CUDA plugin. This commit fixes compilation by
    * adding missing headers
    * adding a static_cast of ContextInfoType to fix the following
    compilation error:
    sycl\plugins\unified_runtime\ur\adapters\cuda\context.cpp(49): error
    C2397: conversion from 'ur_context_info_t' to 'uint32_t' requires a
    narrowing conversion
    freibold authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    0e5aa09 View commit details
    Browse the repository at this point in the history
  36. [SYCL][UR][CUDA] Fix a few issues in the cuda adapter revealed by UR …

    …enqueue CTS. (#10244)
    aarongreig authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    8ca215d View commit details
    Browse the repository at this point in the history
  37. [CUDA][HIP][UR] Fix race condition in adapters (#10826)

    event->record was being called before event->start, which resulted in
    event->record failing silently. This fixes that.
    hdelan authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    3b1586e View commit details
    Browse the repository at this point in the history
  38. Configuration menu
    Copy the full SHA
    1bebd84 View commit details
    Browse the repository at this point in the history
  39. [SYCL][CUDA] Add std::ignore for USM alloc args that are only used in…

    … an assert. (#10899)
    
    This should fix the post-commit build fails.
    aarongreig authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    e8cfc2c View commit details
    Browse the repository at this point in the history
  40. [UR][CUDA][HIP] Fix Set Arg Local (#10710)

    In the CUDA/HIP adapters `urKernelSetArgValue` was being used to
    implement both `urKernelSetArgValue` & `urKernelSetArgLocal`. However,
    if the validation layer is enabled in UR then the path to set local arg
    is never taken since it includes a check that `pArgValue` is not null.
    
    This PR:
     * Implements `urKernelSetArgLocal` for CUDA/HIP adapters
    * Changes `pi2ur` to call `urKernelSetArgLocal` when `arg_value` is
    `nullptr`
    * Implements `urKernelSetArgLocal` for L0 adapter - this just calls back
    into `urKernelSetArgValue`.
    veselypeta authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    cb57a66 View commit details
    Browse the repository at this point in the history
  41. [UR][CUDA][HIP][L0] Cleanup licence header (#10824)

    Fix the license headers at the top of each source file in the unified
    runtime directory.
    
    ---------
    
    Co-authored-by: Alexey Bader <alexey.bader@intel.com>
    2 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    710e8b5 View commit details
    Browse the repository at this point in the history
  42. [SYCL][HIP] Implemented supported make_* interop functions. (#10526)

    This PR adds missing functions in the hip backend to allow for
    interoperability in programs that create sycl objects from native hip
    objects. The new function implementations are:
    
    - `make_device`
    - `make_queue`
    - `make_event`
    
    Note that it would really make sense for
    intel/llvm#10491 to be merged first because this
    PR makes the same code change in pi2ur, for a fix that is attributed to
    #10491.
    
    ---------
    
    Signed-off-by: Jack Kirk <jack.kirk@codeplay.com>
    JackAKirk authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    10e2880 View commit details
    Browse the repository at this point in the history
  43. [UR][CUDA] Ifdef remaining CUDA 11 datatypes (#10922)

    intel/llvm#10691 missed these extra cases. This
    PR adds the needed ifdefs
    hdelan authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    5eafdf5 View commit details
    Browse the repository at this point in the history
  44. [SYCL][CUDA] Reintroduce removing of size checks from USM allocations…

    … (#11023)
    
    Resolve same issue fixed in
    [PR](intel/llvm#10034)
    
    Co-authored-by: Omar Ahmed <omar.ahmed@codeplay.com>
    2 people authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    7ba3c58 View commit details
    Browse the repository at this point in the history
  45. [UR][CUDA][HIP] Replace ur::assertion with UR_CHECK_ERROR (#10986)

    Lots of hip/cu driver API calls were wrapped in `ur::assertion(res ==
    CU_SUCCESS)` etc which:
    
    - Means that any native error messages returned from the affected driver
    api calls were lost.
    - Since these APIs report errors asynchronously, such that they are
    thrown from the last API call rather than the call which led to the
    error, previous asynchronous error messages from different APIs to the
    ones wrapped by the `ur::assertion` could also be lost depending on user
    code.
    
    These problems are fixed by swapping these assertions with
    `UR_CHECK_ERROR`.
    
    Note that in the future UR may want to adjust `UR_CHECK_ERROR` so that
    it throws `UR_RESULT_ERROR_ADAPTER_SPECIFIC` etc instead of using
    `std::cerr` etc to report the error etc. But I think it makes sense to
    still use `UR_CHECK_ERROR` to wrap driver API calls because it means
    that the __LINE__, __FUNCTION__ etc info can be correctly passed to
    native error reporting.
    
    ---------
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    02383b0 View commit details
    Browse the repository at this point in the history
  46. [SYCL][CUDA][Bindless] Add support for normalized channel types (#11120)

    - Support was added for the following image channel types:
      - `unorm_int8`
      - `unorm_int16`
      - `snorm_int8`
      - `snorm_int16`
    
    - Reading these types through `read_image` returns the denormalized
    floating point data.
    
    - A test was added for these new types.
    
    - Support for the following packed normalized image channel types was
    removed from the proposal:
      - `unorm_short_565`
      - `unorm_short_555`
      - `unorm_int_101010`
    
    - This was done due to lack of device support. If the need for these
    types arises in the future, we can revisit support for these types.
    przemektmalon authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    2d5a79b View commit details
    Browse the repository at this point in the history
  47. Configuration menu
    Copy the full SHA
    ad82940 View commit details
    Browse the repository at this point in the history
  48. [PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes. (#11155)

    Fetch the latest revision of unified runtime:
    
    Notable changes
    * Some command buffer entry-points have been renamed, also 2 additions
    of membufferFill and USMFill
    * UrInit/urTearDown have been removed - replaced with loader only
    versions.
    veselypeta authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    c85400d View commit details
    Browse the repository at this point in the history
  49. [SYCL] Add esimd as an optional device feature (#10017)

    This change adds a new aspect for esimd, `ext_intel_esimd`, and
    annotates the two fundamental esimd classes, `simd_obj_impl` and
    `simd_view_impl` with the `uses_aspect` attribute.
    
    `simd_obj_impl` is the base class of `simd` and `simd_mask` which are
    the fundamental user-facing classes.
    `simd_view_impl` is the base of only `simd_view`. `simd_obj_impl` is not
    a base of `simd_view_impl`, but every `simd_view_impl` requires a
    reference to a `simd` or `simd_mask` at construction time, so I am not
    sure if we truly need to annotate `simd_view_impl`, but I added it to be
    safe.
    
    It also adds a new PI device info query,
    `PI_EXT_INTEL_DEVICE_INFO_ESIMD_SUPPORT` that is used to query at
    runtime if a device supports ESIMD. For UR-based plugins, we map that to
    `UR_DEVICE_INFO_ESIMD_SUPPOR`.
    
    The implementation simply returns false for cuda, hip and native_cpu.
    For l0 and opencl, we check that the device is an intel gpu by querying
    the device type is gpu and the vendor id is `0x8086`.
    For ESIMD emulator we simply return true.
    
    I would appreciate careful review on the plugin changes in particular,
    as I am not an expert.
    
    This change also updates the esimd spec to document the new aspect.
    
    In a future change, I plan to use the new aspect to remove the
    requirement for the `SYCL_ESIMD_FUNCTION` and `SYCL_ESIMD_KERNEL` macros
    that set function attributes, but I am not doing that as part of this PR
    as it requires more investigation.
    
    ---------
    
    Signed-off-by: Sarnie, Nick <nick.sarnie@intel.com>
    sarnex authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    d8e85fc View commit details
    Browse the repository at this point in the history
  50. Revert "[PI][UR][HIP][CUDA][L0][NCPU] Pull in latest UR fixes." (#11227)

    Reverts intel/llvm#11155. I'm almost sure this is what broke our CI for
    the past few days.
    aelovikov-intel authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    905d2f2 View commit details
    Browse the repository at this point in the history
  51. [SYCL][CUDA] Multiple fixes for the CUDA adapter (#11121)

    - Return correct error code in urContextCreateWithNativeHandle
    - Add UR_DEVICE_INFO_VIRTUAL_MEMORY_SUPPORT to urDeviceGetInfo
    - Move asserts from urEnqueueMemBufferFill to UR validation layers
    (commit 2c533e6 on UR repository)
    - Make behaviour of urEventSetCallback consistent with other unsupported
    entrypoints.
    - urProgramGetInfo and urQueueGetInfo now return
    UR_RESULT_ERROR_UNSUPPORTED_ENUMERATION for unsupported cases.
    - Add entrypoints for urSamplerGetNativeHandle and
    urSamplerCreateWithNativeHandle
    - Fix UR_USM_ALLOC_INFO_BASE_PTR and UR_USM_ALLOC_INFO_SIZE
    implementation.
    fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    00761dc View commit details
    Browse the repository at this point in the history
  52. [SYCL][NFC] Make UR_CHECK_ERROR a void return macro (#11100)

    `UR_CHECK_ERROR` was designed to return `ur_result_t`, however in
    practice it was guaranteed to only ever return `UR_RESULT_SUCCESS`, as
    other paths would either terminate, abort or throw.
    
    This in turns leads to poor quality/error prone code, as the codebase
    was littered with:
    * statements not checking the return value - depending on the compiler
    generating a warning,
    * extra check on the return which was only ever going to be true.
    
    Some care was required, as the codebase has a habit of accumulating err
    codes across branches, so depending on the use case the initial value of
    `ur_result_t Result`s had to be set accordingly (now that
    `UR_CHECK_ERROR` does not return).
    jchlanda authored and fabiomestre committed Sep 27, 2023
    Configuration menu
    Copy the full SHA
    bed33f7 View commit details
    Browse the repository at this point in the history
  53. Configuration menu
    Copy the full SHA
    46912f6 View commit details
    Browse the repository at this point in the history
  54. Configuration menu
    Copy the full SHA
    9461c79 View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2023

  1. Configuration menu
    Copy the full SHA
    c55e3d1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a0eb5a6 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#897 from callumfare/ci/cancel_old_workf…

    …lows
    
    CI: Cancel stale workflows when new commits are added to a PR
    callumfare authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    1941850 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d1f7ebc View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    899f831 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    2cc31e5 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    20297af View commit details
    Browse the repository at this point in the history
  8. [UR] Fix include path

    veselypeta committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    fbd14e1 View commit details
    Browse the repository at this point in the history
  9. Merge pull request oneapi-src#884 from omarahmed1111/Add-test-workflo…

    …w-for-codegen-example
    
    Add test workflow for codegen example
    omarahmed1111 authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    e72d570 View commit details
    Browse the repository at this point in the history
  10. [CUDA] Update license headers and fix compilation issues (oneapi-src#900

    )
    
    - Update license headers for the CUDA adapter
    - Add virtual memory entrypoints to ur_interface_loader.cpp
    - Add missing source files to the CUDA adapter cmake
    - Fix compilation error in image.cpp (uninitialized variable)
    fabiomestre authored Sep 28, 2023
    Configuration menu
    Copy the full SHA
    00c7edb View commit details
    Browse the repository at this point in the history
  11. Configuration menu
    Copy the full SHA
    48a03ac View commit details
    Browse the repository at this point in the history
  12. [UR][L0] Propagate errors from USMAllocationMakeResident

    This change ensures that USM allocation APIs don't return
    `UR_RESULT_SUCCESS` when an error occurs within
    `USMAllocationMakeResident`.
    
    Signed-off-by: Michael Aziz <michael.aziz@intel.com>
    0x12CC committed Sep 28, 2023
    Configuration menu
    Copy the full SHA
    e5d6a91 View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2023

  1. [UR][Tests] fix match.py on-error printing

    Input and match files were printed only on one error.
    Now, it will be printed for both places (where this makes sense).
    
    This is an extension to commit: be85a49
    lukaszstolarczuk committed Sep 29, 2023
    Configuration menu
    Copy the full SHA
    fedcee6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d6dc4c0 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    19597b2 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2023

  1. Merge pull request oneapi-src#904 from lukaszstolarczuk/fix-match.py

    [UR][Tests] Update match scripts
    pbalcer authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    187633a View commit details
    Browse the repository at this point in the history
  2. [SYCL][CUDA] Implement ext_oneapi_queue_priority (#11296)

    The E2E test assumes that the device supports priorities (sm_35 or
    newer).
    al42and authored and fabiomestre committed Oct 2, 2023
    Configuration menu
    Copy the full SHA
    6a0eb7e View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#903 from veselypeta/petr/move_hip

    [UR][HIP] Move HIP adapter sourcer to Unified Runtime
    veselypeta authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    1c6fa3c View commit details
    Browse the repository at this point in the history
  4. [UR][L0][CMake] Disable warning for ICX on Windows

    On Windows we may build L0 using ICX , this is to revert changes to
    check WIN32 instead of MSVC, to fix build failures with ICX on Windows.
    jsji committed Oct 2, 2023
    Configuration menu
    Copy the full SHA
    9265d33 View commit details
    Browse the repository at this point in the history
  5. Merge pull request oneapi-src#917 from jsji/adapters

    [UR][L0][CMake] Disable warning for ICX on Windows
    igchor authored Oct 2, 2023
    Configuration menu
    Copy the full SHA
    f1c17dc View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2023

  1. Configuration menu
    Copy the full SHA
    ec915f8 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2023

  1. Merge pull request oneapi-src#923 from pbalcer/l0-win-fix-build

    [L0] fix windows cmake scripts
    pbalcer authored Oct 4, 2023
    Configuration menu
    Copy the full SHA
    e6343f4 View commit details
    Browse the repository at this point in the history
  2. [UR][L0] Copy prebuilt L0 to avoid leaking shared folder path

    We allow reusing existing pre-built L0 Libs and Headers.
    However, using LEVEL_ZERO_LIBRARY and LEVEL_ZERO_INCLUDE_DIR directly
    may leak the path of shared folder.
    
    This PR introduces two L0_LIBRARY and L0_INCLUDE_DIR ,
    and copy L0 loader/headers locally to the build to avoid leaking their
    location on a shared drive.
    jsji committed Oct 4, 2023
    Configuration menu
    Copy the full SHA
    6a2c548 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2023

  1. Merge pull request oneapi-src#920 from jsji/localcopy

    [UR][L0] Copy prebuilt L0 to avoid leaking shared folder path
    pbalcer authored Oct 5, 2023
    Configuration menu
    Copy the full SHA
    b38855e View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2023

  1. Add CODEOWNERS file

    Adapter implementations are being moved into the repo we should mirror
    the code ownership of those implementations. This patch adds a
    CODEOWNERS file which matches the pre adapter move ownership to be used
    as a baseline for future changes for each individual adapter.
    kbenzie committed Oct 9, 2023
    Configuration menu
    Copy the full SHA
    49770f5 View commit details
    Browse the repository at this point in the history
  2. [CUDA] Fix queue creation with native handle

    The new priority parameter hadn't been reflected here so the ownership
    property was being used as priority and default to `true` which caused
    crashes.
    npmiller committed Oct 9, 2023
    Configuration menu
    Copy the full SHA
    b4425bc View commit details
    Browse the repository at this point in the history

Commits on Oct 10, 2023

  1. [HIP] Enable kernel finalization using comgr

    For kernel fusion support for hip, we need to finalize the kernels using comgr.
    The patch finalizes tagged binaries during buildProgram
    before handing it over to the hip runtime.
    
    Signed-off-by: Victor Lomuller <victor@codeplay.com>
    Naghasan committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    2fd9dea View commit details
    Browse the repository at this point in the history
  2. Re add extended deleters

    hdelan committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    55dccfc View commit details
    Browse the repository at this point in the history
  3. [UR][L0] Fix minor code duplication

    - Second return has no effect
    - Checking the same flag twice has no effect
    al42and committed Oct 10, 2023
    Configuration menu
    Copy the full SHA
    6180a78 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#909 from kbenzie/benie/codeowners

    Add CODEOWNERS file
    kbenzie authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    0677296 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    484cd07 View commit details
    Browse the repository at this point in the history
  6. Merge pull request oneapi-src#906 from 0x12CC/l0_usm_error_checking

    [UR][L0] Propagate errors from `USMAllocationMakeResident`
    kbenzie authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    4954850 View commit details
    Browse the repository at this point in the history
  7. Merge pull request oneapi-src#915 from al42and/fix-minor-typos-in-lev…

    …elzero-adapter
    
    [UR][L0] Fix minor code duplication
    pbalcer authored Oct 10, 2023
    Configuration menu
    Copy the full SHA
    0d3fd4b View commit details
    Browse the repository at this point in the history

Commits on Oct 11, 2023

  1. Configuration menu
    Copy the full SHA
    a6f382a View commit details
    Browse the repository at this point in the history
  2. Use CUDA team for HIP adapter too

    The HIP team needs some fixing up currently so use the CUDA team for
    now.
    npmiller committed Oct 11, 2023
    Configuration menu
    Copy the full SHA
    463e7ce View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2023

  1. Change xfail list

    hdelan committed Oct 12, 2023
    Configuration menu
    Copy the full SHA
    b002e00 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#946 from npmiller/adapters-owners

    Add code owners for the CUDA and HIP adapters
    kbenzie authored Oct 12, 2023
    Configuration menu
    Copy the full SHA
    775af6e View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    3f831b2 View commit details
    Browse the repository at this point in the history

Commits on Oct 13, 2023

  1. Merge pull request oneapi-src#955 from kbenzie/benie/codeowners-level…

    …-zero
    
    [UR][L0] Add adapter source to codeowners file
    kbenzie authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    a5d102a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9af962b View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#958 from kbenzie/benie/codeowners-hip

    Update HIP adapter codeowners to use specific team
    kbenzie authored Oct 13, 2023
    Configuration menu
    Copy the full SHA
    ab45620 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2023

  1. Configuration menu
    Copy the full SHA
    496d694 View commit details
    Browse the repository at this point in the history
  2. [Command-buffer][L0] Reset sync-point events

    The L0 events used to implement UR sync-points in
    a command-buffer aren't reset after the first execution.
    Leaving them in a signalled state before subsequent submissions
    of the UR command-buffer.
    
    This patch resets the L0 events used as UR command-buffer sync-points
    to the non-signalled state, by appending event reset commands to the
    end of the user defined command-list. There is also a reset added for
    the UR command-buffer wait event that is signalled when the wait list
    dependencies of `urCommandBufferEnqueueExp` are met.
    
    Additionally this patch removes the host scope flag from being set on L0 events
    created by the UR command-buffer, as we never signal/reset them from host, only
    device.
    EwanC committed Oct 16, 2023
    Configuration menu
    Copy the full SHA
    e8e1415 View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2023

  1. [Command-buffer][L0] Condense signalling

    Rather than having a command-list wide barrier and separate
    signal command, have a single barrier command which signals
    the UR command-buffer signal event and waits on the
    events that correspond to sync-points.
    
    Co-authored-by: Pablo Reble <pablo.reble@intel.com>
    EwanC and reble committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    6523932 View commit details
    Browse the repository at this point in the history
  2. [SYCL][OpenCL] Port OpenCL plugin to Unified Runtime (#10490)

    Co-authored-by: Martin Morrison-Grant <martin.morrisongrant@codeplay.com>
    Co-authored-by: Petr Vesely <petr.vesely@codeplay.com>
    Co-authored-by: Callum Fare <callum@codeplay.com>
    Co-authored-by: aarongreig <aarongreig01@gmail.com>
    5 people committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    7c755c4 View commit details
    Browse the repository at this point in the history
  3. [OpenCL] Follow up changes after moving the adapter

    - Update Cmake to use local adapter source files
    - Update license headers
    - Add virtual memory entrypoints to interface_loader
    - Add .clang-format
    - Update CODEOWNERS file for OpenCL adapter
    fabiomestre committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    1f149fb View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#943 from hdelan/re-add-extended-deleters

    [HIP] Re add extended deleters
    kbenzie authored Oct 17, 2023
    Configuration menu
    Copy the full SHA
    f0de2f4 View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. [OpenCL] Allow upstream to manually set the OpenCL target (oneapi-src…

    …#969)
    
    - Upstream CI is unable to use find_package() for OpenCL. This
    commit allows it to set the target manually and skip the call to
    find_package().
    - This commit also changes UR to use the OpenCL::OpenCL target.
    It provides a simpler alternative to import the opencl library
    without having to use add_library().
    fabiomestre authored Oct 18, 2023
    Configuration menu
    Copy the full SHA
    3653e58 View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2023

  1. Configuration menu
    Copy the full SHA
    657ffde View commit details
    Browse the repository at this point in the history
  2. Set command-buffer experimental feature codeowners

    Set [unified-runtime-command-buffer-write](https://github.com/orgs/oneapi-src/teams/unified-runtime-command-buffer-write] as the codeowners of the command-buffer experimental feature.
    Including:
    * Adapter implementations
    * Specification document
    * YML definition
    EwanC committed Oct 19, 2023
    Configuration menu
    Copy the full SHA
    7d759ee View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2023

  1. Merge pull request oneapi-src#972 from oneapi-src/revert-906-l0_usm_e…

    …rror_checking
    
    Revert "[UR][L0] Propagate errors from `USMAllocationMakeResident`"
    kbenzie authored Oct 20, 2023
    Configuration menu
    Copy the full SHA
    a76e3b1 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#973 from Bensuo/ewan/command_buffer_cod…

    …eowners
    
    Set command-buffer experimental feature codeowners
    kbenzie authored Oct 20, 2023
    Configuration menu
    Copy the full SHA
    614e6d0 View commit details
    Browse the repository at this point in the history

Commits on Oct 23, 2023

  1. Configuration menu
    Copy the full SHA
    9a13afd View commit details
    Browse the repository at this point in the history
  2. [CUDA][EXP] CUDA adapter support for command buffers

    - Implement initial subset of command buffer functions for CUDA
    - Report command buffer extension as supported from device query
    - Some additional methods and refactoring to reuse existing adapter code
    Bensuo committed Oct 23, 2023
    Configuration menu
    Copy the full SHA
    74f42f8 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oneapi-src#945 from npmiller/fix-priority

    [CUDA] Fix queue creation with native handle
    kbenzie authored Oct 23, 2023
    Configuration menu
    Copy the full SHA
    3a3aae3 View commit details
    Browse the repository at this point in the history

Commits on Oct 24, 2023

  1. Configuration menu
    Copy the full SHA
    86f96f0 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2023

  1. Merge pull request oneapi-src#940 from Naghasan/victor/kernel-fusion-amd

    [UR][HIP] Enable kernel finalization using comgr
    kbenzie authored Oct 25, 2023
    Configuration menu
    Copy the full SHA
    cf26de2 View commit details
    Browse the repository at this point in the history
  2. CODEOWNERS for Bindless Images

    Add `unified-runtime-bindless-images-write` team
    as code owners for anything related to images.
    ProGTX committed Oct 25, 2023
    Configuration menu
    Copy the full SHA
    33e2494 View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2023

  1. Merge pull request oneapi-src#995 from oneapi-src/peter/bindless-code…

    …owners
    
    CODEOWNERS for Bindless Images
    kbenzie authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    20d4f49 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#965 from Bensuo/ewan/L0_internal_event_fix

    [Command-buffer][L0] Reset sync-point events
    kbenzie authored Oct 26, 2023
    Configuration menu
    Copy the full SHA
    7c8617b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    fdf2cf8 View commit details
    Browse the repository at this point in the history

Commits on Oct 27, 2023

  1. Merge pull request oneapi-src#908 from szadam/test_output

    Print all CTS outputs (from gtest) in CI.
    pbalcer authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2ab0734 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#961 from hdelan/change-unions-to-stdvar…

    …iant
    
    [HIP][CUDA] Change unions in ur_mem_handle_t_ to stdvariant
    kbenzie authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    036b9cf View commit details
    Browse the repository at this point in the history
  3. Merge commit '9d82232b9c7e57997325608320e2d89c5fb95187' into merge-so…

    …me-main-changes-into-adapters
    omarahmed1111 committed Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2f2aebe View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#1005 from omarahmed1111/merge-some-main…

    …-changes-into-adapters
    
    Bump adapters branch
    omarahmed1111 authored Oct 27, 2023
    Configuration menu
    Copy the full SHA
    2417fcf View commit details
    Browse the repository at this point in the history

Commits on Oct 30, 2023

  1. Configuration menu
    Copy the full SHA
    df79dad View commit details
    Browse the repository at this point in the history
  2. Use std::get<BufferMem>

    This is required after oneapi-src#961
    EwanC committed Oct 30, 2023
    Configuration menu
    Copy the full SHA
    5fb2f28 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    b232631 View commit details
    Browse the repository at this point in the history
  4. Merge pull request oneapi-src#1015 from oneapi-src/revert-1005-merge-…

    …some-main-changes-into-adapters
    
    Revert "Bump adapters branch"
    kbenzie authored Oct 30, 2023
    Configuration menu
    Copy the full SHA
    55d432c View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2023

  1. Configuration menu
    Copy the full SHA
    5ecaf66 View commit details
    Browse the repository at this point in the history
  2. Merge pull request oneapi-src#932 from Bensuo/cuda-cmd-buffers

    [CUDA][EXP] CUDA adapter support for command buffers
    kbenzie authored Oct 31, 2023
    Configuration menu
    Copy the full SHA
    be53fb3 View commit details
    Browse the repository at this point in the history
  3. [CUDA][HIP] Move adapter specific testing out of CTS

    This patch moves the `test-adapter-cuda` and `test-adapter-hip` test
    suites out of the `test/conformance` directory. The goal of this move is
    to ensure that all adapters will be tested with the same set and number
    of conformance tests, making conformance pass rates directly comparable
    between adapters. Up to now the CUDA and HIP adapters have included
    additional unit tests of internals which skews pass rates.
    kbenzie committed Oct 31, 2023
    Configuration menu
    Copy the full SHA
    f533123 View commit details
    Browse the repository at this point in the history