Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sycl-rel_5_2_0: [UR][L0] Fix Native Host memory usage on device with copy back sync (#13014) #13430

Closed
wants to merge 7 commits into from

Commits on Apr 15, 2024

  1. [UR][L0] Support for urUsmP2PPeerAccessGetInfoExp to query p2p access… (

    intel#12983)
    
    … info
    
    pre-commit PR for
    oneapi-src/unified-runtime#1429
    
    ---------
    
    Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
    Co-authored-by: Kenneth Benzie (Benie) <k.benzie@codeplay.com>
    nrspruit and kbenzie committed Apr 15, 2024
    Configuration menu
    Copy the full SHA
    22e9785 View commit details
    Browse the repository at this point in the history
  2. [CUDA][LIBCLC] Implement RC11 seq_cst for PTX6.0 (intel#12516)

    Implement `seq_cst` RC11/ptx6.0 memory consistency for CUDA backend.
    
    See https://dl.acm.org/doi/pdf/10.1145/3297858.3304043 and
    https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#memory-consistency-model
    for full details. Requires sm_70 or above. With this PR there is now a
    complete mapping between SYCL memory consistency model capabilities and
    the official CUDA model, fully exploiting CUDA capabilities when
    possible on supported arches.
    
    This makes the SYCL-CTS atomic_ref tests fully pass for sm_70 on the
    cuda backend.
    
    Fixes intel#11208
    
    Depends on intel#12907
    
    ---------
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk authored and kbenzie committed Apr 15, 2024
    Configuration menu
    Copy the full SHA
    fa53fea View commit details
    Browse the repository at this point in the history
  3. [UR] Add urProgramGetGlobalVariablePointer entrypoint (intel#12496)

    Co-authored-by: Kenneth Benzie (Benie) <k.benzie@codeplay.com>
    fabiomestre and kbenzie committed Apr 15, 2024
    Configuration menu
    Copy the full SHA
    0326cdc View commit details
    Browse the repository at this point in the history
  4. [SYCL][Graph][UR] Update UR to support updating kernel commands in co…

    …mmand buffers for L0 (intel#12897)
    againull authored and kbenzie committed Apr 15, 2024
    Configuration menu
    Copy the full SHA
    a486c12 View commit details
    Browse the repository at this point in the history
  5. [UR] CI for UR PR refactor-guess-local-worksize (intel#12663)

    oneapi-src/unified-runtime#1326
    
    ---------
    
    Co-authored-by: Kenneth Benzie (Benie) <k.benzie@codeplay.com>
    hdelan and kbenzie committed Apr 15, 2024
    Configuration menu
    Copy the full SHA
    0838aba View commit details
    Browse the repository at this point in the history

Commits on Apr 16, 2024

  1. [SYCL][Graph][HIP] Set minimum ROCm version for graphs (intel#13035)

    Tests UR PR oneapi-src/unified-runtime#1447 that
    only reports support for UR command-buffers on ROCm 5.5.1 and later to
    work around HIP driver bugs related to HIP-Graph in earlier version.
    
    This requirement is also explicitly mentioned in the design doc.
    EwanC authored and kbenzie committed Apr 16, 2024
    Configuration menu
    Copy the full SHA
    257ac92 View commit details
    Browse the repository at this point in the history
  2. [UR][L0] Fix Native Host memory usage on device with copy back sync (i…

    …ntel#13014)
    
    pre-commit PR for
    oneapi-src/unified-runtime#1439
    
    ---------
    
    Signed-off-by: Neil R. Spruit <neil.r.spruit@intel.com>
    Co-authored-by: Kenneth Benzie (Benie) <k.benzie@codeplay.com>
    nrspruit and kbenzie committed Apr 16, 2024
    Configuration menu
    Copy the full SHA
    42919a9 View commit details
    Browse the repository at this point in the history