Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[lapack][blas][cuda] Update host task impl to use enqueue_native_command #572

Merged
merged 11 commits into from
Oct 8, 2024

Commits on Sep 18, 2024

  1. Implemented cusolver native_command.

    See SYCL_EXT_ONEAPI_ENQUEUE_NATIVE_COMMAND for details.
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Sep 18, 2024
    Configuration menu
    Copy the full SHA
    e19072c View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. Impl native command for cublas

    See SYCL_EXT_ONEAPI_ENQUEUE_NATIVE_COMMAND extension document for
    details.
    
    Generalize helpers funcs and use them for blas l1, l2, l3, batch
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    44867dc View commit details
    Browse the repository at this point in the history
  2. Update lapack tests to manage queue syncs

    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    a28cd4d View commit details
    Browse the repository at this point in the history
  3. Update name of function

    cublas_native_named_func
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Sep 19, 2024
    Configuration menu
    Copy the full SHA
    b33b9e3 View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. Use cublas_native_named_func more

    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Sep 20, 2024
    Configuration menu
    Copy the full SHA
    8fba319 View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2024

  1. Remove dep check

    this dep check is overzealous because it enforces that a dependent event cannot be submitted to run on the native device queue but not
    completed before a later event it is dependent upon has also been marked
    running on the device. This is not part of the sycl spec and
    unnecessarily slows down execution.
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Oct 1, 2024
    Configuration menu
    Copy the full SHA
    af4d1fd View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2024

  1. Improve synchronous impl for scratchpad

    These funcs are async in the cusolver backend.
    
    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    c082d4a View commit details
    Browse the repository at this point in the history
  2. Revert changes to lapack tests

    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Oct 2, 2024
    Configuration menu
    Copy the full SHA
    ba20653 View commit details
    Browse the repository at this point in the history

Commits on Oct 4, 2024

  1. Fix format.

    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Oct 4, 2024
    Configuration menu
    Copy the full SHA
    61c9a53 View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2024

  1. Revert "Fix format."

    This reverts commit 61c9a53.
    JackAKirk committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    27b251f View commit details
    Browse the repository at this point in the history
  2. Fix format try 2.

    Signed-off-by: JackAKirk <jack.kirk@codeplay.com>
    JackAKirk committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    94dcc7e View commit details
    Browse the repository at this point in the history