Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: expose function for listening to policy violations on a specific GPU group #73

Merged
merged 1 commit into from
Sep 10, 2024

Commits on Sep 9, 2024

  1. feat: expose function for listening to policy violations on a specifi…

    …c GPU group
    
    == Motivation ==
    
    Enable finer grained GPU policy violation tracking
    
    == Details ==
    
    The current go-dcgm library exposes a way to listen to policy violations across
    all GPUs. While this is useful, it does not enable users to understand exactly
    which GPUs are experiencing issues. Ideally, users would also be able to listen
    to policy violations on specific groups which could be created on a per-gpu basis.
    This would allow users to then know when specific GPUs were experiencing issues.
    
    This change exposes a new function, ListenForPolicyViolationsForGroup, which takes a
    GroupHandle passed by the user and listens to policy violations for that group. It
    also modifies ListenForPolicyViolations to use this new function, but with specifying
    the group for all GPUs — so no net change in behavior.
    sanjams2 committed Sep 9, 2024
    Configuration menu
    Copy the full SHA
    f7d7ffc View commit details
    Browse the repository at this point in the history