Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

Closed

Conversation

mfrancepillois
Copy link
Contributor

The CUDA backend does not handle buffer fill larger than 32 bits.
The implemented workaround is to add several node that perform 32 bits buffer fill ops.
In the previous implementation, all these nodes took all previous node(s) as predecessors.
In this version, the first node takes all previous node(s) as predecessors, then the subsequent node takes the newly added node as a predecessor which ensures that the whole fill ops is completed when the last node in this sequence is completed.

…DA graph support

The CUDA backend does not handle buffer fill larger than 32 bits.
The implemented workaround is to add several node that perform 32 bits buffer fill ops.
In the previous implementation, all these nodes took all previous node(s) as predecessors.
In this version, the first node takes all previous node(s) as predecessors, then the subsequent node takes the newly added node as a predecessor which ensures that the whole fill ops is completed when the last node in this sequence is completed.
@mfrancepillois mfrancepillois requested a review from a team as a code owner January 16, 2024 16:36
@EwanC EwanC added the ready to merge Added to PR's which are ready to merge label Jan 17, 2024
@mfrancepillois
Copy link
Contributor Author

Linked DPC++ PR: intel/llvm#12405

@mfrancepillois
Copy link
Contributor Author

PR #1319 includes this bugfix and other improvements for the buffer fill op in CUDA (Command-Buffer).
Consequently, this PR can be closed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ready to merge Added to PR's which are ready to merge
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants