[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

mfrancepillois · 2024-01-16T16:36:06Z

The CUDA backend does not handle buffer fill larger than 32 bits.
The implemented workaround is to add several node that perform 32 bits buffer fill ops.
In the previous implementation, all these nodes took all previous node(s) as predecessors.
In this version, the first node takes all previous node(s) as predecessors, then the subsequent node takes the newly added node as a predecessor which ensures that the whole fill ops is completed when the last node in this sequence is completed.

…DA graph support The CUDA backend does not handle buffer fill larger than 32 bits. The implemented workaround is to add several node that perform 32 bits buffer fill ops. In the previous implementation, all these nodes took all previous node(s) as predecessors. In this version, the first node takes all previous node(s) as predecessors, then the subsequent node takes the newly added node as a predecessor which ensures that the whole fill ops is completed when the last node in this sequence is completed.

mfrancepillois · 2024-01-17T10:49:35Z

Linked DPC++ PR: intel/llvm#12405

mfrancepillois · 2024-02-05T12:36:46Z

PR #1319 includes this bugfix and other improvements for the buffer fill op in CUDA (Command-Buffer).
Consequently, this PR can be closed.

mfrancepillois requested a review from a team as a code owner January 16, 2024 16:36

EwanC approved these changes Jan 17, 2024

View reviewed changes

EwanC added the ready to merge Added to PR's which are ready to merge label Jan 17, 2024

Bensuo approved these changes Jan 17, 2024

View reviewed changes

mfrancepillois closed this Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

mfrancepillois commented Jan 16, 2024

mfrancepillois commented Jan 17, 2024

mfrancepillois commented Feb 5, 2024

[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CUDA graph support #1256

Conversation

mfrancepillois commented Jan 16, 2024

mfrancepillois commented Jan 17, 2024

mfrancepillois commented Feb 5, 2024