Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[EXP][CMDBUF] Fix dependency handling for large buffer fill ops in CU…
…DA graph support The CUDA backend does not handle buffer fill larger than 32 bits. The implemented workaround is to add several node that perform 32 bits buffer fill ops. In the previous implementation, all these nodes took all previous node(s) as predecessors. In this version, the first node takes all previous node(s) as predecessors, then the subsequent node takes the newly added node as a predecessor which ensures that the whole fill ops is completed when the last node in this sequence is completed.
- Loading branch information