-
Notifications
You must be signed in to change notification settings - Fork 116
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[CUDA] Fix synchronization issue in urEnqueueMemImageCopy
For 1D images, urEnqueueMemImageCopy was using cuMemcpyAtoA which does not have an asynchronous version. This means that, when the MemCpy happens between to arrays in device memory, the call will be asynchronous and might complete after the event returned by urEnqueueMemImageCopy finishes. This commits fixes the issue by using cuMemcpy2DAsync to copy 1D images by setting the height to 1.
- Loading branch information
1 parent
ce152a6
commit c72458c
Showing
1 changed file
with
15 additions
and
19 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters