-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[UR] [CUDA] Changed the output of querying localMemSize #931
[UR] [CUDA] Changed the output of querying localMemSize #931
Conversation
Corresponding PR in intel/llvm: intel/llvm#11454 |
LGTM |
b19e090
to
9cbd2ef
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. I guess this should be documented somewhere though.
Yes, I'll adjust the documentation once this is in |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@konradkusiak97 please fix the conflict in source/adapters/cuda/enqueue.cpp
692f2d7
to
66287b1
Compare
ac43166
to
d5a4691
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM will wait for the checks to pass before merging.
With this patch when the user asks for the amount of
localMemSize
, they receive the maximum amount available on the SM. Setting the maximum limit with the env variable still remains an option.