-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NATIVECPU] Initial threadpool implementation for Native CPU #1478
Conversation
@@ -160,15 +163,22 @@ UR_APIEXPORT ur_result_t UR_APICALL urDeviceGetInfo(ur_device_handle_t hDevice, | |||
case UR_DEVICE_INFO_PREFERRED_VECTOR_WIDTH_FLOAT: | |||
case UR_DEVICE_INFO_PREFERRED_VECTOR_WIDTH_DOUBLE: | |||
case UR_DEVICE_INFO_PREFERRED_VECTOR_WIDTH_HALF: | |||
// TODO: How can we query vector width in a platform | |||
// independent way? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As part of future PRs we could have cpu device descriptors with the option to override the individual device properties (below) with environment variables.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me as the first iteration of the threadpool integration 👍
63e04fa
to
062c35d
Compare
062c35d
to
c594cdc
Compare
This PR introduces a simple threadpool implementation that is used to execute work groups in parallel.
DPC++ PR: intel/llvm#13176