-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(shortfin-sd) Program initialization and logging improvements #444
Conversation
Best found throughput number (11/6) is at .89 samples per second per gpu. More progress to come. |
New best:
Repro:
in another shell:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some nits.
shortfin/dev_me.py
Outdated
@@ -196,7 +197,8 @@ def configure_mode(env_info: EnvInfo, args): | |||
env_vars["SHORTFIN_IREE_SOURCE_DIR"] = env_info.iree_dir | |||
if env_info.clang_exe: | |||
env_vars["CC"] = env_info.clang_exe | |||
env_vars["CXX"] = f"{env_info.clang_exe}++" | |||
clangxx_exe = "clang++-".join(env_info.clang_exe.split("clang-")) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Does this work for unadorned "clang" commands? Maybe drop this file from the patch and we do it separately?
@@ -34,7 +34,7 @@ def __init__(self, device="local-task", device_ids=None): | |||
raise ValueError(f"Device id {did} could not be parsed.") | |||
else: | |||
selected = available | |||
sb = sf.amdgpu.SystemBuilder(amdgpu_visible_devices=";".join(selected)) | |||
sb = sf.SystemBuilder(system_type="amdgpu") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You can just create the SystemBuilder once:
sb = sf.SystemBuilder(system_type="amdgpu")
sb.visible_devices = get_selected_devices(sb)
self.ls = sb.create_system()
(i.e. you can set properties between construction of the SystemBuilder and create_system())
Note that the property takes a normal list of strings so no need to join.
It also looks like you dropped the visible_devices thing in this patch.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah, that makes more sense. thanks.
@@ -23,6 +23,7 @@ AMDGPUSystemBuilder::AMDGPUSystemBuilder(iree_allocator_t host_allocator, | |||
iree_hal_hip_device_params_initialize(&default_device_params_); | |||
InitializeDefaultSettings(); | |||
config_options().ValidateUndef(); | |||
default_device_params_.async_allocations = false; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can drop this now.
307100b
to
01d2691
Compare
No description provided.