New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[Core] Support Torch profiler in Habana Worker #357

Merged

mswiniarsk merged 2 commits into habana_main from dev/mswiniarski/support_torch_profiler

Oct 4, 2024

mswiniarsk commented Oct 3, 2024

This PR allows to profile execution on HPU through flag VLLM_TORCH_PROFILER_DIR. Similar as it is done for GPU.
The profiling can be controlled:

Asynchronously by posting requests to the server:
a) to start collecting profile:
curl -X POST http://localhost:8080/start_profile
b) to stop collecting profile:
curl -X POST http://localhost:8080/stop_profile
In script, by instructing LLM object to start and stop profiling:

from vllm import LLM, SamplingParams
llm = LLM(...)
llm.start_profile()
llm.stop_profile()

mswiniarsk added 2 commits

October 3, 2024 12:18


          Support Torch profiler in Habana Worker

af7cfc6


          Fix formatting

9cd3fc9

szutenberg approved these changes

View reviewed changes

mswiniarsk merged commit d8ba780 into habana_main

19 checks passed

mswiniarsk deleted the dev/mswiniarski/support_torch_profiler branch

October 4, 2024 08:33

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet