Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Compile warmup take too long #201

Open
Zjq9409 opened this issue Aug 26, 2024 · 1 comment
Open

[Feature]: Compile warmup take too long #201

Zjq9409 opened this issue Aug 26, 2024 · 1 comment
Labels
intel Issues or PRs submitted by Intel

Comments

@Zjq9409
Copy link

Zjq9409 commented Aug 26, 2024

🚀 The feature, motivation and pitch

INFO 08-26 07:31:47 habana_model_runner.py:1192] [Warmup][Prompt][1/56] batch_size:64 seq_len:1024 free_mem:13.93 GiB
INFO 08-26 07:32:25 habana_model_runner.py:1192] [Warmup][Prompt][2/56] batch_size:64 seq_len:896 free_mem:13.86 GiB
INFO 08-26 07:32:59 habana_model_runner.py:1192] [Warmup][Prompt][3/56] batch_size:64 seq_len:768 free_mem:13.86 GiB
INFO 08-26 07:33:27 habana_model_runner.py:1192] [Warmup][Prompt][4/56] batch_size:64 seq_len:640 free_mem:13.86 GiB
INFO 08-26 07:33:52 habana_model_runner.py:1192] [Warmup][Prompt][5/56] batch_size:32 seq_len:1024 free_mem:13.86 GiB
INFO 08-26 07:34:08 habana_model_runner.py:1192] [Warmup][Prompt][6/56] batch_size:64 seq_len:512 free_mem:13.86 GiB
INFO 08-26 07:34:25 habana_model_runner.py:1192] [Warmup][Prompt][7/56] batch_size:32 seq_len:896 free_mem:13.86 GiB
INFO 08-26 07:34:40 habana_model_runner.py:1192] [Warmup][Prompt][8/56] batch_size:32 seq_len:768 free_mem:13.86 GiB

Could it be compiled with multiple threads, if the graph compilation use CPU? During my compilation process, CPU utilization is very low.
image

Alternatives

No response

Additional context

No response

@mswiniarsk
Copy link

@Zjq9409 yes, we are planning to decrease warmup time in several directions:

  • decrease compilation time, so that single graph takes less time
  • more sophisticated way of selecting buckets, so that we are not compiling graphs that are not truly needed
  • parallel compilation

@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@HabanaAI HabanaAI deleted a comment Aug 26, 2024
@kzawora-intel kzawora-intel added the intel Issues or PRs submitted by Intel label Aug 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
intel Issues or PRs submitted by Intel
Projects
None yet
Development

No branches or pull requests

3 participants