Add fake HPU mode to Habana components #180

kzawora-intel · 2024-08-13T11:50:53Z

No description provided.

…a/test_pr

…a/fake_hpu

mswiniarsk

Overall the idea is great, but it introduces lots of conditional to our code (if not is_fake_hpu()).
I think it would be great if we could apply here monkey patching - similiar to GPU Migration: https://docs.habana.ai/en/latest/PyTorch/PyTorch_Model_Porting/GPU_Migration_Toolkit/GPU_Migration_Toolkit.html

In this case we could override all "hpu" modules with "pass" (do nothing) or "cpu", and then limit changes in our main HPU specific modules, as well as ease future development so there will be no need to add is_fake_hpu every time.

mswiniarsk · 2024-08-26T13:07:26Z

.github/workflows/cpu-test.yml

+
+jobs:
+  cputest:
+    runs-on: ubuntu-latest


wouldn't it be safer to use a hardcoded ubuntu version?

mswiniarsk · 2024-08-26T13:08:22Z

.github/workflows/cpu-test.yml

+      - habana_main
+  pull_request:
+    branches:
+      - habana_main


what do you think about adding also habana_next? Just temporary until the time we maintain two branches

mswiniarsk · 2024-08-26T13:11:31Z

.github/workflows/cpu-test.yml

+        VLLM_TARGET_DEVICE=hpu python setup.py develop
+    - name: cpu-test
+      run: |
+        VLLM_SKIP_WARMUP=true VLLM_PROMPT_SEQ_BUCKET_MAX=128 python examples/offline_inference_fakehpu.py


Running with warmup would be an additional bonus validation don't you think? Probably it would be better to limit number of buckets, so that it does not take that much time, instead of disabling warmup

mswiniarsk · 2024-08-26T13:13:03Z

vllm/model_executor/models/opt.py

@@ -100,6 +100,7 @@ def forward(
        kv_cache: torch.Tensor,
        attn_metadata: AttentionMetadata,
    ) -> torch.Tensor:
+        #        import pdb; pdb.set_trace()


I guess this comment is not needed

michalkuligowski · 2024-09-17T07:29:13Z

vllm/worker/habana_worker.py

@@ -126,6 +131,11 @@ def determine_num_available_blocks(self) -> Tuple[int, int]:

        # Execute a forward pass with dummy inputs to profile the memory usage
        # of the model.
+        if is_fake_hpu():
+            #            self.model_runner.profile_run()


please remove commented code

kzawora-intel · 2024-10-02T08:29:29Z

irrelevant

kzawora-intel added 10 commits August 13, 2024 11:02

Update habana_model_runner.py

e52c0ec

Merge remote-tracking branch 'origin/habana_main' into private/kzawor…

dcc878b

…a/test_pr

Add fake HPU mode

afffe33

Merge remote-tracking branch 'origin/habana_main' into private/kzawor…

ed414dc

…a/fake_hpu

format.sh

ceca996

tp fixes

1976d75

add cpu github action job

db4c30f

format.sh

08c9cf3

fix cputest job

ebcb4ab

add better validation

506e026

mswiniarsk requested changes Aug 26, 2024

View reviewed changes

kzawora-intel added the habana Issues or PRs submitted by Habana Labs label Aug 29, 2024

This was referenced Sep 2, 2024

[WIP] Add fake HPU mode to Habana components. #228

Closed

[WIP] Add fake HPU mode to Habana components with dummy habana_frameworks module. #242

Closed

Add fake HPU mode to Habana components with dummy habana_frameworks module. #250

Merged

michalkuligowski reviewed Sep 17, 2024

View reviewed changes

kzawora-intel closed this Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add fake HPU mode to Habana components #180

Add fake HPU mode to Habana components #180

kzawora-intel commented Aug 13, 2024

mswiniarsk left a comment

mswiniarsk Aug 26, 2024

mswiniarsk Aug 26, 2024

mswiniarsk Aug 26, 2024

mswiniarsk Aug 26, 2024

michalkuligowski Sep 17, 2024

kzawora-intel commented Oct 2, 2024

Add fake HPU mode to Habana components #180

Add fake HPU mode to Habana components #180

Conversation

kzawora-intel commented Aug 13, 2024

mswiniarsk left a comment

Choose a reason for hiding this comment

mswiniarsk Aug 26, 2024

Choose a reason for hiding this comment

mswiniarsk Aug 26, 2024

Choose a reason for hiding this comment

mswiniarsk Aug 26, 2024

Choose a reason for hiding this comment

mswiniarsk Aug 26, 2024

Choose a reason for hiding this comment

michalkuligowski Sep 17, 2024

Choose a reason for hiding this comment

kzawora-intel commented Oct 2, 2024