Lora Mask based on lora index #348

hlahkar · 2024-09-30T08:47:46Z

Changes the filling of lora mask from lora_id to lora_index. This is needed to ensure that the mask does not fail in case lora id is greater than max_loras

michalkuligowski · 2024-09-30T09:53:53Z

vllm/worker/habana_model_runner.py

+                end_pos = start_pos + self.lora_config.max_lora_rank
+                lora_mask[i, start_pos:end_pos] = ones
+            lora_mask = lora_mask.to('hpu')
+            lora_logits_mask = lora_mask


Could you explain why now logits mask is pointing to mask, before it was left None for decode phase.
Is that because now in line 1906 there can be reference to an object that is None? If so I would rather add there check if lora_logits_mask is not none instead of changing the logic of what lora_logits_mask is keeping

@michalkuligowski for decode phase lora_mask and logits_mask are same; previously we were setting it at execute_model; just moved to create_lora_mask for a cleaner code

vivekgoe

Please also check if these changes do not break long lora context test. We may need to port these changes to long lora context branch to check that (maybe Sanju/Ruheena can do this check).

vllm/worker/habana_model_runner.py

vivekgoe

Looks good to me.

hlahkar requested review from vivekgoe and SanjuCSudhakaran September 30, 2024 08:48

michalkuligowski added the intel Issues or PRs submitted by Intel label Sep 30, 2024

michalkuligowski requested a review from kzawora-intel September 30, 2024 09:45

michalkuligowski reviewed Sep 30, 2024

View reviewed changes

hlahkar force-pushed the dev/hlahkar/lora_index_fix branch 7 times, most recently from cdee227 to c2572b9 Compare September 30, 2024 12:49

vivekgoe requested changes Oct 1, 2024

View reviewed changes

vllm/worker/habana_model_runner.py Show resolved Hide resolved

vllm/worker/habana_model_runner.py Outdated Show resolved Hide resolved

vllm/worker/habana_model_runner.py Outdated Show resolved Hide resolved

vllm/worker/habana_model_runner.py Show resolved Hide resolved

vivekgoe added habana Issues or PRs submitted by Habana Labs and removed intel Issues or PRs submitted by Intel labels Oct 1, 2024

Lora Mask based on lora index

e3c40f1

hlahkar force-pushed the dev/hlahkar/lora_index_fix branch from c2572b9 to cafde9c Compare October 1, 2024 08:49

Remove unwanted fields from ModelInputForHPU

9d62244

hlahkar force-pushed the dev/hlahkar/lora_index_fix branch from cafde9c to 9d62244 Compare October 1, 2024 09:43

vivekgoe approved these changes Oct 3, 2024

View reviewed changes

madamczykhabana merged commit da03d8b into habana_main Oct 3, 2024
19 checks passed

hlahkar deleted the dev/hlahkar/lora_index_fix branch October 16, 2024 08:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lora Mask based on lora index #348

Lora Mask based on lora index #348

hlahkar commented Sep 30, 2024

michalkuligowski Sep 30, 2024

hlahkar Sep 30, 2024

vivekgoe left a comment

vivekgoe left a comment

Lora Mask based on lora index #348

Lora Mask based on lora index #348

Conversation

hlahkar commented Sep 30, 2024

michalkuligowski Sep 30, 2024

Choose a reason for hiding this comment

hlahkar Sep 30, 2024

Choose a reason for hiding this comment

vivekgoe left a comment

Choose a reason for hiding this comment

vivekgoe left a comment

Choose a reason for hiding this comment