Multimodal support for Llava #545
kshetrajna12
started this conversation in
Feature requests
Replies: 2 comments
-
I think we would need to integrate with transformers via a logit processor; we are currently considering this option. |
Beta Was this translation helpful? Give feedback.
0 replies
-
@kshetrajna12 please follow #787 I will expect outlines will have multimodal support soon. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Is there a way to get multi modal support to get Llava working ?
Specifically https://github.com/huggingface/transformers/blob/f4f57f9dfa68948a383c352a900d588f63f6290a/src/transformers/models/llava/modeling_llava.py#L237
Currently fails due to
Beta Was this translation helpful? Give feedback.
All reactions