Multimodal support for Llava #545

kshetrajna12 · 2024-01-17T02:05:38Z

kshetrajna12
Jan 17, 2024

Is there a way to get multi modal support to get Llava working ?

Specifically https://github.com/huggingface/transformers/blob/f4f57f9dfa68948a383c352a900d588f63f6290a/src/transformers/models/llava/modeling_llava.py#L237

Currently fails due to

ValueError: Unrecognized configuration class <class 'transformers.models.llava.configuration_llava.LlavaConfig'> for this kind of AutoModel: AutoModelForCausalLM.
Model type should be one of BartConfig, BertConfig, BertGenerationConfig, BigBirdConfig, BigBirdPegasusConfig, BioGptConfig, BlenderbotConfig, BlenderbotSmallConfig, BloomConfig, CamembertConfig, LlamaConfig, CodeGenConfig, CpmAntConfig, CTRLConfig, Data2VecTextConfig, ElectraConfig, ErnieConfig, FalconConfig, FuyuConfig, GitConfig, GPT2Config, GPT2Config, GPTBigCodeConfig, GPTNeoConfig, GPTNeoXConfig, GPTNeoXJapaneseConfig, GPTJConfig, LlamaConfig, MarianConfig, MBartConfig, MegaConfig, MegatronBertConfig, MistralConfig, MixtralConfig, MptConfig, MusicgenConfig, MvpConfig, OpenLlamaConfig, OpenAIGPTConfig, OPTConfig, PegasusConfig, PersimmonConfig, PhiConfig, PLBartConfig, ProphetNetConfig, QDQBertConfig, ReformerConfig, RemBertConfig, RobertaConfig, RobertaPreLayerNormConfig, RoCBertConfig, RoFormerConfig, RwkvConfig, Speech2Text2Config, TransfoXLConfig, TrOCRConfig, WhisperConfig, XGLMConfig, XLMConfig, XLMProphetNetConfig, XLMRobertaConfig, XLMRobertaXLConfig, XLNetConfig, XmodConfig.

rlouf · 2024-01-18T22:54:59Z

rlouf
Jan 18, 2024
Maintainer

I think we would need to integrate with transformers via a logit processor; we are currently considering this option.

0 replies

lapp0 · 2024-06-13T02:42:44Z

lapp0
Jun 13, 2024

@kshetrajna12 please follow #787

I will expect outlines will have multimodal support soon.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multimodal support for Llava #545

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Multimodal support for Llava #545

kshetrajna12 Jan 17, 2024

Replies: 2 comments

rlouf Jan 18, 2024 Maintainer

lapp0 Jun 13, 2024

kshetrajna12
Jan 17, 2024

rlouf
Jan 18, 2024
Maintainer

lapp0
Jun 13, 2024