Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for LLaVA model #482

Open
youssefadr opened this issue Oct 10, 2023 · 7 comments
Open

Add support for LLaVA model #482

youssefadr opened this issue Oct 10, 2023 · 7 comments

Comments

@youssefadr
Copy link

youssefadr commented Oct 10, 2023

🚀 The feature, motivation and pitch

LLaVA seems to be currently a strong open-source competitor to GPT4-V, it doesn't seem to be supported by the library. Do you plan on adding it? If yes, is there something I could contribute with to help?

Alternatives

No response

Additional context

No response

@ebsmothers
Copy link
Contributor

Hi @youssefadr, thanks for opening this issue. LLaVA is definitely something we're interested in adding and we would be happy to have you contribute. Is there a specific portion of the model you're especially interested in helping out with?

@youssefadr
Copy link
Author

Thanks for your answer @ebsmothers, I would like to add the model to torchmultimodal/models first.

@ebsmothers
Copy link
Contributor

That sounds reasonable to me. We already have CLIP visual encoders in the library here, so feel free to reuse those. Then the bulk of the work for the model should be to add the LLM. A couple pointers to help with that: TransformerDecoderLayer, RMSNorm. We also have an open PR for rotary positional embeddings (#450) that might be useful. Let me know if this makes sense, happy to provide more details as needed.

@youssefadr
Copy link
Author

Nice ! I'll come back to you with more questions later, not sure I'll start working on it this week.

@theadamsabra
Copy link

@youssefadr have you worked on this to any capacity? i'm interested in picking this up if not

@ebsmothers
Copy link
Contributor

@theadamsabra if not, you are more than welcome to take it up

@theadamsabra
Copy link

@ebsmothers thanks! If I don't get a response by tomorrow I'll just pick it up myself

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants