Llama 3.2 11B Currently Only Supports Single Image #1281

Jack-Khuu · 2024-10-08T03:37:55Z

🐛 Describe the bug

Currently, Llama 3.2 11B only supports a single optional image prompt in torchchat. The base torchtune model backing Llama3.2 11B should* be capable of supporting multiturn with:

Multiple Simultaneous Images
Replacing the previous image

This Issue acts as a tracker for the development of these 2 extensions to Llama 3.2 11B functionality

E.g. Via OpenAI API/Browser you can currently provide text prompts similar to LLama3.1 8B, but you are unable to replace the image once one is provided

*Should being the operative word as it may require additional changes to the torchtune repo

Versions

NA

Jack-Khuu added enhancement New feature or request Known Gaps These are known Gaps/Issues/Bug items in torchchat Llama 3.2- Multimodal Issues related to Multimodal of Llama3.2 labels Oct 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.2 11B Currently Only Supports Single Image #1281

Llama 3.2 11B Currently Only Supports Single Image #1281

Jack-Khuu commented Oct 8, 2024 •

edited

Loading

Llama 3.2 11B Currently Only Supports Single Image #1281

Llama 3.2 11B Currently Only Supports Single Image #1281

Comments

Jack-Khuu commented Oct 8, 2024 • edited Loading

🐛 Describe the bug

Versions

Jack-Khuu commented Oct 8, 2024 •

edited

Loading