Llama 3.2 11B Currently Only Supports Single Image #1281
Labels
enhancement
New feature or request
Known Gaps
These are known Gaps/Issues/Bug items in torchchat
Llama 3.2- Multimodal
Issues related to Multimodal of Llama3.2
🐛 Describe the bug
Currently, Llama 3.2 11B only supports a single optional image prompt in torchchat. The base torchtune model backing Llama3.2 11B should* be capable of supporting multiturn with:
This Issue acts as a tracker for the development of these 2 extensions to Llama 3.2 11B functionality
E.g. Via OpenAI API/Browser you can currently provide text prompts similar to LLama3.1 8B, but you are unable to replace the image once one is provided
*Should being the operative word as it may require additional changes to the torchtune repo
Versions
NA
The text was updated successfully, but these errors were encountered: