This tutorial showcases the multimodal RAG powered by Milvus, Visualized BGE model, and GPT-4o. With this system, users are able to upload an image and edit text instructions, which are processed by BGE's composed retrieval model to search for candidate images. GPT-4o then acts as a reranker, selecting the most suitable image and providing the rationale behind the choice. This powerful combination enables a seamless and intuitive image search experience, leveraging Milvus for efficient retrieval, BGE model for precise image processing and matching, and GPT-4o for advanced reranking.
-
Notifications
You must be signed in to change notification settings - Fork 0
rrfsantos/Multimodal-RAG-with-Milvus
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Multimodal RAG powered by Milvus, Visualized BGE model, and GPT-4o.
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published