Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

This repository contains a notebook with a multimodal system using images as frames from YouTube videos, LlamaIndex framework, Qdrant as a vector database, and Gemini as embedding and llm model.

Main Steps

Data Ingestion: Load videos and metadata from a YouTube playlist
Indexing: MultiModalVectorStoreIndex from LlamaIndex
Embedding and Model: Gemini
Vector Store: Qdrant with 2 collections (text and images)
Query Retrieval: Top recipe and frame images

Feel free to ⭐ and clone this repo 😉

Tech Stack

Project Description

For detailed project descriptions, refer to this Medium article.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
youtube_recipes_multimodal.ipynb		youtube_recipes_multimodal.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

Tech Stack

Project Description

About

Languages

benitomartin/multimodal-youtube-recipes

Folders and files

Latest commit

History

Repository files navigation

Multimodal Recipe Recommender using Qdrant, LlamaIndex, and Google Gemini 🚀

Tech Stack

Project Description

About

Topics

Resources

Stars

Watchers

Forks

Languages