Skip to content

Latest commit

 

History

History
38 lines (24 loc) · 1.82 KB

README.md

File metadata and controls

38 lines (24 loc) · 1.82 KB

BorusanAuto-EmbedStorage 🚗💡📚

Welcome to the BorusanAuto-EmbedStorage repository, the hub for innovative PDF processing and data embedding techniques developed during the Borusan AutoHackathon.

About the Project 📈

This project involves an automated process of extracting text from PDFs, generating embeddings using Azure OpenAI text-embedding-ada-002model, and efficiently storing these embeddings in a Qdrant database for advanced search and retrieval.

BorusanOto 🚗 - Embeddings Workflow Diagram 🌟
image
Qdrant Collection Snapshot 📘
Qdrant Collection Snapshot

Features 🌟

  • PDF Text Extraction: Convert PDF documents into manageable text chunks.
  • Embedding Generation: Utilize Azure and OpenAI models for embedding generation.
  • Qdrant Integration: Seamlessly store and manage embeddings in Qdrant collections.

Getting Started 🚀

To begin using this repository, clone the repo and follow the setup and cell run instructions in the notebook.

Contributing 🤝

Contributions to enhance the project are welcome. Please read the contribution guidelines for more information.

License 📄

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments 🙌

A big thank you to Borusan for hosting the AutoHackathon and providing an opportunity to innovate in the automotive and AI space.


For more information on the Borusan AutoHackathon, visit Borusan AutoHackathon Details.