Skip to content

Automating text extraction from PDFs and transforming content into searchable embeddings with Qdrant for advanced data retrieval. πŸ’‘πŸ“š

License

Notifications You must be signed in to change notification settings

nanelimon-organization/automate-embedding-storage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

12 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

BorusanAuto-EmbedStorage πŸš—πŸ’‘πŸ“š

Welcome to the BorusanAuto-EmbedStorage repository, the hub for innovative PDF processing and data embedding techniques developed during the Borusan AutoHackathon.

About the Project πŸ“ˆ

This project involves an automated process of extracting text from PDFs, generating embeddings using Azure OpenAI text-embedding-ada-002model, and efficiently storing these embeddings in a Qdrant database for advanced search and retrieval.

BorusanOto πŸš— - Embeddings Workflow Diagram 🌟
image
Qdrant Collection Snapshot πŸ“˜
Qdrant Collection Snapshot

Features 🌟

  • PDF Text Extraction: Convert PDF documents into manageable text chunks.
  • Embedding Generation: Utilize Azure and OpenAI models for embedding generation.
  • Qdrant Integration: Seamlessly store and manage embeddings in Qdrant collections.

Getting Started πŸš€

To begin using this repository, clone the repo and follow the setup and cell run instructions in the notebook.

Contributing 🀝

Contributions to enhance the project are welcome. Please read the contribution guidelines for more information.

License πŸ“„

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments πŸ™Œ

A big thank you to Borusan for hosting the AutoHackathon and providing an opportunity to innovate in the automotive and AI space.


For more information on the Borusan AutoHackathon, visit Borusan AutoHackathon Details.

About

Automating text extraction from PDFs and transforming content into searchable embeddings with Qdrant for advanced data retrieval. πŸ’‘πŸ“š

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published