How to best store large documents #38389
-
As part of a RAG project, I want to store the output of each step of the ingestion pipeline to avoid redoing a lot of work if I want to swap a component. PDFs are in S3, and chunks & embeddings are in a Milvus collection. But I'm not sure where I should store the Text representation. I only need to do the following with the text:
Could I store the full text in a field in my main collection ? Or should I create a new collection ? Or should I store it outside of Milvus altogether ? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
The max length of each text is 65535(bytes) for milvus Varchar field, currently milvus doesn't have the ability to store super long-length strings or binaries. You can use some other db to store the long-length strings, such as mongodb. |
Beta Was this translation helpful? Give feedback.
The max length of each text is 65535(bytes) for milvus Varchar field, currently milvus doesn't have the ability to store super long-length strings or binaries. You can use some other db to store the long-length strings, such as mongodb.