How much write QPS can a standalone instance hold? #35881
Unanswered
pycui
asked this question in
Q&A and General discussion
Replies: 3 comments 2 replies
-
|
Beta Was this translation helpful? Give feedback.
0 replies
-
Unfortunately in my crash loop, there is no log perserved, e.g.
Pick any non-running milvus pod, see
|
Beta Was this translation helpful? Give feedback.
0 replies
-
you need to get the latest log when the node is brought up. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Can't seem find much relevant and up-to-date data on this. Some numbers assumption here if helpful to answer: Say the standalone instance has no CPU limit and can use up to 128 cores and 500G memory. Say the write request is 1024 dimension with 8K additional text data. Batch 5000 items in a request so each write is close to 64M limit but never exceeded. Is there a way to estimate the write QPS capacity?
Also, what is the behavior if write QPS is larger than capacity? Ideally server should reject excessive requests and it should never fail the milvus instance itself, but I'm seeing in my test that if sending write for too high and too long the Milvus server eventually enters a crashloop, and even write has stopped it seems taking a long time to recover (still crashes sporadically). I'm wondering if this is expected.
Beta Was this translation helpful? Give feedback.
All reactions