Skip to content

benchmarks? #160

Answered by BBC-Esq
BBC-Esq asked this question in Q&A
Mar 18, 2024 · 4 comments · 4 replies
Discussion options

You must be logged in to vote

Here's the graph just showing the large models, plus instructor-xl, same settings as above. You can see that anything other than a batch size of 1 actually decreases performance for instructor-xl but not the others. However, these are all NON float16. As I mentioned above, you can get away with instructor-xl on a batch size of 2 for significant improvement, granted, my test was on an RTX 4090...

And here's the same comparison but VRAM usage...Again, why is sentence-transformers setting a default batch size of 32?

Replies: 4 comments 4 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@michaelfeil
Comment options

@BBC-Esq
Comment options

Comment options

You must be logged in to vote
2 replies
@bash99
Comment options

@michaelfeil
Comment options

Answer selected by BBC-Esq
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #158 on March 18, 2024 23:48.