Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Server capacity & resources #259

Open
aslam-t opened this issue Aug 7, 2024 · 2 comments
Open

Server capacity & resources #259

aslam-t opened this issue Aug 7, 2024 · 2 comments

Comments

@aslam-t
Copy link

aslam-t commented Aug 7, 2024

Question (Not an Issue)

Hi team,

I have a question rather than a issue. Please bear with me as I have no knowledge of vosk internals.
So, I am hosting a docker instance of vosk-server. My primary use case is to utilize it in a conferencing system as transcriber. It works fine with my initial testing with few users but some clarity on following is still required before I confidently release:

  1. Which resource is most important for vosk-server CPU, RAM or GPU etc.?
  2. How many instances are enough to handle 1000 concurrent sessions?
  3. Do we have a pointer if such info already exists.?

I am only transcribing English for now, no other language, if that matter for my case.

Looking forward to any help.

Thanks

@nshmyrev
Copy link
Contributor

nshmyrev commented Aug 8, 2024

Which resource is most important for vosk-server CPU, RAM or GPU etc.

Memory bandwidth and model size are most restricting factors.

How many instances are enough to handle 1000 concurrent sessions?

10 CPU servers of 48 cores each or 5 GPU servers with RTX4090

@zhq1
Copy link

zhq1 commented Aug 28, 2024

Is there any way to optimize CPU usage? For example, not recognizing the contents of silent packages

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants