Skip to content

Slow Inference Speeds on Cluster GPUs #833

Answered by jmdelahanty
jmdelahanty asked this question in Help!
Discussion options

You must be logged in to vote

To simplify the info in the discussion here, two things really helped us achieve speed ups from 5-10 FPS to over 400FPS:

  1. Correct drivers: SLEAP/other GPU dependent software doesn't seem to crash as much as struggle if the non-optimal drivers are being referenced in your linux terminal.
  2. Separating out CPU and GPU workloads as @talmo suggested. This helps enormously with people's workflows. On A40s it appears that doing both CPU/GPU tasks in the same sleap-track call is sufficiently fast, but each step is about 2x as fast if they are separated out.

Replies: 2 comments 23 replies

Comment options

You must be logged in to vote
23 replies
@jmdelahanty
Comment options

@jmdelahanty
Comment options

@boadecea25
Comment options

@jmdelahanty
Comment options

@boadecea25
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by jmdelahanty
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Help!
Labels
None yet
4 participants