feat: add Cupy samples #518

royinx · 2023-08-15T16:11:55Z

hello world,
this PR do the followings:

sample for Cupy + TensorRT
added build config and test case for CICD
fixed the first batch output didn't sync issue in SampleTensorRTResnet.py

please review

gedoensmax · 2023-08-15T20:05:04Z

setup.py

+def get_cupy() -> str:
+ CUDA_VERSION = os.environ.get("CUDA_VERSION", None)
+ if CUDA_VERSION>="11.2": # after 11.2 use
+ cupy_pack = f"cupy-cuda{CUDA_VERSION[:2]}x"
+ else:
+ cupy_pack = f"cupy-cuda{CUDA_VERSION[:4].replace('.','')}"
+ return cupy_pack


This fails if the env variable is not set. Can we have a check that in this case does not install any cupy or installs just the latest ?
Also you could do a fall through check: env variable -> nvcc subprocess -> nvidia-smi.

I considered the approach of env, nvcc and nvidia-smi, and there will encounter some issues like

no env var set

nvcc not available in the runtime image, like nvidia/cuda:11.7.1-runtime-ubuntu22.04

nvidia-smi always shows the latest CUDA version 12.2 by the host driver while docker is using 11.7
or even has no nvidia-smi for some cases I have seen.

Can I list all cuda versions in cuda directory /usr/local/cuda* ?
while I have no idea about windows cuda

Yeah as usual windows is the annoying part. This is why I though you could do a fall through approach, trying one after the other and if nothing works just assume a version ?

For linux, I check /usr/local/cuda-* to extract the version.
For windows, I implemented the fall-through approach (nvcc > nvidia-smi).
This approach should handle most of the cases.

There are some potential issues still in the fall-through approach.

nvcc not available.

utility not in NVIDIA_DRIVER_CAPABILITIES=compute,video (also a conflict in Dockerfile.tensorrt)

CUDA version in nvidia-smi does not match the cuda installed.

Since cupy package conflict with each other, assume one is not a good idea. I prefer not to install it if the CUDA version cannot be accessed correctly.
If the complexity goes up, I think we should let the user install it manually.

royinx added 8 commits August 7, 2023 00:24

feat: add cupy encoder

52e02c6

feat: support cupy pointer

3fe4944

feat: support add cupy example

7e01549

chore: update error msg

37ca2d2

chore: update memos

4dee596

feat: add Cupy TensorRT

c3380a8

debug: fix the first batch output unsync issue

0d76435

feat: add cupy for build

52a269c

royinx force-pushed the master branch 3 times, most recently from fbddfa3 to 0b1f36b Compare August 15, 2023 19:04

gedoensmax reviewed Aug 15, 2023

View reviewed changes

royinx force-pushed the master branch from 208a876 to 52a269c Compare August 16, 2023 21:31

Merge branch 'master' into master

c57aa83

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Cupy samples #518

feat: add Cupy samples #518

royinx commented Aug 15, 2023 •

edited

Loading

gedoensmax Aug 15, 2023

royinx Aug 16, 2023 •

edited

Loading

gedoensmax Aug 16, 2023

royinx Aug 16, 2023

feat: add Cupy samples #518

Are you sure you want to change the base?

feat: add Cupy samples #518

Conversation

royinx commented Aug 15, 2023 • edited Loading

gedoensmax Aug 15, 2023

Choose a reason for hiding this comment

royinx Aug 16, 2023 • edited Loading

Choose a reason for hiding this comment

gedoensmax Aug 16, 2023

Choose a reason for hiding this comment

royinx Aug 16, 2023

Choose a reason for hiding this comment

royinx commented Aug 15, 2023 •

edited

Loading

royinx Aug 16, 2023 •

edited

Loading