Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

On Orin platform,cuDLA inference can block TensorRT inference on the GPU, resulting in the GPU being completely idle. #3804

Closed
SolDogLi opened this issue Apr 18, 2024 · 4 comments
Assignees
Labels
triaged Issue has been triaged by maintainers

Comments

@SolDogLi
Copy link

Description

I am using two threads and two CUDA streams to perform inference on two models using cuDLA and GPU TensorRT.
However, I have observed that when performing inference on one model using cuDLA, it blocks the inference on the other model using GPU TensorRT, resulting in the GPU being completely idle.
img_v3_02a2_171fe0b8-df89-447c-8dd0-087354e8138g
Is it not possible to run cuDLA and GPU inference simultaneously?

Environment

TensorRT Version: 8.5

NVIDIA GPU: orin

NVIDIA Driver Version:

CUDA Version: 11.2

CUDNN Version:

Relevant Files

nsys report:https://drive.google.com/file/d/1I1iqgpOwb_FlDpX0Nxbaip7R_osdSDN3/view?usp=drive_link

Steps To Reproduce

mCuDLACtx->submitDLATask(mStream); thread A and stream1 submitDLATask

trt_context->enqueueV2(buffers, stream, nullptr); thread B and stream2 tensorRT enqueueV2

@lix19937
Copy link

NOTE: drive os 6060 has dla bug(dla stability /timeout), you can update to 6080.

@zerollzeng
Copy link
Collaborator

Please try the latest DOS release, and if you are our Auto customer, I think you can file nvbugs directly to NV? Or you have a Tier-1?

@zerollzeng
Copy link
Collaborator

Because DLA runtime is not in the scope of TensorRT, so their's nothing we can do in TRT side.

@zerollzeng zerollzeng self-assigned this Apr 25, 2024
@zerollzeng zerollzeng added the triaged Issue has been triaged by maintainers label Apr 25, 2024
@ttyio
Copy link
Collaborator

ttyio commented Jul 2, 2024

closing since no activity for more than 3 weeks, pls reopen if you still have question, thanks all!

@ttyio ttyio closed this as completed Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

No branches or pull requests

4 participants