As the number of CPU cores decreases, the BLS mode processing time increases #7373

callmezhangchenchenokay · 2024-06-25T13:50:24Z

Description
BLS mode calls a TensorRT backend model hundreds of times, and the processing time increases as the number of CPU cores decreases
Triton Information
nvcr.io/nvidia/tritonserver:24.05-py3

To Reproduce
My BLS code looks like this: model.py in BLS calls t2s_sdec, platform: "tensorrt_plan"

model transformation

/usr/src/tensorrt/bin/trtexec --onnx=nahida_t2s_encoder_sim.onnx \
--shapes=ref_seq:1x40,text_seq:1x100,ref_bert:40x1024,text_bert:100x1024,ssl_content:1x768x350  \
--minShapes=ref_seq:1x1,text_seq:1x1,ref_bert:1x1024,text_bert:1x1024,ssl_content:1x768x240 \
--optShapes=ref_seq:1x40,text_seq:1x100,ref_bert:40x1024,text_bert:100x1024,ssl_content:1x768x350 \
--maxShapes=ref_seq:1x500,text_seq:1x500,ref_bert:500x1024,text_bert:500x1024,ssl_content:1x768x500 \
--saveEngine=nahida_t2s_encoder_sim.engine

As the for loop increases, the input gradually becomes larger
Set in t2s sdec/config.json parameters: {key: "FORCE_CPU_ONLY_INPUT_TENSORS" value: {string_value:"no"}}
When the number of CPU cores is 100, 387 times , the totaltime is 2s, the other time is 300ms
When the number of CPU cores is 24, 387 times, the totaltime is 5s, the other time is 600ms

The change in number of CPU cores is set when docker is started --cpuset-cpus=0-23
There is no interference from other processes
Expected behavior
I hope the decrease in the number of CPU cores will not affect the overall process time

CallmeZhangChenchen mentioned this issue Jun 26, 2024

In BLS mode, does the data go to the CPU #7364

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

As the number of CPU cores decreases, the BLS mode processing time increases #7373

As the number of CPU cores decreases, the BLS mode processing time increases #7373

callmezhangchenchenokay commented Jun 25, 2024 •

edited

Loading

As the number of CPU cores decreases, the BLS mode processing time increases #7373

As the number of CPU cores decreases, the BLS mode processing time increases #7373

Comments

callmezhangchenchenokay commented Jun 25, 2024 • edited Loading

callmezhangchenchenokay commented Jun 25, 2024 •

edited

Loading