Core dump when call `getDeviceMemorySizeForProfileV2` #4302

tp-nan · 2024-12-26T09:59:46Z

Description

Core dumped when call getDeviceMemorySizeForProfileV2. When use getDeviceMemorySizeForProfile, all work well.

Environment

TensorRT Version: 10.7

NVIDIA GPU: 3080TI

NVIDIA Driver Version: 530.41.03

CUDA Version: 11.8 in docker / 12.1 for system

Steps To Reproduce

Is a resnet18, with two profiles(1->4)

The text was updated successfully, but these errors were encountered:

lix19937 · 2024-12-28T15:16:33Z

Do you set BuilderFlag::kWEIGHT_STREAMING during engine building

tp-nan · 2025-01-06T03:10:45Z

Do you set BuilderFlag::kWEIGHT_STREAMING during engine building

No

lix19937 · 2025-01-06T05:17:51Z

So you need add.

tp-nan · 2025-01-06T05:42:30Z

So you need add.

Thank you for the information. If designed this way, I would suggest to add relevant documentation. As WEIGHT STREAMING requires strongly-typed models, many models cannot be used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core dump when call `getDeviceMemorySizeForProfileV2` #4302

Core dump when call `getDeviceMemorySizeForProfileV2` #4302

tp-nan commented Dec 26, 2024

lix19937 commented Dec 28, 2024

tp-nan commented Jan 6, 2025

lix19937 commented Jan 6, 2025

tp-nan commented Jan 6, 2025

Core dump when call getDeviceMemorySizeForProfileV2 #4302

Core dump when call getDeviceMemorySizeForProfileV2 #4302

Comments

tp-nan commented Dec 26, 2024

Description

Environment

Steps To Reproduce

lix19937 commented Dec 28, 2024

tp-nan commented Jan 6, 2025

lix19937 commented Jan 6, 2025

tp-nan commented Jan 6, 2025

Core dump when call `getDeviceMemorySizeForProfileV2` #4302

Core dump when call `getDeviceMemorySizeForProfileV2` #4302