quantization process takes too long #35

yyfcc17 · 2024-12-10T12:12:36Z

during the ptq process, it only use 1 gpu, although i have set 4 gpus in the config file (i have 4 gpus installed).

it takes almost 52 hours to quantize my 8B flux model (the Step 3 in your readme) using a L40s gpu, is this normal?

can we accelerate the ptq process by quantizing blocks parallelly on different gpus?

thanks.

senlyu163 · 2024-12-11T01:53:23Z

Hi, i encountered OOM problem when running step 3 on a single L40s gpu. How did you solve it?

yyfcc17 · 2024-12-11T02:27:01Z

Maybe because my flux model is smaller, only 8B, and my L40s gpu has 45GB memory, it is enough.

senlyu163 · 2024-12-11T02:41:16Z

Maybe because my flux model is smaller, only 8B, and my L40s gpu has 45GB memory, it is enough.

Thanks for reply, i also use pixart-sigma, 800M parameters, encountered OOM on 45GB L40s gpu too. I use the original config yaml, should i modify it?

yyfcc17 · 2024-12-11T03:42:19Z

there is a

batch_size: 256

in the config file, maybe the batch size is too large for your device.

senlyu163 · 2024-12-12T01:41:19Z

thanks for your advice. i run sdxl-turbo on my gpus, but it costs many hours. Waiting for parallel acceleration too :)

Chenglin-Yang · 2024-12-13T03:48:31Z

Hi

during the ptq process, it only use 1 gpu, although i have set 4 gpus in the config file (i have 4 gpus installed).

it takes almost 52 hours to quantize my 8B flux model (the Step 3 in your readme) using a L40s gpu, is this normal?

can we accelerate the ptq process by quantizing blocks parallelly on different gpus?

thanks.

Hello @yyfcc17 , I didn't see where we can set the gpu number in the config files. May I ask how you set this?

yyfcc17 · 2024-12-13T07:49:17Z

Hi

during the ptq process, it only use 1 gpu, although i have set 4 gpus in the config file (i have 4 gpus installed).
it takes almost 52 hours to quantize my 8B flux model (the Step 3 in your readme) using a L40s gpu, is this normal?
can we accelerate the ptq process by quantizing blocks parallelly on different gpus?
thanks.

Hello @yyfcc17 , I didn't see where we can set the gpu number in the config files. May I ask how you set this?

it's in examples/diffusion/configs/__default__.yaml, seems only for evaluation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantization process takes too long #35

quantization process takes too long #35

yyfcc17 commented Dec 10, 2024 •

edited

Loading

senlyu163 commented Dec 11, 2024

yyfcc17 commented Dec 11, 2024

senlyu163 commented Dec 11, 2024

yyfcc17 commented Dec 11, 2024

senlyu163 commented Dec 12, 2024

Chenglin-Yang commented Dec 13, 2024

yyfcc17 commented Dec 13, 2024

quantization process takes too long #35

quantization process takes too long #35

Comments

yyfcc17 commented Dec 10, 2024 • edited Loading

senlyu163 commented Dec 11, 2024

yyfcc17 commented Dec 11, 2024

senlyu163 commented Dec 11, 2024

yyfcc17 commented Dec 11, 2024

senlyu163 commented Dec 12, 2024

Chenglin-Yang commented Dec 13, 2024

yyfcc17 commented Dec 13, 2024

yyfcc17 commented Dec 10, 2024 •

edited

Loading