Feat (gpfq): separate float and quant forward pass for speedup #955

fabianandresgrob · 2024-05-16T15:36:01Z

Speeding up GPFQ with separate forward passes for quantized and float input. I avoided offloading the float input to disc and instead saved them under an attribute for GPFQ and simply moved them off the GPU when collected.

fabianandresgrob requested a review from Giuseppe5 May 16, 2024 15:36

fabianandresgrob marked this pull request as ready for review May 31, 2024 11:27

fabianandresgrob force-pushed the gpfq_offload_float_acts branch 2 times, most recently from bece0c6 to 9d886c0 Compare June 3, 2024 10:12

fabianandresgrob added 3 commits June 18, 2024 10:00

Feat (gpfq): separate float and quant forward pass for speedup

a7b9b01

Feat (gpfq): adding example code

1e05f98

Feat (GPFQ): offload float input to disc

bbc025c

fabianandresgrob force-pushed the gpfq_offload_float_acts branch from a5e4405 to bbc025c Compare June 18, 2024 11:48

Fix (GPFQ): change offloading to use torch

9f6ade9

fabianandresgrob force-pushed the gpfq_offload_float_acts branch from af9c229 to 9f6ade9 Compare June 20, 2024 15:18

Giuseppe5 closed this Oct 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (gpfq): separate float and quant forward pass for speedup #955

Feat (gpfq): separate float and quant forward pass for speedup #955

fabianandresgrob commented May 16, 2024

Feat (gpfq): separate float and quant forward pass for speedup #955

Feat (gpfq): separate float and quant forward pass for speedup #955

Conversation

fabianandresgrob commented May 16, 2024