is QAT still on the roadmap for keras3? #20319

matpalm · 2024-10-03T01:38:28Z

matpalm
Oct 3, 2024

is QAT still on the roadmap?

( #18930 from Feb has it as a "near future" release. )

have been porting code to keras3 ( as a way to move more to Jax ) & have options for post training quantisation but expect a non trivial benefit from QAT in a number of projects.

can see a path forward by partially porting pieces of https://www.tensorflow.org/model_optimization/api_docs/python/tfmot and/or https://github.com/google/aqt but will hold back if a QAT api is imminent ? ( additionally; might have some bandwidth to help if there are community contrib options? )

Answered by james77777778

Nov 15, 2024

I'm happy to help with this but unfortunately I don't have enough bandwidth right now.

@matpalm I suggest starting by implementing these ops in a backend-agnostic way, as they are necessary for QAT:

tf.quantization.fake_quant_with_min_max_args
tf.quantization.fake_quant_with_min_max_vars
tf.quantization.fake_quant_with_min_max_vars_per_channel

You should be able to implement them using ops.custom_gradient

View full answer

matpalm · 2024-11-13T23:57:17Z

matpalm
Nov 13, 2024
Author

bump

though i can jump through hoops to get tfmot working ( ish ) the fact it is keras2 compat only is a pain :/

if keras3 QAT is pending i can hold out, otherwise will have to consider dropping back to keras2 :( :( :(

0 replies

fchollet · 2024-11-14T07:57:18Z

fchollet
Nov 14, 2024
Maintainer

QAT is still on the roadmap but the work hasn't started yet, as we've had to prioritize other things... we can definitely re-prioritize it though. Most of the infra is already here in via the work @james77777778 did for float8 training.

0 replies

matpalm · 2024-11-14T09:35:55Z

matpalm
Nov 14, 2024
Author

great! if there's any pointers you can give me @james77777778 i'd be happy to help out; should have a bit of bandwidth for this over the next few months.

4 replies

james77777778 Nov 15, 2024

I'm happy to help with this but unfortunately I don't have enough bandwidth right now.

@matpalm I suggest starting by implementing these ops in a backend-agnostic way, as they are necessary for QAT:

tf.quantization.fake_quant_with_min_max_args
tf.quantization.fake_quant_with_min_max_vars
tf.quantization.fake_quant_with_min_max_vars_per_channel

You should be able to implement them using ops.custom_gradient

Answer selected by matpalm

matpalm Nov 15, 2024
Author

great thanks, i'll follow up in a PR

james77777778 Nov 15, 2024

I can help review the implementation if that sounds good to @fchollet!

fchollet Nov 15, 2024
Maintainer

Awesome, thanks you two!

doncarlos999 · 2025-01-16T08:05:42Z

doncarlos999
Jan 16, 2025

@james77777778 Now that #20641 is merged, what would the next step in porting QAT from tfmot?

1 reply

james77777778 Jan 17, 2025

I’m working on some POCs to verify the impl.
Currently, I think we need a wrapper to add the fake_quant op to the call method of the layers.

Ref: https://github.com/tensorflow/model-optimization/blob/master/tensorflow_model_optimization/python/core/quantization/keras/quantize_wrapper.py#L43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

is QAT still on the roadmap for keras3? #20319

{{title}}

Replies: 4 comments 5 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

is QAT still on the roadmap for keras3? #20319

matpalm Oct 3, 2024

Replies: 4 comments · 5 replies

matpalm Nov 13, 2024 Author

fchollet Nov 14, 2024 Maintainer

matpalm Nov 14, 2024 Author

james77777778 Nov 15, 2024

matpalm Nov 15, 2024 Author

james77777778 Nov 15, 2024

fchollet Nov 15, 2024 Maintainer

doncarlos999 Jan 16, 2025

james77777778 Jan 17, 2025

matpalm
Oct 3, 2024

Replies: 4 comments 5 replies

matpalm
Nov 13, 2024
Author

fchollet
Nov 14, 2024
Maintainer

matpalm
Nov 14, 2024
Author

matpalm Nov 15, 2024
Author

fchollet Nov 15, 2024
Maintainer

doncarlos999
Jan 16, 2025