Implement tool for saved Keras model files inspection, diff and patching #19768

pmasousa · 2024-05-28T12:48:24Z

It consists of three sub features, that allow:
Visualization of the contents of .keras and .weights.h5 files in a notebook, where you can further expand the contents up until the weights of the file. Diff of a model compared to a reference model, being presented side by side with the differences highlighted. Patching a model, where it is possible to change a layer's name and changing a specified weight of a model.

Please let us know if this is what you had in mind.

closes #19705

It consists of three sub features, that allow: Visualization of the contents of .keras and .weights.h5 files, where you can further expand the contents up until the weights of the file. Diff of a model compared to a reference model, being presented side by side with the differences highlighted. Patching a model, where it is possible to change a layer's name and changing a specified weight of a model. Co-authored-by: Pedro Curto <pedro.a.curto@tecnico.ulisboa.pt>

fchollet · 2024-05-28T15:27:52Z

Thanks for the PR! Do you have a Colab notebook that demos the new features?

pmasousa · 2024-05-29T10:55:46Z

Yes, here

I couldn't implement the changes in the saved weights file, only in a .keras file.
If you could explain how to do it I would appreciate it.

fchollet · 2024-05-29T17:57:01Z

Great work! Here are some things we should do.

For every display output, there should be a shell mode (plain text with text color tags) and a notebook mode (html). We route to one or the other based on whether we detect we are in a notebook or not.
The diff isn't super useful. What we should do is:
- Compare the model structure; for instance if a layer is absent from one model but present in the other, we should highlight that. We should display the names of those layers, and the count of weights and sublayers associated with it.
- For each layer that matches (by path) across the models, we should compare the weight structure (number of weights, weights shapes, dtypes) and highlight any differences.
The edit tool can be a class, e.g.

editor = KerasFileEditor(filepath)
editor.list_layer_paths()  # Return all layer paths
editor.layer_info(layer_path)  # Show weight structure for this layer
editor.edit_layer(layer_path, new_name=..., new_vars=...)
editor.write_out(filepath)

I couldn't implement the changes in the saved weights file, only in a .keras file.

They aren't very different -- the weights file is one of the files present in the .keras file. You only need to implement these features for the weights file, then the same code will also work for the .keras file.

pedro-curto · 2024-05-31T15:49:02Z

Thank you very much for your feedback, we really appreciate it!

We're currently on our degree's final two weeks, so everything is very intensive right now and we barely have any time because of the projects. We would really like to make those changes and get back to you because we enjoyed working on this feature and would like it to be as good as possible and according to your needs and the specification. We will get back to you and make the required changes starting next week, if that is ok with you. Again, thanks for your time, patience and feedback!

fchollet · 2024-05-31T16:44:18Z

Sure -- there's no rush! Thanks for working on this!

Implemented the solicited changes: Changed inspect_file to differentiate between shell mode (with plain text with text color tags) and notebook mode (using HTML) Changed the diff functionality to match the solicited requirements (comparing model structure and weight structure according to specification in PR discussion) and have clearer and better output Reworked the edit tool to be a class and have the solicited methods (listing layer paths, showing weight structure, editing layers and writing out to a path) Co-authored-by: Pedro Curto <pedro.a.curto@tecnico.ulisboa.pt>

pedro-curto · 2024-06-30T00:45:30Z

Hello. Sorry for taking so long.
We've made a commit with the changes that we believe that match your feedback on the things we should do on the tool. We have a link to a colab that we prepared, in case you want to test the functionalities and see what we changed and how in an interactive way: it's this colab. Is this what you had in mind?
Any feedback is greatly appreciated, and thank you for your time and patience with us so far!

fchollet · 2024-07-06T00:33:02Z

Thanks for the update -- the functionality looks great! I think the interface could look more professional though. Maybe we can first focus on the HTML version of the interface (for Colab / notebooks) and then we can figure out later what the CLI / text-only version should look like?

pedro-curto · 2024-07-07T22:32:34Z

Glad you liked it!
I didn't understand what you meant by referring that the interface could look more professional, could you clarify a bit for us to know what we should change? Thanks for guiding us until now!

fchollet · 2024-07-08T05:11:43Z

I didn't understand what you meant by referring that the interface could look more professional, could you clarify a bit for us to know what we should change?

Sure. What you had at the very end of this notebook was quite nice, for instance. Penzai is also reasonably nice.

pedro-curto · 2024-07-08T07:11:50Z

Just to clarify, you would like us to make the compare_models interface look more professional, like the inspect_file interface?

fchollet · 2024-07-08T18:39:04Z

Just to clarify, you would like us to make the compare_models interface look more professional, like the inspect_file interface?

Yes, exactly -- preferably something with interactive HTML. It could list the layers for which there was a discrepancy, and clicking on the layer would reveal the issue. Interactiveness enables greater UX clarity.

pedro-curto · 2024-07-09T21:13:15Z

Okay, thank you for explaining. Me and my friend are currently working so this will only be possible to do in our free time, but we will keep you updated if there is progress!

github-actions · 2024-07-27T01:52:09Z

This PR is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

pedro-curto · 2024-08-08T12:00:05Z

I'd really like to make the required changes but it's been impossible to have time, so I'm commenting for the PR not to close.

fchollet · 2024-09-22T23:22:23Z

Thank you for the contributions here so far -- we merged a subset of this feature in keras/src/saving/file_editor.py. Currently it only really works with the CLI and not HTML/js, but you are more than welcome to contribute improvements in this direction going forward!

google-ml-butler bot added the size:L label May 28, 2024

google-ml-butler bot assigned gbaned May 28, 2024

gbaned requested a review from fchollet June 3, 2024 05:51

google-ml-butler bot added the awaiting review label Jun 3, 2024

gbaned added stat:awaiting response from contributor and removed awaiting review labels Jul 12, 2024

github-actions bot added the stale label Jul 27, 2024

google-ml-butler bot removed stale stat:awaiting response from contributor labels Aug 8, 2024

fchollet mentioned this pull request Sep 5, 2024

Add assert statement to check model structure on model_visualization_test #20208

Merged

fchollet closed this Sep 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement tool for saved Keras model files inspection, diff and patching #19768

Implement tool for saved Keras model files inspection, diff and patching #19768

pmasousa commented May 28, 2024 •

edited

Loading

fchollet commented May 28, 2024

pmasousa commented May 29, 2024

fchollet commented May 29, 2024

pedro-curto commented May 31, 2024

fchollet commented May 31, 2024

pedro-curto commented Jun 30, 2024

fchollet commented Jul 6, 2024

pedro-curto commented Jul 7, 2024

fchollet commented Jul 8, 2024

pedro-curto commented Jul 8, 2024

fchollet commented Jul 8, 2024

pedro-curto commented Jul 9, 2024

github-actions bot commented Jul 27, 2024

pedro-curto commented Aug 8, 2024

fchollet commented Sep 22, 2024

Implement tool for saved Keras model files inspection, diff and patching #19768

Implement tool for saved Keras model files inspection, diff and patching #19768

Conversation

pmasousa commented May 28, 2024 • edited Loading

fchollet commented May 28, 2024

pmasousa commented May 29, 2024

fchollet commented May 29, 2024

pedro-curto commented May 31, 2024

fchollet commented May 31, 2024

pedro-curto commented Jun 30, 2024

fchollet commented Jul 6, 2024

pedro-curto commented Jul 7, 2024

fchollet commented Jul 8, 2024

pedro-curto commented Jul 8, 2024

fchollet commented Jul 8, 2024

pedro-curto commented Jul 9, 2024

github-actions bot commented Jul 27, 2024

pedro-curto commented Aug 8, 2024

fchollet commented Sep 22, 2024

pmasousa commented May 28, 2024 •

edited

Loading