Module Slicing #115

alexander-g · 2020-11-24T07:26:20Z

Experimental support for slicing modules. e.g:

x = jnp.zeros((2, 224, 224, 3))
resnet = elegy.nets.resnet.ResNet18()
submodule = elegy.module_slicing.slice_module_from_to(
            resnet,
            start_module=None,
            end_module=["/res_net_block_1", "/res_net_block_3", "/res_net_block_5", "/res_net_block_7" ],
            sample_input=x,
        )
outputs = elegy.Model(submodule).predict(x)
assert outputs[0].shape == (2, 56, 56, 64)
assert outputs[1].shape == (2, 28, 28, 128)
assert outputs[2].shape == (2, 14, 14, 256)
assert outputs[3].shape == (2, 7, 7, 512)

This currently requires the additional package networkx. This could be removed with some more work if you don't want to introduce another dependency.

Limitations:

All operations between start_module and end_module must be performed by modules
i.e. jax.nn.relu() or x+1 is not allowed but can be converted by wrapping with elegy.to_module()
Only one input module is supported
All modules between start_module and end_module must have a single input and a single output
Resulting module is currently not trainable: .get_parameters() does not return any weights. Need some hints how to fix that.

codecov-io · 2020-11-25T11:57:42Z

Codecov Report

Merging #115 (2c9cf1c) into master (87e18c1) will increase coverage by 0.81%.
The diff coverage is 98.56%.

@@            Coverage Diff             @@
##           master     #115      +/-   ##
==========================================
+ Coverage   85.04%   85.86%   +0.81%     
==========================================
  Files         131      133       +2     
  Lines        6963     7300     +337     
==========================================
+ Hits         5922     6268     +346     
+ Misses       1041     1032       -9

Impacted Files	Coverage Δ
elegy/module_test.py	`99.30% <ø> (ø)`
elegy/module_slicing.py	`97.76% <97.76%> (ø)`
elegy/module_slicing_test.py	`99.33% <99.33%> (ø)`
elegy/hooks.py	`85.61% <100.00%> (ø)`
elegy/hooks_test.py	`100.00% <100.00%> (ø)`
elegy/model/model.py	`96.41% <100.00%> (ø)`
elegy/model/model_core.py	`90.04% <100.00%> (+0.09%)`	⬆️
elegy/module.py	`95.52% <100.00%> (+0.26%)`	⬆️
elegy/types.py	`94.35% <100.00%> (+0.08%)`	⬆️
... and 3 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 87e18c1...2c9cf1c. Read the comment docs.

alexander-g · 2020-11-25T11:59:37Z

The resulting module can now be retrained.

cgarciae · 2020-11-25T17:25:13Z

Hey @alexander-g ! This is amazing 😃 Can you share a bit about what strategy you are using to construct the new model?
I think this is a key feature to enable easier Transfer Learning and thus making Elegy more appealing for real use-cases.

cgarciae · 2020-11-25T17:26:07Z

cc @charlielito

alexander-g · 2020-11-25T18:51:47Z

I collect information about the main module and its submodules from the summaries feature. Then I construct a directed graph with the modules representing the edges and the inputs/outputs of the modules representing the nodes. If the outputs of module A are the same as the inputs of module B (as returned by id() ^1) then they are connected.
Then I simply search for the shortest path between start_module and the end_module and finally go along this path and execute the corresponding edges/modules.

^1: luckily this seems to work well in JAX, even when you do a x+=1, the value returned by id() changes, whereas in numpy it stays the same.

elegy/module.py

…now possible

alexander-g · 2020-12-04T10:15:47Z

Modules with multiple inputs between start_module and end_module should now work too (e.g. skip connections in ResNet or U-Net). However this is getting more complex than I initially thought. I try to add as many comments as possible and cover everything with test cases but I sometimes get confused by this code myself.
What would be a good API for this functionality? Maybe add this as a method: Module.slice()?

cgarciae · 2020-12-14T14:05:34Z

I'll try to test this out so I can give a better opinion about toe API. In previous version we actually had something in the spirit of slice but it was only for the parameters dictionary, Module.slice seems more intuitive IMO.

alexander-g · 2020-12-23T16:36:50Z

Added the Module.slice() method

One ugly detail: I import the module_slicing python module inside the function in module.py:line 622 because of a circular dependency. Was not able to fix that.
Moreover, because Module is the parent class of many other Modules like Conv, BatchNorm etc. mkdocs wants to add this method to the docs although it doesn't really make sense. Can I prevent that? Or should I add them anyway?

Apart from that, I think this is usable and can be merged. I'd like to use it in #126

cgarciae · 2020-12-23T17:50:42Z

One ugly detail: I import the module_slicing python module inside the function in module.py:line 622 because of a circular dependency. Was not able to fix that.

I ended moving all hooks like add_loss, add_summary, etc from elegy.hooks to module because of this. An alternative strategy is to set a dummy reference of module_slice on module and patch it on creation:

# module.py
module_slice = None
...

# modole_slice.py
import sys
from . import module

current_module = sys.modules[__name__]
module.module_slice = current_module
...

Not sure this pattern is strictly better.

charlielito · 2020-12-26T01:15:54Z

elegy/module.py

+        - all operations between `start_module` and `end_module` must be performed by modules
+            i.e. `jax.nn.relu()` or `x+1` is not allowed but can be converted by wrapping with `elegy.to_module()`
+        - only one `start_module` is supported
+        - all modules between `start_module` and `end_module` must have a single output


based on your comment this is not a limitation now?

Still is.
It's now possible to have inner modules that have multiple inputs but the result module still must have only one input. Single output limitation also holds.

charlielito · 2020-12-26T01:18:34Z

elegy/module.py

        ...
    ```

    Arguments:
        module_or_name: The name of the summary or alternatively the module that this summary will represent.
            If a summary with the same name already exists a unique identifier will be generated.
        value: The value for the summary.
+        input_values: The input arguments for the module, required for slicing.


can you elaborate on the structure of the tuple?

alexander-g · 2021-01-15T13:09:17Z

Requesting review/testing @cgarciae
This is usable and #126 depends on it.

alexander-g · 2021-02-20T16:25:08Z

Updated to 0.6.0

@cgarciae What is your opinion on this PR? Shall we continue or do you have other ideas on how to do transfer learning

cgarciae · 2021-02-20T16:50:55Z

My current opinion about this feature is this.

Pros

When it works its really simple
You can potentially use any intermediate layer

Cons

Can be brittle / silently fail if the author of the Module is not careful.
Currently only properly supported in Elegy Modules.

The only alternative I can think right now is to have an optional flag (e.g. return_multiple) that returns a dictionary with multiple outs that you would like to expose for the user e.g:

def call(...):
    if self.return_multiple:
        return dict(layer1=out1, layer2=out2, ...)
    else:
        return out

This solution is framework agnostic and easy to implement but is not automatic and requires effort from the author of the Module.

alexander-g · 2021-02-20T16:58:39Z

The only alternative I can think right now is to have an optional flag (e.g. return_multiple) that returns a dictionary with multiple outs that you would like to expose for the user e.g:

This is exactly what I want to avoid. This requires too much ahead-thinking, which is difficult to do especially if the module was written by another author. Say, I want to inspect an arbitrary inner layer of a ResNet, then I'd need to rewrite the call() of this Module

Can be brittle / silently fail if the author of the Module is not careful.

Could you explain what you mean with this or give an example? What should the author take care of? It is indeed brittle but rather because of the inner graph logic which is not yet fully "battle-tested".

cgarciae · 2021-02-20T17:05:02Z

This is exactly what I want to avoid. This requires too much ahead-thinking, which is difficult to do especially if the module was written by another author. Say, I want to inspect an arbitrary inner layer of a ResNet, then I'd need to rewrite the call() of this Module

I think we can add this feature as a show-case of what can be done even if only certain Modules support it.

Could you explain what you mean with this or give an example? What should the author take care of? It is indeed brittle but rather because of the inner graph logic which is not yet fully "battle-tested".

If I remember correctly this works with the hooks.add_summary feature and only works properly if all intermediate operations are properly added to the summaries, so a causal call to e.g. relu will not be registered and might give an incorrect slice.

cgarciae · 2021-02-22T02:56:46Z

I was told that in Pytorch you do this by taking a slice from Sequential. I like this API because its simple, and easy to implement since Sequential already has all the machinery to do this.

alexander-g · 2021-02-22T06:12:25Z

I was told that in Pytorch you do this by taking a slice from Sequential.

Sounds limited.

torchvision creates an additional class IntermediateLayerGetter to extract intermediate feature maps from a ResNet. I don't like this approach at all, it's not simple for the user.

I've started experimenting with a new method based on jax.make_jaxpr. This would allow using arbitrary JAX code like jax.nn.relu(x) or x+1, but start and end targets still need to be Elegy modules.

cgarciae · 2021-02-23T01:08:01Z

I've started experimenting with a new method based on jax.make_jaxpr. This would allow using arbitrary JAX code like jax.nn.relu(x) or x+1, but start and end targets still need to be Elegy modules.

This seems super useful, can you show how it could look?
My main worry with something so raw is that the names might be difficult to find (I am just guessing).

alexander-g added 2 commits November 24, 2020 08:05

experimental module slicing

e622fde

added networkx dependency

83d2c0c

alexander-g force-pushed the slicing_modules branch from 62d3e3d to 83d2c0c Compare November 25, 2020 07:35

alexander-g added 3 commits November 25, 2020 10:53

sliced module is now retrainable

0ffc36f

exception handling

c4ec83d

refactoring

c525614

alexander-g marked this pull request as ready for review November 25, 2020 11:59

resnet50 fix

2f18a9a

cgarciae reviewed Dec 3, 2020

View reviewed changes

elegy/module.py Outdated Show resolved Hide resolved

alexander-g added 3 commits December 4, 2020 10:50

Slicing with multi-input modules between start_module and end_module …

a75c7e3

…now possible

Docs for add_summary

41fa916

black

15943e3

Merge branch 'master' into slicing_modules

250e783

alexander-g added 2 commits December 23, 2020 16:08

Merge branch 'master' into _slicing

f57d41b

fixing poetry

aa02024

alexander-g force-pushed the slicing_modules branch from 64bdaa4 to aa02024 Compare December 23, 2020 15:15

Module.slice()

3b3d88a

alexander-g force-pushed the slicing_modules branch from ec22e53 to 3b3d88a Compare December 23, 2020 16:32

Merge branch 'master' into slicing_modules

06cfd4c

charlielito reviewed Dec 26, 2020

View reviewed changes

alexander-g added 4 commits December 26, 2020 07:10

circular dependency fix

e77a207

can now specify inputs as an output target for Module.slice()

9a59c93

slicing deferred call bugfix

c89df43

Merge branch 'master' into slicing_modules

c2808a4

alexander-g mentioned this pull request Jan 15, 2021

[WIP] U-Net #126

Draft

7 tasks

alexander-g added 3 commits February 10, 2021 09:04

Merge branch 'master' into slicing_modules

f5d669d

update to 0.6.0

d908398

Merge branch 'master' into slicing_modules

330c3e9

alexander-g force-pushed the slicing_modules branch from 14046c2 to e9ae2c2 Compare February 20, 2021 16:17

test fixes and black

2c9cf1c

alexander-g force-pushed the slicing_modules branch from e9ae2c2 to 2c9cf1c Compare February 20, 2021 16:20

cgarciae closed this Feb 22, 2021

cgarciae reopened this Feb 22, 2021

alexander-g mentioned this pull request Feb 28, 2021

Module Slicing 2 #169

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Module Slicing #115

Module Slicing #115

alexander-g commented Nov 24, 2020

codecov-io commented Nov 25, 2020 •

edited

Loading

alexander-g commented Nov 25, 2020

cgarciae commented Nov 25, 2020

cgarciae commented Nov 25, 2020

alexander-g commented Nov 25, 2020

alexander-g commented Dec 4, 2020

cgarciae commented Dec 14, 2020 •

edited

Loading

alexander-g commented Dec 23, 2020

cgarciae commented Dec 23, 2020

charlielito Dec 26, 2020

alexander-g Dec 26, 2020

charlielito Dec 26, 2020

alexander-g commented Jan 15, 2021

alexander-g commented Feb 20, 2021

cgarciae commented Feb 20, 2021 •

edited

Loading

alexander-g commented Feb 20, 2021

cgarciae commented Feb 20, 2021 •

edited

Loading

cgarciae commented Feb 22, 2021

alexander-g commented Feb 22, 2021

cgarciae commented Feb 23, 2021 •

edited

Loading

Module Slicing #115

Are you sure you want to change the base?

Module Slicing #115

Conversation

alexander-g commented Nov 24, 2020

codecov-io commented Nov 25, 2020 • edited Loading

Codecov Report

alexander-g commented Nov 25, 2020

cgarciae commented Nov 25, 2020

cgarciae commented Nov 25, 2020

alexander-g commented Nov 25, 2020

alexander-g commented Dec 4, 2020

cgarciae commented Dec 14, 2020 • edited Loading

alexander-g commented Dec 23, 2020

cgarciae commented Dec 23, 2020

charlielito Dec 26, 2020

Choose a reason for hiding this comment

alexander-g Dec 26, 2020

Choose a reason for hiding this comment

charlielito Dec 26, 2020

Choose a reason for hiding this comment

alexander-g commented Jan 15, 2021

alexander-g commented Feb 20, 2021

cgarciae commented Feb 20, 2021 • edited Loading

Pros

Cons

alexander-g commented Feb 20, 2021

cgarciae commented Feb 20, 2021 • edited Loading

cgarciae commented Feb 22, 2021

alexander-g commented Feb 22, 2021

cgarciae commented Feb 23, 2021 • edited Loading

codecov-io commented Nov 25, 2020 •

edited

Loading

cgarciae commented Dec 14, 2020 •

edited

Loading

cgarciae commented Feb 20, 2021 •

edited

Loading

cgarciae commented Feb 20, 2021 •

edited

Loading

cgarciae commented Feb 23, 2021 •

edited

Loading