[Tripy] Permit `eval` while tracing, but do not update trace. #443

slyubomirsky · 2024-12-12T06:49:56Z

Addresses #409. Evaluation while tracing still gives a warning but does not alter the graph, so it does not produce any errors.

slyubomirsky · 2024-12-12T07:01:54Z

I am not fully certain how the storage mechanism described in the subsequent comment on #409 should work. If the main issue is recompilation caused by the fact that we not updating the trace graph if we evaluate while tracing, perhaps one approach could be to insert the Storage op anyway but restore the original graph after compilation.

pranavm-nvidia · 2024-12-12T17:55:13Z

tripy/tripy/frontend/tensor.py

@@ -265,8 +269,15 @@ def __repr__(self) -> str:

        data_list = self.tolist()

-        assert isinstance(self.trace_tensor.producer, Storage)
-        data_shape = self.trace_tensor.producer.shape
+        if isinstance(self.trace_tensor.producer, Storage):


Does the memref have shape information? If so, it would probably be cleaner to use the memref directly rather than tolist().

pranavm-nvidia · 2024-12-12T17:57:23Z

tripy/tripy/frontend/tensor.py

+        if not self.trace_tensor.is_compile_tracer:
+            Storage.build_internal([], [self.trace_tensor], data)


The problem with this is if we evaluate multiple tensors in a compiled graph, we'll get quadratic time complexity. What I suggested in #409 (comment) is to store the evaluated result but only use it for evaluation and not tracing. I'm not sure exactly how that would work, but I expect it would require changes to how we trace during eval().

pranavm-nvidia · 2024-12-12T17:58:06Z

I am not fully certain how the storage mechanism described in the subsequent comment on #409 should work. If the main issue is recompilation caused by the fact that we not updating the trace graph if we evaluate while tracing, perhaps one approach could be to insert the Storage op anyway but restore the original graph after compilation.

Could you elaborate on this approach? Sounds promising.

slyubomirsky · 2024-12-12T22:25:27Z

I haven't tried implementing this yet, but if we could record which ops we changed during the trace, we could avoid recompilation during the trace and then change them back afterwards.

… not update its trace tensor, however

…t to avoid slowing down other evaluations

slyubomirsky · 2024-12-17T04:44:01Z

tripy/nvtripy/frontend/tensor.py

+        if REVERT_GRAPH_AFTER_COMPILING is not None:
+            REVERT_GRAPH_AFTER_COMPILING.append((self.trace_tensor, self.trace_tensor.producer))


This might be a bit of a clumsy approach to keeping around the old producers (required a global var from the compiler). I wasn't sure where else the data structure could reside, since evaluation is handled as a method on tensors. It's a tricky issue because the tensor evaluated could be anywhere in the middle of the graph and the interface to compile is an opaque Callable.

What about storing it on the trace tensors themselves? original_producer or something like that?

I guess in that case we would need to do a DFS from the output and swap the producer before stepping further

Yeah searching the graph might be necessary if there isn't anywhere else we could record the tensors that have been evaluated.

slyubomirsky · 2024-12-17T22:55:27Z

Closing per discussion: We've decided that there aren't any cases we could think of where evaluating while tracing is useful, especially since evaluating without using the result (e.g., just printing) already worked. We can revisit if such a case does present itself.

slyubomirsky added the tripy Pull request for the tripy project label Dec 12, 2024

pranavm-nvidia reviewed Dec 12, 2024

View reviewed changes

slyubomirsky added 2 commits December 16, 2024 22:53

Warn but do not give an error if a tensor is evaled while tracing. Do…

e0bc302

… not update its trace tensor, however

Use alternative approach to revert changes to the graph after the fac…

f89fd44

…t to avoid slowing down other evaluations

slyubomirsky force-pushed the no-error-for-eval-while-tracing branch from dbeccd0 to f89fd44 Compare December 17, 2024 04:39

slyubomirsky commented Dec 17, 2024

View reviewed changes

slyubomirsky closed this Dec 17, 2024

slyubomirsky mentioned this pull request Dec 17, 2024

Improve behavior of evaluation during compile #409

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tripy] Permit `eval` while tracing, but do not update trace. #443

[Tripy] Permit `eval` while tracing, but do not update trace. #443

slyubomirsky commented Dec 12, 2024

slyubomirsky commented Dec 12, 2024

pranavm-nvidia Dec 12, 2024

pranavm-nvidia Dec 12, 2024

pranavm-nvidia commented Dec 12, 2024

slyubomirsky commented Dec 12, 2024 •

edited

Loading

slyubomirsky Dec 17, 2024

pranavm-nvidia Dec 17, 2024

pranavm-nvidia Dec 17, 2024

slyubomirsky Dec 17, 2024

slyubomirsky commented Dec 17, 2024

		if not self.trace_tensor.is_compile_tracer:
		Storage.build_internal([], [self.trace_tensor], data)

		if REVERT_GRAPH_AFTER_COMPILING is not None:
		REVERT_GRAPH_AFTER_COMPILING.append((self.trace_tensor, self.trace_tensor.producer))

[Tripy] Permit eval while tracing, but do not update trace. #443

[Tripy] Permit eval while tracing, but do not update trace. #443

Conversation

slyubomirsky commented Dec 12, 2024

slyubomirsky commented Dec 12, 2024

pranavm-nvidia Dec 12, 2024

Choose a reason for hiding this comment

pranavm-nvidia Dec 12, 2024

Choose a reason for hiding this comment

pranavm-nvidia commented Dec 12, 2024

slyubomirsky commented Dec 12, 2024 • edited Loading

slyubomirsky Dec 17, 2024

Choose a reason for hiding this comment

pranavm-nvidia Dec 17, 2024

Choose a reason for hiding this comment

pranavm-nvidia Dec 17, 2024

Choose a reason for hiding this comment

slyubomirsky Dec 17, 2024

Choose a reason for hiding this comment

slyubomirsky commented Dec 17, 2024

[Tripy] Permit `eval` while tracing, but do not update trace. #443

[Tripy] Permit `eval` while tracing, but do not update trace. #443

slyubomirsky commented Dec 12, 2024 •

edited

Loading