Ownership of gpu tasks #686

devreal · 2024-10-29T21:30:36Z

Description

The GPU task structure is allocated by the DSL but free'd by the device management thread. That breaks the ownership and prevents certain optimizations, like eliding the allocation by integrating the GPU task structure into a higher-level task object.

Describe the solution you'd like

The release of the memory holding the GPU task should be done by the DSL, which allocated it. Remove the free of that structure from the device task and move it into the release callback provided by the DSL (i.e., DTD and PTG).

Describe alternatives you've considered

Adding another callback to free the structure but that adds no value.

Additional context

bosilca · 2024-10-30T00:43:53Z

I agree, this has bugged me for a while but I was not able to find a clean solution. The problem is that we cannot free the gpu_task in task_release because we don't have the gpu_task there and because the task itself does not have a pointer back to the gpu_task.

We could make the gpu_task derive from parsec_object_t (which would give us access to a constructor/destructor) but the assumption there is that the object is automatically freed (after the destructors are called) when the refcount reaches zero. Not a great solution either.

This makes the lifecycle of the device tasks symmetric: they are allocated by the DSL and therefore shall be freed by the DSL. This addresses the request from ICLDisco#686. Signed-off-by: George Bosilca <gbosilca@nvidia.com>

devreal · 2024-10-30T01:08:01Z

Ahh, I forgot that the task doesn't point back to the gpu task. Maybe a callback on the chore would do? It's in a chore callback where we create the gpu task so we might as well have a callback to destroy it. There is no use for it for host-side tasks but I don't see that as an issue, it's just ignored.

bosilca · 2024-10-30T01:14:54Z

Look at #688. I'm trying to test it, but the new MPI detection sucks on all our machines.

This makes the lifecycle of the device tasks symmetric: they are allocated by the DSL and therefore shall be freed by the DSL. This addresses the request from ICLDisco#686. Signed-off-by: George Bosilca <gbosilca@nvidia.com>

devreal added the enhancement New feature or request label Oct 29, 2024

bosilca mentioned this issue Oct 30, 2024

The device task is now released by the DSL #688

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ownership of gpu tasks #686

Ownership of gpu tasks #686

devreal commented Oct 29, 2024

bosilca commented Oct 30, 2024

devreal commented Oct 30, 2024

bosilca commented Oct 30, 2024

Ownership of gpu tasks #686

Ownership of gpu tasks #686

Comments

devreal commented Oct 29, 2024

Description

Describe the solution you'd like

Describe alternatives you've considered

Additional context

bosilca commented Oct 30, 2024

devreal commented Oct 30, 2024

bosilca commented Oct 30, 2024