Remove CUDA synchronizations by slicing input tensor with `int` instead of CUDA tensors in `nn.LinearEmbeddingEncoder` #432

akihironitta · 2024-08-10T14:53:56Z

start_idx and end_idx used at feat.values[:, start_idx:end_idx] are on device, which leads to cuda synchronizations.

for more information, see https://pre-commit.ci

…h-frame into aki-rm-cuda-sync

yiweny

Good catch. Thank you!

torch_frame/nn/encoder/stype_encoder.py

update

ebc5dc6

akihironitta added bug nn labels Aug 10, 2024

akihironitta self-assigned this Aug 10, 2024

pre-commit-ci bot and others added 3 commits August 10, 2024 14:56

[pre-commit.ci] auto fixes from pre-commit.com hooks

e187f03

for more information, see https://pre-commit.ci

update changelog

95869bc

Merge branch 'aki-rm-cuda-sync' of https://github.com/pyg-team/pytorc…

a324a6c

…h-frame into aki-rm-cuda-sync

akihironitta changed the title ~~Remove CUDA synchronizations by slicing input tensor with int instead of CUDA tensors~~ Remove CUDA synchronizations by slicing input tensor with int instead of CUDA tensors in nn.LinearEmbeddingEncoder Aug 10, 2024

akihironitta requested a review from rusty1s August 10, 2024 15:13

yiweny approved these changes Aug 10, 2024

View reviewed changes

torch_frame/nn/encoder/stype_encoder.py Show resolved Hide resolved

zechengz approved these changes Aug 11, 2024

View reviewed changes

torch_frame/nn/encoder/stype_encoder.py Show resolved Hide resolved

torch_frame/nn/encoder/stype_encoder.py Show resolved Hide resolved

akihironitta merged commit 1f4c4b8 into master Aug 12, 2024
13 checks passed

akihironitta deleted the aki-rm-cuda-sync branch August 12, 2024 09:02

akihironitta added the performance label Aug 20, 2024

This was referenced Sep 18, 2024

Remove unnecessary CUDA synchronisations #453

Open

Fix offset in LinearEmbeddingEncoder #455

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove CUDA synchronizations by slicing input tensor with `int` instead of CUDA tensors in `nn.LinearEmbeddingEncoder` #432

Remove CUDA synchronizations by slicing input tensor with `int` instead of CUDA tensors in `nn.LinearEmbeddingEncoder` #432

akihironitta commented Aug 10, 2024

yiweny left a comment

Remove CUDA synchronizations by slicing input tensor with int instead of CUDA tensors in nn.LinearEmbeddingEncoder #432

Remove CUDA synchronizations by slicing input tensor with int instead of CUDA tensors in nn.LinearEmbeddingEncoder #432

Conversation

akihironitta commented Aug 10, 2024

yiweny left a comment

Choose a reason for hiding this comment

Remove CUDA synchronizations by slicing input tensor with `int` instead of CUDA tensors in `nn.LinearEmbeddingEncoder` #432

Remove CUDA synchronizations by slicing input tensor with `int` instead of CUDA tensors in `nn.LinearEmbeddingEncoder` #432