使用torch.compile()一行优化 #43

hyskylord · 2023-03-25T23:48:19Z

hyskylord
Mar 25, 2023

最近尝试了pytorch 2.X的compile功能，使用后训练速度大概提升了10%。一个小的遗憾是test_play由于目前输入输出的batch size不固定暂时无法优化。

Pytorch版本：2.1.0.dev20230315 py3.10_cuda11.8_cudnn8.7.0_0

mode：
mortal = torch.compile(mortal, mode="reduce-overhead")
current_dqn = torch.compile(current_dqn, mode="reduce-overhead")
next_rank_pred = torch.compile(next_rank_pred, mode="reduce-overhead")

效果：~6.8batch/s → ~7.5 batch/s

Equim-chan · 2023-03-26T07:47:47Z

Equim-chan
Mar 26, 2023
Maintainer

之前在我的环境里 pytorch 2 在 inference 的时候有机率 coredumps（和是否使用 torch.compile 无关），就先回滚回了 1。

0 replies

Equim-chan · 2023-10-06T14:40:35Z

Equim-chan
Oct 6, 2023
Maintainer

v4 会包含一个配置项来开启 compile。

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

使用torch.compile()一行优化 #43

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

使用torch.compile()一行优化 #43

hyskylord Mar 25, 2023

Replies: 2 comments

Equim-chan Mar 26, 2023 Maintainer

Equim-chan Oct 6, 2023 Maintainer

hyskylord
Mar 25, 2023

Equim-chan
Mar 26, 2023
Maintainer

Equim-chan
Oct 6, 2023
Maintainer