Skip to content

Actions: NVIDIA/TransformerEngine

Build

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,657 workflow runs
3,657 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Improving communication overlap for the case of multi kernel queue usage
Build #6103: Pull request #1308 synchronize by pre-commit-ci bot
November 2, 2024 15:55 Action required youngeunkwon0405:fdl_for_merge
November 2, 2024 15:55 Action required
[JAX] Collective GEMM custom op with nvte_cublas_gemm (no comm. overlap)
Build #6101: Pull request #1307 synchronize by pre-commit-ci bot
November 2, 2024 02:30 Action required denera:jax-collective-gemm
November 2, 2024 02:30 Action required
[PyTorch] Userbuffers support in operation-based API
Build #6098: Pull request #1142 synchronize by timmoon10
November 1, 2024 21:17 1h 13m 44s timmoon10:ub-ops
November 1, 2024 21:17 1h 13m 44s
[JAX] Expose cp params to jax DPA api
Build #6096: Pull request #1292 synchronize by mgoldfarb-nvidia
November 1, 2024 20:40 1h 15m 44s kocchop:faysal/expose-cp-to-jax-dpa
November 1, 2024 20:40 1h 15m 44s
[JAX] Fix for Disable FusedAttn with FFI by default
Build #6095: Pull request #1304 synchronize by phu0ngng
November 1, 2024 19:43 1h 10m 51s phu0ngng:fused_attn_ffi
November 1, 2024 19:43 1h 10m 51s
[JAX] Fix for Disable FusedAttn with FFI by default
Build #6094: Pull request #1304 opened by phu0ngng
November 1, 2024 15:49 1h 13m 42s phu0ngng:fused_attn_ffi
November 1, 2024 15:49 1h 13m 42s
[PyTorch] Make FP8 MHA work with RoPE when CP is on
Build #6093: Pull request #1297 synchronize by yaox12
November 1, 2024 04:32 1h 9m 28s yaox12:xiny/fp8_mha_with_rope_cp
November 1, 2024 04:32 1h 9m 28s
[PyTorch] Make FP8 MHA work with RoPE when CP is on
Build #6092: Pull request #1297 synchronize by yaox12
November 1, 2024 04:30 1h 6m 42s yaox12:xiny/fp8_mha_with_rope_cp
November 1, 2024 04:30 1h 6m 42s
Support CUDA Graph for MoE models
Build #6091: Pull request #1233 synchronize by buptzyb
November 1, 2024 01:51 -1s buptzyb:cudagraph_moe
November 1, 2024 01:51 -1s
[PyTorch] Userbuffers support in operation-based API
Build #6090: Pull request #1142 synchronize by pre-commit-ci bot
October 31, 2024 23:05 1h 10m 16s timmoon10:ub-ops
October 31, 2024 23:05 1h 10m 16s
[PyTorch] Userbuffers support in operation-based API
Build #6089: Pull request #1142 synchronize by timmoon10
October 31, 2024 23:04 1h 15m 1s timmoon10:ub-ops
October 31, 2024 23:04 1h 15m 1s
[JAX] Expose cp params to jax DPA api
Build #6088: Pull request #1292 synchronize by mgoldfarb-nvidia
October 31, 2024 22:21 1h 14m 14s kocchop:faysal/expose-cp-to-jax-dpa
October 31, 2024 22:21 1h 14m 14s
[PyTorch] Add heuristics for intializing FP8 params
Build #6087: Pull request #1300 synchronize by timmoon10
October 31, 2024 21:54 1h 9m 33s timmoon10:fp8-heuristic
October 31, 2024 21:54 1h 9m 33s
Support using fp16 master weights and fp16/fp8 optimizer states in FusedAdam
Build #6086: Pull request #1078 synchronize by timmoon10
October 31, 2024 20:46 1h 13m 14s kunlunl:mx_fp16
October 31, 2024 20:46 1h 13m 14s