Skip to content

Actions: NVIDIA/TransformerEngine

Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,273 workflow runs
3,273 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[PyTorch] Miscellaneous fixes for FA3 attention
Documentation #5070: Pull request #1174 synchronize by cyanguwa
October 7, 2024 23:52 1m 11s cyanguwa:add_descales
October 7, 2024 23:52 1m 11s
[PyTorch] Miscellaneous fixes for FA3 attention
Documentation #5069: Pull request #1174 synchronize by cyanguwa
October 7, 2024 23:42 1m 3s cyanguwa:add_descales
October 7, 2024 23:42 1m 3s
[PyTorch] Miscellaneous fixes for FA3 attention
Documentation #5068: Pull request #1174 synchronize by pre-commit-ci bot
October 7, 2024 23:40 55s cyanguwa:add_descales
October 7, 2024 23:40 55s
[PyTorch] Miscellaneous fixes for FA3 attention
Documentation #5067: Pull request #1174 synchronize by cyanguwa
October 7, 2024 23:40 57s cyanguwa:add_descales
October 7, 2024 23:40 57s
[TE/JAX] Enabling CudaGraph for custom calls with FFI
Documentation #5063: Pull request #1228 opened by phu0ngng
October 7, 2024 19:53 56s phu0ngng:jax_cuda_graph
October 7, 2024 19:53 56s
[PyTorch] Drop FA2 as an installation requirement
Documentation #5060: Pull request #1226 synchronize by pre-commit-ci bot
October 7, 2024 16:45 Action required cyanguwa:make_fa_optional
October 7, 2024 16:45 Action required
[PyTorch] Drop FA2 as an installation requirement
Documentation #5059: Pull request #1226 opened by cyanguwa
October 7, 2024 16:44 53s cyanguwa:make_fa_optional
October 7, 2024 16:44 53s
Fix cuDNN sliding window size
Documentation #5058: Pull request #1212 synchronize by ksivaman
October 7, 2024 16:29 1m 1s cyanguwa:fix_cudnn_swa
October 7, 2024 16:29 1m 1s
Hierarchical CP implementation (Ulysses + Ring)
Documentation #5057: Pull request #1209 synchronize by xrennvidia
October 7, 2024 06:49 1m 5s xrennvidia:xren/cp_a2a_p2p
October 7, 2024 06:49 1m 5s
[PyTorch] Miscellaneous fixes for FA3 attention
Documentation #5056: Pull request #1174 synchronize by cyanguwa
October 6, 2024 18:57 1m 9s cyanguwa:add_descales
October 6, 2024 18:57 1m 9s
[PyTorch] remove duplicate code
Documentation #5055: Pull request #1215 synchronize by ksivaman
October 6, 2024 15:18 52s emmanuel-ferdman:main
October 6, 2024 15:18 52s
[PyTorch] Add description for _extra_state in different TE versions
Documentation #5053: Pull request #1223 synchronize by pre-commit-ci bot
October 6, 2024 04:28 Action required cyanguwa:fix_extra_states
October 6, 2024 04:28 Action required
Small fixes to Float8Tensor
Documentation #5051: Pull request #1225 synchronize by pre-commit-ci bot
October 4, 2024 23:58 1m 4s ptrendx:pr_float8_tensor_fixes
October 4, 2024 23:58 1m 4s
Small fixes to Float8Tensor
Documentation #5050: Pull request #1225 opened by ptrendx
October 4, 2024 23:58 1m 6s ptrendx:pr_float8_tensor_fixes
October 4, 2024 23:58 1m 6s
Tests for distributed
Documentation #5048: Pull request #1196 synchronize by pggPL
October 4, 2024 09:26 1m 5s pggPL:distributed_tests
October 4, 2024 09:26 1m 5s
Hierarchical CP implementation (Ulysses + Ring)
Documentation #5047: Pull request #1209 synchronize by xrennvidia
October 4, 2024 01:47 1m 10s xrennvidia:xren/cp_a2a_p2p
October 4, 2024 01:47 1m 10s