Skip to content

Actions: NVIDIA/TransformerEngine

Deploy nightly docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
536 workflow runs
536 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[PyTorch] Fix AttentionParams comparison logic (#1397)
Deploy nightly docs #769: Commit 7aa8118 pushed by cyanguwa
January 21, 2025 18:21 1m 35s main
January 21, 2025 18:21 1m 35s
[JAX] Consolidate the distributed fused attention test code (#1405)
Deploy nightly docs #768: Commit 6e84892 pushed by mgoldfarb-nvidia
January 17, 2025 04:08 1m 30s main
January 17, 2025 04:08 1m 30s
[PyTorch] te.Linear FP8 DGRAD+RS output bugfix (#1412)
Deploy nightly docs #767: Commit c2937c5 pushed by denera
January 16, 2025 20:32 1m 34s main
January 16, 2025 20:32 1m 34s
Make it an option to compile activation functions with fast math (#1410)
Deploy nightly docs #766: Commit 3d63cbb pushed by ksivaman
January 15, 2025 18:12 1m 33s main
January 15, 2025 18:12 1m 33s
[PyTorch] Adding TP overlap support for te.Linear with `parallel_mo…
Deploy nightly docs #765: Commit 2402406 pushed by denera
January 13, 2025 20:24 1m 31s main
January 13, 2025 20:24 1m 31s
Fix "refractor" typo in the PR template (#1402)
Deploy nightly docs #764: Commit cbc4653 pushed by timmoon10
January 13, 2025 19:28 1m 33s main
January 13, 2025 19:28 1m 33s
[JAX] Test_multiprocessing_encoder with process spawn in bash (#1394)
Deploy nightly docs #763: Commit a65ad37 pushed by phu0ngng
January 11, 2025 00:53 2m 30s main
January 11, 2025 00:53 2m 30s
Take token count quantization of fused attention into consideration f…
Deploy nightly docs #762: Commit 7b861e7 pushed by xrennvidia
January 10, 2025 09:47 1m 24s main
January 10, 2025 09:47 1m 24s
clean CP implementation for flash attention and cuDNN 9.6 (#1387)
Deploy nightly docs #761: Commit 560bccf pushed by xrennvidia
January 8, 2025 18:09 1m 32s main
January 8, 2025 18:09 1m 32s
[JAX] Correct fused attention output after each step of ring attentio…
Deploy nightly docs #760: Commit a4cb1d1 pushed by mgoldfarb-nvidia
January 8, 2025 16:09 1m 38s main
January 8, 2025 16:09 1m 38s
bug fix for using return_layernorm_output=True (#1382)
Deploy nightly docs #759: Commit 61cf102 pushed by timmoon10
January 8, 2025 02:07 1m 23s main
January 8, 2025 02:07 1m 23s
[JAX] Add THD + SWA unit tests (#1390)
Deploy nightly docs #758: Commit b898cbe pushed by zlsh80826
January 8, 2025 00:31 1m 20s main
January 8, 2025 00:31 1m 20s
Update copyright to include 2025 (#1388)
Deploy nightly docs #757: Commit c9ea6be pushed by ksivaman
January 2, 2025 22:21 1m 18s main
January 2, 2025 22:21 1m 18s
[common/PyTorch] Add cuDNN SWA (left, 0) + padding + bottom right cau…
Deploy nightly docs #756: Commit 838345e pushed by cyanguwa
December 20, 2024 05:32 1m 15s main
December 20, 2024 05:32 1m 15s
[JAX] Move parallel encoder tests to L0 distributed test set. (#1356)
Deploy nightly docs #755: Commit a3b32ec pushed by phu0ngng
December 18, 2024 15:47 1m 38s main
December 18, 2024 15:47 1m 38s
[PyTorch] Fix get_swa_mask() for padding masks (#1281)
Deploy nightly docs #754: Commit f033498 pushed by cyanguwa
December 18, 2024 02:15 1m 39s main
December 18, 2024 02:15 1m 39s
[PyTorch] Add weights_only=False for torch.load (#1374)
Deploy nightly docs #753: Commit 83dac8c pushed by cyanguwa
December 18, 2024 02:15 1m 40s main
December 18, 2024 02:15 1m 40s
[JAX] Fused attention unit tests fixes and refinements (#1352)
Deploy nightly docs #752: Commit 7f5c784 pushed by zlsh80826
December 17, 2024 07:41 1m 23s main
December 17, 2024 07:41 1m 23s
[common] Add max_t support for KV in THD (#1370)
Deploy nightly docs #751: Commit f4f35c2 pushed by cyanguwa
December 17, 2024 03:57 1m 20s main
December 17, 2024 03:57 1m 20s
Enabling FP8 all-gather for TE Float8Tensor when using Torch FSDP2 (#…
Deploy nightly docs #750: Commit 0196ed4 pushed by youngeunkwon0405
December 16, 2024 23:39 1m 16s main
December 16, 2024 23:39 1m 16s
[JAX] Bug Fix: Softmax FFIs with correct Encapsulates (#1375)
Deploy nightly docs #749: Commit 1975ace pushed by phu0ngng
December 14, 2024 17:09 1m 23s main
December 14, 2024 17:09 1m 23s
Fix an invalid reference in the doc (#1362)
Deploy nightly docs #748: Commit 1ae8190 pushed by denera
December 14, 2024 02:09 1m 28s main
December 14, 2024 02:09 1m 28s
Add user to CI (#1371)
Deploy nightly docs #747: Commit e7bfc0c pushed by phu0ngng
December 12, 2024 22:16 1m 22s main
December 12, 2024 22:16 1m 22s
[JAX] Bug fix for distributed normalization (#1366)
Deploy nightly docs #746: Commit 0e1d9fa pushed by phu0ngng
December 12, 2024 13:00 1m 41s main
December 12, 2024 13:00 1m 41s
[JAX] Use default factory for not sharing mutable default values (#1364)
Deploy nightly docs #745: Commit e4c99b0 pushed by phu0ngng
December 10, 2024 17:31 1m 38s main
December 10, 2024 17:31 1m 38s