-
Notifications
You must be signed in to change notification settings - Fork 655
Pull requests: NVIDIA/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core] MXFP8 grouped GEMM + tensor-scaled FP8 fixes
#2748
opened Mar 9, 2026 by
jberchtold-nvidia
Loading…
13 tasks
[NVFP4][MOE] Add unfused quantization fallback when input shape is not aligned
#2747
opened Mar 9, 2026 by
zhongbozhu
Loading…
13 tasks
[JAX] Add bias support for v2 grouped GEMM path
#2744
opened Mar 6, 2026 by
jberchtold-nvidia
Loading…
8 of 13 tasks
[Common] Persistent Grouped NVFP4 quantization kernel
#2743
opened Mar 6, 2026 by
Oleg-Goncharov
Loading…
8 of 13 tasks
Add guard at lowest JAX version that still supports triton kernel calling
#2741
opened Mar 6, 2026 by
tdophung
Loading…
6 of 13 tasks
[Common] Persistent Grouped MXFP8 quantization kernel
enhancement
New feature or request
MoE
#2738
opened Mar 5, 2026 by
Oleg-Goncharov
Loading…
9 of 13 tasks
Feat/cp nvshmem enhanced
community-contribution
PRs from external contributor outside the core maintainers, representing community-driven work.
#2737
opened Mar 5, 2026 by
Knight-of-Thunder
Loading…
1 of 13 tasks
[PyTorch debug] Fix issue with tp_group=None
#2733
opened Mar 4, 2026 by
pggPL
Loading…
8 of 13 tasks
Feature/unswizzle
community-contribution
PRs from external contributor outside the core maintainers, representing community-driven work.
#2732
opened Mar 4, 2026 by
int-smart
Loading…
9 of 13 tasks
fix: scope get_full_cu_seqlens cache key by device and inference mode
#2728
opened Mar 3, 2026 by
DmCarpe93
Loading…
8 of 13 tasks
Add DCP compatibility for FSDP2-TP sharding in TransformerEngine.
#2713
opened Feb 26, 2026 by
cspades
Loading…
3 of 13 tasks
Enable sm120 support for fused attn if cuDNN is 9.18.1+
#2693
opened Feb 20, 2026 by
KshitijLakhani
•
Draft
13 tasks
[JAX] Fix get_seqlens_and_offsets() to accept vmapped seg ids and non vmapped seg offsets
2.14.0
#2692
opened Feb 19, 2026 by
KshitijLakhani
Loading…
7 of 13 tasks
[PyTorch] Error out if constructing Something isn't working
LayerNormLinear with row tensor parallelism
bug
#2688
opened Feb 17, 2026 by
timmoon10
Loading…
6 of 13 tasks
[PyTorch] torch.compile support for permutation functions
#2686
opened Feb 17, 2026 by
pggPL
Loading…
9 of 13 tasks
Previous Next
ProTip!
Adding no:label will show everything without a label.