-
Notifications
You must be signed in to change notification settings - Fork 32
Pull requests: ROCm/TransformerEngine
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
add dsv4 production mxfp8 gemm shapes
ci-level 1
CI test level 1
#636
opened Jun 18, 2026 by
matthiasdiener
Contributor
Loading…
13 tasks
Butterfly all reduce for warp level reductions
#635
opened Jun 18, 2026 by
alextmagro
Contributor
Loading…
[ROCm] Fix biased wgrad with fp32 gradient accumulation
ci-level 3
CI test level 3
#634
opened Jun 18, 2026 by
XinyuJiangCMU
Loading…
Ipanfilo/port fixes to 212
ci-level 2
CI test level 2
#633
opened Jun 17, 2026 by
ipanfilo
Collaborator
Loading…
1 of 13 tasks
Introduce a fused padding + cast transpose kernel grouped linear
ci-level 3
CI test level 3
#632
opened Jun 17, 2026 by
alextmagro
Contributor
Loading…
Added support for AITER JIT native splitkv kernel
ci-level 3
CI test level 3
#631
opened Jun 16, 2026 by
Micky774
Contributor
Loading…
13 tasks
gfx1250 mxfp8 gemm: add NN/NT transpose workaround
ci-level 1
CI test level 1
#630
opened Jun 16, 2026 by
matthiasdiener
Contributor
•
Draft
1 of 13 tasks
Hotfix for Maxtext regression with JAX 0.9 changes
ci-level 2
CI test level 2
#629
opened Jun 16, 2026 by
ipanfilo
Collaborator
Loading…
1 of 13 tasks
gfx1250 mxfp8 gemm: loosen restrictions on K
ci-level 1
CI test level 1
#627
opened Jun 16, 2026 by
matthiasdiener
Contributor
Loading…
1 of 13 tasks
Add gfx1250 support to CK tile group GEMM
ci-level 1
CI test level 1
#626
opened Jun 16, 2026 by
aris134
Contributor
Loading…
1 of 13 tasks
Add ROCm HIP small-seq fused attention via crossattn_hip_kernel
#625
opened Jun 15, 2026 by
VeeraRajasekhar
Contributor
Loading…
13 tasks
[DRAFT] Optimize on lightnling Indexer Triton kernels
#624
opened Jun 12, 2026 by
leonling-ll
•
Draft
[CI] Add resilience to artifacts fetch
#622
opened Jun 9, 2026 by
leo-automation
Collaborator
Loading…
[FEAT] Microbenchmark add visualization
#620
opened Jun 8, 2026 by
Micky774
Contributor
Loading…
13 tasks
Refactored reduction kernels
ci-level 3
CI test level 3
#618
opened Jun 8, 2026 by
Micky774
Contributor
Loading…
13 tasks
Incorporate statistical significance testing to benchmarks
#614
opened Jun 8, 2026 by
Micky774
Contributor
Loading…
13 tasks
enable blockwise FP8 quantization on rocm
ci-level 1
CI test level 1
#609
opened Jun 3, 2026 by
asdfvg123
Loading…
1 of 13 tasks
Fix CI artifact download failures in sGPU/mGPU test jobs
#602
opened May 29, 2026 by
VeeraRajasekhar
Contributor
•
Draft
Update QoLA/AITER
ci-level 3
CI test level 3
#599
opened May 28, 2026 by
Micky774
Contributor
Loading…
13 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-15.