Documentation for `linalg.softmax` lowering in lighthouse + IREE attention lowering walkthrough by charithaintc · Pull Request #2 · charithaintc/lighthouse

charithaintc · 2026-04-03T23:06:19Z

No description provided.

Jianhui-Li · 2026-04-09T05:37:16Z

docs/IREEAttentionLowering.md

+- **Vectorization** via IREE's vector distribution pipeline
+- **Mapping to MMA intrinsics** (e.g., MFMA on MI300X) for the two matmuls (Steps 1 and 5)
+- **Register-level tiling** and shared memory promotion for GPU targets
+- The `scf.for` loop around these ops implements the streaming/online iteration over K/V chunks


How Iree fuses these scf.for loop?

Jianhui-Li · 2026-04-09T05:39:52Z

docs/IREEAttentionLowering.md

+| 3a | `P = exp(S - new_max)` | Elementwise | `[16, 16]` |
+| 3b | `alpha = exp(old_max - new_max)` | Elementwise | `[16]` |
+| 4 | `new_sum = alpha * old_sum + Σ P` | Scale + row reduction | `[16]` |
+| 5 | `new_acc = alpha * old_acc + P @ V` | Scale + matmul | `[16, 64]` ← `[16, 16] × [16, 64]` |


[MLIR] Fusible Softmax with Following Matrix Multiplication · Issue #1617 · intel-innersource/frame…
describes a high level idea that try to decompse softmax to the step 2/3a/3b/4/5' (with V replaced as I, so using P@I instead of P@V), which allows P@V being fused. Since the last step has same loop structure, the second GEMM loop would be able to be fused into the softmax. But not sure how the linalg tile/fusion can be enhanced to support this fusion.

Jianhui-Li · 2026-04-09T14:39:47Z

docs/softmax_lowering.md

+**Notes**
+- Sets the layout for anchor xegpu ops. Each Wg consistes of [8, 1] subgroups
+  doing 8x64 softmax slice. 
+- Only sets the layotu for `store_nd`. Layout propagation does the rest.  


typo: layotu

charithaintc added 28 commits March 13, 2026 23:32

save work

eecee07

Merge branch 'main' into softmax_impl

9ff9dbc

save work

f991027

save work

22415bb

save work

cb9ead1

save work

ac39be3

save work

51d494e

save work

7ac8852

Merge branch 'main' into softmax_impl

fa2993d

save work

d65bf9f

save working version

0bf3eb3

save working version

fabd656

save working version

1e63d7d

save working version

64b5d73

save working version

108f2c0

save working version

a7e1e6c

precommit issues

df53caa

use linalg.softmax

9bcc653

save work

3f5cbce

add inner dim tiling

6204d6c

Merge branch 'main' into softmax_impl

a8ca522

save fused version

1feb0d4

save work

a28cf4a

save work

79e2f73

save work

55c175c

save work

bf3a8c6

Merge branch 'softmax_impl' into softmax_doc

81da73e

save work

b083887

charithaintc changed the title ~~Documentation for linalg.softmax lowering in lighthouse.~~ Documentation for linalg.softmax lowering in lighthouse. Apr 3, 2026

tiled reduction doc

76a02bd

charithaintc added 2 commits April 3, 2026 23:55

save work

4d6ae0a

add attention lowering doc:

5ba1e67

charithaintc changed the title ~~Documentation for linalg.softmax lowering in lighthouse.~~ Documentation for linalg.softmax lowering in lighthouse + IREE attention lowering walkthrough Apr 8, 2026

charithaintc added 10 commits April 8, 2026 21:13

save work

2b5f9ae

save work

35598a8

save work

a54fa4b

save work

6e1c7f5

save work

6a21b02

save work

69a3e45

save work

c5e5d37

save work

c001e19

save work

a60dbe9

save work

996a911

Jianhui-Li reviewed Apr 9, 2026

View reviewed changes

save work

14176ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation for `linalg.softmax` lowering in lighthouse + IREE attention lowering walkthrough #2

Documentation for `linalg.softmax` lowering in lighthouse + IREE attention lowering walkthrough #2
charithaintc wants to merge 42 commits intomainfrom
softmax_doc

charithaintc commented Apr 3, 2026

Uh oh!

Jianhui-Li Apr 9, 2026

Uh oh!

Jianhui-Li Apr 9, 2026

Uh oh!

Jianhui-Li Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

charithaintc commented Apr 3, 2026

Uh oh!

Jianhui-Li Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Jianhui-Li Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Jianhui-Li Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants