Skip to content

Implementation of tiled attention with bf16 and circular buffers which reduces memory requirements by 4x on longer context on gemma models.#839

Merged
copybara-service[bot] merged 1 commit intodevfrom
test_864904207
Feb 24, 2026
Merged

Commits

Commits on Feb 24, 2026