Skip to content

[CUDA] New 4bit GEMM kernels for inference#1949

Merged
matthewdouglas merged 8 commits into
mainfrom
gemm4bit
May 21, 2026
Merged

[CUDA] New 4bit GEMM kernels for inference#1949
matthewdouglas merged 8 commits into
mainfrom
gemm4bit

Commits

Commits on May 15, 2026

Commits on May 20, 2026