Skip to content

Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858

Open
TimDettmers wants to merge 11 commits intomainfrom
feature/kbit-quantization
Open

Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858
TimDettmers wants to merge 11 commits intomainfrom
feature/kbit-quantization

Commits

Commits on Feb 14, 2026

Commits on Feb 22, 2026