Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858
Open
TimDettmers wants to merge 11 commits intomainfrom
Open
Add k-bit blockwise quantization (K=2-5) with warp-level CUDA kernels#1858TimDettmers wants to merge 11 commits intomainfrom
TimDettmers wants to merge 11 commits intomainfrom
Commits
Commits on Feb 14, 2026
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted
- andcommitted