Skip to content

Koboldcpp-nocuda 1.116.1 crashing trying to run inference with gemma-4-31B-it-UD-IQ3_XXS.gguf #2298

Description

@8u6man

Kobold version: 1.116.1
RX7900GRE + Windows 11
model: gemma-4-31B-it-UD-IQ3_XXS.gguf

Model loads but fails during first batch of tokens being processed.

Image

Link to Unsloth repo where the model can be found: https://huggingface.co/unsloth/gemma-4-31B-it-GGUF

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions