diffusiongemma
Here are 5 public repositories matching this topic...
Native WinUI 3 control panel for running local llama.cpp and DiffusionGemma backends with model management, Hugging Face downloads, runtime tuning, logs, and resource monitoring.
-
Updated
Jun 11, 2026 - C#
FastMCP fleet MCP server for diffusion LMs (dLLM). DiffusionGemma on Goliath RTX 4090 — batch inference, HLE-shaped reasoning, ~200–400 tok/s. Doc phase; llama-diffusion-cli sidecar next. Complements local-llm-mcp.
-
Updated
Jun 17, 2026
Docker-Compose template to self-host Google DiffusionGemma 26B on an NVIDIA GPU host via llama.cpp
-
Updated
Jun 12, 2026 - Dockerfile
Matrix-style logit conditioning for DiffusionGemma's llama.cpp denoiser
-
Updated
Jun 15, 2026 - Python
Improve this page
Add a description, image, and links to the diffusiongemma topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the diffusiongemma topic, visit your repo's landing page and select "manage topics."