Skip to content

Add mlx-serve to Large Language Model#233

Open
ddalcu wants to merge 1 commit into
zigcc:mainfrom
ddalcu:add-mlx-serve
Open

Add mlx-serve to Large Language Model#233
ddalcu wants to merge 1 commit into
zigcc:mainfrom
ddalcu:add-mlx-serve

Conversation

@ddalcu

@ddalcu ddalcu commented Jun 22, 2026

Copy link
Copy Markdown

Adds ddalcu/mlx-serve to Data & Science → Large Language Model.

A native Zig LLM inference server for Apple Silicon: runs MLX-format models and GGUF (embedded llama.cpp), exposes OpenAI- and Anthropic-compatible HTTP APIs (works with Claude Code), with speculative decoding and KV-cache quantization. Ships MLX Core, a macOS menu-bar app. MIT-licensed.

Entry placed in alphabetical order.

Copilot AI review requested due to automatic review settings June 22, 2026 03:35
@ddalcu ddalcu requested review from jiacai2050 and xihale as code owners June 22, 2026 03:35

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request adds the ddalcu/mlx-serve project, a native LLM inference server for Apple Silicon, to the README.md file. There are no review comments, and I have no feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds ddalcu/mlx-serve to the repository’s curated list under Data & Science → Large Language Model, expanding the set of Zig-based LLM tooling with an Apple Silicon-focused inference server.

Changes:

  • Added a new README entry for ddalcu/mlx-serve in the Large Language Model section.
  • Positioned the entry in alphabetical order within that section.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants