Skip to content

[WIP] Qwen2.5 7B on-device code docstrings generation for macOS Silicon#8

Merged
martysai merged 8 commits intomasterfrom
qwen2.5-coder
Feb 16, 2026
Merged

[WIP] Qwen2.5 7B on-device code docstrings generation for macOS Silicon#8
martysai merged 8 commits intomasterfrom
qwen2.5-coder

Conversation

@martysai
Copy link
Owner

@martysai martysai commented Jan 25, 2026

Summary

Enabling multiple Qwen Coder models.

Changes

  • Model Configuration Registry (src/training/models.py)

    • SamplingConfig and ModelConfig dataclasses
    • Pre-configured support for Qwen 2.5 Coder (7B, 14B, 32B) and Qwen3 Coder 30B-A3B (MoE)
    • Model-specific sampling parameters (Qwen3 uses temp=1.0, top_p=0.95)
  • API Enhancements (src/training/serve.py)

    • Per-request model selection via model parameter
    • New /models endpoint to list available configurations
    • Health endpoint now reports active model info
    • Model-specific keep_alive settings (300s for MoE models)
  • CLI Improvements (scripts/run_ollama.py)

    • --model-key for registry-based model selection
    • --list-models to display available models
    • Model-specific sampling parameters in requests
  • Cross-Platform Support

    • .gitattributes for consistent line endings
    • Windows PowerShell/CMD examples in documentation

Available Models

Model Key Architecture Memory (Q4) Context
qwen2.5-coder-32b Dense ~18GB 32K
qwen2.5-coder-14b Dense ~8GB 32K
qwen2.5-coder-7b Dense ~4GB 32K
qwen3-coder-30b MoE (3.3B active) ~18GB 256K

Test plan

  • 20 new tests for models module
  • 10 new tests for model selection in serve
  • All 48 tests passing
  • Manual testing with Ollama

🤖 Generated with Claude Code

martysai and others added 3 commits January 25, 2026 14:07
- Add model configuration registry (src/training/models.py) with
  SamplingConfig and ModelConfig dataclasses
- Support Qwen 2.5 Coder (7B, 14B, 32B) and Qwen3 Coder 30B-A3B (MoE)
- Add per-request model selection via API and environment variable
- Apply model-specific sampling parameters (Qwen3 uses temp=1.0, top_p=0.95)
- Add /models endpoint to list available configurations
- Update health endpoint to report active model info
- Add --model-key and --list-models CLI options
- Add .gitattributes for cross-platform line endings
- Add Windows PowerShell/CMD examples in documentation
- Add 48 new tests (20 for models, 10 for serve model selection)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
martysai and others added 5 commits February 3, 2026 18:26
Add multi-model support for Qwen 2.5 and Qwen3 Coder
- Add ci.yml workflow that runs on push/PR to master and qwen2.5-coder
- Test on Python 3.12 with pytest and coverage reporting
- Run ruff linter (warnings only, doesn't fail build)
- Add pytest-cov to dev dependencies

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
- Generate XML coverage report for Codecov
- Upload coverage using codecov-action@v4
- Requires CODECOV_TOKEN secret in repository settings

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
@codecov-commenter
Copy link

codecov-commenter commented Feb 16, 2026

Codecov Report

❌ Patch coverage is 97.47899% with 3 lines in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (master@9302246). Learn more about missing BASE report.

Files with missing lines Patch % Lines
src/training/serve.py 96.55% 3 Missing ⚠️
Additional details and impacted files

Impacted file tree graph

@@            Coverage Diff            @@
##             master       #8   +/-   ##
=========================================
  Coverage          ?   27.44%           
=========================================
  Files             ?       12           
  Lines             ?      430           
  Branches          ?        0           
=========================================
  Hits              ?      118           
  Misses            ?      312           
  Partials          ?        0           
Flag Coverage Δ
unittests 27.44% <97.47%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
src/training/models.py 100.00% <100.00%> (ø)
src/training/serve.py 95.55% <96.55%> (ø)
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@martysai martysai merged commit e0f1828 into master Feb 16, 2026
2 checks passed
@martysai martysai deleted the qwen2.5-coder branch February 16, 2026 18:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants