Generalize deploy runbook: remove Hephaestus-specific dual-daemon policy#2
Open
pdfinn wants to merge 2 commits into
Open
Generalize deploy runbook: remove Hephaestus-specific dual-daemon policy#2pdfinn wants to merge 2 commits into
pdfinn wants to merge 2 commits into
Conversation
The README named the on-hardware validation box, its `/mnt/orin-ssd` layout, and the confidential workloads colocated on it. This repo is public — none of that belongs here. Removes the "Hephaestus disk policy" section in full and genericizes the CI / working-notes mentions.
…ublic tree
This repo is public. Several files named an internal dev-host hostname,
its filesystem layout, and confidential workload names colocated on it.
None of that belongs in a public-facing serving repo intended for
external users.
Changes:
- Rename runbooks/hephaestus-deploy.md -> runbooks/deploy.md and rewrite
as a generic Jetson Orin AGX deployment guide. Drop the entire
dual-Docker-daemon section (that's per-environment internal policy).
- CLAUDE.md: drop the "dual-purpose model" section and table that named
confidential workloads; genericize remaining host references.
- docs/SGLANG-ADOPTION-NOTES.md: replace dev-box hostname and
/mnt/orin-ssd paths with generic placeholders; replace the
"honouring the disk policy" subsection that named confidential
workloads with a non-specific "daemonless extraction" rationale.
- runbooks/lucibridge-routing.md, sglang/orin/{README.md,Dockerfile,
bake-tokenizers.sh,config.py,validate-on-hardware.sh},
sglang/thor/Dockerfile: replace remaining hostname / SSD-path /
internal-runbook references with generic equivalents.
Verified: `git ls-files | xargs grep -iE 'NERVA|\bTAK\b|hephaestus|
orin-ssd'` returns no matches.
Note: git history still contains all of these references in earlier
commits. A history rewrite is a separate operation.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Refactor the SGLang deployment documentation to be hardware-agnostic and remove references to Hephaestus-specific operational constraints. The runbook now targets any Jetson Orin AGX with a single Docker daemon, making it accessible to new contributors and field deployments without requiring knowledge of internal dual-daemon infrastructure.
Key Changes
runbooks/hephaestus-deploy.md→runbooks/deploy.mdto reflect that it now documents the standard field-deployment shape, not a specific internal hostdocker.service+docker-dev.service) is deleted; users now follow a single-daemon path/mnt/orin-ssdas a load-bearing requirement; documentation now uses generic paths like${HF_CACHE:-/var/lib/huggingface}and${LOGS:-/var/log/sglang}/mnt/orin-ssd/huggingface,/mnt/orin-ssd/pdfinn/)docker --host unix:///run/docker-dev.sockinvocations to plaindockervalidate-on-hardware.sh) to use environment variables and generic socket guidanceNotable Details
This change aligns with the repo's public-facing nature (per CLAUDE.md) and removes internal hostname/workload references that shouldn't appear in public documentation.
https://claude.ai/code/session_01QTQBMe39u2tE33QriAMLgo