[AMD] add dsv4 sglang disagg by billishyahao · Pull Request #1818 · SemiAnalysisAI/InferenceX

billishyahao · 2026-06-18T01:51:32Z

Note

Medium Risk
Touches shared disagg launch paths (server_sglang.sh, models.yaml) for all models, not only DSv4; behavior changes when EP is disabled and MoE auto-sizing is partially commented out.

Overview
Adds dsv4-fp4-mi355x-sglang-disagg to the AMD master benchmark matrix (8k/1k, non-MTP) with sweeps over pure TP8, DEP8 (MoRI KV + MoE a2a), and dp-attention + TP-MoE, plus a new workflow runner dsv4_fp4_mi355x_sglang-disagg.sh and a perf-changelog entry.

The multi-node harness is extended for DSv4 PD: a DeepSeek-V4-Pro block in models.yaml (dsv4 attention backend, mori disagg, prefill disable_cuda_graph) and matching MoRI/kernel env overrides in env.sh; the bench client uses --dsv4 framing instead of chat templates.

server_sglang.sh / models.yaml refactor MoE CLI so ep_flags (mori a2a, deepep, fake dispatch) apply only when EP is on—ep=1 stays TP-MoE even with dp-attention—and prefill can honor per-model disable_cuda_graph, context_length, and optional MORI_NUM_MAX_DISPATCH_TOKENS_PER_RANK_* overrides. submit.sh threads DRY_RUN for previewing composed launch commands on a real allocation.

^{Reviewed by Cursor Bugbot for commit f56f8de. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit c22652b. Configure here.}

github-actions · 2026-06-18T02:41:28Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27731746053
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27731746053

github-actions · 2026-06-18T12:35:30Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27731746053
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27731746053

# Conflicts: # perf-changelog.yaml

github-actions · 2026-06-21T04:34:05Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27731746053
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27731746053

functionstackx · 2026-06-21T06:24:17Z

+  description:
+    - "init submission of dsv4 sglang disagg "


can u also ur ai agent include descriptions + links of some of the bug fix PRs in here like

[AMD] Support unified_kv_triton for disaggregation sgl-project/sglang#27935

[AMD] fix moriep quant kernel not implemented issue sgl-project/sglang#27855

[PD][MoRI] Align hybrid state transfer with per-component schema sgl-project/sglang#26539

github-actions · 2026-06-21T07:14:57Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27893589025
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27893589025

github-actions · 2026-06-21T15:30:57Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=27896968169
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=27896968169

functionstackx · 2026-06-21T20:31:26Z

hi @billishyahao there seems to be an accuracy issues with TP8+TP8. codex has narrowed it down to conc=4, here is the bug report for when u wake up, please take a look

sgl-project/sglang#28851

https://github.com/SemiAnalysisAI/InferenceX/actions/runs/27896968169/job/82550287079?pr=1818

functionstackx

can u remove https://github.com/SemiAnalysisAI/InferenceX/blob/main/benchmarks/multi_node/amd_utils/patches/mori_conn.py too

this is no longer needed now that sgl-project/sglang#26525 is fixed in sgl-project/sglang#26539

InferenceX/benchmarks/multi_node/amd_utils/job.slurm

Lines 73 to 80 in 6a07901

    
           _MORI_PATCH_FILE="$DI_REPO_DIR/benchmarks/multi_node/amd_utils/patches/mori_conn.py" 
        
           _MORI_PATCH_TARGET="/sgl-workspace/sglang/python/sglang/srt/disaggregation/mori/conn.py" 
        
           if [[ "${MORI_CONN_PATCH:-auto}" != "skip" ]] \ 
        
              && [[ -f "$_MORI_PATCH_FILE" ]] \ 
        
              && [[ "${DOCKER_IMAGE_NAME:-}" == *"v0.5.12.post1"* ]] \ 
        
              && [[ "${EXTRA_DOCKER_MOUNTS:-}" != *"$_MORI_PATCH_TARGET"* ]]; then 
        
               EXTRA_DOCKER_MOUNTS="${EXTRA_DOCKER_MOUNTS:-} -v ${_MORI_PATCH_FILE}:${_MORI_PATCH_TARGET}:ro" 
        
               export EXTRA_DOCKER_MOUNTS

billishyahao added 8 commits June 12, 2026 03:56

[AMD] add dsv4 sglang disagg

1e5f3a1

fix image

89961e8

fix the image

03ffadf

fix

e875933

fix

dc23512

bump image

b687779

add more sweeps

4e01d43

fix

c22652b

billishyahao requested a review from a team June 18, 2026 01:51

billishyahao requested a review from chunfangamd as a code owner June 18, 2026 01:51

github-project-automation Bot added this to InferenceMAX Board Jun 18, 2026

billishyahao requested review from 1am9trash, seungrokj and yctseng0211 as code owners June 18, 2026 01:51

cursor Bot reviewed Jun 18, 2026

View reviewed changes

Comment thread benchmarks/multi_node/amd_utils/server_sglang.sh

billishyahao added 2 commits June 18, 2026 01:56

Merge remote-tracking branch 'inf/main' into amd/dsv4_sgl_di

2897c81

add perf log

fb605ef

billishyahao added AMD full-sweep-enabled labels Jun 18, 2026

functionstackx added the all-evals Expand eval selection to every fixed-sequence config label Jun 21, 2026

Merge remote-tracking branch 'origin/main' into amd/dsv4_sgl_di

9b5299a

# Conflicts: # perf-changelog.yaml

functionstackx reviewed Jun 21, 2026

View reviewed changes

Merge branch 'main' into amd/dsv4_sgl_di

f56f8de

functionstackx mentioned this pull request Jun 21, 2026

[Bug] MultiNode Disagg MI355X AMD DeepSeekv4 Pro Accuracy Issues GSM8k=0.0356 sgl-project/sglang#28851

Open

5 tasks

functionstackx mentioned this pull request Jun 21, 2026

[Klaud Cold] dsv4-fp4-mi355x-sglang-disagg: DeepSeek-V4-Pro SGLang disagg (8k1k conc=1 smoke test) #1708

Closed

5 tasks

functionstackx requested changes Jun 21, 2026

View reviewed changes

Oseltamivir mentioned this pull request Jun 22, 2026

[AMD] dsv4 atom-disagg eval sweep — validate reduced ATOM logging #1882

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMD] add dsv4 sglang disagg#1818

[AMD] add dsv4 sglang disagg#1818
billishyahao wants to merge 12 commits into
mainfrom
amd/dsv4_sgl_di

billishyahao commented Jun 18, 2026 •

edited by cursor Bot

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

functionstackx Jun 21, 2026

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

functionstackx commented Jun 21, 2026

Uh oh!

functionstackx left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	_MORI_PATCH_FILE="$DI_REPO_DIR/benchmarks/multi_node/amd_utils/patches/mori_conn.py"
	_MORI_PATCH_TARGET="/sgl-workspace/sglang/python/sglang/srt/disaggregation/mori/conn.py"
	if [[ "${MORI_CONN_PATCH:-auto}" != "skip" ]] \
	&& [[ -f "$_MORI_PATCH_FILE" ]] \
	&& [[ "${DOCKER_IMAGE_NAME:-}" == "v0.5.12.post1" ]] \
	&& [[ "${EXTRA_DOCKER_MOUNTS:-}" != "$_MORI_PATCH_TARGET" ]]; then
	EXTRA_DOCKER_MOUNTS="${EXTRA_DOCKER_MOUNTS:-} -v ${_MORI_PATCH_FILE}:${_MORI_PATCH_TARGET}:ro"
	export EXTRA_DOCKER_MOUNTS

Conversation

billishyahao commented Jun 18, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 18, 2026

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

functionstackx Jun 21, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

github-actions Bot commented Jun 21, 2026

Uh oh!

functionstackx commented Jun 21, 2026

Uh oh!

functionstackx left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

billishyahao commented Jun 18, 2026 •

edited by cursor Bot

Loading