-
Notifications
You must be signed in to change notification settings - Fork 206
[AMD] Add DSv4-FP4-MI355X ATOMMESH MTP #1855
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
ea33910
027f3f1
2216d11
cd745fa
baf0e06
1485744
4e039bc
0868467
c7d48b0
39e62eb
ba37d04
eb7179f
6893a06
5106002
55c810d
6386657
92746e9
97f0cab
f9a93c4
4d4fe2b
8b4a94c
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -2521,6 +2521,114 @@ dsv4-fp4-mi355x-atom-disagg: | |
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
|
|
||
| dsv4-fp4-mi355x-atom-disagg-mtp: | ||
| image: rocm/atom-dev:nightly_202606181332 | ||
| model: deepseek-ai/DeepSeek-V4-Pro | ||
| model-prefix: dsv4 | ||
| runner: mi355x | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Wrong CI runner for disaggMedium Severity The new multinode disaggregated recipe sets Reviewed by Cursor Bugbot for commit 8b4a94c. Configure here. |
||
| precision: fp4 | ||
| framework: atom-disagg | ||
| multinode: true | ||
| disagg: true | ||
| scenarios: | ||
| fixed-seq-len: | ||
| - isl: 8192 | ||
| osl: 1024 | ||
| search-space: | ||
| # 2P1D TP8+DPA+TBO+MTP1 | ||
| - spec-decoding: "mtp" | ||
| conc-list: [ 256, 512, 768, 1024, 2048 ] | ||
| prefill: | ||
| num-worker: 2 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "PREFILL_NODES=2" | ||
| decode: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
| - "DECODE_MTP_SIZE=1" | ||
| # 2P1D TP8+DPA+TBO+MTP3 | ||
| - spec-decoding: "mtp" | ||
| conc-list: [ 256, 512, 768, 1024, 2048 ] | ||
| prefill: | ||
| num-worker: 2 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "PREFILL_NODES=2" | ||
| decode: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
| - "DECODE_MTP_SIZE=3" | ||
| # 1P1D TP8+MTP3 | ||
| - spec-decoding: "mtp" | ||
| conc-list: [ 1, 2, 4, 8, 16, 32, 64, 128, 256 ] | ||
| prefill: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: false | ||
| additional-settings: | ||
| - "PREFILL_NODES=1" | ||
| decode: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: false | ||
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
| - "DECODE_MTP_SIZE=3" | ||
| # 1P1D TP8+DPA+TBO+MTP1 | ||
| - isl: 1024 | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Missing ISL8192 DPA MTP sweepMedium Severity The Reviewed by Cursor Bugbot for commit 8b4a94c. Configure here. |
||
| osl: 1024 | ||
| search-space: | ||
| - spec-decoding: "mtp" | ||
| conc-list: [ 64, 128, 256, 512, 1024 ] | ||
| prefill: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "PREFILL_NODES=1" | ||
| decode: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: true | ||
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
| - "DECODE_MTP_SIZE=1" | ||
| # 1P1D TP8+MTP3 | ||
| - spec-decoding: "mtp" | ||
| conc-list: [ 1, 2, 4, 8, 16, 32, 64, 128, 256 ] | ||
| prefill: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: false | ||
| additional-settings: | ||
| - "PREFILL_NODES=1" | ||
| decode: | ||
| num-worker: 1 | ||
| tp: 8 | ||
| ep: 1 | ||
| dp-attn: false | ||
| additional-settings: | ||
| - "DECODE_NODES=1" | ||
| - "DECODE_MTP_SIZE=3" | ||
|
|
||
| # MiniMax-M3 MXFP8 MI355X recipe: | ||
| # https://github.com/vllm-project/recipes/commit/2a3728ed9892debfd767a72a58ebc90b33f186e5 | ||
| # MXFP8 runs from TP=4 on gfx950; block size 128 is mandatory for MSA. | ||
|
|
||


There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
plz use runner
mi355x-disagg