[plugin][OOT Benchmark] Refine OOT benchmark(manual trigger) to cover key models by zejunchen-zejun · Pull Request #409 · ROCm/ATOM

zejunchen-zejun · 2026-03-25T09:24:31Z

Refine OOT benchmark with following points:

only manual trigger OOT benchmark, no schedule, no nightly
benchmark kimi(TP8 TP4), gptoss(TP1), qwen3.5(TP8), ds-fp8(TP8) and ds-mxfp4(TP8)
benchmark concurrency 4 8 16 32 64
benchmark 1k/1k, 8k/1k, 1k/8k
default all False for all models
选择main branch发起OOT benchmark，默认直接拉取最新的OOT docker进行benchmark
选择main branch发起OOT benchmark，并且指定了某个OOT docker，直接拉取该docker进行benchmark
选择非main branch发起OOT benchmark，自动基于该branch构建OOT docker，随后进行benchmark，数据不上传到dashboard

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

change to manual trigger align env and arguments choice box default false Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot

Pull request overview

This PR refines the manual OOT vLLM benchmark workflow to target a curated set of key models and benchmark parameter combinations, while switching from building a custom OOT image in-workflow to pulling a prebuilt “latest” image.

Changes:

Make the OOT benchmark workflow manual-only with model toggles defaulting to false, and add an oot_image input to pull a prebuilt benchmark image.
Change the benchmark execution from an in-job loop over param_lists to a full job matrix over (model × params) and generate per-config artifacts.
Update the OOT model config list to adjust env vars and add Qwen3.5-397B-A17B-FP8.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
.github/workflows/atom-vllm-oot-benchmark.yaml	Switch to pulling a prebuilt OOT image; add param matrix expansion; default-disable models for manual selection.
.github/benchmark/oot_benchmark_models.json	Update env vars for existing models and add the Qwen3.5 FP8 model entry.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/atom-vllm-oot-benchmark.yaml

will not be dispatched Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

wuhuikx · 2026-03-25T09:48:23Z

for each concurrency, we need to re-launch the container, do you follow this instruction?
warmup num = 2x con
prompt num = 10x con

Ensure each model can be triggered separately. For example, when we have optimization on DS, we only need to refresh data on this model while keep the others silent to save hardware resource.

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/atom-vllm-oot-benchmark.yaml

zejunchen-zejun · 2026-03-25T10:02:54Z

for each concurrency, we need to re-launch the container, do you follow this instruction?

warmup num = 2x con

prompt num = 10x con

Ensure each model can be triggered separately. For example, when we have optimization on DS, we only need to refresh data on this model while keep the others silent to save hardware resource.

followed in this PR
followed, set the warm up steps: --num-warmups=\"$(( 2 * CONC ))\" \
followed, set the promopt steps: --num-prompts=\"$(( CONC * 10 ))\" \

wuhuikx · 2026-03-26T02:20:18Z

One more thing, we need to make benchmark running on different atom branch for vllm upgrading. Do we have the function now for manually selecting the branch?

zejunchen-zejun · 2026-03-26T02:50:19Z

One more thing, we need to make benchmark running on different atom branch for vllm upgrading. Do we have the function now for manually selecting the branch?

make sense, we need it can also do acceptance test. Let me add it

for acceptance test when upgrading vLLM Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/atom-vllm-oot-benchmark.yaml

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

.github/workflows/atom-vllm-oot-benchmark.yaml

zejunchen-zejun added 2 commits March 25, 2026 16:07

[plugin][oot benchmark] refine the OOT benchmark workflow

934cae4

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

add model qwen3.5

27fd782

change to manual trigger align env and arguments choice box default false Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI review requested due to automatic review settings March 25, 2026 09:24

Copilot started reviewing on behalf of zejunchen-zejun March 25, 2026 09:25 View session

set 4 GPU machine for Kimi-K2 TP4

5f9cf75

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI reviewed Mar 25, 2026

View reviewed changes

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

zejunchen-zejun requested review from gbyu-amd, gyohuangxin, valarLip and wuhuikx March 25, 2026 09:38

if the model has not been chosen, the gpu runner

f236d2c

will not be dispatched Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

change the config

62437ef

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI review requested due to automatic review settings March 25, 2026 09:57

Copilot started reviewing on behalf of zejunchen-zejun March 25, 2026 09:58 View session

remove redundant env flag for gptoss

f8c77fb

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI reviewed Mar 25, 2026

View reviewed changes

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

Merge branch 'main' into zejun/add_benchmark_3.24

5f5e1d4

add specific branch trigger OOT benchmark

7a11c93

for acceptance test when upgrading vLLM Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI review requested due to automatic review settings March 26, 2026 03:50

Copilot started reviewing on behalf of zejunchen-zejun March 26, 2026 03:51 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

zejunchen-zejun added 2 commits March 26, 2026 12:03

change the oot benchmark behavior

8c8833c

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

refine the docker remove logic and rebuild logic

80bff24

Signed-off-by: zejunchen-zejun <zejun.chen@amd.com>

Copilot AI review requested due to automatic review settings March 26, 2026 04:14

Copilot started reviewing on behalf of zejunchen-zejun March 26, 2026 04:15 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

.github/workflows/atom-vllm-oot-benchmark.yaml Show resolved Hide resolved

Conversation

zejunchen-zejun commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wuhuikx commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

zejunchen-zejun commented Mar 25, 2026

Uh oh!

wuhuikx commented Mar 26, 2026

Uh oh!

zejunchen-zejun commented Mar 26, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zejunchen-zejun commented Mar 25, 2026 •

edited

Loading

wuhuikx commented Mar 25, 2026 •

edited

Loading